Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks

Alanis, Alma Y.; Alvarez, Jesus G.; Sanchez, Oscar D.; Hernandez, Hannia M.; Valdivia-G, Arturo

doi:10.3390/machines12120844

Open AccessArticle

Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks

by

Alma Y. Alanis

,

Jesus G. Alvarez

,

Oscar D. Sanchez

^*,

Hannia M. Hernandez

and

Arturo Valdivia-G

University Center of Exact Sciences and Engineering, University of Guadalajara, Marcelino Garcia Barragan 1421, Guadalajara 44430, Mexico

^*

Author to whom correspondence should be addressed.

Machines 2024, 12(12), 844; https://doi.org/10.3390/machines12120844

Submission received: 17 October 2024 / Revised: 21 November 2024 / Accepted: 22 November 2024 / Published: 25 November 2024

(This article belongs to the Special Issue Computational Intelligence for Fault Detection and Classification)

Download

Browse Figures

Review Reports Versions Notes

Abstract

This paper presents an online model-free sensor fault-tolerant control scheme capable of tolerating the most common faults affecting an induction motor. This approach involves using neural networks for fault detection to provide the controller with sufficient information to counteract adverse consequences due to sensor faults, such as degradation in performance, reliability, and even failures in the control system. The proposed approach does not consider the knowledge of the nominal model of the system or when the fault may occur. Therefore, a high-order recurrent neural network trained online by the Extended Kalman Filter is used to obtain a mathematical model of the system. The obtained model is used to synthesize a discrete-time sliding mode control. Then, the fault-detection and -isolation stage is performed by independent neural networks, which have as input the signal from the current sensor and the position sensor, respectively. In this way, the neural classifiers continuously monitor the sensors, showing the ability to know the sensor status. The combination of controller and fault detection maintains the operation of the motor during the time of the fault occurrence, whether due to sensor disconnection, degradation, or connection failure. In fact, the MLP neural network achieves an accuracy between 95% and 99% and shows an AUC of 97% to 99%, and this neural network correctly classifies true positives with acceptable performance. The Recall value is high, between 97% and 99%, and the F1 score confirms a good performance. In contrast, the CNN shows a higher accuracy, between 96% and 99% in accuracy and 98% to 99% in AUC. In addition, its Recall and F1 reflect a better balance and capacity to handle complex data, demonstrating its superiority to MLP in fault classification. Therefore, neural networks are a promising approach in areas such as fault-tolerant control.

Keywords:

deep neural network; fault-tolerant control; fault detection and isolation; induction motor; data streams

1. Introduction

In recent years, there has been increasing interest in fault-tolerant control systems, motivated by the complexity of the systems to be managed and by the fact that the various components of the system can generate uncertainties, disturbances, or risks [1,2]. In general, a failure in a system can be defined as a deviation of a parameter from its acceptable value that can cause a reduction in, or loss of, the capability of a functional unit to perform a required function. In this way, fault tolerance is interpreted as the ability of a system to continue functioning despite the presence of faults. The presence of faults is inevitable in any real system, and this can cause impairments in the stability and performance of the system.

The operation of faulty systems mitigates system performance, but the degradation may be acceptable up to a certain level of confidence. Such approaches that are capable of providing system safety and reliability by mitigating the effects of system failures or breakdowns are called fault-tolerant control (FTC) systems [3,4]. Fault-tolerant control schemes are an integral part of all safety-critical systems for many of the systems with real-world applications such as engines. FTC consists of the following phases.

The primary stage is fault detection and isolation (FDI), which is to process the system’s input–output data to find faults. In addition to detecting the fault, FDI is responsible for isolating it from other possible faults present to facilitate precise and efficient intervention [5]. Then, an adjustment is made to the fault-tolerant control algorithm to compensate for the effects caused by the detected fault, using the information obtained during the detection and isolation phase. In this way, it is guaranteed that the system can continue to operate safely and efficiently, despite the presence of faults, maintaining its performance within acceptable margins [3].

FDI is indispensable for fault-tolerant control systems because it provides information about faults, which enables control capable of reducing or eliminating negative impacts on system performance. Therefore, FDI is an important field of research in all types of systems. Numerous studies have been dedicated to fault detection and isolation. However, most FDI schemes are model-based, using nominal, fault-free mathematical representations of the system.

Model-based schemes use the residual generated by the difference between plant sensor measurements and signals generated by a mathematical model. Therefore, if this residual is too large and exceeds a predefined threshold, then an alarm is triggered. In order to prevent false alarms brought on by signal noise or disruptions, this threshold was carefully chosen. An example of a model-based technique in the paper [6] addresses the problem of isolation, diagnosis, and fault-tolerant control in quadrotor UAV systems with uncertainties. A general dynamic model is proposed that considers uncertainties in the system state and input. The work uses an observer method to identify faults, followed by an adaptive observer that accurately estimates the magnitude of the faults. The estimate is then used by a fault-tolerant controller based on sliding modes to compensate for the faults and ensure that the system output follows the desired reference signals, even in the face of external disturbances and actuator failures. The effectiveness of the methodology is validated through simulations. Also, in the article [7], model-based fault detection and isolation are addressed. It focuses on distributed parameter systems (DPSs) modeled by parabolic partial differential equations (PDEs). For fault detection, a filter-based scheme is proposed that allows the detection of faults in actuators, sensors, and states in linear and non-linear systems based on Luenberger-type observers with filters to generate residuals to detect the fault. The paper [8] addresses the problem of fault detection and isolation (FDI) of sensors in networks of linear process systems with a loop structure. A dynamic model based on a two-input, single-output LTI state-space model is proposed that incorporates faults as additive linear terms. The fault isolation algorithm is able to detect faulty measurements in the network. Furthermore, a proportional-integral (PI)-based observer is developed to estimate the magnitude of the faults. The proposed method is validated in MATLAB/Simulink.

In model-free schemes, the system’s state variables are monitored by employing measured data without establishing explicit dependency laws. From these principles, an accurate model is obtained. The development of this type of approach is based on measurements of the input and output variables of the process. Data are collected from the plant, both under normal conditions and in the presence of faults [9].

In this work, a real-time fault diagnosis and isolation scheme is proposed using neural networks to provide information to control systems about the presence of faults. There are various classification techniques to solve fault-detection and -isolation problems in control systems, such as probabilistic methods and “black box” approaches [10,11], as well as artificial neural networks (ANNs), support vector machines (SVMs), and fuzzy inference [12]. A promising approach is using deep neural networks (DNNs) for fault classification and isolation. By employing currents, vibrations, or voltages as input to the model, DNNs have been used to detect faults in circuits or motors, improve maintenance and detect early faults in machinery gears [13,14,15].

In model-free adaptive control approaches, the controller is designed using observed input and output data of a plant. Assumptions like unmodeled dynamics or theoretical preconceptions regarding plant dynamics are eliminated because nominal plant models are not necessary [16]. Under this circumstance, controller design is considered a controller-parameter-identification problem.

For this purpose, artificial neural networks (ANNs) have gained relevance in identifying mathematical models accurately, allowing them to predict, simulate, emulate, and even design [17] control systems. These models identified by neural networks approximate nonlinear dynamics, along with perturbations, which are then used to synthesize conventional [18] controllers.

This work presents a fault-tolerant control scheme that fully utilizes neural networks for its implementation, from system identification to fault detection and isolation, using sliding mode control for discrete-time nonlinear systems. In general, works related to fault-tolerant control (FTC) are model-based. However, in this work, both the controller and the fault classifier are considered model-free, making it an attractive approach within artificial intelligence.

The proposed methodology for neural classification on faulty sensor data streams and fault-tolerant control is applied to an induction motor. The fault-tolerant control and fault detection and isolation schemes are implemented in a closed loop using deep neural network architectures without knowledge of a nominal motor model. The contributions of the work are highlighted below.

Sensor fault classification is performed in real time using two deep neural networks.
The classification is performed using neural classifiers without pre-processing the data.
The neural classifiers and the controller are tested in a closed loop in MATLAB/Simulink Version: 24.2.0.2790852 (R2024b) Update 2.
A real-world application is included in the fault-tolerant control (FTC) scheme.
The fault classification results between a multilayer perceptron (MLP) neural network and a convolutional neural network (CNN) are compared.
A high-order recurrent neural network (RHONN) was used to identify the model of an induction motor, and the Extended Kalman Filter (EKF) was used as a training algorithm to obtain an adequate model despite unexpected conditions such as disturbances or sensor failures.
A sliding mode control for discrete-time nonlinear systems was used to control the motor in the presence of faults.

The paper is organized as follows: first, a review of the analyzed system is included. Next, the proposed neural networks that will be used as fault classifiers are described. Then, the proposed scheme for fault detection and isolation is presented. Then, the RHONN for system identification is described, together with the sliding mode controller. Next, the results obtained are presented and discussed, and finally, conclusions and future work are presented.

2. Review of the Analyzed System

Induction motors (IMs) have diverse applications in industry and in real-world applications because they are reliable, low-cost, and efficient tools [4,19,20]. Electric motor applications subject them to long periods of work under electrical and mechanical stress conditions, which can cause failures or malfunctions, negatively affecting their stability and efficiency. For this reason, it is crucial to detect failures quickly and accurately to take corrective measures that preserve safety and performance and then prevent major failures [21,22]. There are three types of failures in induction motors: converter, machine, and sensor. Among them, the sensor has the greatest potential for failure, and since its function is to monitor the system’s state variables, it is crucial for control applications.

Control of induction motors (IMs) often relies on feedback signals such as rotor position and currents in stator coordinates

α

-

β

. The transformed voltage vector rotates with the power supply frequency, and its components are projected onto the

α

-

β

axes. However, faults caused in these sensors negatively affect system performance, which could lead to total system failure. Current sensors are usually the most prone to failure, but malfunctions can also occur in other speed or position sensors.

The state variables considered measurable are

θ

for the rotor position,

ω

for its speed,

i_{α}

and

i_{β}

for the alpha and beta currents, respectively, and

ϕ_{α}

and

ϕ_{β}

, representing the alpha and beta fluxes, respectively [23].

For this work, faults in current and position sensors are considered, and multilayer perceptron neural networks and convolutional neural networks perform fault detection. Then, system identification is carried out by a high-order recurrent neural network (RHONN), which is trained online; the obtained model synthesizes a discrete-time sliding-mode controller. The configuration used is shown in Figure 1. The proposed scheme is simulated and tested in MATLAB/Simulink Version: 24.2.0.2790852 (R2024b) Update 2, under various operating conditions.

3. Deep Neural Networks

Online sensor fault detection requires methods capable of analyzing large volumes of data in real time to achieve rapid and accurate detection. Common methods for sensor fault detection rely on observers, who rely on accurate mathematical models of the system being observed. However, in real-world applications, system parameters tend to vary over time. In addition, unknown perturbations can cause false alarms, making this approach ineffective [24].

There are FDI methods that have used neural networks, but they have shallow architectures, which limits their ability to learn complex non-linear relationships [11].

In contrast, deep neural networks (DNNs) have more complex architectures due to multiple layers of nonlinear operations, enabling them to capture complex functions through the training of those layers. [25]. In fault detection and isolation problems, DNNs can learn the characteristics of healthy or defective sensors, employing only the observed data from the sensors.

For FDI problems, deep neural networks have the ability to learn the characteristics of a healthy state or a fault state using only data observed over time by sensors. These characteristics allow the development of more reliable and effective fault diagnosis systems, overcoming the shortcomings of current methods for FDI.

Therefore, in this work, two neural networks are proposed to carry out online fault classification and isolation using the observed data from current sensors (

α

and

β

) and the position sensor in a fast and accurate manner. The proposed neural networks are the MLP neural network and the convolutional neural network, which are described below.

3.1. Multilayer Perceptron Networks

One of the widely recognized and used neural networks in various research fields is the MLP. It has applications such as in the problems published in [26,27], and it especially stands out for its efficiency and flexibility in classification tasks [27]. Its basic architecture generally consists of an input layer, one or more hidden layers, and an output layer in feed-forward mode. Dense layers comprise two or more hidden layers, each composed of several nodes, which are connected to the nodes of the adjacent layers through their corresponding weights.

The output of the network

y_{p}

is obtained by adding the m neurons of the hidden layer, each multiplied by the inputs

u_{n}

and the weights W, as shown in the following equation:

y_{p} = σ (\sum W_{n m} σ (\sum W_{n n} u_{n} + b_{1}) + b_{2})

(1)

where m is the number of neurons,

u_{n}

represents the input vector,

σ

is the activation function,

W_{n n}

corresponds to the weight matrix of the hidden layer,

W_{n m}

is the weight matrix of the second layer, and

b_{1}

and

b_{2}

are the bias values.

3.2. Convolutional Neural Network

Convolutional neural networks (CNNs) have proven to be effective and widely used tools in various applications such as object recognition and image classification due to their ability to extract complex features from visual data. In recent years, their use has also been extended to the field of fault detection and classification, where CNNs help to identify anomalous patterns and improve the accuracy of problem diagnosis [28,29,30,31,32].

Although CNN applications are focused on 2D images or where information can be represented in two-dimensional matrices, 1D convolution architectures have also been used. These architectures use one-dimensional kernels and filters on the input signal. However, there are not many works that use this architecture with online applications in classification tasks.

A CNN is built from three main layers: a filter bank layer, a nonlinearity layer, and a feature pooling layer [33].

The filter bank layer is responsible for applying various filters to the input data. The output of the layer is generated by convolving the weights of the neurons (also known as filters or kernels) with the input data, resulting in an activation map. The resulting activation maps are stacked to form an output volume [34].

The nonlinearity layer uses the Rectified Linear Unit (ReLU) nonlinear function to adjust the generated output. The ReLU function is expressed by the following equation:

R e L U = \{\begin{matrix} 0, & i f v < 0, \\ v, & i f v \geq 0 . \end{matrix}

(2)

In the pooling layer, the dimension of the input data is reduced. The most frequently applied methods are max pooling and mean pooling. Finally, in the output layer, the softmax function is used to maximize the probability of the output classes [34]. The equation of the Softmax function is expressed as follows:

o_{j} = \frac{e^{v_{j}}}{\sum_{j = 1}^{M} e^{v_{j}}},

(3)

where M represents the total number of output nodes, v is the output of the network before applying the softmax function, and o is the output of the network after applying the softmax function.

4. Neural Identifier

Figure 2 shows the methodology implemented in the identification of the nonlinear model with uncertainties, external disturbances, and sensor failures. In addition, the neural controller used to control the system in the presence of sensor failures is shown. The structure of a model identification system that integrates a Recurrent High-Order Neural Network (RHONN) and an Extended Kalman Filter (EKF). This scheme is used to estimate the state of a nonlinear dynamical system.

Let us consider an unknown nonlinear system that is subject to sensor failures. The complete system can be described as follows:

\bar{x} (k + 1) = G (\bar{x} (k), \bar{u} (k)) + Δ (k)

(4)

y (k) = C \bar{x} (k)

(5)

where

\bar{x} \in R^{n}

represents the state vector of the system,

\bar{u} \in R^{m}

is the control signal applied,

y \in R^{p}

is the output vector,

C \in R^{p \times m}

is a known output matrix,

G \in R^{n} \times R^{m} \to R^{n}

is a nonlinear function, and

Δ \in R^{n}

is the disturbance vector. Equation (4) can be rewritten in component-wise form:

{\bar{x}}_{i} (k + 1) = G_{i} (\bar{x} (k), \bar{u} (k)) + Δ_{i} (k) i = 1, 2, \dots, n

(6)

Sensor faults can be defined as

\bar{x} (k) = ψ_{i} (x_{i} (k), γ_{i} (k))

(7)

where

ψ_{i}

is an unknown nonlinear function that depends on the sensor signal and is considered unknown but bounded. This function reflects the loss of sensor efficiency due to external inputs not measurable by the system, such as biases, drifts, or loss of accuracy over time

k_{i}^{*}

.

x_{i} (k)

is assumed to be a measurable variable, and its measurement is denoted as

{\bar{x}}_{i} (k)

. The identification of the system is given in Equation (7) using a high-order recurrent neural network (RHONN).

In the context of sensor fault detection, the characteristics of the vectors

\bar{x}

representing the internal dynamics of the system (4) can result in two distinct outcomes: either “failed” or “healthy”. Consequently, this scenario can be viewed as a classification problem, characterized by

ψ_{i} (x_{i} (k), γ_{i} (k)) = \{\begin{matrix} f a i l u r e, ψ_{i} (*) \in Γ \\ n o n - f a i l u r e, ψ_{i} (*) \notin Γ \end{matrix}

(8)

where

Γ

represents the set of all possible failure modes. This set

Γ

is then considered partially known due to the difficulty of completely defining the nonlinear system.

If the complete state

x (k)

of the system is available and there is a data structure in the dynamics of (8), they can be recognized as features in a time series.

So, we can say that it is feasible to identify a “failure scenario” in the system by combining all input signals into a univariate time series.

4.1. Recurrent High-Order Neural Networks

The RHONN is useful for control tasks [35]. The ideal RHONN is

{\bar{x}}_{i} (k + 1) = w_{i}^{* T} z_{i} (\bar{x} (k), \bar{u} (k)) + ϵ_{z i} i = 1, 2, \dots, n

(9)

where

w^{*} i

is the vector of ideal weights and

z_{i} (\bar{x} (k), \bar{u} (k))

is a vector containing

L_{i}

high-order terms, as described by [36]. Also,

ϵ_{z i}

is the bounded approximation error that depends on the number of adjustable weights [36].

z_{i} (\bar{x} (k), \bar{u} (k)) = [\begin{matrix} z_{i_{1}} \\ z_{i_{2}} \\ ⋮ \\ z_{i_{L_{i}}} \end{matrix}] = [\begin{matrix} \prod_{j \in I_{1}} ξ_{i j}^{r_{i j}^{(1)}} \\ \prod_{j \in I_{2}} ξ_{i j}^{r_{i j}^{(2)}} \\ ⋮ \\ \prod_{j \in I_{L_{i}}} ξ_{i j}^{r_{i j}^{(L_{i})}} \end{matrix}]

(10)

with

r_{i j} (k)

being non-negative integers and

ξ_{i}

is defined as follows:

ξ_{i} = [\begin{matrix} ξ_{i_{1}} \\ ⋮ \\ ξ_{i_{n}} \\ ξ_{i_{n + 1}} \\ ⋮ \\ z_{i_{n + m}} \end{matrix}] = [\begin{matrix} S (x_{1}) \\ ⋮ \\ S (x_{n}) \\ {\bar{u}}_{1} \\ ⋮ \\ {\bar{u}}_{m} \end{matrix}]

(11)

in Equation (11),

\bar{u} = [{\bar{u}}_{1}, {\bar{u}}_{2}, \dots, {\bar{u}}_{m}]

is the input vector to the neural network, and

S (\cdot)

is:

S (ς) = μ_{i} t a n h (β_{i} ς)

(12)

where

ς

is a real value variable and

μ

,

β

are positive constants.

The ideal weight vector

w^{*}

is assumed to be minimized together with

| ϵ_{z i} |

on a compact set

Ω_{z i} \supset R^{L_{i}}

. For the purposes of analysis, the vector

w_{i}^{T}

is assumed to exist and is constant and unknown. Therefore, by defining

w_{i}

as an estimate of

w_{i}^{*}

, one can approximate the ideal RHONN as follows:

{\hat{x}}_{i} (k + 1) = w_{i}^{T} z_{i} (\bar{x} (k), \bar{u} (k)) + ϵ_{z i} i = 1, 2, \dots, n

(13)

Then, the estimation error of the weights is defined as

\begin{matrix} {\tilde{w}}_{i} (k) = [w_{i}^{*} (k) - w_{i} (k)] \end{matrix}

(14)

Then, from (9) and (13), the identification error is defined as

\begin{matrix} {\tilde{x}}_{i} (k) = [{\bar{x}}_{i} (k) - {\hat{x}}_{i} (k)] \end{matrix}

(15)

where

\hat{x}

corresponds to the output generated by the neural network and n represents the number of state variables.

4.2. RHONN Training Algorithm

The Extended Kalman Filter (EKF) algorithm is a widely used algorithm for RHONN weight optimization due to its fast convergence. The fundamental objective is to minimize the difference between the state variables of the system and those produced by the neural network. The EKF-based algorithm is described below:

\begin{matrix} w_{i} (k + 1) = w_{i} (k) + η_{i} K_{i} (k) {\tilde{x}}_{i} \\ K_{i} (k) = P_{i} (k) + H_{i} (k) {[R_{i} (k) + H_{i}^{T} (k) P_{i} (k) H_{i}]}^{- 1} \\ P_{i} (k + 1) = P_{i} (k) - K_{i} (k) H_{i}^{T} P_{i} (k) + Q_{i} (k) \end{matrix}

(16)

where

L_{i}

denotes the total number of weights in the neural network,

w_{i} \in R^{L_{i}}

is the vector containing these weights, and

P_{i} \in R^{L_{i} \times L_{i}}

is the covariance matrix associated with the estimation error of these weights. The learning rate is represented by

η_{i}

, while

K_{i} \in R^{L_{i} \times m}

is the Kalman gain matrix.

Q_{i} \in R^{L_{i} \times L_{i}}

is the noise covariance matrix related to the estimation of the weights, and

R_{i} \in R^{m \times m}

corresponds to the measurement noise covariance matrix. Finally,

H_{i} \in R^{L_{i} \times m}

is defined as follows:

H_{i} (k) = {[\begin{matrix} \frac{\partial {\bar{x}}_{i} (k)}{\partial w_{i} (k)} \end{matrix}]}^{T}

(17)

5. Fault-Tolerant Sliding Mode Neural Controller

This study employs the RHONN neural model to develop an adaptive control law, based on the discrete-time sliding mode technique, applied to unknown nonlinear systems affected by sensor failures. According to [37], a discrete-time nonlinear system is described as follows:

\bar{x} (k + 1) = f (x (k)) + B (k) \bar{u} (k)

(18)

where

\bar{x} \in R^{n}

, f is a nonlinear function,

B \in R^{n x m}

is the input matrix, and

\bar{u}

is the applied control law defined in Equation (23). Then,

u \in R^{m}

is the computed control law, defined as follows:

u (k) = \{\begin{matrix} u_{t} (k) & f o r & ∥ u_{t} (k) ∥ \leq u_{m a x} \\ u_{m a x} \frac{u_{t} (k)}{∥ u_{t} (k) ∥} & f o r & ∥ u_{t} (k) ∥ > u_{m a x} \end{matrix}

(19)

with

u_{t} (k) = u_{s}^{e q} (k) + u_{s n} (k)

(20)

where

u_{m a x}

is the control bound. Then, the equivalent control

u_{s}^{e q}

and the stabilizing term to asymptotically reach the sliding surface

u_{s n}

are defined as

\begin{matrix} u_{s}^{e q} (k) = - B^{- 1} (k) (f (k) - x^{d e s} (k + 1)) \\ u_{s n} (k) = - B^{- 1} (k) K_{s} S_{m o d} (k) \end{matrix}

(21)

where

K_{s}

is a Schur matrix,

x^{d e s}

is the desired time-varying trajectory for the state x,

B^{- 1}

is the generalized inverse of B, and the sliding surface

S_{m o d}

defined as the neural trajectory tracking error is described by:

S_{m o d} (k) = \hat{x} (k) - x^{d e s} (k)

(22)

It is important to note that the controller described in (23) has the ability to handle noise, uncertainties, unmodeled dynamics, and even measurement errors. For this reason, certain aspects of the control proposed in [37] have been considered to implement a fault-tolerant control scheme. First, it is assumed that faults occur in the sensors, which affect the state variables. As a result, faulty state variables replace the original measured variables. In the neural model, each

x_{i}

(i = 1, 2, \dots, n)

is replaced by

{\bar{x}}_{i}

(i = 1, 2, \dots, n)

, where

{\bar{x}}_{i}

corresponds to the sensor output, under computed normal or faulty conditions. With this fine-tuned neural model, the control law is synthesized as

\bar{u} (k) = \{\begin{matrix} u (\bar{x} (k)) for \bar{x} (k) healthy \\ u (\bar{x} (τ)) for \bar{x} (k) faulty \end{matrix}

(23)

where fault occurrence is on

t = τ

. This implies that the last value of the state variable is on hold when a fault that starts at instant

τ

is presented.

6. Neural-Network-Based Fault-Detection Strategy

The purpose of the FTC is to identify, correct, or reduce the impact of a fault. To this end, a controller capable of responding appropriately has been designed. In addition, an FDI scheme that works in real time and provides sufficient information to the controller is proposed. Figure 3 presents the proposed FTC scheme, which includes the induction motor, the current and position sensors, the controller, and the diagnostic process. With this configuration, the controller operates following a specific decision logic, oriented according to the fault detection and classification scheme, which allows for generating an FTC action when a fault is detected.

In industrial applications, sensors are the most susceptible to failure. However, traditional methods are affected in their efficiency by real-time processing requirements and the variable nature of the measured data. Therefore, new fault classification methods are needed that can extract valuable information from sensors, even in the presence of uncertainties, non-linear behaviors, and variability in the signals. In addition, these methods must be computationally efficient and achieve high accuracy in fault classification.

In this work, the problem of fault detection and diagnosis is addressed in two phases: a signal isolation phase and a fault classification phase. The methodology is detailed below.

For control tasks for induction motors, three sensors are usually used, two for current and one for position. In this way, fault detection and diagnosis can be approached as a multivariate time series classification problem.

In the literature, there are several methods for classifying multivariate time series, such as Dynamic Time Warping [38], whose purpose is to extract features from time series. The only drawback is that this approach presents a high computational cost when working with large data sets, which makes it unviable for real-time implementations.

For fault detection, each sensor is monitored individually by a neural fault classifier, as shown in Figure 4. The classification results are achieved by separating the multivariate time series into univariate series, and then the univariate series are supervised by a neural classifier. The main advantage of this approach is that the neural classifier can simultaneously differentiate between failures that occur in one or more sensors without much context.

Sliding window-based strategies have been used to solve the data stream mining problem, where more relevance is given to recent information compared to historical data. In this technique, old samples are iteratively replaced with newly observed samples. This strategy is effective for fault detection, as it allows identifying faults that can occur at any time with the least number of data possible.

In this work, we propose the use of this approach since the continuous update of the current time window facilitates online fault detection to adapt to changes in system behavior over time. Data collected in overlapping time windows are used to provide additional context to [39] neural networks.

The sliding window technique divides the data into segments of length d, establishing a relationship between past and future information [40], as detailed below.

Here, we consider a univariate time series

X = [\bar{x} (0), \bar{x} (1), \bar{x} (2), \dots, \bar{x} (k)]

of finite length, where k is the number of samples observed so far. We obtain a sliding window

X (k)

with length d, which captures the local information of the time series X. This window is updated iteratively as the sensor acquires new samples in real-time, i.e.,

X_{k} = [\bar{x} (k - d - 1), \dots, \bar{x} (k - 1), \bar{x} (k)]

(24)

where

{1, 2, \dots, d - 1}

are lags. A graphic representation is shown in Figure 5.

The dataset D then consists of the time series

X_{m}

and its corresponding class label vector

Y_{m}

.

D = (X_{0}, Y_{0}), (X_{1}, Y_{1}), \dots, (X_{k}, Y_{k})

(25)

where q represents the number of classes of

Y_{m}

, such that each element

j \in [1, q]

is 1 if it corresponds to a class of

X_{m}

and 0 otherwise.

7. Results

In this section, we present the results obtained with two neural classifiers, applying the previously described sensor-fault-tolerant controller scheme in neural sliding mode. The complete induction motor model, together with the controller and the neural classifier, is implemented in Matlab/SIMULINK to carry out the closed-loop system tests.

7.1. Neural Identifier

The proposed neural model for a three-phase induction motor is designed as in (13). In order to define the neural model, only the previously determined state variables are considered. The proposed RHONN model is defined to obtain a block control form [23,41] as follows:

\begin{matrix} {\hat{x}}_{1} (k + 1) & = & w_{11} (k) S (θ (k)) + w_{12} (k) S (ω (k)) \\ {\hat{x}}_{2} (k + 1) & = & w_{21} S (ω (k)) - w_{21}^{'} S (ϕ_{β} (k)) i_{α} (k) \\ + & w_{22}^{'} S (ϕ_{α} (k)) i_{β} (k) \\ {\hat{x}}_{3} (k + 1) & = & w_{31} (k) S (Φ_{m} (k)) ω (k) \\ + & w_{32} (k) S (ϕ_{β} (k)) i_{α} (k) \\ + & w_{33} (k) S (ϕ_{α} (k)) i_{β} (k) \\ - w_{31}^{'} (k) S (ϕ_{β} (k)) u_{α} (k) \\ + & w_{32}^{'} (k) S (ϕ_{α} (k)) u_{β} (k) \\ {\hat{x}}_{4} (k + 1) & = & w_{41} (k) S (Φ_{m} (k)) + w_{41}^{'} S (ϕ_{α} (k)) i_{α} (k) \\ + & w_{42}^{'} S (ϕ_{β} (k)) i_{β} (k) \\ {\hat{x}}_{5} (k + 1) & = & w_{51} (k) S (ϕ_{α} (k)) ω (k) \\ + & w_{52} (k) S (ϕ_{β} (k)) ω (k) \\ + & w_{53} (k) S (i_{α} (k)) + w_{51}^{'} u_{α} (k) \\ {\hat{x}}_{6} (k + 1) & = & w_{61} (k) S (ϕ_{α} (k)) ω (k) \\ + & w_{62} (k) S (ϕ_{β} (k)) ω (k) \\ + & w_{63} (k) S (i_{β} (k)) + w_{61}^{'} u_{β} (k) \end{matrix}

(26)

where

θ

represents the rotor position measurement;

ω

is its speed;

i_{α}

and

i_{β}

are the alpha and beta current measurements, respectively;

ϕ_{α}

and

ϕ_{β}

represent the alpha and beta fluxes;

{\hat{x}}_{1}

represents the rotor position;

{\hat{x}}_{2}

is the rotor speed;

{\hat{x}}_{3}

is the electromechanical torque

τ e

;

{\hat{x}}_{4}

is the magnitude of the squared rotor flux

Φ m = ϕ_{α}^{2} + ϕ_{β}^{2}

; and

{\hat{x}}_{5}

and

{\hat{x}}_{6}

are the currents in the alpha and beta axes, respectively. The state variables of the neural network are randomly initialized, as are the weight vectors. Additionally, specific weights are set as

w_{21}^{'} = w_{22}^{'} = 0.1

,

w_{31}^{'} = w_{32}^{'} = 0.092

,

w_{41}^{'} = w_{42}^{'} = 0.00092

and

w_{51}^{'} = w_{61}^{'} = 0.0092

. The covariance matrices are heuristically initialized to minimize the identification error.

7.2. Fault-Tolerant Control

Now, based on the neural network model (26), an FTC is designed (23). The variables to be controlled are

τ_{e}

and

Φ_{m}

, assuming complete access to state variables for measurement. Let us consider

z (k) = [\begin{matrix} z_{1} (k) \\ z_{2} (k) \end{matrix}] = [\begin{matrix} {\hat{x}}_{3} (k) - τ_{e}^{d e s} (k) \\ {\hat{x}}_{4} (k) - Φ_{m}^{d e s} (k) \end{matrix}]

(27)

where

τ_{e}^{d e s}

is the desired electromechanical torque and

Φ_{m}^{d e s}

is the desired square rotor flux magnitude. Then, the dynamic of (27) is defined as

z (k + 1) = [\begin{matrix} F_{z_{1}} (k) + B_{z_{1}} (k) u (k) \\ F_{z_{2}} (k) \end{matrix}]

with

\begin{matrix} F_{z_{1}} (k) & = & w_{31} (k) S (Φ_{m} (k)) ω (k) \\ + & w_{32} (k) S (ϕ_{β} (k)) i_{α} (k) \\ + & w_{33} (k) S (ϕ_{α} (k)) i_{β} (k) - τ_{e}^{d e s} (k + 1) \end{matrix}

(28)

\begin{matrix} F_{z_{2}} (k) & = & w_{41} (k) S (Φ_{m} (k)) + w_{41}^{'} S (ϕ_{α} (k)) i_{α} (k) \\ + & w_{42}^{'} S (ϕ_{β} (k)) i_{β} (k) - Φ_{m}^{d e s} (k + 1) \end{matrix}

(29)

\begin{matrix} B_{z_{1}} (k) & = & {[\begin{matrix} - w_{31}^{'} S (ϕ_{β} (k)) & w_{32}^{'} S (ϕ_{α} (k)) \end{matrix}]}^{T} \end{matrix}

(30)

Given this, the control law (23) is applied, and introducing a desired dynamic for

z_{2} (k)

as

k_{2} z_{2} (k)

with

|k_{2}| < 0

, it is then possible to define

z_{3} (k)

as

z_{3} (k) = F_{z_{2}} (k) - k_{2} z_{2} (k)

(31)

with

z (k)

defined as

z (k + 1) = [\begin{matrix} z_{1} (k + 1) \\ z_{2} (k + 1) \\ z_{3} (k + 1) \end{matrix}] = [\begin{matrix} F_{z_{1}} (k) + B_{z_{1}} (k) u (k) \\ k_{2} z_{2} (k) + z_{3} (k) \\ F_{z_{3}} (k) + B_{z_{3}} (k) u (k) \end{matrix}]

(32)

where

\begin{matrix} F_{z_{3}} (k) & = & w_{41} (k + 1) S (F_{z_{2}} (k) Φ_{m}^{d e s} (k + 1)) \\ + & w_{41}^{'} S (ϕ_{α} (k + 1)) {\hat{x}}_{5} (k + 1) \\ + & w_{42}^{'} S (ϕ_{β} (k + 1)) {\hat{x}}_{6} (k + 1) - Φ_{m}^{d e s} (k + 2) \\ - & k_{2} z_{2} (k + 1) \\ B_{z_{3}} (k) & = & {[\begin{matrix} w_{41}^{'} w_{51}^{'} S (ϕ_{α} (k)) & w_{42}^{'} w_{61}^{'} S (ϕ_{β} (k)) \end{matrix}]}^{T} \end{matrix}

(33)

Sliding mode surface is selected as

S_{D} (k) = {[z_{1} (k), z_{3} (k)]}^{T}

, whose dynamic is

S_{D} (k + 1) = [\begin{matrix} F_{z_{1}} (k) \\ F_{z_{3}} (k) \end{matrix}] + [\begin{matrix} B_{z_{1}} (k) \\ B_{z_{3}} (k) \end{matrix}] u (k)

(34)

then, from

S_{D} (k + 1) = 0

, the following is obtained:

u_{e q} (k) = - B_{s}^{- 1} (k) F_{s} (k)

(35)

7.3. Neural Classifier

Due to the complexity of deep neural networks, both the CNN and the MLP are trained offline using data other than the test data. MLP contains an input layer with 10 neurons representing the dimension of the delay vector, then two hidden layers with 20 neurons in each layer, and finally a neuron at the output; all layers have a Sigmoid activation function. The convolutional neural network architecture contains an input layer with 10 neurons. Then, the convolutional layer is followed by a ReLU activation function and a pooling layer, which generate 20 feature maps. Afterward, the network includes a first dense layer with 180 neurons, which allows the combining of the features extracted by the convolutional layer, followed by a second dense layer with 100 neurons. The network output only generates a single output representing the failed or healthy states.

7.4. Sensor Faults Description

Figure 6 illustrates the behavior of the currents

α

and

β

, as well as the position of the closed-loop system. Different faults are applied to the signals from the current sensors

i_{α}

,

i_{β}

and the position sensor

θ

, as can be seen in the same figure.

The data presented above were obtained from the Simulink diagram shown in Figure 7. The Simulink model represents a rotational induction motor. The model includes blocks used to simulate fault conditions that may occur in the sensors due to disconnections. Faults are synthetically introduced in the sensors

θ

and the current components

i_{α}

and

i_{β}

, which are essential to control and monitor the motor’s performance.

Three faults are introduced, one for each of the sensors mentioned above. The first fault occurs in the current component

i_{α}

at

t = 2

s. This introduces an interruption in one of the key current measurements that drive the motor dynamics. At

t = 5

s, the second fault appears in the component

i_{β}

affecting the other current measurement. Finally, at

t = 7

s, a fault is introduced in the rotor position (

θ

). The progressive nature of these faults allows for a step-by-step investigation of the engine’s ability to maintain functionality and performance despite these faults. This setup provides a test environment to simulate real-world engine failure scenarios and allows for the evaluation of fault detection and diagnostic methods, as well as fault-tolerant controllers under controlled conditions.

7.5. Evaluation Criteria

The performance of the neural classifiers was then evaluated using the following evaluation criteria.

Classification accuracy (A) measures the proportion of correct predictions relative to the total number of samples and is calculated as follows:

A = \frac{T_{P} + F_{P}}{T_{P} + T_{N} + F_{P} + F_{N}}

(36)

The precision (P) metric evaluates how effective the classification of true positives is compared to false positives:

P = \frac{T_{P}}{T_{P} + F_{P}}

(37)

Recall (R) provides information about the proportion of correctly identified positive cases:

R = \frac{T_{P}}{T_{P} + F_{N}}

(38)

Finally, the F1 score combines the precision and recall metrics into a single value, which is especially useful when the class distribution is unbalanced:

F 1 = 2 * \frac{P * R}{P + R}

(39)

where

T_{P}, F_{P}, T_{N}, F_{N}

, are true positive, false positive, true negative, and false negative, respectively.

The ROC curve is a widely used criterion to evaluate the discrimination capacity of classification methods, showing the relationship between the rate of true positives and false positives according to different thresholds. In this curve, a diagonal line indicates random classification performance. The most effective classification methods are those whose curve approaches the upper left edge, indicating high performance. The ability of a model to differentiate between classes is summarized in the area under the curve (AUC); the higher the AUC, the better the classifier’s performance [42].

7.6. Fault-Tolerant Control and Fault-Detection Results

The classification results of the proposed classification methods on the different sensors are presented in Table 1. The MLP neural network shows good results, achieving a CA of between 95% and 99% for the evaluated sensors. Regarding the ROC curve, the area under the curve (AUC) varies between 97% and 99%. However, the precision criterion indicates that the MLP neural network makes errors in the classification of true positives, this could be an indicator of acceptable, although not optimal, performance in this metric. On the other hand, the recall value shows that the MLP neural network is capable of correctly classifying false positives, reaching values between 97% and 99%, so the performance in this aspect is acceptable. In the case of the F1 score, which combines precision and recall, it confirms a solid performance of the MLP overall. It is important to note that all samples were classified online without any preprocessing of the data.

On the other hand, the CNN shows higher performance in the evaluation criteria according to the CA; CNN achieves results between 96% and 99%, while the area under the curve (AUC) is between 98% and 99%. The case of precision shows values ranging from 79% to 92%, indicating an acceptable classification of true positives compared to false positives. According to recall, CNN obtains consistent results between 98% and 99%, so CNN has the ability to correctly identify positive cases. Finally, the F1 score is in a range of 88% to 96%, demonstrating an effective balance between precision and recall. In general, CNN outperforms MLP in fault classification and its greater ability to handle complex and non-linear data.

It is noteworthy that the data used for online classification are completely different from those used during the training of deep neural networks. Despite this, the results obtained by both neural classifiers show robust and reliable performance.

The objective is to achieve the tracking of the stator power reference; for this simulation, a time-varying reference is used. The neural controller was subjected to three different faults that occurred in the three sensors throughout the simulation. The fault events are shown in Figure 8 and are described as follows.

At time

t = 2

s, the current sensor

i_{α}

presents a fault. Then, the neural classifier immediately detects and isolates the fault, which generates an alert that is provided to the controller. In this event, the neural identifier compensates for the disturbance generated by the fault. Although the fault temporarily alters the motor’s performance, the neural sliding mode fault-tolerant controller has the ability to reject the disturbance and restore system stability in a short period of time.

Then, another failure occurs in the current sensor

i_{β}

at time

t = 5

s; in the same way, the neural classifier has the ability to detect the failure, and this time, the motor control performance is affected to a lesser extent than for the first failure. Thus, despite the disturbance caused by the sensor failure, the stability remains stable despite the failure of the sensor

i_{β}

.

Finally, the last fault occurs at time

t = 7

in the position sensor. After the fault identification, the controller takes action to reduce the problems caused by the fault. In fact, the fault does not have a major impact on the reference track; the performance is kept almost unchanged, which demonstrates the controller’s ability to efficiently manage this type of event without compromising the stability or accuracy of the system.

8. Discussion

Overall, the results obtained from the neural sliding mode’s fault-tolerant controller and the neural classifier show robust performance in maintaining engine performance even in the presence of sensor failures.

The neural classifier, based on deep neural networks, was able to accurately differentiate between the different classes (the fault state or the normal state). This allowed the classifier to provide the necessary information to both the controller and the neural identifier, facilitating the compensation of the effects caused by each detected fault.

Furthermore, it can be noted that although the first fault produces a slight disturbance in the system, it manages to stabilize quickly. Under the presence of the second fault, the impact is drastically reduced. Finally, the effect is practically reduced to zero when the third sensor failure occurs. The results show the ability of the neural controller to adapt and effectively mitigate the effects of sensor failures as they occur, thus maintaining system stability and performance over time.

The proposed method stands out from other fault detection approaches due to its accuracy (96–99%), low sensitivity to noise, and a response time of 1.67647 ns, and it does not require a nominal model of the system for its implementation. Unlike methods such as the state observer and Kalman Filters, which are effective for monitoring but require a model and are not currently used in closed-loop control tasks, the proposed method is suitable for both real-time detection and applications in closed-loop control systems. This makes it versatile and efficient in environments where immediate response and accuracy in fault identification are needed, overcoming the limitations of traditional methods that often rely on specific models or are too sensitive to noise. A summary of the comparison is added in Table 2.

9. Conclusions

In this study, an online fault-tolerant control scheme was proposed that does not depend on a nominal model of the system. A high-order recurrent neural network (RHONN) trained using the Extended Kalman Filter (EKF) was used to identify the behavior of an induction motor. The identification generated a nonlinear model that allowed the synthesis of a sensor fault-tolerant controller with neural sliding mode, capable of maintaining system stability and minimizing the effects of successive sensor failures, thus achieving a small tracking error. It is important to note that the proposed methodology inserted three faults in different sensors, which were effectively absorbed and compensated by the RHONN model.

Additionally, a neural classifier was implemented to detect and isolate the online faults. Two types of deep neural networks were used, thus demonstrating the potential of deep learning in detecting sensor faults without the need for a nominal model of the system. Typically, in fault-tolerant control schemes, fault detection and isolation are performed by observers comparing the sensor output to continuous estimates. However, these methods rely heavily on nominal models, which are not always easy to obtain and can become inaccurate when system parameters change over time.

For example, methods such as the Luenberger State Observer are based on a mathematical model of the system to estimate its state and detect deviations that could indicate a failure. The Kalman Filter is also popular in fault detection, but this method assumes linearity in the system, which can limit its adaptability and accuracy in complex systems. Other techniques use Analytical Redundancy Analysis, which compares redundant signals in the system to identify discrepancies that could indicate failures. Although it is a useful technique in systems where redundancy is easy to implement, it is inefficient when faced with multiple failures and presents problems in systems without redundant sensors.

Deep neural networks have drawbacks in real-time fault detection, including processing time when the neural network contains too many hidden layers, or the need for large amounts of labeled data to achieve adequate performance, which can be a challenge in problems where data, especially those with faults, are scarce. In this work, these limitations were addressed by proposing improvements in the network structure and optimization to reduce inference times.

The comparative results showed that both deep neural networks were able to classify online with a high level of confidence. The proposed CNN obtained a superior performance in this work. As a future line of research, it is suggested that the use of other specialized time-series neural network architectures, such as long-short-term memory (LSTM) networks, be explored either in their unidirectional or bidirectional (BiLSTM) versions, to further improve fault detection in dynamic environments.

Author Contributions

Investigation, O.D.S.; Methodology, A.Y.A.; Software, O.D.S. and A.V.-G.; Supervision, A.V.-G. and A.Y.A.; Visualization, O.D.S.; Writing—original draft, O.D.S., H.M.H. and J.G.A. Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received external funding from CONAHCyT from CONAHCYT FOPl6-2021-03 number 319608.

Data Availability Statement

Data are available upon request to the corresponding author.

Acknowledgments

The authors thank the University of Guadalajara for giving us the support to develop this research. We also thank CONACyT for the financing provided in the project.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Alanis, A.Y.; Alvarez, J.G. Real-time model-free resilient control for discrete nonlinear systems. Asian J. Control. 2021, 23, 2097–2111. [Google Scholar] [CrossRef]
Wang, Z.; Liu, L.; Zhang, H. Neural network-based model-free adaptive fault-tolerant control for discrete-time nonlinear systems with sensor fault. IEEE Trans. Syst. Man Cybern. Syst. 2017, 47, 2351–2362. [Google Scholar] [CrossRef]
Bonivento, C.; Isidori, A.; Marconi, L.; Paoli, A. Implicit fault-tolerant control: Application to induction motors. Automatica 2004, 40, 355–371. [Google Scholar] [CrossRef]
Abid, A.; Khan, M.T.; Lang, H.; de Silva, C.W. Adaptive system identification and severity index-based fault diagnosis in motors. IEEE/ASME Trans. Mechatronics 2019, 24, 1628–1639. [Google Scholar] [CrossRef]
Xia, M.; Li, T.; Xu, L.; Liu, L.; De Silva, C.W. Fault diagnosis for rotating machinery using multiple sensors and convolutional neural networks. IEEE/ASME Trans. Mechatronics 2017, 23, 101–110. [Google Scholar] [CrossRef]
Zuo, L.; Yao, L. Fault Isolation and Fault-Tolerant Control for the Uncertain Quadrotor UAV System. Int. J. Control. Autom. Syst. 2024, 22, 301–310. [Google Scholar] [CrossRef]
Ferdowsi, H.; Cai, J.; Jagannathan, S. Filter-based fault detection and isolation in distributed parameter systems modeled by parabolic partial differential equations. IEEE Access 2023, 11, 45011–45027. [Google Scholar] [CrossRef]
Kurniawan, W.; Hangos, K.M.; Márton, L. Fault Isolation and Estimation in Networks of Linear Process Systems. Entropy 2023, 25, 862. [Google Scholar] [CrossRef]
Nozari, H.A.; Nazeri, S.; Banadaki, H.D.; Castaldi, P. Model-free fault detection and isolation of a benchmark process control system based on multiple classifiers techniques—A comparative study. Control Eng. Pract. 2018, 73, 134–148. [Google Scholar] [CrossRef]
Kuncheva, L.I. Combining Pattern Classifiers: Methods and Algorithms; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
Jia, F.; Lei, Y.; Lin, J.; Zhou, X.; Lu, N. Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mech. Syst. Signal Process. 2016, 72, 303–315. [Google Scholar] [CrossRef]
Kumar, P.; Hati, A.S. Review on machine learning algorithm based fault detection in induction motors. Arch. Comput. Methods Eng. 2021, 28, 1929–1940. [Google Scholar] [CrossRef]
Abdul, Z.K.; Al-Talabani, A.K.; Ramadan, D.O. A Hybrid Temporal Feature for Gear Fault Diagnosis Using the Long Short Term Memory. IEEE Sens. J. 2020, 20, 14444–14452. [Google Scholar] [CrossRef]
Chu, R.; Zhang, R.; Huang, Q.; Yang, K. TDV-LSTM: A New Methodology for Series Arc Fault Detection in Low Power AC Systems. In Proceedings of the 2020 IEEE Sustainable Power and Energy Conference (iSPEC), Chengdu, China, 23–25 November 2020; pp. 2319–2324. [Google Scholar] [CrossRef]
Sabir, R.; Rosato, D.; Hartmann, S.; Guehmann, C. LSTM Based Bearing Fault Diagnosis of Electrical Machines using Motor Current Signal. In Proceedings of the 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), Boca Raton, FL, USA, 16–19 December 2019; pp. 613–618. [Google Scholar] [CrossRef]
Hou, Z.; Jin, S. A novel data-driven control approach for a class of discrete-time nonlinear systems. IEEE Trans. Control Syst. Technol. 2010, 19, 1549–1558. [Google Scholar] [CrossRef]
Nguyen, V.T.; Lin, C.Y.; Su, S.F.; Tran, Q.V. Adaptive chattering free neural network based sliding mode control for trajectory tracking of redundant parallel manipulators. Asian J. Control 2019, 21, 908–923. [Google Scholar] [CrossRef]
Castaneda, C.E.; Loukianov, A.G.; Sanchez, E.N.; Castillo-Toledo, B. Discrete-time neural sliding-mode block control for a DC motor with controlled flux. IEEE Trans. Ind. Electron. 2011, 59, 1194–1207. [Google Scholar] [CrossRef]
Toma, R.N.; Prosvirin, A.E.; Kim, J.M. Bearing fault diagnosis of induction motors using a genetic algorithm and machine learning classifiers. Sensors 2020, 20, 1884. [Google Scholar] [CrossRef]
Glowacz, A.; Glowacz, W.; Kozik, J.; Piech, K.; Gutten, M.; Caesarendra, W.; Liu, H.; Brumercik, F.; Irfan, M.; Khan, Z.F. Detection of deterioration of three-phase induction motor using vibration signals. Meas. Sci. Rev. 2019, 19, 241–249. [Google Scholar] [CrossRef]
Glowacz, A. Diagnostics of rotor damages of three-phase induction motors using acoustic signals and SMOFS-20-EXPANDED. Arch. Acoust. 2016, 41, 507–515. [Google Scholar] [CrossRef]
AlShorman, O.; Alkahatni, F.; Masadeh, M.; Irfan, M.; Glowacz, A.; Althobiani, F.; Kozik, J.; Glowacz, W. Sounds and acoustic emission-based early fault diagnosis of induction motor: A review study. Adv. Mech. Eng. 2021, 13, 1687814021996915. [Google Scholar] [CrossRef]
Alanis, A.Y.; Sanchez, E.N.; Loukianov, A.G.; Perez-Cisneros, M.A. Real-time discrete neural block control using sliding modes for electric induction motors. IEEE Trans. Control Syst. Technol. 2009, 18, 11–21. [Google Scholar] [CrossRef]
Zhang, J.; Swain, A.K.; Nguang, S.K. Robust Observer-Based Fault Diagnosis for Nonlinear Systems Using MATLAB®; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Ogunmolu, O.; Gu, X.; Jiang, S.; Gans, N. Nonlinear systems identification using deep dynamic neural networks. arXiv 2016, arXiv:1610.01439. [Google Scholar]
Zhai, X.; Ali, A.A.S.; Amira, A.; Bensaali, F. MLP neural network based gas classification system on Zynq SoC. IEEE Access 2016, 4, 8138–8146. [Google Scholar] [CrossRef]
Lim, T.; Ratnam, M.; Khalid, M. Automatic classification of weld defects using simulated data and an MLP neural network. Insight-Non Test. Cond. Monit. 2007, 49, 154–159. [Google Scholar] [CrossRef]
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
Zheng, Y.; Liu, Q.; Chen, E.; Ge, Y.; Zhao, J.L. Time series classification using multi-channels deep convolutional neural networks. In Proceedings of the International Conference on Web-Age Information Management, Macau, China, 16–18 June 2014; Springer: Berlin/Heidelberg, Germany, 2014; pp. 298–310. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
Pang, X.; Xue, X.; Jiang, W.; Lu, K. An Investigation into Fault Diagnosis of Planetary Gearboxes using a Bispectrum Convolutional Neural Network. IEEE/ASME Trans. Mechatronics 2020, 26, 2027–2037. [Google Scholar] [CrossRef]
Chen, S.; Meng, Y.; Tang, H.; Tian, Y.; He, N.; Shao, C. Robust deep learning-based diagnosis of mixed faults in rotating machinery. IEEE/ASME Trans. Mechatronics 2020, 25, 2167–2176. [Google Scholar] [CrossRef]
LeCun, Y.; Kavukcuoglu, K.; Farabet, C. Convolutional networks and applications in vision. In Proceedings of the 2010 IEEE International Symposium on Circuits and Systems, Paris, France, 30 May–2 June 2010; pp. 253–256. [Google Scholar]
Albawi, S.; Bayat, O.; Al-Azawi, S.; Ucan, O.N. Social touch gesture recognition using convolutional neural network. Comput. Intell. Neurosci. 2018, 2018, 6973103. [Google Scholar] [CrossRef]
Sanchez, E.N.; Alanís, A.Y.; Loukianov, A.G. Discrete-Time High Order Neural Control; Springer: Berlin, Heidelberg, 2008. [Google Scholar]
Rovithakis, G.A.; Christodoulou, M.A. Adaptive Control with Recurrent High-Order Neural Networks: Theory and Industrial Applications; Springer Science & Business Media: Berlin, Heidelberg, 2012. [Google Scholar]
Ruiz, R.; Sanchez, E.N.; Loukianov, A.G. Discrete time block control of a double fed induction generator using sliding modes. In Proceedings of the 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC), St. Petersburg, Russia, 8–10 July 2009; pp. 1483–1488. [Google Scholar]
Xing, Z.; Pei, J.; Keogh, E. A brief survey on sequence classification. ACM Sigkdd Explor. Newsl. 2010, 12, 40–48. [Google Scholar] [CrossRef]
Graves, A.; Schmidhuber, J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 2005, 18, 602–610. [Google Scholar] [CrossRef]
Mozaffari, L.; Mozaffari, A.; Azad, N.L. Vehicle speed prediction via a sliding-window time series analysis and an evolutionary least learning machine: A case study on San Francisco urban roads. Eng. Sci. Technol. Int. J. 2015, 18, 150–162. [Google Scholar] [CrossRef]
Quintero-Manriquez, E.; Sanchez, E.N.; Felix, R.A. Induction motor torque control via discrete-time sliding mode. In Proceedings of the 2016 World Automation Congress (WAC), Rio Grande, PR, USA, 31 July–4 August 2016. [Google Scholar] [CrossRef]
Cerda, J.; Cifuentes, L. Uso de curvas ROC en investigación clínica: Aspectos teórico-prácticos. Rev. Chil. Infectol. 2012, 29, 138–141. [Google Scholar] [CrossRef] [PubMed]
Kiranyaz, S.; Gastli, A.; Ben-Brahim, L.; Al-Emadi, N.; Gabbouj, M. Real-time fault detection and identification for MMC using 1-D convolutional neural networks. IEEE Trans. Ind. Electron. 2018, 66, 8760–8771. [Google Scholar] [CrossRef]
Verma, A.K.; Nagpal, S.; Desai, A.; Sudha, R. An efficient neural-network model for real-time fault detection in industrial machine. Neural Comput. Appl. 2021, 33, 1297–1310. [Google Scholar] [CrossRef]
Leão, L.S.; Cavalini, A.A., Jr.; Morais, T.S.; Melo, G.P.; Steffen, V., Jr. Fault detection in rotating machinery by using the modal state observer approach. J. Sound Vib. 2019, 458, 123–142. [Google Scholar] [CrossRef]
Yan, K.; Ji, Z.; Shen, W. Online fault detection methods for chillers combining extended kalman filter and recursive one-class SVM. Neurocomputing 2017, 228, 205–212. [Google Scholar] [CrossRef]
Kajmakovic, A.; Diwold, K.; Römer, K.; Pestana, J.; Kajtazovic, N. Degradation Detection in a Redundant Sensor Architecture. Sensors 2022, 22, 4649. [Google Scholar] [CrossRef]
Basri, H.M.; Lias, K.; Abidin, W.W.Z.; Tay, K.; Zen, H. Fault detection using dynamic parity space approach. In Proceedings of the 2012 IEEE International Power Engineering and Optimization Conference, Melaka, Malaysia, 6–7 June 2012; pp. 52–56. [Google Scholar]
Lekie, A.J.; Idoniboyeobu, D.; Braide, S. Fault detection on distribution line using fuzzy logic. Int. J. Sci. Eng. Res. 2018, 9, 490–503. [Google Scholar]

Figure 1. Rotary induction motor test bench setup.

Figure 2. Neural identification of unknown system under sensors failure.

Figure 3. The scheme of the control and identification system based on machine learning techniques that integrates a High-Order Recurrent Neural Network (RHONN), an Extended Kalman Filter (EKF), and a neural classifier is shown. This scheme improves the performance in the presence of sensor failures.

Figure 4. Deep neural network architecture for online fault classification.

Figure 5. The sliding-window extraction from the time series X.

Figure 6. Data set from induction motor sensors.

Figure 7. Simulink diagram of induction motor.

Figure 8. Tracking results of the neural sliding-mode sensor’s fault-tolerant controller with the CNN-based neural classifier.

Table 1. Classification results obtained by the MLP and CNN neural classifiers.

Neural Network	Sensor	Accuracy	AUC	Precision	Recall	F1 Score	Execution Time
MLP	Position	0.9914	0.9949	0.9269	0.9949	0.9597
	Current $i_{α}$	0.9539	0.9750	0.7601	0.9750	0.8542	1.67647 × 10⁻⁹ s.
	Current $i_{β}$	0.9560	0.9765	0.7659	0.9765	0.8585
CNN	Postion	0.9914	0.9955	0.9269	0.9955	0.9610
	Current $i_{α}$	0.9653	0.9809	0.7951	0.9809	0.8783	4.6236 × 10⁻⁸ s.
	Current $i_{β}$	0.9688	0.9828	0.8080	0.9828	0.8869

Table 2. Incorporation of methods in the fault-detection literature and the proposed method.

Method	Required Model	Noise Sensitivity	Response Time	Real-Time Detection	Accuracy	Application in Control Tasks	Citations
Deep Neural Network	No (free model)	High	<100 ms	Yes	>99%	Not directly (in monitoring)	[43]
Artificial neural networks	No (free model)	Moderate	28 μs	Yes	94.73% unprocessed, 98.43% processed	Not directly (in monitoring)	[44]
State Observer Approach	Yes	High	Not specified	Yes	High (Not specified)	With potential use	[45]
Kalman Filter/EKF	Yes	High	Not specified	Yes	80–95%	Not directly (in monitoring)	[46]
Redundancy Analysis	No	High	Not specified	Yes	80–97%	Not directly (in monitoring)	[47]
Signal Parity Methods	Yes	High	Not specified	Yes	Effective in diagnosis and isolation	Not directly (in monitoring)	[48]
Fuzzy Logic	No (requires rules)	High	Not specified	Yes	High (based on defuzzification)	Not directly (in monitoring)	[49]
Observer-Based Approach	Yes	High	Not specified	Yes	High (accurate fault determination)	Yes, with fault compensation	[6]
Filter-Based	Yes	High	Not specified (high computational cost)	Yes	High (in well-defined systems)	Not directly (in diagnosis)	[7]
Model-based approach	Yes	Moderate	Not specified	Yes	High in well-defined networks	Not directly (in diagnosis)	[8]
Proposed Method	No	Low	1.67647 × 10⁻⁹ s	Yes	96–99%	Applied in closed loop with a controller

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alanis, A.Y.; Alvarez, J.G.; Sanchez, O.D.; Hernandez, H.M.; Valdivia-G, A. Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks. Machines 2024, 12, 844. https://doi.org/10.3390/machines12120844

AMA Style

Alanis AY, Alvarez JG, Sanchez OD, Hernandez HM, Valdivia-G A. Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks. Machines. 2024; 12(12):844. https://doi.org/10.3390/machines12120844

Chicago/Turabian Style

Alanis, Alma Y., Jesus G. Alvarez, Oscar D. Sanchez, Hannia M. Hernandez, and Arturo Valdivia-G. 2024. "Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks" Machines 12, no. 12: 844. https://doi.org/10.3390/machines12120844

APA Style

Alanis, A. Y., Alvarez, J. G., Sanchez, O. D., Hernandez, H. M., & Valdivia-G, A. (2024). Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks. Machines, 12(12), 844. https://doi.org/10.3390/machines12120844

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fault-Tolerant Closed-Loop Controller Using Online Fault Detection by Neural Networks

Abstract

1. Introduction

2. Review of the Analyzed System

3. Deep Neural Networks

3.1. Multilayer Perceptron Networks

3.2. Convolutional Neural Network

4. Neural Identifier

4.1. Recurrent High-Order Neural Networks

4.2. RHONN Training Algorithm

5. Fault-Tolerant Sliding Mode Neural Controller

6. Neural-Network-Based Fault-Detection Strategy

7. Results

7.1. Neural Identifier

7.2. Fault-Tolerant Control

7.3. Neural Classifier

7.4. Sensor Faults Description

7.5. Evaluation Criteria

7.6. Fault-Tolerant Control and Fault-Detection Results

8. Discussion

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI