Next Article in Journal
A Dynamic Reserve Evaluation Method for an Ultra-Deep Fractured Tight Sandstone Gas Reservoir
Previous Article in Journal
Sustainable Transport in the European Union: Exploring the Net-Zero Transition through Confirmatory Factor Analysis and Gaussian Graphical Modeling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Multistage Physics-Informed Neural Network for Fault Detection in Regulating Valves of Nuclear Power Plants

1
Energy Department, Politecnico di Milano, 20156 Milan, Italy
2
MINES Paris, PSL University, CRC, 06904 Sophia Antipolis, France
3
China Nuclear Power Engineering Co., Ltd., Beijing 100822, China
4
School of Mechanical Engineering & Automation, Beihang University (BUAA), Beijing 100191, China
*
Authors to whom correspondence should be addressed.
Energies 2024, 17(11), 2647; https://doi.org/10.3390/en17112647
Submission received: 2 April 2024 / Revised: 21 May 2024 / Accepted: 27 May 2024 / Published: 30 May 2024

Abstract

:
In Nuclear Power Plants (NPPs), online condition monitoring and the fault detection of structures, systems and components (SSCs) can aid in guaranteeing safe operation. The use of data-driven methods for these tasks is limited by the requirement of physically consistent outcomes, particularly in safety-critical systems. Considering the importance of regulating valves (e.g., safety relief valves and main steam isolation valves), this work proposes a multistage Physics-Informed Neural Network (PINN) for fault detection in such components. Two stages of the PINN are built by developing the process model of the regulating valve, which integrates the basic valve sizing equation into the loss function to jointly train the two stages of the PINN. In the 1st stage, a shallow Neural Network (NN) with only one hidden layer is developed to estimate the equivalent flow coefficient (a key performance indicator of regulating valves) using the displacement of the valve as input. In the 2nd stage, a Deep Neural Network (DNN) is developed to estimate the flow rate expected in normal conditions using inputs such as the estimated flow coefficient from the 1st stage, the differential pressure, and the fluid temperature. Then, the residual, i.e., the difference between the estimated and measured flow rates, is fed into a Deep Support Vector Data Description (DeepSVDD) to detect the occurrence of faults. Moreover, the deviation between the estimated flow coefficients of normal and faulty conditions is used to interpret the consistency of the detection result with physics. The proposed method is, first, applied to a simulation case implemented to emulate the operating characteristics of regulating the valves of NPPs and then validated on a real-world case study based on the DAMADICS benchmark. Compared to state-of-the-art fault detection methods, the obtained results from the proposed method show effective fault detection performance and reasonable flow coefficient estimation, thus guaranteeing the physical consistency of the detection results.

1. Introduction

In general, a Nuclear Power Plant (NPP) consists of structures, systems and components (SSCs), some of which are critical for safety and operational thermal efficiency of the NPP [1]. Operational interruptions of these SSCs can result in a million-dollar loss a day [2]. Therefore, accurate online condition monitoring and anomaly detection are necessary and crucial for these SSCs of a NPP [3]. One of the important SSCs is the regulating valve, which is widely used in different parts of the NPP for various purposes to ensure the safe and reliable operation of the whole system [4]. Such valves include safety/relief valves used in the pressurizer for regulating the pressure of the nuclear reactor coolant inventory, main steam isolation valves used in the main steam system for the rapid and tight closure of primary containment isolation, and turbine regulating/control valves used in the turbine system for regulating the steam flow to the turbine [5]. It is highly desirable to detect faults in regulating valves as early as possible before they become inoperable.
Over the past decades, a wide range of methods for Fault Detection and Diagnosis (FDD) has been proposed within NPP applications, thus ranging from model-based and signal processing-based approaches to artificial intelligence-based and machine learning-based techniques. In [6], a model-based method using a subspace identification model was developed to detect faulty sensors. In [7], a signal-processing method based on time–frequency analysis was proposed, thus focusing on vibration signals to identify the operational status of various components in NPPs. In [8], a novel method based on the transformation of impact signals using the Wigner–Ville distribution was proposed to estimate the location of loose parts. In [9], a Long Short-Term Memory (LSTM) network was developed to perform sensor fault detection using labeled simulation data. In [10], a convolutional gated recurrent network aided by particle swarm optimization for hyperparameters tuning was proposed to perform fault diagnosis using a simulation dataset. A detailed review of the methods developed for FDD in NPPs can be found in [2].
Despite these developments, there are few reported works on the detection of faults in regulating valves. In [11], a method was proposed for the fault-critical point prediction of a gate valve based on the characteristic analysis of the operational process variables using experimental data. In [12], a fault prediction method combining Principal Component Analysis (PCA) and a neural network using experimental data was developed. In [13], a gate recurrent unit-based method was developed for the fault diagnosis of electric gate valves using features extracted from vibration signals. In [14], a fault diagnosis method for electric isolation valves was developed by combining a knowledge-based rule reasoning model and a data-driven gate recurrent unit model.
The fault detection and diagnostic of regulating valves in other industries are also receiving attention. In [15], a model-based method for fault detection and isolation was proposed using fuzzy modeling to derive nonlinear models for the processes under normal and faulty conditions. The residuals obtained from these models were then used for fault detection. In [16], an online fault detection method was proposed for monitoring the signal values of the valve supply current and spool position in relation to the spool positioning control signal. In [17], a fault detection method was proposed based on canonical variate analysis using the simulation data of a benchmark model for a pneumatic control valve used in the power, food processing, and chemical industries. In [18], a method based on Support Vector Machines (SVMs) was developed for fault detection in a pneumatic control valve used in the manufacturing industry. In [19], a fault diagnostic and prognostic method based on SVMs and an Adaptive Neurofuzzy Inference System (ANFIS) was developed for a hydrocontrol valve used in the aerospace industry.
On the other hand, with the recent advancements in artificial intelligence, deep learning-based approaches have attracted attention in the field of Prognostics and Health Management (PHM), thanks to their powerful automatic feature extraction and nonlinear fitting capabilities [20,21]. In [22], a valve stiction detection method based on the Convolutional Neural Network (CNN) was developed. In [23], an internal leakage detection method based on the CNN was developed using the power spectral density images of acoustic emission signals. While these methods have performed well with large amounts of data, pure data-driven deep learning approaches generally violate physics-based laws with a lack of explainability of the detection and diagnostic results due to their black box nature. Hence, with the Physics-Informed Neural Network (PINN), a new variant of the Neural Network (NN) has been proposed in recent research works to make the black box model physically meaningful, interpretable, and applicable to different industrial scenarios by incorporating physical constraints from domain knowledge.
The demand for model interpretability and physical consistency in the PHM field has led to the recognition of the PINN as a highly promising method to improve the performance and interpretability of maintenance decision making [24]. In [25], a physics-informed deep learning approach based on the CNN was developed using a physics-driven loss function derived from physical knowledge to improve fault detection accuracy. In [26], a PINN method based on Autoencoders (AEs) was developed by integrating the physics law of the current–voltage relation into the loss function to perform high impedance fault detection. In [27], a physics-guided CNN was developed in which the CNN kernel was designed using the physical knowledge of the faulty signal of rolling element bearings. With respect to industrial actuators, only a few approaches of the PINN have been reported in the literature. In [28], the PINN was used for system identification in servohydraulic shaking tables. In [29], the PINN was used for the prediction of displacement in soft pneumatic actuators. In [30], the PINN was used for distinguishing anomalies caused by the faults of actuators and sensors. Notice that these applications of the PINN are for specific objectives other than for fault detection in regulating valves.
In this context, a multistage PINN combined with a Deep Support Vector Data Description (SVDD) for fault detection in regulating valves is proposed in this work. Two stages of the PINN model are built by developing the process model of the regulating valves to estimate the flow rate expected under normal conditions. In the 1st stage, a shallow NN with only one hidden layer is used to estimate the equivalent flow coefficient (a key performance indicator of the regulating valve) using the displacement of the valve as input. In the 2nd stage, a Deep Neural Network (DNN) with multiple hidden layers is used to estimate the flow rate, thus using as inputs the estimated flow coefficient from the 1st stage, the differential pressure of the regulating valve, and the liquid temperature. The development of the two stages for the PINN instead of a single deep PINN is motivated by the relevance of the flow coefficient, which serves as an input to the 2nd stage. The physics-informed loss function must also take into account this parameter (see Section 3.1.2), given its relevance in determining the flow capacity of the regulating valves [31]. Thus, by implementing the two stages of the PINN, the model can well capture the underlying physical relationships and dependencies within the system. To jointly and effectively train the multistage PINN, the basic valve sizing equation is incorporated into the PINN using a novel physical loss function. In this way, the PINN model can estimate the flow coefficient without the need for real labels. Note that the addition of the physical loss function is the primary difference between the PINN and conventional NN, which allows for training the PINN so as to conform as much as possible to the relevant physical principles under consideration. Finally, the residual of the estimation, i.e., the difference between the estimated and the measurement of flow rate, is fed into the DeepSVDD [32], which is a deep learning variant of the SVDD that uses a DNN as a mapping function instead of a kernel function to detect the anomaly in regulating valves. In the DeepSVDD, the classifier learns to recognize the boundaries or characteristics of a single class based on the training data containing samples of only that class. The objective is to obtain a classifier that enables recognition of that class, thereby allowing the detection of all other samples not from the class as anomalies or outliers. This approach is commonly used in scenarios where samples from other classes are rare, unknown, or unavailable during the training process. In the present work, a strategy is also developed for online updating of the 1st stage of the multistage PINN for the estimation of the online flow coefficient.
The proposed method is applied to a simulation case study and then validated on a real-world case study based on the “Development and Application of Methods for Actuator Diagnosis in Industrial Control Systems” (DAMADICS) benchmark [31]. The obtained results, in comparison to other state-of-the-art methods, demonstrate the effectiveness of the proposed method for fault detection in regulating valves.
The main contributions of this work are as follows:
(1)
The development of a multistage PINN model for the estimation of the flow rate signal of regulating valves;
(2)
The formulation of a physically consistent loss function for training the multistage PINN to estimate the flow coefficient based on the online updating strategy in the absence of real labeled input data;
(3)
The use of DeepSVDD to analyze residuals for fault detection, without the need to manually set the threshold.
The remainder of the paper is organized as follows. Section 2 formulates the problem. In Section 3, the proposed method is presented, and the metrics used to assess its performance are outlined in Section 4. The application of the proposed method to both simulation and real-world case studies is discussed in Section 5. Finally, Section 6 presents the conclusions of the work.

2. Problem Statement

We consider a control/regulating valve, which is one of the important components of an NPP in which multiple degradation modes can occur. The valve is typically monitored by measuring a set of N physical quantities at a generic time, t, which can be formalized into the vector:
x ( t ) = [ x t , 1 , , x t , j , , x t , N ]
Such quantities include the fluid temperature, the displacement of the valve, and the differential pressure, which are expected to contain information correlated to the flow rate, Q ( t ) , of the regulating valve, which is a crucial sensory measurement at time, t.
Without loss of generality, the dataset D t r a i n = [ x ( t ) , Q ( t ) ] t = 1 , , T collected over a period of time during normal operating conditions of the regulating valve is assumed to be available, which contains the following:
(1)
The measurement X R T × N , which is historically collected during the past operation of the regulating valve, which is an N -dimensional variable signal matrix, with  x t , j , t = 1 , T , j = 1 , N being the measurement of the physical quantity j at time t;
(2)
The flow rate measurement Q R T collected during the past operation of the regulating valve, which is a vector, with  Q ( t ) , t = 1 , T being the measurement of the flow rate at time t.
Based on the above assumption, and considering a new test input x t e s t ( t ) and its corresponding flow rate Q t e s t ( t ) measured at current time t, the objective of the present work is to develop a data-driven dynamic process model, f ( · ) , that represents the behavior of regulating valves in normal condition, for which we have the following:
(1)
It receives in input the online measurement x t e s t ( t ) at the current time t and produces in output the estimate Q ^ t e s t ( t ) of the observation Q t e s t ( t ) expected in the normal condition;
(2)
It detects the occurrence of a fault using the analysis of the residual, Q ^ t e s t t Q t e s t t , between the estimate Q ^ t e s t ( t ) and the observation Q t e s t ( t ) at the current time t.

3. Proposed Method

The proposed method for the detection of faults in regulating valves is sketched in Figure 1. It consists of a multistage PINN for the estimation of the flow coefficient and flow rate (Section 3.1), as well as a DeepSVDD for the analysis of residuals for fault detection (Section 3.2). The implementation procedure of the proposed method for fault detection is described in Section 3.3.

3.1. Multistage PINN Model

3.1.1. Basic Valve Sizing Equation

Regulating valves aim to prevent, allow, and properly limit the flow of fluids through the systems to ensure reliable operation. Generally, the correct sizing of a regulating valve is crucial to maintain a desired set point. Based on fluid mechanics theory, a basic control/regulating valve sizing equation can be derived [33]:
Q = C v Δ p ρ
c v = C v ρ
where Q is the flow rate; C v is the liquid sizing coefficient indicating the flow capacity of the valve; Δ p is the differential pressure across the valve; and ρ is the liquid specific gravity. c v is defined here as the equivalent to C v (equivalent flow coefficient, a key performance indicator of regulating valves), as the value of ρ is unknown and always assumed to be constant during operation [34]. This equation is used for formulating the physical loss function for the multistage PINN training. The motivation of using the PINN to estimate c v is that the values of the flow coefficient (liquid sizing coefficient) are unknown in this work, so the flow rate cannot be computed even if the equation is in a closed form. In some cases, flow coefficients can be obtained from some manufacturing specifications, but the coefficient gradually changes over the period of operation as the valve ages, which makes it difficult to obtain the accurate coefficient values during operation.

3.1.2. Definition of the Multistage PINN Loss Function

The proposed multistage PINN aims to estimate the flow rate signal at time t using the input signals, x ( t ) = [ Δ p ( t ) , d i s ( t ) , t e m ( t ) ] , where d i s ( t ) is the displacement of the regulating valve, and  t e m ( t ) is the fluid temperature. To estimate c v without using real labels, the two-stage PINN is designed such that the 1st stage is modeled with a shallow NN, f θ A , which enables efficient online updating, and the 2nd stage is modeled with a DNN, f θ D , which is a deeper network used to capture the complex relationships in the data for estimating the flow rate signal (Figure 1):
c ^ ( t ) = f θ A ( d i s ( t ) )
Q ^ ( t ) = f θ D ( Δ p ( t ) , c ^ ( t ) , t e m ( t ) )
where θ A and θ D are the vectors of the shallow NN and DNN parameters, respectively. The set of trainable parameters [ θ A , θ D ] are optimized jointly during the supervised training phase, whose objective is to minimize the estimation error of Q ^ via a physically consistent loss function:
argmin θ A , θ D L t o t a l
with
L t o t a l = ( 1 μ ) L d a t a + μ L p h y
where L d a t a is the data-driven loss function, L p h y is the physical loss function, and  μ is the hyperparameter that indicates the importance given to L p h y with respect to L d a t a during model training. The data-driven loss function is formulated using the mean squared error:
L d a t a = 1 T t = 1 T ( Q ^ ( t ) Q ( t ) ) 2
where Q ( t ) is the actual value of the flow rate, and T is the number of patterns in D t r a i n . Conversely, the physical loss function is derived from the derivative of Q with respect to c v :
d Q d c v = Δ p
L p h y = 1 T t = 1 T ( d Q ^ ( t ) d c ^ v ( t ) Δ p ( t ) )
The formulation of the physical loss function is designed such that its reliance on the actual labels of C v can be eliminated, which is an achievable task through the utilization of the Automatic Differentiation feature inherent in Neural Networks [35]. Consequently, the estimation of c ^ ( t ) is rendered independent of the necessity for real labels, thereby establishing it as a viable surrogate for the Health Indicator (HI). This, in turn, facilitates the interpretation of fault detection results.
The hyperparameter μ in Equation (7) can be optimized based on the model performance on the validation set. In this work, a two-step process is adopted for determining the hyperparameter μ by optimization. First, the value of μ is set to 0 to determine the optimal model architecture; then, a grid search approach is applied, where the value of μ is varied between 0 and 1 to obtain the best physically consistent trained model.

3.2. Analysis of Residuals by DeepSVDD for Fault Detection

Following the development of the multistage PINN model for predicting Q, an unsupervised method based on DeepSVDD is developed to analyze the residuals for detecting the occurrence of a fault. DeepSVDD is used instead of deterministic threshold approaches, which require the proper setting of the threshold. This is an application-specific complex task that can result in either large false alarm rates if the threshold is too small or large missed alarm rates if the threshold is too large. SVDD is a technique related to the One Class-SVM (OC-SVM) that uses a kernel-based method for mapping the data into a high-dimensional feature space to find the smallest hypersphere, instead of a hyperplane, with center m and radius R > 0 that enclose the majority of the data in normal conditions. DeepSVDD is an extension of SVDD that uses deep learning, like DNNs, to learn effective representations of the data with an SVDD objective. DeepSVDD does not rely on predefined kernel functions, thus allowing for more effective feature extraction and making it suitable for complex and large datasets.
Assume that a validation dataset D v a l = [ x ( k ) , Q ( k ) ] k = 1 , , N collected during normal operating conditions is also available, which can be arranged into sequences of n matrices, with each having a time window length of w with s s sliding steps and ( w s s ) overlapping between two consecutive time windows. The anomaly indicator ( A I ) for the ith time window is defined as follows:
A I i = r i L 2 2 ; i = 1 , 2 , , n
where r i = Q ^ i Q i is the residual of the ith time window predicted by f θ A , f θ D . The anomaly indicators A I v a l , computed from D v a l , are mapped onto n s d feature representations using a DNN, f θ s d , of the DeepSVDD. The aim of f θ s d is to define a more compact space where A I v a l falls within, whereas the A I from abnormal conditions does not. The boundary of the discovered compact space is the hypersphere of the minimal volume characterized by center m and radius R of the minimal volume (Figure 2). The center m of the hypersphere can be calculated as the mean of the f θ s d representations that result from the f θ s d initialization during the first forward iteration of the procedure. The training of the DeepSVDD is achieved through the optimization of the following objective function:
argmin 1 n i = 1 n f θ s d ( A I i ) m 2 + λ 2 l = 1 L θ s d l F 2
where f θ s d is the DNN of the DeepSVDD contains L hidden layers, and θ s d = θ s d 1 , , θ s d L are the parameters of each layer. Specifically, f θ s d ( A I i ) is the learned representation of the input A I i from the network f θ s d with parameters θ s d . The primary goal of the DeepSVDD is to learn θ s d by identifying a hypersphere of the minimum volume with center m. Minimizing f θ s d ( A I i ) m 2 minimizes the volume of the hypersphere. The second term in the objective function represents a weight decay regularizer with a hyperparameter λ > 0 , where · F 2 denotes the Frobenius norm. To bring the data as close to the center m as possible, the DNN must extract the common factors of variation. Once trained, the radius R is defined as the 90th percentile of f θ s d ( A I i ) m i = 1 , , n calculated across all training instances. During testing, the DeepSVDD model determines the outcome by comparing R with the distance calculated between m and f θ s d ( A I t e s t ) (i.e., f θ s d ( A I t e s t ) m i = 1 , , n ), where a fault is identified if the latter exceeds R.

3.3. Implementation Procedure of the Proposed Method

The implementation procedure of the proposed method described above is depicted in Figure 3, thus comprising two parts: offline modeling and online monitoring.

3.3.1. Offline Modelling

The offline modeling utilizes historical normal operating data, divided into training set D t r a i n = [ x ( t ) , Q ( t ) ] t = 1 , , T and validation set D v a l = [ x ( k ) , Q ( k ) ] k = 1 , , K . In the offline modeling, the training of the proposed method consists of two phases: the training of the multistage PINN and the training of the DeepSVDD.
The multistage PINN is, first, trained by minimizing the multistage loss function (Equation (7)). The hyperparameter, μ in Equation (7), is set using a grid search approach on the validation set.
To train the DeepSVDD, the following sequential steps are first implemented:
(1)
Rearrange the validation set into n = ( K w ) / s s matrices, with each having a time window length w with ( w s s ) overlapping between consecutive time windows, as described in Section 3.2. This results in sequences of an array of n matrices, A v a l R n × w × ( N + 1 ) , with  A i v a l = [ x ( l ) , Q ( l ) ] l = 1 : w being the matrix of the ith time window, i = 1 , 2 , , n , in the array;
(2)
Predict the flow rates for each ith time window in A v a l using the trained multi stage PINN, f θ A , f θ D ;
(3)
Compute the corresponding anomaly indicator A I i for the predicted flow rates in step (2) using Equation (11), thus resulting in A I v a l = [ A I i ] i = 1 , , n .
Then, the DeepSVDD is trained using the data A I v a l by optimizing Equation (12) to obtain a decision function f s d ( A I v a l ) .
Finally, a baseline for the equivalent flow coefficient curve, c v , indicates the normal operating condition of a regulating valve and is established through the following sequential steps:
(1)
The incremental displacement input, d i s c = [ d i s i c ] i = 1 , , N d , is artificially generated:
d i s i c = i N d , i = 1 , , N d
This represents the operational process of regulating valves from fully open to closed, as described in [31]. N d is set to 100 in this work;
(2)
The generated d i s c is used as input to the 1st stage of the trained multistage PINN, f θ A , to estimate c v , which is a health indicator reflecting the flow capacity at different opening statuses of the regulating valve.
A plot of c v against the artificially generated displacement, d i s c , establishes a baseline for the equivalent flow coefficient curve of the regulating valve operating under normal conditions.

3.3.2. Online Monitoring

The online monitoring of the proposed method requires w successive online observations, X t e s t ( t ) = [ x t e s t ( t ) ] t = 1 , , w , to reconstruct the measurements Q t e s t ( t ) = [ Q t e s t ( t ) ] t = 1 , , w expected in normal conditions. After w successive online observations, for any subsequent s s consecutive online observations, the matrix X t e s t and the corresponding vector Q t e s t ( t ) are retroactively updated from the current measurement backward to the size of the moving time window w. The updated X t e s t is then used for predictions. The online monitoring comprises two parts:
(1)
The estimation of the flow rates, Q ^ t e s t ( t ) , expected in normal conditions for fault detection;
(2)
The estimation of the online equivalent flow coefficient, c v o n , which is a health indicator used for interpreting the consistency of the detection outcomes with physics.
With respect to (1), the trained multistage PINN f θ A , f θ D in Section 3.3.1 is stored as f θ A , θ D h 1 and is utilized to predict the Q ^ t e s t expected in normal conditions. The predictions are used by the anomaly indicator calculator, thus providing input A I t e s t to the trained DeepSVDD, which outputs the prediction f θ s d ( A I t e s t ) . The regulating valve condition is then considered "normal" if f θ s d ( A I t e s t ) m 2 R and “abnormal” otherwise.
With respect to (2), the trained multistage PINN f θ A , f θ D in Section 3.3.1 is separately stored as f θ A , θ D h 2 , which is fine-tuned online using X t e s t . Fine-tuning involves updating only the 1st stage of the PINN, f θ A h 2 , by freezing the parameters θ D of the 2nd stage f θ D h 2 and updating the parameters θ A of f θ A h 2 through the optimization of the multistage loss function (Equation (7)). Subsequently, the equivalent flow coefficient curve, c v o n , is established by the updated model f θ A h 2 using d i s c as input. The difference between the baseline c v and the online estimation c v o n can reflect potential degradation modes by analyzing flow capacity variation, and this can be used to interpret the physical consistency of the fault detection results.
In this work, the response time of the proposed fault detection method depends on the size, w, of the time window that is used to compute the anomaly indicators: it has been set to 100 s with a delay of 10 s (sliding step, ss). After the initialization of the model for online application, w is updatedevery 10 s. If a faulty transient process is shorter than the s s , the fault cannot be detected instantly.

4. Performance Metrics

The performance evaluation of the proposed method for fault detection employs various metrics, including accuracy, precision, recall, F1-score, and the Area Under the receiver operating characteristic Curve ( A U C ) [36]. Specifically, the correct classification of abnormal condition patterns is referred to as the true positive ( t p ), whereas the correct classification of normal condition patterns is referred to as the true negative ( t n ). The false positive ( f p ) represents the misclassification of normal condition patterns as abnormal, and the false negative ( f n ) represents the misclassification of abnormal condition patterns as normal.
The A c c u r a c y is defined as the fraction of correctly classified patterns among all patterns:
A c c u r a c y = t p + t n t p + t n + f p + f n
The A c c u r a c y is used to evaluate the overall classification performance on both normal and abnormal patterns. It is effective when there is no significant class imbalance.
The P r e c i s i o n is defined as the fraction of abnormal condition patterns correctly classified among those patterns identified as abnormal:
P r e c i s i o n = t p t p + f p
The P r e c i s i o n is used to evaluate the performance only on the detected abnormal condition patterns: a high P r e c i s i o n value implies a low false alarm rate.
The R e c a l l , or sensitivity, is defined as the fraction of abnormal condition patterns correctly classified among the actual abnormal condition patterns:
R e c a l l = t p t p + f n
The R e c a l l is used to evaluate the performance only on actual abnormal condition patterns: a high R e c a l l value implies a low missed alarm rate.
The S p e c i f i c i t y is defined as the fraction of normal condition patterns correctly classified among the actual normal condition patterns:
S p e c i f i c i t y = t n t n + f p
The S p e c i f i c i t y is used to evaluate the performance only on actual normal condition patterns: a high S p e c i f i c i t y value implies a low false alarm rate.
The F 1 s c o r e ( F 1 ) is the harmonic mean of P r e c i s i o n and R e c a l l :
F 1 = 2 1 P r e c i s i o n + 1 R e c a l l
The F 1 s c o r e is used to evaluate the comprehensive performance considering the false and missed alarm rate. It provides a single value that captures the trade-off between P r e c i s i o n and R e c a l l , thus making it particularly useful when dealing with uneven class distributions.
The R e c e i v e r O p e r a t i n g C h a r a c t e r i s t i c ( R O C ) c u r v e is a curve generated by plotting the recall against the false positive rate ( F P R ):
F P R = f p f p + t n
The A r e a U n d e r t h e R O C C u r v e ( A U C ) is calculated using an average of trapezoidal approximations. It particularly shows the performance on distinguishing normal and abnormal conditions. The values of the performance metrics A c c u r a c y , P r e c i s i o n , R e c a l l , F 1 s c o r e , and A U C , range between 0 and 1, with a larger value indicating better performance.

5. Applications

Considering the unavailability of real data collected from regulating valves in NPPs, two case studies based on the DAMADICS benchmark [31] have been considered: (1) a synthetic case study designed to emulate the real industrial application of a control/regulating valve (Section 5.1) and (2) a real case study concerning the time series data collected from the operational activities of regulating valves installed in actual industrial plants (Section 5.2).

5.1. Synthetic Case Study

5.1.1. Benchmark Description

The model of an industrial control valve within the DAMADICS benchmark [31] is adopted to simulate operational data. This type of control/regulating valve is commonly used in various industrial systems, including NPPs, and therefore, it is employed in this work to simulate the operational scenarios of regulating valves, thus addressing the real data scarcity issue prevalent in NPPs. The structure of the control valve considered is depicted in Figure 4. During its operation, an electrical signal from the external controller is converted into a pressure signal via a diaphragm. The positioner ensures the generation of the correct pressure, following which a pneumatic servo motor adjusts the valve plug to the desired position for controlling/regulating the flow rate. The DAMADICS benchmark model, denoted as f m ( · ) , is shown in Figure 5. The input signals to the model are the command signal, c o m , which is provided by the user or the operational schedule to maintain the optimal functioning of the industrial component; the upstream and downstream fluid pressures p 1 and p 2 , which represent the uncertainties in working conditions; and the fluid temperature t e m c , which influences the flow principles of the fluid. The output signals, i.e., the measured signals, consist of the displacement signal d i s , the differential pressure of the valve Δ p , the flow rate Q , and the liquid temperature t e m .

5.1.2. Data Generation

Data generation is performed using the DAMADICS model f m ( · ) . Given the flexibility of the benchmark, the user can define appropriate simulation inputs for specific applications. To obtain the data of the variable commands and working conditions that reflect the operational activities of regulating valves in NPPs, the relevant preset inputs to f m ( · ) (Figure 5) have to be properly defined. In this work, they are defined as follows:
(1)
The command signal, c o m , is defined as a chirp signal, wherein a chirp represents a signal with a varying frequency over time. By employing a chirp signal as the command signal, the dynamic behavior of the component under diverse input frequencies can be comprehensively simulated. This is motivated by the fact that the frequency response of an NPP regulating valve is an important performance indicator, which can be tested by subjecting the valve to input signals of varying frequencies [37]. The vector c o m is expressed as follows:
c o m ( t ) = a c · sin ( φ ( t ) · t s f ) + b c ,
φ ( t ) = 2 · π · ( 0.1 + t s f ) .
Here, φ ( · ) is a linear function that samples the instantaneous phase to emulate sufficiently diverse commands in realistic conditions. The parameters a c and b c are randomly sampled for each simulation from Uniform distributions, with  a c U ( 0 , 0.5 ) and b c U ( 0.2 , 0.8 ) . Additionally, s f serves as the scaling factor for t, thus following s f U ( 200 , 600 ) ;
(2)
The upstream and downstream fluid pressures, p 1 and p 2 , are defined as follows:
p 1 ( t ) = a p 1 · sin ( f p 1 · t ) + b p 1 ,
p 2 ( t ) = a p 2 · sin ( f p 2 · t ) + b p 2
where a p 1 , b p 1 , a p 2 , and  b p 2 are randomly sampled from their respective Uniform distributions: a p 1 U ( 0 P a , 5 e 5 P a ) , b p 1 U ( 3 e 6 P a , 4 e 6 P a ) , a p 2 U ( 0 P a , 2 e 5 P a ) , and  b p 2 U ( 1.5 e 6 P a , 2 e 6 P a ) . The pressure values are set considering the actual conditions in NPPs [38]. This ensures that the parameters are simulated within a realistic envelop of conditions. The frequencies f p 1 and f p 2 are defined frequencies of the sinusoidal signal, thus following f p 1 U ( 0.01 , 0.15 ) and f p 2 U ( 0.01 , 0.15 ) ;
(3)
The temperature of each simulation is defined as follows:
t e m c ( t ) = t c o n + W ( t )
where t c o n is sampled from a Gaussian distribution t c o n N ( 50   ° C , 20   ° C ) , and  W ( t ) is the white noise function accounting temperature fluctuations. The value of the temperature is set considering the real environment of the safety regulating valves, which is linked to the cooling systems in NPPs [39].
The simulation procedure for generating data is detailed in Algorithm 1 below. Subsequently, the generated normal operating condition dataset, D n c , containing K s i m = 200 time series trajectories with each having a length of T = 500 time series observations, is used for model development (with D t r a i n representing 75% of the data and D v a l representing the remaining 25%). Similarly, the test dataset, D t e s t , is generated such that a fault is injected at a specific time instant in each time series (Figure 6). In this case, three fault modes, which commonly manifest in the operation of regulating valves, are considered: (1) the sedimentation of the valve plug or valve seat (denoted as “f1”); (2) the erosion of the valve plug or valve seat (denoted as “f2”); and (3) internal leakage or valve tightness (denoted as “f3”). For each fault mode, K f s i m = 30 time series trajectories are generated, with each one having a length of T = 1500 time series observations.
Algorithm 1: Data generation using the benchmark model.
Energies 17 02647 i001

5.1.3. Results and Discussion

The PINN model, f θ A , f θ D , was built using a shallow NN, f θ A , with one hidden layer made by five neurons and a DNN, f θ D , with three hidden layers made by ten neurons each. The activation function T a n h was used in all layers except the output layers of f θ A and f θ D in which the S i g m o i d function was employed to guarantee the scale of c [ 0 , 1 ] [40]. To train the PINN, the Adam optimizer [41] was utilized with a learning rate η = 0.001 , a batch size of L b a t c h = 512 , and a number of epochs of N e p o c h = 200 . Other architectures, e.g., a shallow NN with 10 neurons and a DNN with 20 neurons in each of the three hidden layers, have also been explored. However, these alternative architectures considered resulted in no improvement in performance while significantly increasing the computational burden of the model training. Thus, the relatively simple architecture presented in this work was preferred. The weight of the physical loss, μ , was set to 0.02 through a grid search approach based on the reconstruction accuracy of the model on the validation dataset. Figure 7 shows an example of the curve that contains the estimated equivalent flow coefficient generated by the trained f θ A under normal conditions, which follows the expected behavior of the flow coefficient with respect to displacement [31], As shown in Figure 7, the flow coefficient decreased monotonically as the displacement increased. Although the lack of real flow coefficients made it impossible to validate the exact values in Figure 7, the obtained results follow the expected physical trend of the flow coefficient of the valve. In this work, this trend was compared with another curve that contained the estimated equivalent flow coefficients in abnormal conditions. Through this comparison, the relative difference between the two curves was used to observe the degradation in the flow capacity of the regulating valve. Figure 8 shows an example of the reconstructed flow rate signals in the validation set along with the corresponding residuals. The Root Mean Squared Error (RMSE) of the flow rate signals reconstructed by the PINN came out to 1.735.
For the DeepSVDD model, the architecture of the DNN f θ s d was composed of one hidden layer with 64 neurons and an output layer with 32 neurons ( n s d = 32 feature representations), which were activated by the R e l u function. The batch size was set to 32, the number of epochs was set to 100, and the A d a m optimizer was once again employed for model training. The 32 extracted feature representations were then used to build a hypersphere that enclosed the majority of the data in normal conditions, thus enabling the detection of faults outside the hypersphere. To compute the robust anomaly indicators with the minimal false and missed alarms, the sizes of the time window w and the sliding step s s were set to 100 and 10, respectively, with w s s (i.e., 100 − 10 = 90) overlapping between the two consecutive time windows.
To verify and evaluate the performance of the proposed method, the state-of-the-art fault detection methods presented in Table 1 were considered for comparison.
To ensure a fair comparison, each of the aforementioned methods for signal reconstruction was combined with DeepSVDD for fault detection.
Performance comparisons for the simulated fault modes are reported in Table 2, Table 3 and Table 4. Both the proposed method and M 1 method showed the satisfactory performance (above 0.95 ) across all metrics for the three simulated fault modes of regulating valves, with the proposed method demonstrating the best performance. Although, with respect to the fault f3 (internal leakage, Table 4), the model M 2 provided slightly larger value of S p e c i f i c i t y (a measure of how well a model is able to detect normal condition cases), and its value of Recall (a measure of how well a model is able to detect the abnormal condition cases) with respect to all of the three simulated faults was significantly smaller than that of the proposed method. Due to the strongly nonlinear relationships in the data, both the PCA and AAKR methods exhibited limitations in accurately and robustly reconstructing the signals. Consequently, these limitations led to a decrease in their fault detection performances.
Regarding the estimated equivalent flow coefficient curves, Figure 9 shows a set of c v generated from the fine-tuned models f θ A h 2 using a validation dataset. c v represents the healthy region of c v under normal conditions, thus considering uncertainties and noise in the dataset. Figure 10 compares the c v of the normal conditions and the c v f 1 of fault f1, where c v f 1 was lower at all d i s points, thus indicating a decrease in the flow capacity due to the occurrence of the f1 fault mode (valve seat sedimentation), which is exactly consistent with the mechanism of f1 (valve seat sedimentation can cause the possible shrinkage of the opening area, which in turn decreases the flow capacity). In contrast, Figure 11 indicates an increase in the flow capacity for the f2 fault mode (valve seat erosion), as the opening area expands in relation to the nominal area. Similarly, Figure 12 shows an increase in the flow capacity for the f3 fault mode (internal leakage). Consequently, the estimated flow coefficient curves generated by the proposed multistage PINN model follow the physics-based mechanisms of different fault modes, thus offering insights useful for maintenance engineers.
The proposed method demonstrates satisfactory fault detection performance and provides consistent interpretations for the detected faults based on the physical mechanism. This aligns with the physics-based understanding of different fault modes, thus making it useful for potentially assisting maintenance engineers in comprehending the detected faults.

5.2. Real Case Study

5.2.1. Dataset Description

The time series data collected from a real industrial control valve within the DAMADICS benchmark project [31] were considered for the validation of the proposed method. The normal condition dataset consists of 55,000 time series observations of four signals, Δ p , d i s , t e m , Q , which were partitioned into training and validation sets containing 45,000 and 10,000 observations, respectively. The training set was used for training the model, and the validation set was used for hyperparameter setting. Figure 13 shows the time series evolutions of Q and d i s under normal operating conditions. The test dataset used to assess the fault detection performance of the proposed method contains 2095 observations in the normal condition and two fault modes of 415 observations in P a r t l y o p e n e d b y p a s s f a u l t (denoted as “f4”) and 205 observations in P o s i t i o n e r s u p p l y p r e s s u r e d r o p (denoted as “f5”).

5.2.2. Results and Discussion

The implementation of the proposed method with respect to the real case study was similar to those of the synthetic case except that the learning rate, η , and the weight of physical loss function, μ , were set to 0.003 and 0.3, respectively. Figure 14 shows an example of the reconstructed flow rate signals in the validation set along with the corresponding residuals. The RMSE of the flow rate signals reconstructed by the PINN was 4.184.
The performance of the proposed method was again compared with that of the state-of-the-art methods presented in Section 5.1.3. Table 5 and Table 6 present the performance metrics for faults f4 and f5, respectively. With respect to fault f4 (Table 5), model M 2 demonstrated the highest performance in terms of the accuracy, F 1 s c o r e , and A U C . Model M 2 and the proposed method showed comparable S p e c i f i c i t y values, thus indicating that both models are capable of detecting normal conditions. On the other hand, the proposed method had the highest Precision among all the models, thus suggesting its superior ability to detect abnormal conditions, which implies the lowest false alarm rate. With respect to fault f5 (Table 6), the proposed method consistently outperformed all the other models across the A c c u r a c y , P r e c i s i o n , R e c a l l , S p e c i f i c i t y , F 1 s c o r e , and A U C , which demonstrates its effectiveness in detecting both normal and abnormal conditions. It is important to note that, whereas model M 4 achieved a perfect R e c a l l value indicating perfect performance in detecting abnormal conditions, its performance concerning other metrics was significantly lower than that of the proposed method. This also highlights the importance of considering multiple performance metrics to assess the overall effectiveness of a fault detection method. Comparing these results to the synthetic case, the performances of models M 2 , M 3 , M 4 , and M 5 in this case study surpassed those previously obtained. This can be attributed to the relatively stable working conditions observed in the real case, as shown in Figure 13 (where the d i s signal varied within a limited range of 40% to 60%). In summary, the proposed method demonstrates superior overall performance for fault detection in regulating valves.
The trained proposed method was used to generate the equivalent flow coefficient curve. Figure 15 shows a set of c v curves generated from the fine-tuned models f θ A h 2 using the validation dataset, thus representing the healthy region of c v . For the p a r t l y o p e n e d b y p a s s fault f4, the estimated c v f 4 was significantly above c v (Figure 16), thus indicating an equivalent increase in flow capacity due to the open bypass of the regulating valves. For the p o s i t i o n e r s u p p l y p r e s s u r e d r o p fault f5, the estimated c v f 5 was partially overlapped with c v (Figure 17), thus indicating that the flow capacity of the regulating valve remained approximately equal to nominal. This observation aligns with the fault mechanism, thus indicating that the flow capacity of the regulating valve remained unaffected by the p o s i t i o n e r s u p p l y p r e s s u r e d r o p .
In summary, the proposed method shows superior fault detection performance and provides physical insights into the detected fault modes that are consistent with the physics-based mechanisms.

6. Conclusions

In this paper, a novel method based on the multistage PINN has been developed for the detection of faults in regulating the valves of NPPs. The method consists of a muiltistage PINN proposed to estimate the flow rate signal of a regulating valve expected under normal conditions and a DeepSVDD employed to analyze the residuals (the differences between the estimates and measurements of the flow rate) for the detection of faults, thus eliminating the need for manual threshold setting. Additionally, the fundamental valve sizing equation was integrated into the multistage PINN, thus enabling the estimation of the equivalent flow coefficient curve Cv, which in turn facilitates the interpretation of the fault detection results. The proposed method was applied to two case studies from the DAMADICS benchmark: (1) a synthetic simulation case implemented to emulate the operations of regulating valves in NPPs and (2) a real dataset collected from operating industrial control valves. The obtained results show that the performance of the proposed method is more satisfactory than other state-of-the-art methods, particularly in scenarios with varying working conditions. The results also show that the deviation of the estimated C v from its nominal values can provide a rational interpretation of the occurred fault mode from the perspective of the flow capacity, which intuitively can assist maintenance engineers in subsequent maintenance decisions.
Future work will be devoted to (1) the exploration of more advanced NN techniques, such as recurrent NN, Bayesian NN, and PINNs, integrated with the physical loss function proposed in this work to enhance the model capabilities in capturing temporal dependencies and complex patterns in the data and (2) the use of real data collected from operational regulating/safety valves in NPPs to calibrate the developed model and further validate its performance.

Author Contributions

Conceptualization, C.L., I.A., E.Z., W.L., Y.Z., W.Y. and J.C.; Methodology, C.L., I.A. and E.Z.; Software, C.L.; Validation, C.L., I.A. and E.Z.; Formal analysis, C.L., I.A. and E.Z.; Investigation, C.L., I.A. and E.Z.; Writing—original draft, C.L. and I.A.; Writing—review & editing, C.L., I.A., E.Z., W.L., Y.Z., W.Y. and J.C.; Visualization, C.L., I.A. and E.Z.; Supervision, I.A. and E.Z.; Project administration, E.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are publicly available from https://iair.mchtr.pw.edu.pl/Damadics.

Acknowledgments

Chenyang Lai gratefully acknowledges the financial support from the China Scholarship Council (No. 202006290009).

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AAKRAuto Associative Kernel Regression
AEAutoencoder
A I Anomaly Indicator
A U C Area Under the receiver operating characteristic Curve
CNNConvolutional Neural Network
DAMADICSDevelopment and Application of Methods for Actuator Diagnosis
in Industrial Control Systems
DNNDeep Neural Network
F P R False Positive Rate
HIHealth Indicator
LSTMLong Short-Term Memory
NNNeural Network
NPPNuclear Power Plant
PCAPrincipal Component Analysis
PHMPrognostics and Health Management
PINNPhysics-Informed Neural Network
RMSERoot Mean Squared Error
SSCsStructures, Systems, and Components
SVDDSupport Vector Data Description
SVMSupport Vector Machine
t p True Positive
t n True Negative
f p False Positive
f n False Negative
NNumber of physical quantities
x ( t ) Physical quantities measured at time t
Q ( t ) Flow rate measurement at time t
D t r a i n Training dataset
D v a l Validation dataset
f ( · ) The data-driven model that represents the behaviour of the regulating valve
in normal condition
Δ p Differential pressure across the valve
d i s Displacement of the regulating valve
ρ Fluid specific gravity
t e m Fluid temperature
c v Physical loss function
θ A Shallow NN parameters of the multistage PINN
θ D DNN parameters of the multistage PINN
L t o t a l Loss function of the multistage PINN
L d a t a Data-driven loss function
L p h y Physical loss function
μ Importance given to L p h y
wLength of time window
s s Number of sliding steps
r i Residuals of ith time window
nNumber of time windows
f θ s d Neural network of DeepSVDD
mCenter of the hypersphere of DeepSVDD
RRadius of the hypersphere of DeepSVDD
TNumber of patterns in the training dataset
KNumber of patterns in the validation dataset
A v a l Array of time windows generated from D v a l
c v o n Equivalent flow coefficients estimated online
f m ( · ) DAMADICS model of control valve
c o m                  Command signal for data generation
p 1 Upstream fluid pressure
p 2 Downstream fluid pressure

References

  1. Zio, E. Advancing nuclear safety. Front. Nucl. Eng. 2024, 2, 1346555. [Google Scholar] [CrossRef]
  2. Ma, J.; Jiang, J. Applications of fault detection and diagnosis methods in nuclear power plants: A review. Prog. Nucl. Energy 2011, 53, 255–266. [Google Scholar] [CrossRef]
  3. Baek, S.; Heo, G. Application of dynamic fault tree analysis to prioritize electric power systems in nuclear power plants. Energies 2021, 14, 4119. [Google Scholar] [CrossRef]
  4. Lin, Z.H.; Hou, C.W.; Zhang, L.; Guan, A.Q.; Jin, Z.J.; Qian, J.Y. Fluid-structure interaction analysis on vibration characteristics of sleeve control valve. Ann. Nucl. Energy 2023, 181, 109579. [Google Scholar] [CrossRef]
  5. Qian, J.Y.; Hou, C.W.; Mu, J.; Gao, Z.X.; Jin, Z.J. Valve core shapes analysis on flux through control valves in nuclear power plants. Nucl. Eng. Technol. 2020, 52, 2173–2182. [Google Scholar] [CrossRef]
  6. Qin, S.; Li, W. Detection and identification of faulty sensors in dynamic processes. AIChE J. 2001, 47, 1581–1593. [Google Scholar] [CrossRef]
  7. Park, G.; Lee, C.; Kim, J.; Ryu, J.; Jung, H. Identification of Nuclear Components Degradation by Time-Frequency Ridge Pattern; American Nuclear Society: La Grange Park, IL, USA, 2006. [Google Scholar]
  8. Kim, Y.; Kim, S.; Chung, H.; Park, Y.; Park, J. A study on technique to estimate impact location of loose part using Wigner-Ville distribution. Prog. Nucl. Energy 2003, 43, 261–266. [Google Scholar] [CrossRef]
  9. Choi, J.; Lee, S. Consistency index-based sensor fault detection system for nuclear power plant emergency situations using an LSTM network. Sensors 2020, 20, 1651. [Google Scholar] [CrossRef] [PubMed]
  10. Wang, H.; Peng, M.; Ayodeji, A.; Xia, H.; Wang, x.; Li, Z. Advanced fault diagnosis method for nuclear power plant based on convolutional gated recurrent network and enhanced particle swarm optimization. Ann. Nucl. Energy 2021, 151, 107934. [Google Scholar] [CrossRef]
  11. Liu, Z.; Liu, J.; Huang, Y.; Li, T.; Nie, C.; Xia, Y.; Zhan, L.; Tang, Z.; Zhang, L. Fault Critical Point Prediction Method of Nuclear Gate Valve with Small Samples Based on Characteristic Analysis of Operation. Materials 2022, 15, 757. [Google Scholar] [CrossRef]
  12. An, Z.; Cheng, L.; Guo, Y.; Ren, M.; Feng, W.; Sun, B.; Ling, J.; Chen, H.; Chen, W.; Luo, Y.; et al. A Novel Principal Component Analysis-Informer Model for Fault Prediction of Nuclear Valves. Machines 2022, 10, 240. [Google Scholar] [CrossRef]
  13. Huang, X.Y.; Xia, H.; Yin, W.Z.; Liu, Y.K.; Miyombo, M.E. Research on fault diagnosis method of electric gate valve under strong background noise. Ann. Nucl. Energy 2023, 194, 110055. [Google Scholar] [CrossRef]
  14. Ai, X.; Yongkuo, L.; Longfei, S.; Chunli, X.; Zhou, H. A concurrent fault diagnosis method for electric isolation valves in nuclear power plants based on rule-based reasoning and data-driven methods. Prog. Nucl. Energy 2024, 170, 105190. [Google Scholar] [CrossRef]
  15. Mendonça, L.F.; Sousa, J.M.C.; Costa, J.M.G. An architecture for fault detection and isolation based on fuzzy methods. Expert Syst. Appl. 2009, 36, 1092–1104. [Google Scholar] [CrossRef]
  16. Raduenz, H.; Mendoza, Y.; Ferronatto, D.; Souza, F.; Bastos, P.; Soares, J.; Negri, V. Online fault detection system for proportional hydraulic valves. J. Braz. Soc. Mech. Sci. Eng. 2018, 40, 331. [Google Scholar] [CrossRef]
  17. Han, X.; Jiang, J.; Xu, A.; Huang, X.; Pei, C.; Sun, Y. Fault detection of pneumatic control valves based on canonical variate analysis. IEEE Sensors J. 2021, 21, 13603–13615. [Google Scholar] [CrossRef]
  18. Venkata, S.K.; Rao, S. Fault detection of a flow control valve using vibration analysis and support vector machine. Electronics 2019, 8, 1062. [Google Scholar] [CrossRef]
  19. Kordestani, M.; Zanj, A.; Orchard, M.E.; Saif, M. A modular fault diagnosis and prognosis method for hydro-control valve system based on redundancy in multisensor data information. IEEE Trans. Reliab. 2018, 68, 330–341. [Google Scholar] [CrossRef]
  20. Qi, B.; Liang, J.; Tong, J. Fault diagnosis techniques for nuclear power plants: A review from the artificial intelligence perspective. Energies 2023, 16, 1850. [Google Scholar] [CrossRef]
  21. Yang, Z.; Baraldi, P.; Zio, E. A method for fault detection in multi-component systems based on sparse autoencoder-based deep neural networks. Reliab. Eng. Syst. Saf. 2022, 220, 108278. [Google Scholar] [CrossRef]
  22. Zhang, K.; Liu, Y.; Gu, Y.; Ruan, X.; Wang, J. Multiple-timescale feature learning strategy for valve stiction detection based on convolutional neural network. IEEE/ASME Trans. Mechatronics 2021, 27, 1478–1488. [Google Scholar] [CrossRef]
  23. Zhu, S.B.; Li, Z.L.; Li, X.; Xu, H.h.; Wang, X.m. Convolutional neural networks-based valve internal leakage recognition model. Measurement 2021, 178, 109395. [Google Scholar] [CrossRef]
  24. Yucesan, Y.A.; Viana, F.A. A physics-informed neural network for wind turbine main bearing fatigue. Int. J. Progn. Health Manag. 2020, 11, 17. [Google Scholar] [CrossRef]
  25. Shen, S.; Lu, H.; Sadoughi, M.; Hu, C.; Nemani, V.; Thelen, A.; Webster, K.; Darr, M.; Sidon, J.; Kenny, S. A physics-informed deep learning approach for bearing fault detection. Eng. Appl. Artif. Intell. 2021, 103, 104295. [Google Scholar] [CrossRef]
  26. Li, W.; Deka, D. Physics-informed learning for high impedance faults detection. In Proceedings of the 2021 IEEE Madrid PowerTech, Madrid, Spain, 28 June–2 July 2021; pp. 1–6. [Google Scholar]
  27. Sadoughi, M.; Hu, C. Physics-based convolutional neural network for fault diagnosis of rolling element bearings. IEEE Sensors J. 2019, 19, 4181–4192. [Google Scholar] [CrossRef]
  28. Guo, W.; He, C.; Shao, P. A novel system identification method for servo-hydraulic shaking table using physics-guided long short-term memory network. Mech. Syst. Signal Process. 2022, 178, 109277. [Google Scholar] [CrossRef]
  29. Sun, W.; Akashi, N.; Kuniyoshi, Y.; Nakajima, K. Physics-informed recurrent neural networks for soft pneumatic actuators. IEEE Robot. Autom. Lett. 2022, 7, 6862–6869. [Google Scholar] [CrossRef]
  30. Guc, F.; Chen, Y. Fault Cause Assignment with Physics Informed Transfer Learning. IFAC-PapersOnLine 2021, 54, 53–58. [Google Scholar] [CrossRef]
  31. Bartyś, M.; Patton, R.; Syfert, M.; de las Heras, S.; Quevedo, J. Introduction to the DAMADICS actuator FDI benchmark study. Control Eng. Pract. 2006, 14, 577–596. [Google Scholar] [CrossRef]
  32. Yi, J.; Yoon, S. Patch svdd: Patch-level svdd for anomaly detection and segmentation. In Proceedings of the Asian Conference on Computer Vision 2020, Kyoto, Japan, 30 November–4 December 2020. [Google Scholar]
  33. Bartyś, M. Specification of Actuators Intended to Use for Benchmark Definition; Warsaw University of Technology: Warszawa, Poland, 2001. [Google Scholar]
  34. Qiu, C.; Jiang, C.H.; Zhang, H.; Wu, J.Y.; Jin, Z.J. Pressure drop and cavitation analysis on sleeve regulating valve. Processes 2019, 7, 829. [Google Scholar] [CrossRef]
  35. Baydin, A.G.; Pearlmutter, B.A.; Radul, A.A.; Siskind, J.M. Automatic differentiation in machine learning: A survey. J. Marchine Learn. Res. 2018, 18, 1–43. [Google Scholar]
  36. Xu, M.; Baraldi, P.; Lu, X.; Zio, E. Generative Adversarial Networks With AdaBoost Ensemble Learning for Anomaly Detection in High-Speed Train Automatic Doors. IEEE Trans. Intell. Transp. Syst. 2022, 23, 23408–23421. [Google Scholar] [CrossRef]
  37. Bucho, L.; Palomo, M.; Vaquer, J.; Lopez, B.; Ruiz, G.; Mora, J.; Verdu, G. Study of the Characteristic Response of Pressure Control System in Order to Obtain the Design Parameters of the New Control System MARK VI Turbine in Cofrentes Nuclear Power Plant. Int. Conf. Nucl. Eng. 2010, 49293, 835–840. [Google Scholar]
  38. Sapra, M.; Kundu, S.; Pal, A.; Vijayan, P.; Vaze, K.; Sinha, R. Design and development of innovative passive valves for Nuclear Power Plant applications. Nucl. Eng. Des. 2015, 286, 195–204. [Google Scholar] [CrossRef]
  39. Benbella, S. A Mixture loss coefficient of safety valves used in nuclear plants. Nucl. Eng. Des. 2009, 239, 1779–1788. [Google Scholar] [CrossRef]
  40. Sharma, S.; Sharma, S.; Athaiya, A. Activation functions in neural networks. Towards Data Sci 2017, 6, 310–316. [Google Scholar] [CrossRef]
  41. Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  42. Ahmed, I.; Heo, G.; Zio, E. On-line process monitoring during transient operations using weighted distance Auto Associative Bilateral Kernel Regression. ISA Trans. 2019, 92, 191–212. [Google Scholar] [CrossRef]
  43. Hines, J.; Uhrig, R.; Wrest, D. Use of autoassociative neural networks for signal validation. J. Intell. Robot. Syst. 1998, 21, 143–154. [Google Scholar] [CrossRef]
Figure 1. Proposed method for fault detection in regulating valves.
Figure 1. Proposed method for fault detection in regulating valves.
Energies 17 02647 g001
Figure 2. Flowchart of DeepSVDD model.
Figure 2. Flowchart of DeepSVDD model.
Energies 17 02647 g002
Figure 3. Implementation procedure of the proposed method.
Figure 3. Implementation procedure of the proposed method.
Energies 17 02647 g003
Figure 4. Structure of the control valve in DAMADICS benchmark; adapted with permission from [31], Elsevier, 2006.
Figure 4. Structure of the control valve in DAMADICS benchmark; adapted with permission from [31], Elsevier, 2006.
Energies 17 02647 g004
Figure 5. DAMADICS module for data generation.
Figure 5. DAMADICS module for data generation.
Energies 17 02647 g005
Figure 6. An example of faulty time series: (a) the fault is injected at 500 s; (b) the evolution of the flow signal.
Figure 6. An example of faulty time series: (a) the fault is injected at 500 s; (b) the evolution of the flow signal.
Energies 17 02647 g006
Figure 7. An equivalent flow coefficient curve, c v , indicating the relationship between the displacement of regulating valve, d i s , and the corresponding flow coefficient, C v , that represents the flow capacity.
Figure 7. An equivalent flow coefficient curve, c v , indicating the relationship between the displacement of regulating valve, d i s , and the corresponding flow coefficient, C v , that represents the flow capacity.
Energies 17 02647 g007
Figure 8. Flow rate signal reconstructions of normal condition (synthetic case): (a) the reconstructed signal; (b) the residuals.
Figure 8. Flow rate signal reconstructions of normal condition (synthetic case): (a) the reconstructed signal; (b) the residuals.
Energies 17 02647 g008
Figure 9. Set of equivalent flow coefficient curves, c v , generated from the fine-tuned model f θ A h 2 under normal conditions.
Figure 9. Set of equivalent flow coefficient curves, c v , generated from the fine-tuned model f θ A h 2 under normal conditions.
Energies 17 02647 g009
Figure 10. Set of equivalent flow coefficient curves, c v f 1 , generated from the fine-tuned model f θ A h 2 under f1 fault mode.
Figure 10. Set of equivalent flow coefficient curves, c v f 1 , generated from the fine-tuned model f θ A h 2 under f1 fault mode.
Energies 17 02647 g010
Figure 11. Set of equivalent flow coefficient curves, c v f 2 , generated from the fine-tuned model f θ A h 2 under f2 fault mode.
Figure 11. Set of equivalent flow coefficient curves, c v f 2 , generated from the fine-tuned model f θ A h 2 under f2 fault mode.
Energies 17 02647 g011
Figure 12. Set of equivalent flow coefficient curves, c v f 3 , generated from the fine-tuned model f θ A h 2 under f3 fault mode.
Figure 12. Set of equivalent flow coefficient curves, c v f 3 , generated from the fine-tuned model f θ A h 2 under f3 fault mode.
Energies 17 02647 g012
Figure 13. Time series evolutions of measurement signals under normal operating conditions: (a) flow signal Q ( t ) and (b) displacement of the valve d i s ( t ) .
Figure 13. Time series evolutions of measurement signals under normal operating conditions: (a) flow signal Q ( t ) and (b) displacement of the valve d i s ( t ) .
Energies 17 02647 g013
Figure 14. Flow rate signal reconstructions of normal condition (real case): (a) the reconstructed signal and (b) the residuals.
Figure 14. Flow rate signal reconstructions of normal condition (real case): (a) the reconstructed signal and (b) the residuals.
Energies 17 02647 g014
Figure 15. Set of equivalent flow coefficient curves, c v , generated from the fine-tuned model f θ A h 2 using validation dataset under normal conditions (real case).
Figure 15. Set of equivalent flow coefficient curves, c v , generated from the fine-tuned model f θ A h 2 using validation dataset under normal conditions (real case).
Energies 17 02647 g015
Figure 16. Set of equivalent flow coefficient curves, c v f 4 , generated from the fine-tuned model f θ A h 2 using test dataset under p a r t l y o p e n e d b y p a s s fault (f4).
Figure 16. Set of equivalent flow coefficient curves, c v f 4 , generated from the fine-tuned model f θ A h 2 using test dataset under p a r t l y o p e n e d b y p a s s fault (f4).
Energies 17 02647 g016
Figure 17. Set of equivalent flow coefficient curves, c v f 5 , generated from the fine-tuned model f θ A h 2 using test dataset under p o s i t i o n e r s u p p l y p r e s s u r e d r o p fault (f5).
Figure 17. Set of equivalent flow coefficient curves, c v f 5 , generated from the fine-tuned model f θ A h 2 using test dataset under p o s i t i o n e r s u p p l y p r e s s u r e d r o p fault (f5).
Energies 17 02647 g017
Table 1. Comparison methods.
Table 1. Comparison methods.
MethodDescriptionNotation
DNNDNN model with three hidden layers and ten neurons in each layer. M 1
AAKRThe Auto Associative Kernel Regression (AAKR) [42] method for reconstructing all signals, Δ p , d i s , t e m , Q . The optimal bandwidth parameter h is set to 0.15 using a grid search approach on the validation dataset, and the anomaly indicator defined in Equation (11) is used. M 2
AAKRThe AAKR model in M 2 is considered, but the anomaly indicator is defined as A I i M 3 = R i A L 2 2 , where R i A is the residual matrix between predictions and measurements of Δ p , d i s , t e m , Q . M 3
PCAThe PCA method [43] is developed to reconstruct the signals Δ p , d i s , t e m , Q . The number of principal components selected for signal reconstruction is 3, which explains at least 80% of the data variability, and the anomaly indicator defined in Equation (11) is used. M 4
PCAThe PCA method in M 4 is used, but the anomaly indicator is defined as A I i M 5 = R i P L 2 2 . M 5
Table 2. Detection performance on f1: Sedimentation of valve plug or valve seat.
Table 2. Detection performance on f1: Sedimentation of valve plug or valve seat.
M1M2M3M4M5Proposed
Accuracy0.96720.65370.40130.31320.30710.9702
Precision0.95440.92030.76250.50150.46510.9585
Recall1.00000.54300.18670.06190.05671.0000
Specificity0.89510.89680.87240.86500.85690.9049
F1-score0.97670.68300.29990.11010.10100.9788
AUC0.94760.71990.52950.46350.45680.9524
Table 3. Detection performance on f2: Erosion of valve plug or valve seat.
Table 3. Detection performance on f2: Erosion of valve plug or valve seat.
M1M2M3M4M5Proposed
Accuracy0.96670.46160.41910.38070.34730.9715
Precision0.95370.84850.78520.72240.63860.9602
Recall1.00000.26330.21260.16000.11521.0000
Specificity0.89350.89680.87240.86500.85690.9089
F1-score0.97630.40190.33460.26200.19520.9799
AUC0.94680.58000.54250.51250.48610.9545
Table 4. Detection performance on f3: Internal leakage or valve tightness.
Table 4. Detection performance on f3: Internal leakage or valve tightness.
M1M2M3M4M5Proposed
Accuracy0.98030.91050.38820.32850.33710.9820
Precision0.97090.97630.65300.36360.45420.9732
Recall1.00000.88500.14300.03200.05451.0000
Specificity0.94290.95910.85520.89330.87520.9476
F1-score0.98520.92840.23460.05880.09730.9864
AUC0.97140.92200.49910.46270.46490.9738
Table 5. Detection performance on f4: Partly opened bypass.
Table 5. Detection performance on f4: Partly opened bypass.
M1M2M3M4M5Proposed
Accuracy0.87740.98110.87740.79250.84910.9623
Precision0.45830.84620.45830.33330.40740.8889
Recall1.00001.00001.00001.00001.00000.7273
Specificity0.86320.97900.86320.76840.83160.9895
F1-score0.62860.91670.62860.50000.57900.8000
AUC0.93160.98950.93160.88420.91580.8584
Table 6. Detection performance on f5: Positioner supply pressure drop.
Table 6. Detection performance on f5: Positioner supply pressure drop.
M1M2M3M4M5Proposed
Accuracy0.85940.88280.82810.71090.67970.9063
Precision0.75000.77420.61360.46380.43480.8333
Recall0.65630.75000.84381.00000.93750.7813
Specificity0.92710.92710.82290.61460.59380.9479
F1-score0.70000.76190.71050.63370.59410.8065
AUC0.79170.83850.83330.80730.76560.8646
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lai, C.; Ahmed, I.; Zio, E.; Li, W.; Zhang, Y.; Yao, W.; Chen, J. A Multistage Physics-Informed Neural Network for Fault Detection in Regulating Valves of Nuclear Power Plants. Energies 2024, 17, 2647. https://doi.org/10.3390/en17112647

AMA Style

Lai C, Ahmed I, Zio E, Li W, Zhang Y, Yao W, Chen J. A Multistage Physics-Informed Neural Network for Fault Detection in Regulating Valves of Nuclear Power Plants. Energies. 2024; 17(11):2647. https://doi.org/10.3390/en17112647

Chicago/Turabian Style

Lai, Chenyang, Ibrahim Ahmed, Enrico Zio, Wei Li, Yiwang Zhang, Wenqing Yao, and Juan Chen. 2024. "A Multistage Physics-Informed Neural Network for Fault Detection in Regulating Valves of Nuclear Power Plants" Energies 17, no. 11: 2647. https://doi.org/10.3390/en17112647

APA Style

Lai, C., Ahmed, I., Zio, E., Li, W., Zhang, Y., Yao, W., & Chen, J. (2024). A Multistage Physics-Informed Neural Network for Fault Detection in Regulating Valves of Nuclear Power Plants. Energies, 17(11), 2647. https://doi.org/10.3390/en17112647

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop