A Neural-Network-Based Nonlinear Adaptive State-Observer for Pressurized Water Reactors

Dong, Zhe

doi:10.3390/en6105382

Open AccessArticle

A Neural-Network-Based Nonlinear Adaptive State-Observer for Pressurized Water Reactors

by

Zhe Dong

Institute of Nuclear and New Energy Technology, Tsinghua University, Beijing 100084, China

Energies 2013, 6(10), 5382-5401; https://doi.org/10.3390/en6105382

Submission received: 29 August 2013 / Revised: 12 October 2013 / Accepted: 14 October 2013 / Published: 18 October 2013

Download

Browse Figures

Versions Notes

Abstract

:

Although there have been some severe nuclear accidents such as Three Mile Island (USA), Chernobyl (Ukraine) and Fukushima (Japan), nuclear fission energy is still a source of clean energy that can substitute for fossil fuels in a centralized way and in a great amount with commercial availability and economic competitiveness. Since the pressurized water reactor (PWR) is the most widely used nuclear fission reactor, its safe, stable and efficient operation is meaningful to the current rebirth of the nuclear fission energy industry. Power-level regulation is an important technique which can deeply affect the operation stability and efficiency of PWRs. Compared with the classical power-level controllers, the advanced power-level regulators could strengthen both the closed-loop stability and control performance by feeding back the internal state-variables. However, not all of the internal state variables of a PWR can be obtained directly by measurements. To implement advanced PWR power-level control law, it is necessary to develop a state-observer to reconstruct the unmeasurable state-variables. Since a PWR is naturally a complex nonlinear system with parameters varying with power-level, fuel burnup, xenon isotope production, control rod worth and etc., it is meaningful to design a nonlinear observer for the PWR with adaptability to system uncertainties. Due to this and the strong learning capability of the multi-layer perceptron (MLP) neural network, an MLP-based nonlinear adaptive observer is given for PWRs. Based upon Lyapunov stability theory, it is proved theoretically that this newly-built observer can provide bounded and convergent state-observation. This observer is then applied to the state-observation of a special PWR, i.e., the nuclear heating reactor (NHR), and numerical simulation results not only verify its feasibility but also give the relationship between the observation performance and observer parameters.

Keywords:

PWR; state-observation; nonlinear observer; multilayer neural network

1. Introduction

The growing requirements for electricity and the pollution caused by burning fossil fuels has led to a renaissance of nuclear energy industry, even if there have been some severe accidents such as Three Mile Island (USA), Chernobyl (Ukraine) and Fukushima (Japan). Since power-level control is a quite crucial technique which guarantees operation stability and efficiency for nuclear reactors, developing high performance power-level regulators is quite meaningful for the current rebirth of nuclear energy industry. Compared with the classical static output feedback power-level control laws, the advanced power regulation strategies have the potential of strengthening both the closed-loop stability and control performance by feeding back the internal system state-variables. Due to the absence of adequate sensors, some state-variables associated with the dynamics of a nuclear reactor are not available for measurement. In order to implement the advanced power-level control strategies for stronger dynamic performance, some observation structure should be used to reconstruct the state-variables that cannot be obtained directly through measurement. In this case, the simpler solution is to utilize the linear observers such as the Luenberger observer [1] and Kalman filter [2,3]. However, the dynamic behavior of a given nuclear reactor exhibits strong nonlinearity and it depends on many factors such as power-level, fuel burnup, etc. The linear observers can only provide satisfactory performance in a small neighborhood near an operating point. Thus, if large variations of the system state variables are required, especially in the case of load following, the previous option is not effective anymore, and nonlinear observers should be developed. Shtessel gave a sliding mode observer to construct a dynamic output feedback loop with a static state-feedback sliding mode controller for regulating the power-level of space nuclear reactor TOPAZ II [4]. Etchepareborda applied the high gain observer to design a nonlinear model predictive power-level control for a pressurized water reactor (PWR)-like research reactor [5]. Dong proposed the dissipation-based high gain filter (DHGF) for the state-observation of PWRs [6], and then applied the DHGF to build the dynamic output-feedback power-level control laws [7,8]. However, the precondition of applying these nonlinear observers is to know the accurate lump-parameter dynamic model of a given nuclear reactor. Although some schemes have been introduced to strengthen the adaptation performance of nonlinear observers to system uncertainties, there are strong constraints on the form of system uncertainties [9]. Therefore, more advanced schemes should be given to further improve the adaptability of nonlinear observation.

Artificial neural networks (ANNs), inspired by biological neural networks, are composed of simple processing elements called neurons normally arranged in layers and interconnected to each other by some weighted connections. This architecture along with a learning algorithm for adjusting the connection weights, exhibits some interesting properties such as learning, approximation and parallel distributed processing capability. The radial basis function (RBF) network and multi-layer perceptron (MLP) network are two widely utilized ANNs. It has been proven theoretically that both the RBF [10,11] and MLP [12,13,14] networks can approximate a wide range of nonlinear functions to any desired degree of accuracy under certain conditions. In recent years, ANNs have also been applied to nuclear engineering, particularly, for reactor control. Ku, Lee and Edwards applied the diagonal recurrent neural network (DRNN) to a nuclear reactor model to improve its temperature response, and here the DRNNs must be trained offline by a linearized reactor model and a pre-designed optimal temperature control [15]. Arab-Alibeik and Setayeshi designed a neural adaptive inverse controller for regulating the power-level of a PWR, and here the ANN was also trained offline by a reactor model [16]. From the above works in applying ANN in nuclear engineering, it was shown that the identification must be sufficiently accurate before control action is initiated. However, in practical control applications, it is desirable to have systematic method of ensuring the stability and robustness of the overall system. In the past few years, several ANN-based control laws for nonlinear systems have been proposed based upon Lyapunov stability theory. One main advantage of these schemes is that the adaptive laws were derived based on the Lyapunov synthesis method and thus can provide the closed-loop stability. Ge et al. proposed an adaptive state-feedback control law for a large class of nonlinear systems based on the RBF network, and the regulating error was proved to converge to a small neighborhood of the origin by using Lyapunov stability theory [17]. Moreover, state-feedback control design methods based on the MLP network were also studied for nonlinear systems in Brunovksy, pure-feedback and lower-triangular forms by using Lyapunov stability theory and techniques of feedback linearization and backstepping [18,19,20,21,22]. It is clear that designing a satisfactory state-observer is the precondition of implementing advanced state-feedback control laws. Since there usually exist system dynamics uncertainties the adaptive observer design method based upon ANNs is another hot topic nowadays. Vargas and Hemerly proposed an adaptive observer for unknown general nonlinear systems based upon both RBF networks and Lyapunov stability theory, and the adaption laws of the weights provide the bounded-error performance [23]. By the use of the adaptive bounding technique, Stepanyan and Hovakimyan gave a RBF-based adaptive observer which could provide asymptotically convergent state estimation for a class of uncertain nonlinear systems [24]. Very recently, Yang et al. also designed a stable RBF-based observer to build a model referenced adaptive controller (MRAC) for an electrohydraulic system [25]. Since the MLP network is nonlinear in its parameters and can be applied to many systems with arbitrary degrees of nonlinearity and complexity, it has already been used to design adaptive observers. Abdollahi et al. gave an MLP-based observer for nonlinear systems by Lyapunov direct method, and then applied it to the state-estimation of flexible-joint manipulators [26]. Pérez-Cruz and Poznyak gave a stable observer for estimating the precursor power and internal reactivity of a nuclear reactor by combining the MLP network and sliding mode technique [27]. Talebi et al. designed a recurrent neural-network-based state-observer for sensor and actuator fault detection of the satellite’s attitude control subsystem [28].

Since a nuclear fission reactor is by nature a complex nonlinear system with its parameters varying with time as a function of power-level, fuel burnup, xenon isotope production, control rod worth, etc., it is very necessary to design nonlinear observers for nuclear reactors with the adaptability to those parameter uncertainties. In this paper, a nonlinear adaptive observer is developed to PWRs by the use of MLP network. Based upon Lyapunov stability theory, both the boundness and convergence property of the observation error is first proved. Then, this observer is applied to the state-observation of a nuclear heating reactor (NHR) which is a special type of PWR with some properties such as natural circulation and self-pressurization. Numerical simulation results not only verify the feasibility of this newly-built observer but also show the relationship between its parameters and performance.

2. Problem Formulation

2.1. Dynamic Model for Observer Design

The reactor model for observer design in this paper is the point kinetics with one equivalent delayed neutron group and temperature feedback from both the fuel and coolant temperature, which is given as follows [6,7,8,29]:

(1)

where n_r is the relative nuclear power, c_r is the relative concentration of delayed neutron precursor, β is the fraction of delayed neutrons, Λ is the effective prompt neutron lifetime, λ is the decay constant of delayed neutron precursor, α_f and α_c are respectively the temperature reactivity feedback coefficients of the fuel and the coolant, T_f is the fuel temperature, T_cav and T_cin are respectively the average and inlet coolant temperatures of the reactor core, T_f,m and T_cav,m are respectively the initial equilibrium values of T_f and T_cav, Ω is the heat transfer coefficient between fuel and coolant, M is the mass flow rate times the heat capacity of the coolant, P₀ is the rated thermal power, ρ_r is the reactivity induced by the control rods, μ_f is the total heat capacity of the fuel elements, μ_c is the total heat capacity of the reactor coolant, G_r is the total reactivity worth of control rods, and z_r is the control input, i.e., the speed signal of control rods.

Suppose that n_r0, c_r0, T_f0, T_cav0, T_cin0 and ρ_r0 are respectively the steady values of n_r, c_r, T_f, T_cav, T_cin and ρ_r, which satisfies:

{\dot{n}}_{r0} = {\dot{c}}_{r0} = {\dot{T}}_{f0} = {\dot{T}}_{cav0} = {\dot{T}}_{cin0} = {\dot{ρ}}_{r0} = 0

(2)

Define the deviations between the actual and the steady values of n_r, c_r, T_f, T_cav, T_cin and ρ_r as:

{\begin{matrix} δ n_{r} = n_{r} - n_{r0} \\ δ c_{r} = c_{r} - c_{r0} \\ δ T_{f} = T_{f} - T_{f0} \\ δ T_{cav} = T_{cav} - T_{cav0} \\ δ T_{cin} = T_{cin} - T_{cin0} \\ δ ρ_{r} = ρ_{r} - ρ_{r0} \end{matrix}

(3)

Moreover, let:

(4)

ξ = δ ρ_{r}

(5)

and:

u = G_{r} z_{r}

(6)

Based on Equations (1) and (2), the nonlinear state-space model for observer design can be written as:

(7)

where:

(8)

(9)

(10)

and the bounded vector θ ∈ R⁴ denotes other modeling uncertainty.

2.2. Approximating System Uncertainty by MLP Network

The MLP network with one hidden layer can be expressed as:

(11)

where z ∈ Rⁿ is the input vector, both V ∈ Rⁿ^×l and W ∈ R^l^×n are the first-to-second layer and second-to-third layer interconnection matrices respectively, l is the number of neutrons in the hidden layer, and:

(12)

Here, vector v_i (i = 1, …, l) is the ith column of interconnection matrix V, and activation function s is chosen as the continuous and differentiable nonlinear sigmoidal function, i.e.,:

s (v) = \frac{1}{1 + e^{- v}}

(13)

It has been proved in [12] that if the node number l of the hidden layer is large enough, then MLP network Equation (11) can approximate any continuous function to arbitrary accuracy on a compact set, from which we can see that there must exist proper weight matrices W and V such that:

(14)

and:

{‖ d_{e} ‖}_{2} < ε

(15)

where ε is a bounded positive scalar, U is a given positive definite matrix and vector σ is defined by Equation (10). Usually in practical engineering, σ is norm-bounded system uncertainty given by Equation (10), and then it is not loss of generality to assume that:

{‖ W ‖}_{F} \leq w_{m}

(16)

and:

{‖ V ‖}_{F} \leq v_{m}

(17)

where, for a matrix A = (a_ij) ∈ R^m×n, the Frobenius norm ‖ ‖_F is defined as:

(18)

2.3. Theoretic Problem Formulation

Usually, δn_r and δT_cav can be obtained directly from measurement, and the output of system Equation (7) can be defined as:

(19)

where:

C = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(20)

Choose the state-observer of system Equation (7) as:

{\begin{cases} \dot{\hat{x}} = f (\hat{x}) + g (\hat{x}) \hat{ξ} + K_{O} e_{O} + U {\hat{G}}_{MNN} (\hat{x}) \\ \dot{\hat{ξ}} = - k_{O ξ} (n_{r0} + x_{1}) e_{1} + u \end{cases}

(21)

where

\hat{x} \in R^{4}

and

\hat{ξ} \in R

are respectively the estimation of x and ξ, vector-valued functions f and g are determined by Equations (8) and (9), respectively:

(22)

(23)

\hat{W}

and

\hat{V}

are weighting matrices of MLP network

\hat{G}

_MLP, and both K_O and k_Oξ are observer gains. Then, the theoretic problem to be solved in this paper is summarized as follows.

Problem 1. How to design observer gains K_O and k_Oξ and the learning algorithms of weighting matrices

\hat{W}

and

\hat{V}

of

\hat{G}

_MLP so that nonlinear adaptive observer Equation (21) is bounded and convergent?

3. Observer Design

It is clear that solving Problem 1 is equivalent to giving the tuning approach for both feedback gains K_O and k_Oξ and weighting matrices

\hat{W}

and

\hat{V}

of

\hat{G}

_MLP. In this section, this tuning approach, which provides bounded and convergent observation, will be given based on Lyapunov stability theory. Before giving the main result of this paper, a useful lemma is firstly introduced as follows.

Lemma 1. The approximation error of

\hat{G}

_MLP to G_MLP defined by:

(24)

satisfies:

(25)

where:

(26)

(27)

(28)

and d_r is the residual term. Moreover, d_r satisfies:

(29)

where c_i (i = 0,1,2,3) are certain positive scalars.

Proof: It is easy to see that the Taylor expansion of S(V^Tx) about

{\hat{V}}^{T} \hat{x}

can be written as:

(30)

where:

(31)

and

O (e_{r})

denotes the sum of the high order terms in the Taylor series expansion. Based on Equation (30), we can derive that:

\begin{array}{l} δ G_{MLP} & = {\hat{W}}^{T} S ({\hat{V}}^{T} \hat{x}) - W^{T} S (V^{T} x) \\ = {(W + \tilde{W})}^{T} \hat{S} - W^{T} [\hat{S} - {\hat{S}}^{'} ({\hat{V}}^{T} \hat{x} - V^{T} x) + O (e_{r})] \\ = {\tilde{W}}^{T} \hat{S} + (\hat{W} - \tilde{W}) {\hat{S}}^{'} ({\hat{V}}^{T} \hat{x} - V^{T} x) - W^{T} O (e_{r}) \\ = {\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) + {\hat{W}}^{T} {\hat{S}}^{'} {\tilde{V}}^{T} \hat{x} + d_{r} \end{array}

(32)

where:

(33)

\tilde{W} = \hat{W} - W

(34)

\tilde{V} = \hat{V} - V

(35)

and:

e = \hat{x} - x

(36)

Then, we can clearly see from Equation (32) that Equation (25) is well satisfied.

Moreover, since we have assumed that activation function s takes the form as Equation (13), it is clear that:

(37)

and we can also derive that:

(38)

From Equation (38), it is easy to check that for

\forall v \in R

:

0 \leq s^{'} (v) \leq 0.25

(39)

and:

| v s^{'} (v) | \leq 0.2239

(40)

Based on Inequalities (39) and (40), we have:

(41)

and:

(42)

Moreover, from Taylor expansion Equation (30), we can know that:

(43)

Then, based on Assumption (17) and Inequalities (37) and (41)–(43), we have:

(44)

By Equation (33), it can be seen that:

\begin{array}{l} {‖ d_{r} ‖}_{2} & \leq {‖ {\tilde{W}}^{T} {\hat{S}}^{'} V^{T} x ‖}_{2} + {‖ {(W + \tilde{W})}^{T} {\hat{S}}^{'} V^{T} e ‖}_{2} + {‖ W^{T} O (e_{r}) ‖}_{2} \\ \leq v_{m} {‖ {\tilde{W}}^{T} ‖}_{F} {‖ {\hat{S}}^{'} ‖}_{F} {‖ x ‖}_{2} + v_{m} {‖ {\tilde{W}}^{T} ‖}_{F} {‖ {\hat{S}}^{'} ‖}_{F} {‖ e ‖}_{2} + w_{m} {‖ {\hat{S}}^{'} ‖}_{F} {‖ e ‖}_{2} + w_{m} {‖ O (e_{r}) ‖}_{2} \\ \leq v_{m} l {‖ {\tilde{W}}^{T} ‖}_{F} ({‖ x ‖}_{2} + {‖ e ‖}_{2}) + w_{m} l {‖ e ‖}_{2} + w_{m} (1.2239 l + 0.25 v_{m} l {‖ x ‖}_{2}) \\ \leq 1.2239 w_{m} l + 0.25 w_{m} v_{m} l {‖ x ‖}_{2} + w_{m} l {‖ e ‖}_{2} + v_{m} l ({‖ x ‖}_{2} + {‖ e ‖}_{2}) \end{array}

(45)

By choosing:

{\begin{cases} c_{0} = 1.2239 w_{m} l \\ c_{1} = 0.25 w_{m} v_{m} l \\ c_{2} = w_{m} l \\ c_{3} = v_{m} l \end{cases}

(46)

We can see that Inequality (29) certainly holds. This completes the proof of Lemma 1.

Remark 1. From Lemma 1, the norm of residual term d_r is influenced by the norms of systems state x, observation error e and approximation error of weighting matrix

\tilde{W}

.

The following Theorem 1, which is the main result of this paper, proposes the design of nonlinear adaptive state-observer based on the MLP neural network.

Theorem 1. Consider state observer Equation (21) of PWR dynamics Equation (7), and suppose that observer gains k_Oξ is positive and system state-vector x is bounded. Let observer gain matrix K_O take the form as:

(47)

where observer gains k_ON, k_OF and k_OC are all positive. Furthermore, choose the learning algorithms of weighting matrices

\hat{W}

and

\hat{V}

of multilayer network

\hat{G}

_MNN as:

(48)

and:

(49)

respectively, where both Γ_W and Γ_V are diagonal positive-definite matrices, both scalars δ_W and δ_V are positive:

(50)

(51)

(52)

δ is a positive scalar and matrix C is defined by Equation (20). Then observation errors e and e_ξ defined by:

e = \hat{x} - x

(53)

and:

e_{ξ} = \hat{ξ} - ξ

(54)

are convergent and bounded.

Proof: From Equations (7), (21), (47), (53) and (54), the dynamics of observation error e satisfies:

{\begin{cases} \dot{e} = ϒ^{- 1} f_{e} (e, e_{ξ}) + U [{\hat{W}}^{T} S ({\hat{V}}^{T} \hat{x}) - W^{T} S (V^{T} x) - d_{e}] \\ {\dot{e}}_{ξ} = - k_{O ξ} (n_{r0} + x_{1}) e_{1} \end{cases}

(55)

where:

(56)

(57)

and approximation error d_e is defined by Equation (14).

Moreover, from Equations (19) and (22):

e_{O} = C e

(58)

from which we have:

e = N_{δ} (H e_{O} + δ e)

(59)

Choose the Lyapunov function of the observation error dynamics Equation (55) as:

(60)

where:

(61)

Differentiate V_e1 along the trajectory given by Equation (55):

(62)

where matrix Σ given by Equation (50) is still a diagonal and positive-definite.

Moreover, from Equation (59), we can derive that:

\begin{array}{l} e^{T} Σ [{\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) + {\hat{W}}^{T} {\hat{S}}^{'} \tilde{V} \hat{x}] & = {(H e_{O} + δ e)}^{T} N_{δ}^{T} Σ [{\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) + {\hat{W}}^{T} {\hat{S}}^{'} \tilde{V} \hat{x}] \\ = e_{O}^{T} H^{T} N_{δ}^{T} Σ {\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) + δ e^{T} N_{δ}^{T} Σ {\hat{W}}^{T} {\hat{S}}^{'} \tilde{V} \hat{x} + e_{O}^{T} H^{T} N_{δ}^{T} Σ {\hat{W}}^{T} {\hat{S}}^{'} \tilde{V} \hat{x} + \\ δ e^{T} N_{δ}^{T} Σ {\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) \end{array}

(63)

Further, since:

(64)

and:

(65)

from Equation (63), we have:

\begin{array}{l} e^{T} Σ [{\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) + {\hat{W}}^{T} {\hat{S}}^{'} \tilde{V} \hat{x}] & \leq e_{O}^{T} H^{T} N_{δ}^{T} Σ (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) + e_{O}^{T} H^{T} N_{δ}^{T} Σ {\hat{W}}^{T} {\hat{S}}^{'} + \frac{1}{2} e^{T} (Γ_{1} + Γ_{2}) e + \\ \frac{1}{2} δ^{2} {(\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x})}^{T} Σ N_{δ} Γ_{1}^{- 1} N_{δ}^{T} Σ (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) tr {\tilde{W} {\tilde{W}}^{T}} + \\ \frac{1}{2} δ^{2} {\hat{x}}^{T} {\hat{S}}^{'} \hat{W} Σ N_{δ} Γ_{2}^{- 1} N_{δ}^{T} Σ {\hat{W}}^{T} {\hat{S}}^{'} \hat{x} tr {\tilde{V} {\tilde{V}}^{T}} \\ = tr {{\tilde{W}}^{T} (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) e_{O}^{T} H^{T} N_{δ}^{T} Σ} + tr {{\tilde{V}}^{T} \hat{x} e_{O}^{T} H^{T} N_{δ}^{T} Σ {\hat{W}}^{T} {\hat{S}}^{'}} + \\ \frac{1}{2} δ^{2} {(\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x})}^{T} Σ N_{δ} Γ_{1}^{- 1} N_{δ}^{T} Σ (\hat{S} - {\hat{S}}^{'} {\hat{V}}^{T} \hat{x}) tr {\tilde{W} {\tilde{W}}^{T}} + \\ \frac{1}{2} δ^{2} {\hat{x}}^{T} {\hat{S}}^{'} \hat{W} Σ N_{δ} Γ_{2}^{- 1} N_{δ}^{T} Σ {\hat{W}}^{T} {\hat{S}}^{'} \hat{x} tr {\tilde{V} {\tilde{V}}^{T}} + \frac{1}{2} e^{T} (Γ_{1} + Γ_{2}) e \end{array}

(66)

Substitute Inequality (66) to Equation (62):

(67)

where:

(68)

(69)

d = d_{c} + d_{r}

(70)

and:

(71)

Then, differentiate V_e along the trajectory given by observation error dynamics Equation (55):

(72)

From Inequality (72), if we choose the learning algorithms of the weighting matrices as Equations (48) and (49), then it is clear that:

\begin{array}{l} {\dot{V}}_{e} (e, e_{ξ}) & = {\dot{V}}_{e1} (e, e_{ξ}) + tr {\tilde{W} Γ_{W}^{- 1} {\dot{\tilde{W}}}^{T}} + tr {\tilde{V} Γ_{V}^{- 1} {\dot{\tilde{V}}}^{T}} \\ \leq - (β + Λ k_{ON}) e_{1}^{2} - β e_{2}^{2} - \frac{| α_{f} | Ω}{k_{OF} P_{0}} [e_{3}^{2} + (1 + \frac{2 M}{Ω} + \frac{μ_{c}}{Ω} k_{OC}) e_{4}^{2}] + \frac{1}{2} e^{T} Γ e + \frac{1}{2} d^{T} Σ Γ^{- 1} Σ d \\ γ_{W} tr {\tilde{W} {\tilde{W}}^{T}} + γ_{V} tr {\tilde{V} {\tilde{V}}^{T}} - δ_{W} tr {\tilde{W} {\hat{W}}^{T}} - δ_{V} tr {\tilde{V} {\hat{V}}^{T}} \\ \leq - e^{T} Ξ e - υ_{W} tr {\tilde{W} {\tilde{W}}^{T}} - υ_{V} tr {\tilde{V} {\tilde{V}}^{T}} + \frac{δ_{W}}{2} tr {W W^{T}} + \frac{δ_{V}}{2} tr {V V^{T}} + \frac{1}{2} d^{T} Σ Γ^{- 1} Σ d \end{array}

(73)

where:

(74)

(75)

υ_{W} = \frac{δ_{W}}{2} - γ_{W}

(76)

and:

υ_{V} = \frac{δ_{V}}{2} - γ_{V}

(77)

Here, scalars δ_W and δ_V should be chosen so that both υ_W and υ_V are positive.

Based on the assumption about the boundness of system state x and Inequalities (15)–(17) and (29), it is clear from Inequality (73) that the observation errors e and e_ξ are convergent and bounded. This completes the proof of Theorem 1.

Remark 2. The MLP-based nonlinear adaptive observer determined by Equations (21) and (47)–(49) does not need any matching condition of system uncertainty σ. However, the existing adaptive observers for nuclear reactors such as the observer presented in [9] still needs some matching condition on the system uncertainty. This means that the neural observer given in this paper is able to deal with general bounded system uncertainties, which is the key advanced feature of this novel neural observer design technique. Moreover, from Equations (21), (48) and (49),

\hat{x}

,

\hat{W}

and

\hat{V}

are updated simultaneously. If the perceptron number of the hidden layer is not large, the simultaneous updating of state-estimation

\hat{x}

and weighting matrices

\hat{W}

and

\hat{V}

cannot affect the real-time performance of the algorithm.

4. Simulation Results with Discussions

To verify the feasibility of this newly-built neural observer, it is applied to the state-observation of a NHR which is a small PWR developed by Institute of Nuclear and New Energy Technology (INET) at Tsinghua University in this section. The NHR has many advanced safety features such as integrated arrangement, natural circulation at any power-levels, self-pressurization, hydraulic control rod driving, and passive residual heat removing [30,31,32], and it can be applied to the fields such as district heating, seawater desalination and electricity production. The structure of the NHR is illustrated in Figure 1. Since NHR dynamics has both strong nonlinearity and high uncertainty, in order to implement advanced power-level controllers for higher operation performance, it is very meaningful to realize the adaptive state-observation for the NHR.

Figure 1. Structure and cross section of the NHR: (1) Primary heating exchanger; (2) Riser; (3) Biological shield; (4) Containment; (5) Pressure vessel; (6) Core; (7) Fuel elements and (8) Control rods.

4.1. Description of the Numerical Simulation

The simulation model of the NHR is composed of the point kinetics model with six delayed neutron groups and lumped dynamic model of the reactor thermal-hydraulics, primary heat exchanger, U-tube steam generator (UTSG), feedwater pump of the UTSG and necessary pipe or volume cells [33]. The parameters of the NHR at the middle of the fuel cycle in 100% power-level are shown in Table 1. The output-feedback-dissipation power-level control strategy given in [34] is adopted here. Moreover, in this simulation, we choose l = 4, k_ON = k_OC = 0.0001, k_OF = 10.0, k_Oξ = 1.0:

δ_{w} = δ_{v} = δ_{wv}

(78)

(79)

where both δ_wv and r_p are given positive scalars. The initial values of interconnection matrices

\hat{W}

and

\hat{V}

, i.e.,

{\hat{W}}_{0}

and

{\hat{V}}_{0}

are set to be

{\hat{W}}_{0} = O

and

{\hat{V}}_{0} = O

, respectively.

Table 1. NHR Parameters at the Middle of the Fuel Cycle in 100% Power-Level.

**Table 1.** NHR Parameters at the Middle of the Fuel Cycle in 100% Power-Level.
Symbol	Quantity	Symbol	Quantity
β	0.0069	α_f	−2.48 × 10⁻⁵ (1/°C)
Λ	4.18 × 10⁻⁵ (s)	α_c	−2.71 × 10⁻⁴ (1/°C)
λ	0.08 (1/s)	M	4.29 (kW/°C)
μ_f	5.01 (MWs/°C)	Ω	1.06 (MW/°C)
μ_c	69.23 (MWs/°C)	P₀	200 (MW)

Case A (large load increase): The load signal changes linearly from 20% to 100% in a minute.

δ_wv = 0.01, and different r_p is adopted in the simulation.
r_p = 1.0, and different δ_wv is adopted.

Case B (large load decrease): The power demand decreases linearly from 100% to 20% in a minute.

δ_wv = 0.01, and different r_p is adopted in the simulation.
r_p = 1.0, and different δ_wv is adopted.

4.2. Simulation Results

In this numerical simulation, the following two case studies are done to show the state-observing performance of MNN-based nonlinear adaptive observer determined by Equations (21) and (47)–(49).

4.2.1. Large Load Increase

This verification represents a hard operation for the NHR. In this case, the power demand increases linearly from 20% to 100% in 60 s.

The observation errors of variations of the relative nuclear power, the relative precursor concentration, and the average temperatures of the fuel and coolant, i.e., the observation errors of state-variables δn_r, δc_r, δT_f and δT_cav with constant δ_wv and different r_p are all illustrated in Figure 2. Furthermore, the observation errors of these state-variables with different δ_wv and constant r_p are shown in Figure 3.

Figure 2. Observation errors of (a) δn_r; (b) δc_r; (c) δT_f and (d) δT_cav in case of A1.

Figure 3. Observation errors of (a) δn_r; (b) δc_r; (c) δT_f and (d) δT_cav in case of A2.

4.2.2. Large Load Decrease

This case also represents a stressed operation for the NHR. The load signal changes linearly from 100% to 20% in a minute. The observation errors of state-variables δn_r, δc_r, δT_f and δT_cav with constant δ_wv and different r_p are all shown in Figure 4, and the responses of these observation errors with different δ_wv and constant r_p are given in Figure 5.

Figure 4. Observation errors of (a) δn_r; (b) δc_r; (c) δT_f and (d) δT_cav in case of B1.

Figure 5. Observation errors of (a) δn_r; (b) δc_r; (c) δT_f and (d) δT_cav in case of B2.

4.3. Discussion

In the procedure of load lift, the load increases rapidly from 20% to 100% in 60 s. Since the actual power level cannot vary so quickly, δn_r becomes smaller, which indicates that the actual power level of the NHR is smaller than the load set by the operator in the initial phase of the process. Due to the function of power level controller, δn_r becomes larger and larger, and finally equals zero. The difference of the power level causes the variations of the precursor concentration and average temperatures of the fuel and coolant inside the reactor core. Similarly, in the case of a load decrease from 100% to 20% in a minute, the actual power level also cannot change so fast, and therefore δn_r become larger, which indicates that the actual power level of the NHR is higher than the load in the initial stage. Then the power-level becomes lower and lower due to the function of power controller, and finally reaches the full power-level.

From Figure 2, Figure 3, Figure 4 and Figure 5, the MLP-based state-observer developed in this paper can provide bounded and convergent state-observations. The load variation leads to the variation of the state variables, which causes the variation of system output. The variation of system output then drives both the observer and learning algorithms of the MLP connection weights to generate a convergent state-observation. It is also clear from these figures that the variation of observer parameters cannot change the boundness and convergence of the state-observation. Further, from Figure 2 and Figure 4, if positive scalar r_p is larger, then the observation performance is higher. Actually, from Equation (79), scalar r_p is larger, the influence of e_O to the weighting connections that correspond to the state-observation of the thermal-hydraulic loop is stronger, which leads to higher observation performance of δT_cav. From both Equations (55) and (57), since e₄, i.e., the observation error of δT_cav can affect the state-observation of neutron kinetics, higher observation performance of δT_cav is positive to improve the observation quality of neutron kinetics. Moreover, from Figure 3 and Figure 5, it is easy to see that if positive scalar δ_wv is larger, the observation performance of δT_cav is worse. However, there is a little improvement to the observation performance of δc_r and δT_f. Based upon the above discussion, MLP-based nonlinear state-observer composed of Equations (21), (47)–(49) provides both bounded and convergent observation of system state-variables, and the parameters of this observer should be properly adjusted.

From the curves plotted in Figure 2, Figure 3, Figure 4 and Figure 5, both the overshoots and settling periods of the estimation errors of unmeasurable state δc_r and δT_f can be reduced to acceptable limits with properly selected scalars r_p and δ_wv, which leads to practical feasibility of this newly-built observer. Usually, r_p should be larger, and δ_wv should be selected based upon the trade-off between the observation performance of δT_cav and that of δc_r and δT_f. Moreover, with comparison to the sliding mode observer [4], high gain observer [5] and DHGF [6], the main virtue of MLP-based nonlinear observer proposed in this paper is its high adaptation capability to system uncertainties. That is to say that this new observer has the adaptation performance that other observers for nuclear reactors do not have.

Finally, due to the widely utilization of those advanced digital control system platforms, there is no difficulty in realizing the MLP-based observer presented in this paper. Furthermore, since there have been some mature MLP network programs, it is easy for the engineers to implement both observer Equation (21) and learning Algorithms (48) and (49) as a software running on a digital platform.

5. Conclusions

Power-level control is an important technique that guarantees the operation stability and efficiency of the pressurized water reactor which is the most widely utilized nuclear fission reactor. Compared with classical static output feedback power-level control, advanced power-level regulators have the potential of improving closed-loop stability and dynamic performance by feeding back the internal state-variables. However, since not all of these internal states can be measured directly, it is necessary to develop state-observers to reconstruct those unmeasurable state-variables for the implementation of the advanced power-level controllers. It is well known that each PWR is naturally a complex nonlinear dynamic system with parameters varying with the power-level, fuel burnup, xenon isotope production, control rod worth, etc., which leads to the necessity of designing a nonlinear observer for the PWR with adaptability to the system uncertainties. Motivated by this, an MLP-based nonlinear adaptive observer is proposed for the PWR. Based upon Lyapunov stability theory, it is proved theoretically that this new observer can provide bounded and convergent state-observation. Numerical simulation results not only verify its feasibility, but also show the relationship between observation performance and tuning parameters.

Acknowledgments

The work in this paper is jointly supported by Natural Science Foundation of China (NSFC) (Grant No. 61374045 and 61004016), Tsinghua University Initiative Scientific Research Program (Grant No. 20121087992) and National S&T Major Project (Grant No. ZX06901). Moreover, the author would like to thank Huang Xiao-Jin deeply for consistent support, valuable discussions and constructive suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Edwards, R.M.; Lee, K.Y.; Schultz, M.A. State-feedback assisted classical control: An incremental approach to control modernization of existing and future nuclear reactors and power plants. Nucl. Technol. 1990, 92, 167–185. [Google Scholar]
Ben-Abdennour, A.; Edwards, R.M.; Lee, K.Y. LQR/LTR robust control of nuclear reactors with improved temperature performance. IEEE Trans. Nucl. Sci. 1992, 39, 2286–2294. [Google Scholar] [CrossRef]
Arab-Alibeik, H.; Setayeshi, S. Improved temperature control of a PWR nuclear reactor using LQG/LTR based controller. IEEE Trans. Nucl. Sci. 2003, 50, 211–218. [Google Scholar] [CrossRef]
Shtessel, Y.B. Sliding mode control of the space nuclear reactor system. IEEE Trans. Aerosp. Electron. Syst. 1998, 34, 579–589. [Google Scholar] [CrossRef]
Etchepareborda, A.; Lolich, J. Research reactor power controller design using an output feedback nonlinear receding horizon control method. Nucl. Eng. Des. 2007, 237, 268–276. [Google Scholar] [CrossRef]
Dong, Z.; Feng, J.; Huang, X.; Zhang, L. Dissipation-based high gain filter for monitoring nuclear reactors. IEEE Trans. Nucl. Sci. 2010, 57, 328–339. [Google Scholar] [CrossRef]
Dong, Z. Nonlinear state-feedback dissipation power level control for nuclear reactors. IEEE Trans. Nucl. Sci. 2011, 58, 241–257. [Google Scholar] [CrossRef]
Dong, Z.; Huang, X.; Zhang, L. Output feedback power-level control of nuclear reactors based on a dissipative high gain filter. Nucl. Eng. Des. 2011, 241, 4783–4793. [Google Scholar] [CrossRef]
Dong, Z. Nonlinear adaptive power-level control for modular high temperature gas-cooled reactors. IEEE Trans. Nucl. Sci. 2013, 60, 1332–1345. [Google Scholar] [CrossRef]
Girosi, F.; Poggio, T. Networks and the best approximation property. Biol. Cybern. 1990, 63, 169–176. [Google Scholar] [CrossRef]
Poggio, T.; Girosi, F. Networks for approximation and learning. IEEE Proc. 1990, 78, 1481–1497. [Google Scholar] [CrossRef]
Funahashi, K.I. On the approximate realization of continuous mappings by neural networks. Neural Netw. 1989, 2, 183–192. [Google Scholar] [CrossRef]
Cybenko, G. Approximating by superpositions of a sigmoid function. Math. Control Signals Syst. 1989, 2, 303–314. [Google Scholar] [CrossRef]
Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximator. Neural Netw. 1989, 2, 183–192. [Google Scholar] [CrossRef]
Ku, C.-C.; Lee, K.Y.; Edwards, R.M. Improved nuclear reactor temperature control using diagonal recurrent neural networks. IEEE Trans. Nucl. Sci. 1992, 39, 2298–2308. [Google Scholar] [CrossRef]
Arab-Alibeik, H.; Setayeshi, S. Adaptive control of a PWR core power using neural networks. Ann. Nucl. Energy 2005, 32, 588–605. [Google Scholar] [CrossRef]
Ge, S.S.; Hang, C.C.; Zhang, T. Adaptive neural network control of nonlinear systems by state and output feedback. IEEE Trans. Syst. Man Cybern. Part B Cybern. 1999, 29, 818–828. [Google Scholar] [CrossRef]
Ge, S.S.; Hang, C.C.; Zhang, T. Nonlinear adaptive control using neural networks and its application to CSTR systems. J. Process Control 1998, 9, 313–323. [Google Scholar] [CrossRef]
Zhang, T.; Ge, S.S.; Hang, C.C. Design and performance analysis of a direct adaptive controller for nonlinear systems. Automatica 1999, 35, 1809–1817. [Google Scholar] [CrossRef]
Ge, S.S.; Hang, C.C.; Zhang, T. Stable adaptive control for nonlinear multivariable systems with a triangular control structure. IEEE Trans. Autom. Control 2000, 45, 1221–1225. [Google Scholar] [CrossRef]
Zhang, T.; Ge, S.S.; Hang, C.C. Adaptive neural network control for strict-feedback nonlinear systems using backstepping design. Automatica 2000, 36, 1835–1846. [Google Scholar] [CrossRef]
Ge, S.S.; Wang, C. Adaptive NN control of uncertain nonlinear pure-feedback systems. Automatica 2002, 38, 671–682. [Google Scholar] [CrossRef]
Ruiz Vargas, J.A.; Hemerly, E.M. Adaptive observers for unknown general nonlinear systems. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2001, 31, 683–690. [Google Scholar] [CrossRef]
Stepanyan, V.; Hovakimyan, N. Robust Adaptive Observer Design for Uncertain Systems with Bounded Disturbances. In Proceedings of the 44th IEEE Conference on Decision and Control, 2005 and 2005 European Control Conference (CDC-ECC’05), Seville, Spain, 12–15 December 2005.
Yang, Y.; Balakrishnan, S.N.; Tang, L.; Landers, R.G. Electrohydraulic control using neural MRAC based on a modified state-observer. IEEE/ASME Trans. Mechatron. 2013, 18, 867–877. [Google Scholar] [CrossRef]
Abdollahi, F.; Talebi, H.A. A stable neural network-based observer with application to flexible-joint manipulators. IEEE Trans. Neural Netw. 2006, 17, 118–129. [Google Scholar] [CrossRef] [PubMed]
Pérez-Cruz, J.H.; Poznyak, A. Estimation of the Precursor Power and Internal Reactivity in a Nuclear Reactor by a Neural Observer. In Proceedings of the 4th International Conference on Electrical and Electronics Engineering (ICEEE 2007), Mexico City, Mexico, 5–7 September 2007; pp. 310–313.
Talebi, H.A.; Khoransani, K.; Tafazoli, S. A recurrent neural-network-based sensor and actuator fault detection and isolation for nonlinear systems with application to the satellite’s attitude control subsystem. IEEE Trans. Neural Networks 2009, 20, 45–60. [Google Scholar] [CrossRef]
Schultz, M.A. Control of Nuclear Reactors and Power Plants, 2nd ed.; McGraw-Hill: New York, NY, USA, 1961. [Google Scholar]
Wang, D.Z.; Ma, C.W.; Dong, D.; Lin, J.G. Chinese nuclear heating test reactor and demonstration plant. Nucl. Eng. Des. 1992, 136, 91–98. [Google Scholar] [CrossRef]
Wang, D.Z. The design characteristics and construction experiences of the 5 MW_t nuclear heating reactor. Nucl. Eng. Des. 1993, 143, 19–24. [Google Scholar] [CrossRef]
Wang, D.Z.; Gao, Z.Y.; Zheng, W.X. Technical design features and safety analysis of the 200 MW_t nuclear heating reactor. Nucl. Eng. Des. 1993, 143, 1–7. [Google Scholar] [CrossRef]
Dong, Z.; Huang, X.; Feng, J.; Zhang, L. Dynamic model for control system design and simulation of a low temperature nuclear reactor. Nucl. Eng. Des. 2009, 239, 2141–2151. [Google Scholar] [CrossRef]
Dong, Z.; Feng, J.; Huang, X.; Zhang, L. Power-level control of nuclear reactors based on feedback dissipation and backstepping. IEEE Trans. Nucl. Sci. 2010, 57, 1577–1588. [Google Scholar] [CrossRef]

© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Dong, Z. A Neural-Network-Based Nonlinear Adaptive State-Observer for Pressurized Water Reactors. Energies 2013, 6, 5382-5401. https://doi.org/10.3390/en6105382

AMA Style

Dong Z. A Neural-Network-Based Nonlinear Adaptive State-Observer for Pressurized Water Reactors. Energies. 2013; 6(10):5382-5401. https://doi.org/10.3390/en6105382

Chicago/Turabian Style

Dong, Zhe. 2013. "A Neural-Network-Based Nonlinear Adaptive State-Observer for Pressurized Water Reactors" Energies 6, no. 10: 5382-5401. https://doi.org/10.3390/en6105382

Article Menu

A Neural-Network-Based Nonlinear Adaptive State-Observer for Pressurized Water Reactors

Abstract

1. Introduction

2. Problem Formulation

2.1. Dynamic Model for Observer Design

2.2. Approximating System Uncertainty by MLP Network

2.3. Theoretic Problem Formulation

3. Observer Design

4. Simulation Results with Discussions

4.1. Description of the Numerical Simulation

4.2. Simulation Results

4.2.1. Large Load Increase

4.2.2. Large Load Decrease

4.3. Discussion

5. Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI