Next Article in Journal
A Mathematical Model of Universal Basic Income and Its Numerical Simulations
Next Article in Special Issue
Towards Bio-Inspired Anomaly Detection Using the Cursory Dendritic Cell Algorithm
Previous Article in Journal
Zero-Crossing Point Detection of Sinusoidal Signal in Presence of Noise and Harmonics Using Deep Neural Networks
Previous Article in Special Issue
The GM-BP Neural Network Prediction Model for International Competitiveness of Computer Information Service Industry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Autoencoder-Based Reduced Order Observer Design for a Class of Diffusion-Convection-Reaction Systems

Automization and Control Group, Kaiserstr. 2, 24143 Kiel, Germany
Algorithms 2021, 14(11), 330; https://doi.org/10.3390/a14110330
Submission received: 13 October 2021 / Revised: 8 November 2021 / Accepted: 10 November 2021 / Published: 11 November 2021
(This article belongs to the Special Issue Computer Science and Intelligent Control)

Abstract

:
The application of autoencoders in combination with Dynamic Mode Decomposition for control (DMDc) and reduced order observer design as well as Kalman Filter design is discussed for low order state reconstruction of a class of scalar linear diffusion-convection-reaction systems. The general idea and conceptual approaches are developed following recent results on machine-learning based identification of the Koopman operator using autoencoders and DMDc for finite-dimensional discrete-time system identification. The resulting linear reduced order model is combined with a classical Kalman Filter for state reconstruction with minimum error covariance as well as a reduced order observer with very low computational and memory demands. The performance of the two schemes is evaluated and compared in terms of the approximated L 2 error norm in a numerical simulation study. It turns out, that for the evaluated case study the reduced-order scheme achieves comparable performance with significantly less computational load.

1. Introduction

The problem of system state reconstruction from local and partial state measurements has important impacts on the ability of controlling the system state and monitoring system health and performance [1,2,3]. This becomes particularly involved in the framework of spatially continuously distributed dynamical systems, which are typically modeled using partial differential equations (pdes). The so called associated state estimation or observer design problem has been studied over the last decades and already reached a state of maturity in particular for linear systems (see, e.g., [4,5]). Additionally, for some classes of nonlinear or semilinear systems, design approaches have been obtained, which differ in design complexity and computational load in their implementation. A particular approach, which is simple and can be implemented with small computational load is the pointwise measurement injection (or pointwise innovation) observer design, which has been employed for solving the problem for some types of semilinear parabolic problems [6,7,8,9,10]. This approach basically extends the idea of a reduced order observer design [1,2] to the considered class of infinite-dimensional systems. The reduced order property is to be understood in relation to the number of internal states of the observer in comparison to the number of states in the system to be monitored. Basically the idea consists in providing a non-redundant state estimation scheme, which does not need to estimate the measured output, but only non-measured state variables.
Considering pde models leads to the additional problem that for implementing any observer in a real-time monitoring scheme, the associated set of pdes has to be solved numerically. This is done using some kind of discretization in space and time, so that a finite-dimensional discrete-time system representation is obtained, which represents a digital twin of the original pde system. The number of internal state variables of this approximation model is typically very high (from about hundred to several thousands of variables, e.g., for finite-element approximations) and thus requires model-order reduction approaches for real-time applicability. For this purpose, typically balanced truncation or proper orthogonal decomposition approaches or similar are employed (see, e.g., [11] and references therein). Besides these, in recent years a focus was led to approaches using dynamic mode decomposition (DMD) [11,12] and its extension to DMDc to include external (control) inputs [13]. The usage of DMD (or DMDc) is particularly motivated by the Koopman theory, according to which nonlinear finite-dimensional systems can be embedded into an infinite-dimensional space on which the linear Kooppman operator provides a complete description of the time evolution of the system observables. The (Koopman) eigenfunctions or the associated Koopman operator form a basis of this space and can be used in principle to determine a finite-dimensional approximation of the dominant modes of this operator (associated to the dominant eigenvalues of the operator). As this embedding can in most cases not be carried out analytically (see, e.g., the discussion in [3,11,12]) one can employ DMD (or DMDc) as a data-based identification approach using snapshots of the state observables as training data. This idea has been combined with the use of autoencoders (AEs) to learn from the training data a low-dimensional basis of the associated dominant Koopman eigenfunction subspace in which a linear dynamics can then be identified using classical DMDc [14]. The use of autoencoders and Koopman theory for automatic learning of finite-dimensional system dynamics is also mainstream in contemporary control systems applications in general (see, e.g., [15,16]).
Having finite-dimensional approximations of the infinite-dimensional pde models, one can also directly address the observer design problem in the finite-dimensional setup. This approach is also called early lumping (first approximate and then design), while the results discussed before belong to the so-called late lumping (first design then approximate) approaches.
Early lumping based on the combination of DMDc with state estimation has already been shown in several application scenarios to yield satisfactory performance [3,17,18,19,20]. In these studies the Kalman Filter [21,22] has been used on the basis of the obtained finite-dimensional linear discrete-time model equations. The Kalman Filter is known to provide satisfactory state estimation results, as it ensures a minimum estimation error covariance taking into account a priori known knowledge on the perturbation statistics, with the perturbations on the states and measurement considered as white noise. One potential drawback for implementations of the Kalman Filter in real-time applications relies on the fact that it requires n 2 + n dynamic variables, where n is the number of states in the reduced-order model, given that the n × n error covariance matrix must be calculated online to provide the optimal filter performance. In comparison to this approach, alternative design methods, such as the reduced-order [1,2] or the geometric observer [23] do actually require less than n dynamic variables.
Motivated by the above considerations, in the present study the autoencoder-based approximation of finite-dimensional low order basis representations is employed in combination with DMDc to identify a reduced-order model for a linear parabolic diffusion-convection-reaction system for which a reduced order observer and a Kalman Filter are implemented and compared. The training data is generated on the basis of a high-fidelity numerical approximation of the solutions of the pde model and the comparison of the approaches carried out on the basis of solutions obtained for different input signals. Given the early lumping nature of this appproach, it somehow represents a complementary counterpart of the late-lumping reduced-order design studies in [6,7,8,9,10]. To the author’s knowledge, this is the first time such a comparison with reduced order observers is carried out, in particular for this class of systems. The result is an inductive step towards the consideration of distributed parameter models involving transport-reaction mechanisms as typically found in chemical engineering applications [7,24,25,26,27,28,29].
The paper is organized as follows. In Section 2 the problem statement is provided. The model-order reduction using autoencoder-based DMDc (ae-dmdc) is discussed in Section 3. In Section 4 the state estimation problem is addressed using the reduced order observer (roo) and the Kalman Filter (kf). The obtained results are discussed in Section 5. Conclusions and an outlook are provided in Section 6.

2. Problem Statement

Consider the following diffusion-convection-reaction system with Danckwert’s boundary conditions
t x ( s , t ) = D s 2 x ( s , t ) v s x ( s , t ) + r ( s ) x ( s , t ) , s ( 0 , 1 ) , t > 0
D s x ( 0 , t ) v x ( 0 , t ) = v u ( t ) , t > 0
s x ( 1 , t ) = 0 , t > 0
x ( s , 0 ) = x 0 ( s ) , s [ 0 , 1 ]
y ( t ) = x ( s m , t ) , t 0
with the diffusion coefficient D > 0 , (superficial) flow velocity v > 0 , reaction coefficient function r L 2 ( 0 , 1 ) , state x ( · , t ) L 2 ( 0 , 1 ) at time t 0 with initial value x 0 L 2 ( 0 , 1 ) , input u ( t ) R , and output y ( t ) R corresponding to a pointwise measurement of x at the position s m [ 0 , 1 ] . The structure of the model is motivated by chemical engineering applications, in particular tubular reactor models (with length L = 1 ) [7,24,25,26,27,28,29] linearized along equilibrium solutions (see Appendix A) (Note that in such applications the flow velocity v might be varying over the spatial extension of what is assumed to be neglectable here).
In the sequel, the developments are illustrated and tested for a case study with
D = 0.1 , v = 0.5 , r ( s ) = k 0 s / ( 1 + k 1 s 2 ) , k 0 = 1 , k 1 = 5
with a reaction rate maximum at s = 0.2 , and measurement locations
s m { 0.5 , 1 }
motivated by related studies on approximate observability of the pde model [29] suggesting s m = 1 and recent studies on the pointwise measurement injection observer [6,7,8,9,10] suggesting an in-domain measurement (here chosen as s m = 0.5 ). Giving the transport-reaction character of the considered system model, the associated system specific time scales can best be captured using the Péclet ( P e ) and Damköhler ( D a ) numbers [24,28]
P e = L v D = 0.5 0.1 = 5 , D a = k 0 L v = 1 0.5 = 2
with P e giving the relation between convective and diffusive characteristic times and D a the relation between reaction and convection characteristic times. Given a moderate pair ( P e , D a ) the solution behavior is expected to be relatively smooth without steep spatial gradients, so that different numerical solution schemes can be implemented. The pde model is solved numerically using a finite-difference approximation with N = 100 collocation points, implying a spatial step of d z = 0.01 yielding a finite-dimensional approximation
x ˙ ( t ) = F x ( t ) + g u ( t ) , t > 0 , x ( 0 ) = x 0
y ( t ) = h T x ( t ) , t 0
with state vector x ( t ) R N at time t 0 , and F R N × N , g , h R N . For this purpose, central differences ( O 2 ( d z ) ) are used for the advection and the diffusion parts.
The finite-dimensional ode model (4) is subsequently solved using the standard Euler forward solver ( O 2 ( d t ) ) in Matlab. Note that for solving the ode system (4) in Matlab a discrete-time system approximation of the form
x ( t + 1 ) = Φ x ( t ) + γ u ( t ) , t 0 , x ( 0 ) = x 0
with Φ R N × N and γ R N is used. In summary, the approximation error is O 2 ( d t , d z ) . To achieve an accurate simulation result, the step size in time was set to d t = 2 × 10 5 .
The solution of the resulting equations for the input and initial value given by
u ( t ) = 3 σ ( t ) + 7 cos ( 3 t ) ( σ ( t 5 ) 2 σ ( t 8 ) + σ ( t 12 ) sin ( 3 t ) )
with σ denoting the Heaviside function, and initial condition x 0 ( s ) = 0.2 sin ( π / 2 s ) 2 is shown in Figure 1.

3. Model-Oder Reduction

For the purpose of reducing the computational load and having a low-dimensional model for observer design, which can also be implemented, e.g., on low-cost devices for system monitoring, or which is fast enough for time-critical real-time applications, in the following a machine-learning based model-order reduction is employed, which is finally motivated by a low-order approximation of the Koopman operator for the discrete-time system (5) (for details, see, e.g., [14] and also [3,12,19]).

3.1. Machine Learning for Koopman Basis Identification

Recall that x R N and consider the simple three-layer autoencoder network, as shown in Figure 2 for which it holds that
ξ = η 1 ( W 1 x + b 1 ) , x ˜ = η 2 ( W 2 ξ + b 2 ) , ξ R q , q < N .
The use of a single hidden layer is basically motivated by the fact that using model order reduction techniques based on simple matrix-vector multiplications such as balanced truncation or proper orthogonal decomposition (see, e.g., [11]) it is known that a satisfactory approximation accuracy can be obtained.
The weight matrices W 1 , W 2 and bias vectors b 1 , b 2 are determined using a standard backpropagation algorithm so that
( W 1 , W 2 , b 1 , b 2 ) = argmin x η 2 ( W 2 η 1 ( W 1 x + b 1 ) + b 2 ) = x ˜ 2 .
The result obtained using the trainAutoencoder function in Matlab with N = 100 inputs, one hidden layer with q = 40 nodes (later associated to a model approximation using q internal states), 5000 snapshots, maximum epochs set to 15,000 and choosing the activation function for the input layer as logsig and of the output layer as purelin can be seen in Figure 3, showing on the left side the comparison between the reference solution (see also Figure 1) and the corresponding output (corresponding to x ˜ ) of the autoencoder. The associated approximated L 2 norm of the approximation error x ˜ x is shown on the right side. It can be seen that a rather good correspondence is obtained.

3.2. Reduced Model Based DMDc

To identify the dynamics in terms of the low-dimensional state representation ξ R q ( q N ) by means of DMDc [13], consider the snapshot and input matrices
Ξ = ξ ( 0 ) ξ ( 1 ) ξ ( m ) R q × m , Ξ + = ξ ( 1 ) ξ ( m + 1 ) R p × m , U = u ( 0 ) u ( m ) R 1 × m , Ξ u = Ξ U R ( q + 1 ) × m
with their respective singular value decompositions
Ξ u = U 1 Σ 1 V 1 T , Ξ + = U 2 Σ 2 V 2 T , U k = U k , r U k , 0 , Σ k = Σ k , r 0 0 Σ k , 0 , V k = V k , r V k , 0 , k = 1 , 2 ,
where
U 1 R ( q + 1 ) × ( q + 1 ) , U 1 , r R ( q + 1 ) × r 1 , U 1 , 0 R ( q + 1 ) × ( q + 1 r 1 ) Σ 1 R ( q + 1 ) × ( q + 1 ) , Σ 1 , r R r 1 × r 1 , Σ 1 , 0 R ( q + 1 r 1 ) × ( q + 1 r 1 ) V 1 R m × ( q + 1 ) , V 1 , r R m × r 1 , V 1 , 0 R m × ( q + 1 r 1 ) U 2 R q × q , U 2 , r R q × r 2 , U 2 , 0 R q × ( q r 2 ) Σ 2 R q × q , Σ 2 , r R r 2 × r 2 , Σ 2 , 0 R ( q r 2 ) × ( q r 2 ) V 2 R m × q , V 2 , r R m × r 2 , V 2 , 0 R m × ( q r 2 ) .
Considering that
ξ ( t + 1 ) = A ξ ( t ) + b u ( t ) = A b ξ ( t ) u ( t )
one obtains
Ξ + = A b Ξ u
or equivalently
U 2 , r U 2 , 0 Σ 2 , r 0 0 Σ 2 , 0 V 2 , r T V 2 , 0 T = A b U 1 , r U 1 , 0 Σ 1 , r 0 0 Σ 1 , 0 V 1 , r T V 1 , 0 T
Taking into account only the dominant singular values σ 2 , i , i = 1 , , r 2 and σ 1 , j , j = 1 , , r 1 this becomes
U 2 , r Σ 2 , r V 2 , r T A b U 1 , r Σ 1 , r V 1 , r T ,
yielding
A b U 2 , r Σ 2 , r V 2 , r T V 1 , r Σ 1 , r 1 U 1 , r T .
Introducing the reduced order state
z ( t ) = U 2 , r T ξ ( t ) R r 2 , ξ ( t ) = U 2 , r z ( t )
and noticing that U 2 , r T U 2 , r = I q it holds that
z ( t + 1 ) = U 2 , r T A b ξ ( t ) u ( t ) Σ 2 , r V 2 , r T V 1 , r Σ 1 , r 1 U 1 , r T ξ ( t ) u ( t ) .
Thus the low–dimensional dynamic model is obtained in the form
z ( t + 1 ) = A ˜ z ( t ) + b ˜ u ( t )
with
A ˜ = U 2 , r T A U 2 , r Σ 2 , r V 2 , r T V 1 , r Σ 1 , r 1 U 1 , r T ( : , 1 : q ) U 2 , r , b ˜ = U 2 , r T b Σ 2 , r V 2 , r T V 1 , r Σ 1 , r 1 U 1 , r T ( : , q + 1 )
The resulting network-based model order reduction scheme is shown in Figure 4.
The obtained results with r 2 = 17 , r 1 = 18 are shown in Figure 5. On the left of the figure, the comparison of the reference solution and the autoencoder-DMDc (ae-dmdc) based approximation can be seen. On the right part of the figure, the associated approximation error norm is shown.
Next, the identification of a suitable approximation of the output (i.e., measurement) operator is addressed. In terms of the numerical approximation (5) of the pde model (1), the output at time t R is given by
y ( t ) = x ( 1 , t ) c T x ( t ) c T η 2 ( W 2 U 2 , r z ( t ) + b 2 ) = c ˜ T z ( t ) .
For the purpose of providing an optimal minimum squared error approximation of c ˜ T the following snapshot matrices have been introduced
Y = y ( 0 ) y ( m ) , Z = z ( 0 ) z ( m ) = U 2 , r T Ξ
so that with the pseudo–inverse Z one obtains
Y = c ˜ T Z , c ˜ T = Y Z , Z = Z T ( Z Z T ) 1 .
The resulting absolute error of the output approximation is shown in Figure 6 for the two different sensor locations s m = 1 and s m = 0.5 .
It can be seen that a good correspondence is achieved for both sensor locations.

4. Full State Reconstruction Using the Ae-Dmdc Reduced-Order Model

In this section, the reduced-order model with state z ( t ) R r 2 (with r 2 N ) is employed to design algorithms for the reconstruction of the complete state x ( t ) R N by combining the available measurement information, the known input u ( t ) with the model using two different observer design approaches and the decoder of the previously designed autoencoder. In general, consider an estimate z ^ with the associated full-state estimate x ^ = η 2 ( W 2 z ^ + b 2 ) it holds for the estimation error e = x ^ x that
e = η 2 ( W 2 z ^ + b 2 ) x = η 2 ( W 2 z ^ + b 2 ) η 2 ( W 2 z + b 2 )
so that for a linear decoder mapping η 2 one obtains
e = η 2 ( W 2 ( z ^ z ) + b 2 ) ,
implying that e 0 if and only if z ^ z 0 . Thus, providing an estimate in the z -coordinates is sufficient for obtaining a reliable estimate in the original coordinates, as long as the autoencoder is adequately trained (i.e., in particular with η 2 ( 0 + b 2 ) 0 ).

4.1. Reduced-Order Observer

The main idea of the reduced order observer is to prevent redundant estimates and reduce model-dependency of the state estimation scheme [1,2]. For this purpose, the measurements (or parts of them) are directly included unfiltered in the estimator, which is only possible if no significant sensor noise is at place. To introduce the approach, formally consider the state transformation
ζ = V z , V = c ˜ T T
with T such that V is invertible, i.e., r a n k ( T ) = r 2 1 (as there is only 1 measurement) and T c = 0 , meaning that the rows of T corresponds to basis vectors of the orthogonal complement of the space spanned by the row vector c T (i.e., the measurement subspace). It should be noted that there exist effective numerical algorithms for determining the matrix T, e.g., using null in Matlab.
Next, consider the partition
ζ = ζ m ζ r , A ¯ = V A ˜ V 1 = A ¯ m A ¯ m r A ¯ r m A ¯ r , b ¯ = V b ˜ = b ¯ m b ¯ r
and the associated reduced order observer
ζ ^ r ( t + 1 ) = A ¯ r m y ( t ) + A ¯ r ζ ^ r ( t ) + b ¯ r u ( t )
z ^ ( t ) = V 1 y ( t ) ζ ^ r ( t )
x ^ ( t ) = η 2 ( W 2 z ^ ( t ) + b 2 ) .
It is straight forward to verify that the associated observation error z ˜ = z ^ z exponentially converges to zero with time if and only if all eigenvalues of the matrix A ¯ r are contained in the open unit disk U 1 C . If some eigenvalues are contained on the unit circle there are estimation error components (in ζ coordinates!) that will remain constant but not grow in time, i.e., for small initial errors, small error evolutions can still be achieved. Given that z = 0 is mapped onto x = 0 the convergence in the original state coordinates can be ensured within some accuracy limitation associated to the approximation performance of the autoencoder.

4.2. Kalman Filter

For comparison purposes, the state estimation by using the Kalman Filter [21,22] is considered. The combination of DMDc models with Kalman Filters has recently been shown to yield quite convincing results [3,17,18,19,20]. One big potential of the Kalman Filter relies on the fact that it provides a minimum estimation error covariance based on a priori statistics of the model and measurement uncertainties, which are considered as white noise processes with covariances Q R r 2 × r 2 and r R in the present context, i.e.,
z ( t + 1 ) = A ˜ z ( t ) + b ˜ u ( t ) + w ( t ) , w ( t ) N ( 0 , Q )
y ( t ) = c ˜ T z ( t ) + v ( t ) , v ( t ) N ( 0 , r ) .
Under these assumptions, the minimum estimation error covariance state estimator is given by the following prediction-correction (innovation) scheme (see, e.g., [21])
Prediction z ^ p ( t + 1 ) = A ˜ z ^ ( t ) + b ˜ u ( t )
P p ( t + 1 ) = A ˜ P ( t ) A ˜ T + Q Kalman gain
L ( t + 1 ) = P p ( t + 1 ) c ˜ ( c ˜ T P p ( t + 1 ) c ˜ + r ) Innovation
z ^ ( t + 1 ) = ( I L ( t + 1 ) c T ) z ^ p ( t + 1 ) + L y ( t + 1 )
P ( t + 1 ) = ( I L ( t + 1 ) c T ) P p ( t + 1 ) Complete state reconstruction
x ^ ( t + 1 ) = η 2 ( W 2 z ^ ( t + 1 ) + b 2 )
with predicted state z ^ p , error covariance P p , Kalman (correction) gain L, state estimate z ^ and error covariance P.
Note that the implementation of the Kalman Filter implies the memory usage of a total of r 2 2 + r 2 floating point variables for storing (and for updating) the error covariance matrix P ( t ) R r 2 × r 2 and the state estimate z ^ ( t ) R r 2 . For the case study with r 2 = 17 this yields 306 dynamic variables, while the original pde approximation is based on only 100. On the other hand it should be noticed that the Kalman Filter provides a minimum estimation error covariance, which is in particular useful if one takes into account the stochastic process and measurement noise perturbations. Given the approximative nature of the model (in full-scale with N = 100 state variables as well as in reduced scale with r 2 = 17 state variables, this uncertainty rejection property can eventually compensate modeling errors up to some point.

4.3. Observer Evaluation

In this section both observer design approaches are compared for two different input scenarios
u 1 ( t ) = σ ( t 5 ) 2 σ ( t 10 ) + σ ( t 15 )
u 2 ( t ) = cos ( 0.5 t ) ( σ ( t 5 ) 2 σ ( t 10 ) + σ ( t 15 ) sin ( 3 t ) )
giving rise to four different scenarios
(S1)
u = u 1 and s m = 1 , (S2) u = u 1 and s m = 0.5
(S3)
u = u 2 and s m = 1 , (S4) u = u 2 and s m = 0.5 .
The associated input and state profiles are shown in Figure 7.
For the comparison of the appproaches, the associated approximated evolution of the L 2 norm of the observation error profiles is evaluated. Further, to evaluate the associated computational load, the mean computation time per time step t ¯ and its variance ν are evaluated for both approaches on a notebook computer with Intel Pentium CPU 4415U with 2.3 GHz.
The results of the evaluation are shown in Figure 8. It can be seen that both observer schemes yield similar estimation performance, with the Kalman Filter performing slightly better in terms of precision for s m = 1 and the reduced order observer for s m = 0.5 . The associated computation times clearly show that the reduced order observer achieves its core calculations about four times as fast as the Kalman Filter, which is due to the higher dimensionality of the Kalman Filter with the required calculation of the covariance matrix.

5. Discussion

As commented above, the implementation of the reduced order observer actually requires only r 2 1 (i.e., for the case study with r 2 = 17 this corresponds to only 16) variables, while the Kalman Filter requires the additional information about the error covariance matrix at each time step, yielding a total of r 2 2 + r 2 variables (i.e., for the case study 306). This implies a significant reduction (by a factor about 19 for the case study) in comparison to the Kalman Filter in terms of memory storage. In addition, as discussed in the previous section, in the performed simulation studies the core calculations of the reduced-order observer were about four times faster than the ones for the Kalman Filter. It should be mentioned anyway, that in the implementation in Matlab, when taking into account the time for the decoding, i.e., the last step in both observer schemes, this time advantage is basically lost and both approaches perform similarly.
As it can be seen, the performance of the Kalman Filter and the reduced order observer are quite comparable for this particular case study. As commented in the introductory discussion for the reduced order observer, this is clearly to be attributed to an output approximation with low noise level. For systems with more sensor noise it is not recommendable to use the unfiltered output directly in the observer scheme.
As a consequence of these results, as long as computation times are below system specific constraints in terms of time scales, both approaches can be employed, while in the case that the computation times should be held as low as possible and that the measurement noise is neglectable, the reduced order observer has clear advantages. This holds true in particular if in addition to the observer scheme some complex calculations or real-time parallel optimizations are performed.

6. Conclusions and Outlook

The combination of autoencoders and dynamic mode decomposition has been employed as a basis for low-dimensional observer design for a class of diffusion-convection-reaction systems using the reduced order observer and Kalman Filter approaches. Using a case study it was shown that both approaches yield satisfactory and comparable results, with the Kalman Filter having a slightly better precision on the cost of significantly higher memory costs, while the reduced order observer provides a comparable performance with very low computational demand and a high degree of simplicity in the design.
Future studies will focus on different, more complex applications, including networks of systems, semilinear pde systems, pdes on higher-dimensional domains (2D, 3D), spatially varying diffusion and convection mechanisms, and study of the impact of the autoencoder structure with more detail (number and size of hidden layers, activation and loss functions, etc.). Comparisons with other model order reduction approaches will be carried out for the respective cases.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Tubular Reactor Model

Consider the (single-species axial-dispersion) tubular reactor model [26,28,29]
t T = D s 2 T v s T η ( T T c ) + δ ρ ( c ) e γ / T
t c = D s 2 c v s c ρ ( c ) e γ / T
with Danckwert’s boundary conditions
D s T ( 0 , t ) v T ( 0 , t ) = v T e , D s T ( 0 , t ) = 0
D s c ( 0 , t ) v c ( 0 , t ) = v c e , D s c ( 0 , t ) = 0 .
In (A1) T , c : [ 0 , 1 ] × R + R with T ( s , t ) representing the reactor temperature at s [ 0 , 1 ] and time t 0 , c ( s , t ) the concentration, D being the diffusion coefficient, v the superficial velocity (assumed to be constant), η the heat interchange coefficient between the cooling jacket with temperature T c ( s , t ) (that can be used as a control input [27]), δ the adiabatic temperature rise, ρ : R + R + the reaction rate function, γ the Arrhenius coefficient, and T e , c e the (constant) feed temperature and concentration, respectively.
In the following, we focus on the reactor behavior close to the equilibrium temperature and concentration profiles T * , c * : [ 0 , 1 ] R + satisfying
0 = D s 2 T * v s T * η ( T * T c ) + δ ρ ( c * ) e γ / T * 0 = D s 2 c * v s c * ρ ( c * ) e γ / T *
with the associated boundary conditions. Introducing the deviation variables
τ = T T * , x = c c *
and using a first order approximation of the nonlinear terms it holds that
ρ ( c ) e γ / T = ρ ( c * + x ) e γ / ( T * + τ ) e γ / T * ρ ( c * ) + ρ ( c * ) γ T * 2 τ + ρ ( c * ) x .
Considering in a first approximation that the temperature deviation can be neglected (e.g., due to some appropriate control through the cooling jacket temperature [27]) one obtains for the concentration dynamics in a neighborhood of the equilibrium profile c * the following dynamics
t x = D s 2 x v x r x
with r ( s ) = ρ ( c * ( s ) ) e γ / T * ( s ) for s ( 0 , 1 ) and the associated boundary conditions. This model is equivalent to the one in (1), assuming a particular reaction rate distribution r with reaction rate maximum determined by the coefficients k 1 .

References

  1. Luenberger, D.G. Observing the state of a linear system. IEEE Trans. Milit. Elect. 1964, 8, 74–80. [Google Scholar] [CrossRef]
  2. Luenberger, D.G. An introduction to observers. IEEE Trans. Autom. Control 1971, 16, 596–602. [Google Scholar] [CrossRef]
  3. Otto, S.E.; Rowley, C.W. Koopman Operators for Estimation and Control of Dynamical Systems. Annu. Rev. Control Robot. Auton. Syst. 2021, 4, 59–87. [Google Scholar] [CrossRef]
  4. Curtain, R.F.; Zwart, H. An Introduction to Infinite-Dimensional Linear Systems Theory; Springer: New York, NY, USA, 1995. [Google Scholar]
  5. Krstic, M.; Smyshlyaev, A. Boundary Control of PDEs: A Course on Backstepping Designs; SIAM: Philadelphia, PA, USA, 2008. [Google Scholar]
  6. Schaum, A.; Moreno, J.A.; Alvarez, J.; Meurer, T. A simple observer scheme for a class of 1-D semi-linear parabolic distributed parameter systems. In Proceedings of the 2015 European Control Conference, Linz, Austria, 15–17 July 2015; pp. 49–54. [Google Scholar]
  7. Schaum, A.; Alvarez, J.; Meurer, T.; Moreno, J.A. State-estimation for a class of tubular reactors using a pointwise innovation scheme. J. Proc. Control 2017, 60, 104–114. [Google Scholar] [CrossRef]
  8. Schaum, A. An unknown input observer for a class of diffusion-convection-reaction systems. at-Automatisierungstechnik 2018, 66, 548–557. [Google Scholar] [CrossRef]
  9. Schaum, A. Dissipativity in Control Engineering. Applications to Finite and Infinite Dimensional Systems; DeGruyter: Berlin, Germany, 2021. [Google Scholar]
  10. Schaum, A.; Koch, S.; Reichhartinger, M.; Meurer, T.; Moreno, J.A.; Horn, M. Nonlinear Observer Design for a 1D Heat Conduction Process. In Proceedings of the 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 13 December 2021. in press. [Google Scholar]
  11. Brunton, S.L.; Kutz, J.N. Data-Driven Science and Engineering; Cambridge University Press: Cambridge, UK, 2019. [Google Scholar]
  12. Williams, M.O.; Kevrekidis, I.G.; Rowley, C.W. A Data–Driven Approximation of the Koopman Operator: Extending Dynamic Mode Decomposition. J. Nonlinear Sci. 2015, 25, 1307–1346. [Google Scholar] [CrossRef] [Green Version]
  13. Proctor, J.L.; Brunton, S.L.; Kutz, J.N. Generalizing Koopman Theory to Allow for Inputs and Control. SIAM J. Appl. Dyn. Syst. 2018, 17, 909–930. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Lusch, B.; Kutz, J.N.; Brunton, S.L. Deep learning for universal linear embeddings of nonlinear dynamics. Nat. Commun. 2018, 9, 4950. [Google Scholar] [CrossRef] [PubMed]
  15. Maksakov, A.; Palis, S. Koopman-based data-driven control for continuous fluidized bed spray granulation with screen-mill-cycle. J. Process Control 2021, 103, 48–54. [Google Scholar] [CrossRef]
  16. Masti, D.; Bemporad, A. Learning nonlinear state–space models using autoencoders. Automatica 2021, 129, 109666. [Google Scholar] [CrossRef]
  17. Gomez, D.F.; Lagor, F.D.; Kirk, P.B.; Lind A., H.; Jones, A.R.; Paley, D.A. Data-driven estimation of the unsteady flowfield near an actuated airfoil with embedded pressure sensors. J. Guid. Control Dyn. 2019, 42, 2279. [Google Scholar] [CrossRef]
  18. Surana, A. Koopman Operator Based Observer Synthesis for Control-Affine Nonlinear Systems. In Proceedings of the IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV, USA, 12–14 December 2016; pp. 6492–6499. [Google Scholar]
  19. Surana, A.; Banaszuk, A. Linear observer synthesis for nonlinear systems using Koopman Operator framework. IFAC-PapersOnLine 2016, 49, 716–723. [Google Scholar] [CrossRef]
  20. Vijayshankar, S.; Nabi, S.; Chakrabarty, A.; Grover, P.; Benosman, M. Dynamic Mode Decomposition and Robust Estimation: Case Study of a 2D Turbulent Boussinesq Flow. In Proceedings of the 2020 American Control Conference, Denver, CO, USA, 1–3 July 2020; pp. 2351–2356. [Google Scholar]
  21. Gelb, A. Applied Optimal Estimation; M.I.T. Press: Cambridge, UK, 1978. [Google Scholar]
  22. Kalman, R. A New Approach to Linear Filtering and Prediction Problems. Trans. ASME–J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef] [Green Version]
  23. Alvarez, J.; Lopez, T. Robust dynamic state estimation of nonlinear plants. AIChE J. 1999, 45, 107–123. [Google Scholar] [CrossRef]
  24. Aris, R. Introduction to the Analysis of Chemical Reactors; Prentice-Hall: Englewood Cliffs, NJ, USA, 1969. [Google Scholar]
  25. Bastin, G.; Dochain, D. On-Line Estimation and Adaptive Control of Bioreactors; Elsevier: Amsterdam, The Netherlands, 1990. [Google Scholar]
  26. Dochain, D. State observers for tubular reactors with unknown kinetics. J. Proc. Control 2000, 45, 259–268. [Google Scholar] [CrossRef]
  27. Franco-de los Reyes, H.A.; Schaum, A.; Meurer, T.; Alvarez, J. Stabilization of an unstable tubular reactor by nonlinear passive output feedback control. J. Process Control 2020, 93, 83–96. [Google Scholar] [CrossRef]
  28. Varma, A.; Aris, R. Stirred pots and empty tubes. In Chemical Reactor Theory; Prentice Hall: Hoboken, NJ, USA, 1977; pp. 79–154. [Google Scholar]
  29. Winkin, J.J.; Dochain, D.; Ligarius, P. Dynamical analysis of distributed parameter tubular reactors. Automatica 2000, 36, 349–361. [Google Scholar] [CrossRef]
Figure 1. Solution of the pde model (1) using finite-difference approximation with N = 100 collocation points (top) and corresponding input (bottom).
Figure 1. Solution of the pde model (1) using finite-difference approximation with N = 100 collocation points (top) and corresponding input (bottom).
Algorithms 14 00330 g001
Figure 2. Autoencoder network for empirical basis identification using machine learning.
Figure 2. Autoencoder network for empirical basis identification using machine learning.
Algorithms 14 00330 g002
Figure 3. (Left): Comparison of the original data set (top) and the approximated one obtained with the trained autoencoder (bottom). (Right): Approximation error norm.
Figure 3. (Left): Comparison of the original data set (top) and the approximated one obtained with the trained autoencoder (bottom). (Right): Approximation error norm.
Algorithms 14 00330 g003
Figure 4. Network-based model-order reduction scheme consisting of an autoencoder network in combination with a linear DMDc-based network.
Figure 4. Network-based model-order reduction scheme consisting of an autoencoder network in combination with a linear DMDc-based network.
Algorithms 14 00330 g004
Figure 5. (Left): Comparison of the original solution (top) and the approximated one obtained with the low order ae-dmdc model obtained by combining the trained autoencoder with the DMDc approximation (bottom). (Right): Approximation error norm.
Figure 5. (Left): Comparison of the original solution (top) and the approximated one obtained with the low order ae-dmdc model obtained by combining the trained autoencoder with the DMDc approximation (bottom). (Right): Approximation error norm.
Algorithms 14 00330 g005
Figure 6. Output approximation error for the training data for the two different sensor locations s m .
Figure 6. Output approximation error for the training data for the two different sensor locations s m .
Algorithms 14 00330 g006
Figure 7. Input scenarios for S1, S2 (left, u = u 1 ) and S3, S4 (right, u = u 2 ).
Figure 7. Input scenarios for S1, S2 (left, u = u 1 ) and S3, S4 (right, u = u 2 ).
Algorithms 14 00330 g007
Figure 8. Comparison of the approximated L 2 observation error norm evolutions for the four different scenarios (S1) to (S4) with the respective mean computation times per unit step ( t ¯ r o o , t ¯ k f and the associated variances ν r o o , ν k f obtained in the simulation study.
Figure 8. Comparison of the approximated L 2 observation error norm evolutions for the four different scenarios (S1) to (S4) with the respective mean computation times per unit step ( t ¯ r o o , t ¯ k f and the associated variances ν r o o , ν k f obtained in the simulation study.
Algorithms 14 00330 g008
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Schaum, A. Autoencoder-Based Reduced Order Observer Design for a Class of Diffusion-Convection-Reaction Systems. Algorithms 2021, 14, 330. https://doi.org/10.3390/a14110330

AMA Style

Schaum A. Autoencoder-Based Reduced Order Observer Design for a Class of Diffusion-Convection-Reaction Systems. Algorithms. 2021; 14(11):330. https://doi.org/10.3390/a14110330

Chicago/Turabian Style

Schaum, Alexander. 2021. "Autoencoder-Based Reduced Order Observer Design for a Class of Diffusion-Convection-Reaction Systems" Algorithms 14, no. 11: 330. https://doi.org/10.3390/a14110330

APA Style

Schaum, A. (2021). Autoencoder-Based Reduced Order Observer Design for a Class of Diffusion-Convection-Reaction Systems. Algorithms, 14(11), 330. https://doi.org/10.3390/a14110330

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop