Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models

Antão, Rómulo; Antunes, José; Mota, Alexandre; Escadas Martins, Rui

doi:10.3390/app10113958

Open AccessArticle

Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models

¹

IEETA, Institute of Electronics and Informatics Engineering of Aveiro University of Aveiro, 3810-193 Aveiro, Portugal

²

DETI, Department of Electronics, Telecommunications and Informatics University of Aveiro, 3810-193 Aveiro, Portugal

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(11), 3958; https://doi.org/10.3390/app10113958

Submission received: 25 April 2020 / Revised: 22 May 2020 / Accepted: 3 June 2020 / Published: 7 June 2020

(This article belongs to the Special Issue Control and Soft Computing)

Download

Browse Figures

Versions Notes

Abstract

:

The present paper proposes an approach for the development of a non-linear model-based predictive controller (NMPC) using a non-linear process model based on Artificial Neural Networks (ANNs). This work exploits recent trends on ANN literature using a TensorFlow implementation and shows how they can be efficiently used as support for closed-loop control systems. Furthermore, it evaluates how the generalization capability problems of neural networks can be efficiently overcome when the model that supports the control algorithm is used outside of its initial training conditions. The process’s transient response performance and steady-state error are parameters under focus and will be evaluated using a MATLAB’s Simulink implementation of a Coupled Tank Liquid Level controller and a Yeast Fermentation Reaction Temperature controller, two well-known benchmark systems for non-linear control problems.

Keywords:

TensorFlow; artificial neural network; system identification; model-based predictive control

1. Introduction

Nowadays, control systems have an important role in the automation industry due to the increasingly tight requirements posed over precision, performance, efficiency, and safety metrics of automatic systems. Moreover, they become ubiquitously present in many aspects of our daily life such as in housing heating systems, household appliances, among many other “intelligent” products that we rely on every day. While this “intelligence” is still far from accomplishing the same challenges humans can, the closed coupling between advanced control algorithms and good domain-specific process models has significantly expanded the application scenarios of autonomous computational systems. Model Predictive Control algorithms and the TensorFlow variant of Artificial Neural Networks (ANN) have many proven concepts and advantages in their specific areas of application (control and system modeling respectively) but, up to the author’s best knowledge have not been used in synergy.

Model-based Predictive Control (MPC) is considered as an advanced technique of control and it is widely used in several industrial applications [1]. The concept of MPC does not refer to a specific control strategy. Instead, it is a set of control techniques (such as Dynamic Matrix Control (DMC) [2] and Generalized Predictive Control [3]) which makes use of a system’s model to calculate the control actions over a time horizon by minimizing a cost function. Thus, the core of the MPC strategy is the mathematical model which describes the dynamics of the system to be controlled. MPC is a computationally expensive and time-consuming algorithm [4] which in the case of low-resource systems imposes a challenge. Because of that, it was firstly used in systems with slow dynamics [5,6], due to its compatibility with the required control rate.

Due to the tremendous technological evolution, computer processing units have become much faster allowing a larger number of operations per second, which facilitated using the MPC algorithm in more demanding systems. For example, Mohanty [7] proposes an MPC controller to control a flotation column while other works use MPC controllers for systems with faster dynamics [8,9].

On the other side, machine learning has been applied successfully in different research fields such as image classification [10], speech recognition [11], natural language processing [12] and system behavior prediction [13] due to its capability to build models that can learn non-linear mappings from data. ANNs cover a small subset of machine learning implementations being some of them openly debated and freely available to foster science advancement. Some examples of machine learning frameworks that have been gathering a great amount of interest from the community are TensorFlow [14], Keras [15], Caffe [16] while Ensmallen [17] has provided efficient mathematical optimization methods supporting the referred frameworks. More recently, new opportunities for machine learning are emerging in the domains of small portable devices using the Tensor Flow Lite implementation [18] and even in embedded systems and other devices with only kilobytes of memory using ported versions of Tensor Flow Lite implementation such as TinyML [19] or CMSIS-NN [20].

In this work it is used a framework created by Raymond Phan (https://github.com/rayryeng), NeuralNetPlayground [21], inspired by the TensorFlow Neural Networks Playground interface, to build small deep learning systems in Matlab for regression and classification of non-linear data applications. These models, based on neural network structures, are trained during an initial open loop actuation of a non-linear process to learn its relevant dynamic features. After the model extraction procedure, they are integrated in an indirect closed loop controller synthesis procedure in order to manipulate the process according to a set of required operation conditions. The control law implementation follows the MPC principles, which optimizes the control actions within a prediction horizon. The methodology is evaluated against a Coupled Tank Liquid Level control problem and a Yeast Fermentation Reaction Temperature control problem, as two non-linear benchmark scenarios. The system architecture of the proposed solution is depicted in Figure 1.

2. Developing a System Model with Tensor Flow

A successful identification of a process’s discrete model is a task highly dependent on the choice of its structure and number of variables. This task becomes even more challenging when it is necessary to employ non-linear models to approximate the system’s input/output behavior. In the present work, it is used an ANN based on the TensorFlow implementation, using a regression variable’s vector according to the Auto-Regressive model with eXogenous inputs (ARX) linear model structure—an approach referred in literature as Neural Network Auto-Regressive with eXogenous inputs (NNARX) [22]. The ANN is organized as a Multilayer Perceptron (MLP), an architecture known to be a proper choice for black-box modeling and system identification due to its scalability and universal approximator capability.

Regarding model structure, we set the activation functions of the hidden layer nodes to be hyperbolic tangent and in the output node as a linear activation function. This setup enables a good balance between model dimensionality and fitness of the model to non-linear systems. In what concerns to the parameters tunning, this procedure is accomplished through the execution of an optimization problem. Optimization is the task of minimizing or maximizing a cost function

f (x)

by varying x. When training neural networks, it is frequently to use the mean squared error as cost function, as defined by J in Equation (1), and our goal is to find the set of parameters that minimize its value:

J (W, Z^{N}) = \frac{1}{2 N} \sum_{n = 1}^{N} {[y (n) - \hat{y} (n | W)]}^{2} + λ W^{⊤} W,

(1)

where N is the number of samples,

Z^{N}

is the set of data containing the inputs,

y (n)

is the expected output,

\hat{y} (n | W)

is the output calculated by the ANN, W is a vector of all ANN weights and

λ

is the regularization parameter that penalizes weights with high values. This parameter must be a positive value and its magnitude must be selected so it does not become the driving factor of the cost function as the main objective is to minimize the error between the training data set and the output generated by the neural network.

The gradient descent is often the chosen iterative optimization algorithm to find a non-linear function minimum [23]. However, one of the disadvantages of this method is that it becomes time-consuming if the data set is large and the network has multiple internal layers. Nevertheless, to have an ANN with good generalization, a huge amount of data is required making the training a heavy task. To overcome this limitation, the Stochastic Gradient Descent (SGD) was developed as an extension of the gradient descent [24], in which the main difference is that instead of using all the samples available in the training data set, it uses only a smaller portion of this data in each step of the algorithm, making the training faster. This mini-batch of samples are drawn uniformly from the training data set. We use the SGD algorithm to train our neural network.

3. Benchmark Models

To assess the control strategy to be presented, two non-linear processes will be used. The Fermentation Reactor [25] and the Coupled Tanks systems [26] are two classical benchmark frameworks frequently used in the literature to evaluate the performance and robustness of non-linear modeling and control methodologies.

3.1. Coupled Tanks System

In several industry processes, it is often required to process liquids within storage devices. Many times, they are solely pumped across reservoirs but can also be part of chemical reactions where sudden volumetric changes can happen. Anyhow, in any scenario, the level of a fluid within a storage tank must be controlled according to its capacity limits.

In the present scenario, a system consisting of two tanks is used, having each one an independent pump to control the inflow of liquid (

q_{1}

and

q_{2}

) and an outlet at the bottom responsible for the liquid leakage. The tanks are interconnected by a channel which allows the liquid to flow between them and the variables under control are the liquid heights in each tank (

h_{1}

and

h_{2}

) [27], as depicted in Figure 2,

The dynamic of this system can be described by the set of non-linear differential Equations (2) and (3) [27].

\begin{matrix} a_{1} \frac{d h_{1}}{d t} & = q_{1} - α_{1} \sqrt{h 1} - sgn (h_{1} - h_{2}) α_{3} \sqrt{h 1 - h 2} \end{matrix}

(2)

\begin{matrix} a_{2} \frac{d h_{2}}{d t} & = q_{2} - α_{2} \sqrt{h 2} + sgn (h_{1} - h_{2}) α_{3} \sqrt{h 1 - h 2} \end{matrix}

(3)

where

a_{1}

and

a_{2}

denote the cross-sectional area of the tank 1 and 2,

h_{1}

and

h_{2}

are the liquid level in tank 1 and 2,

q_{1}

and

q_{2}

are the volumetric flow rate (cm

^{3}

s

^{- 1}

) of Pump 1 and 2,

α_{1}

,

α_{2}

and

α_{3}

are proportionality coefficient corresponding to the

\sqrt{h_{1}}

,

\sqrt{h_{2}}

and

\sqrt{h_{1} - h_{2}}

terms which depend on the discharge coefficients of each outlet and the gravitational constant. In the present evaluation,

q_{2}

will be used as an unmeasured external disturbance for the control system. The reservoir model parameters were obtained from the setup described in [28], and are presented in Table 1:

As evidenced by Figure 3a,b the system’s steady-state gain is of non-linear nature and its incremental gain highly dependent on the current operation point.

3.2. Yeast Fermentation Reaction

Yeast fermentation is a biochemical process which, having ethanol and carbon-dioxide as a sub-product, has significant value for several branches of food industry as well for other domains such as pharmaceutical and chemical. The yeast fermentation reaction is itself a composition of several interdependent physical/chemical processes occurring simultaneously which occur within a reactor. This reactor if often modelled as a stirred tank with constant substrate feed flow and a constant outlet flow containing the product (ethanol), substrate (glucose), and biomass (suspension of yeast). Given its large structure and number of parameters, the details of the model are not hereby presented and can be found in [29].

Fermentation reactions are of an exothermic nature and, since they are dependent on living organisms whose growth rate is highly sensitive to temperature variations, it is important to avoid temperature runaway of the reactor. Driven by this, temperature control is a key factor to ensure the reaction stability, and, for that purpose, cooling jackets are often employed [30]. Thus, from the perspective of a control algorithm, the reactor is a single-input single-output process: the coolant flow rate (

F_{a g}

) is the input (the manipulated variable) and the reactor’s temperature (

T_{r}

) is the output (the controlled variable). In the present evaluation, the substrate temperature will act as an external disturbance to the system. The continuous fermentation reactor that will serve as an evaluation scenario for the developed control strategy is depicted in Figure 4.

The dynamic behavior of the process is of non-linear nature and highly dependent on the current operation point, as evinced by the the steady-state gain curve, depicted in Figure 5a) and the incremental gain represented in Figure 5b.

4. Process Identification

System identification is the task of mathematically describe a model of a dynamic system through a set of measurements made to the real system (black-box modeling).

System identification can be divided in four steps as described by Nørgaard et al. [22]: (i) experiment, (ii) model structure selection, (iii) model estimation, and (iv) model validation. Details about these steps follow.

4.1. Experiment

This is the first step being one of the most important in system identification. Open loop tests are made to gain insights about the system and gather the data that describes the system behavior. There are some choices that need to be done carefully such as the sampling frequency and the input signal that must excite the system over its entire operating range.

In the Coupled Tanks Liquid Level control scenario, by evaluating the system response to several step actuations over the flow rate of pump 1, a sampling interval of 2 seconds was chosen as appropriate to capture the plant’s behavior. Regarding the Yeast Fermentation Reactor Temperature control scenario, this process has a relatively slow dynamic behavior which is mainly imposed by the glucose decomposition rate [29]. Consequently, when the reaction’s operation point is changed, the attained settling time is in the scale of hours and, as so, one sample per hour is enough to capture the process’s relevant dynamics.

For the analysis of both models, train and test sets were created with 20 thousand samples each. This data collection is made using MATLAB, choosing a pseudo-random input signal to manipulate the system.

4.2. Model Structure Selection

One important step for the model identification procedure is the definition of its structure. Regarding the coupled tanks system, as it presents two storage tanks it can be approximated as a second order system. Therefore we use an NNARX model with two past output signals,

y (k - 1)

and

y (k - 2)

, and two past input signals,

u (k - 1)

and

u (k - 2)

, where the output relates to the height of tank 2 (

h_{2}

) and the input is the flow rate of pump 1 (

q_{1}

). Though, other structures, as reported by Nørgaard et al. [22], could be adopted.

In what concerns the Fermentation Reactor modeling, on related literature [25] second-order regressive models with no dead-time are found to be adequate for this task. Once again, we can use an NNARX model with two past output signals,

y (k - 1)

and

y (k - 2)

, and two past input signals,

u (k - 1)

and

u (k - 2)

, where the output relates to the reactor internal temperature

T_{r}

and the input is the coolant flow rate

F_{a g}

.

The structure of both models is generically depicted Figure 6.

4.3. Model Estimation

With the chosen structure and using the gathered data,

Z^{N} = {[u (k), y (k)], k = 1, \dots, N}

, the next step is to train the neural network. This process starts by randomly initializing the weights, and then updating them with the SGD method.

The model training framework allows one to specify several input parameters such as number of hidden layers, number of neurons in each layer, learning rate (

ϵ

), regularization factor (

λ

), epochs and mini-batch size, defined in Table 2. Several neural networks with two hidden layers and different number of neurons (5, 8, and 10 neurons) were trained individually.

4.4. Model Validation

In this final step, the trained model is evaluated to assess if it can properly represent the system behavior. As these models are biased to achieve good performance in the trained data set, they are further validated against a different test data set. For each model, its estimation Mean Squared Error (MSE) is measured using the test data set evaluation. The neural network that has the lowest MSE is then chosen to be the system predictor used by the MPC controller.

4.5. Results

In Table 3 and Table 4 are presented the MSE of both train and test data sets, for the differently trained neural networks.

For the Coupled Tanks Liquid Level model, the lowest value in Table 3 occurs in the simulation number 6 with 10 neurons. In the Yeast Fermentation Reactor Temperature model case, the lowest value in Table 4 occurs in the simulation number 3 with 10 neurons. Figure 7 and Figure 8 depict the results for the “best” neural network, comparing the response calculated by the neural network and the real output for a given test data set. We verify that our procedure is capable of identifying a non-linear model of the system even when the measured signal is disturbed by noisy conditions. The remaining question is if the obtained models are suitable for control purposes.

5. Process Control

Control of non-linear systems is one of the many applications of neural networks, and its goal is to manipulate the system behavior in a pre-defined intended manner. The development of a controller-based in neural networks can be addressed in two ways: (i) direct methods, meaning that the neural network is trained as being the controller according to some criterion, and (ii) indirect methods, where the controller is based on the system model to be controlled (in this case the controller is not a neural network) [22]. In this work, we use an MPC which is an indirect method.

5.1. Model-Based Predictive Control

MPC is a control strategy that uses the model to predict the output. Using these predictions, the aim is to find the control signal that minimizes the cost function that is dependent on those predicted outputs, desired trajectories, and control actions. As these controllers depend on the system’s model, their performance heavily relies on the identified model. Figure 9 represents the basic structure of an MPC algorithm.

The idea of this approach is to minimize the criterion presented in (4).

J (t, U (t)) = \sum_{k = N_{1}}^{N_{2}} {[r (t + k) - \hat{y} (t + k)]}^{2} + ρ \sum_{k = 1}^{N_{u}} {[∆ u (t + k - 1)]}^{2}

(4)

Subject to:

\begin{matrix} ∆ u (t + k) = 0, N_{u} \leq k \leq N_{2} - d \end{matrix}

(5)

\begin{matrix} u_{m i n} \leq u (t) \leq u_{m a x} \end{matrix}

(6)

with respect to the first

N_{u}

future control inputs:

U (t) = {[u (t) \dots u (t + N_{u} - 1)]}^{⊤}

(7)

d is the time system delay (we assumed it equal to 1),

r (t + k)

is the signal with the future reference samples,

\hat{y} (t + k)

is the signal with predicted output samples based on the model,

∆ u (t + k - 1)

is the signal with the changes in the control signal,

N_{1}

is the minimum prediction horizon,

N_{2}

is the prediction horizon,

N_{u}

is the control horizon and

ρ

is the weighting factor for penalizing changes is the control actions.

The minimization of this criterion, when the predictions are determined by a non-linear relationship, constitutes a complex non-linear programming problem. This problem draws more attention when real-time implementation is required as, under this condition, it is necessary to impose an upper bound to the control law sythnesis solution time. In this implementation, we use the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm to minimize the cost function (4) [23], implemented in the Brian Granzow’s (https://github.com/bgranzow) framework [31]. This is a quasi-Newton method that uses an approximation of the Hessian matrix to reduce the memory requirements of the optimization problem.

The MPC strategy can be summarized in the following steps [1]:

The future outputs, $\hat{y} (t + k)$ , are calculated over the prediction horizon at each sampling time using the model of the system, which in this case is a neural network. These values of $\hat{y}$ depend on the past input and output samples and the future control samples, $u (t + k)$ .
The values of $U (t)$ are calculated by an optimization algorithm in order to minimize the cost function (4). This criterion tries to approximate the future outputs to the future reference signal.
After optimization, the first sample of the signal $U (t)$ is applied to the system and the other samples of this signal are ignored. When a new sampling time is available the described cycle starts over.

5.2. Disturbance Rejection

Besides accurately following a desired setpoint, a good controller must be able to react to unexpected external disturbances. This is a problem with control systems that are based on models as models of the distribution of disturbances introduce significant complexity for the control synthesis problem and are often limited in their validity. Therefore one must rely on other methods to deal with the external disturbances and model mismatches problems to avoid steady-state error in our control system [32].

In our model-based application, this problem we use an approach similar to Fatehi et al. [32]. In this work, it is suggested to add to to the future predictions,

\hat{y} (t + k)

, a quantity

d_{m}

representing the disturbance that it is assumed to be constant over the horizon:

\hat{y} (t + k) = f [φ (t + k), W] + d_{m}

(8)

where

f [φ (t + k), W]

is the output of the neural network.

d_{m} (t) = w_{d} (t) e (t) + b (t)

(9)

The parameter

e (t)

is the difference between the real output of the system and the output of the neural network, b and

w_{d}

are weights that are adapted in each sampling time according to (10) and (11), respectively.

\begin{matrix} b (t) & = b (t - 1) + η e (t) + k_{p} [e (t) - e (t - 1)] \end{matrix}

(10)

\begin{matrix} w_{d} (t) & = w_{d} (t - 1) + η e {(t)}^{2} \end{matrix}

(11)

The constants

η

and

k_{p}

are chosen as:

η = k_{p} = 0.1

.

This scheme deals with the two problems mentioned above: (1) the model mismatches and (2) occurrence of external disturbances. Both cause prediction errors but must be treated separately. External perturbations need a faster adaptation to achieve fast variations on the system. When there are no external disturbances the adaptation should be slowed down because it may degrade the performance of prediction made by the model. According to Fatehi et al. [32], this is done by using a high-pass filter in the error signal

e (t)

.

6. Results

In this section, an analysis of the implemented closed-loop controller is made according to several operation scenarios. For each system, the effect of the controller design parameter weight

ρ

on the transient response of the control system is made, the setpoint following capability is evaluated for several operation conditions and, finally the robustness of the control system to unmeasured external disturbances is evaluated.

6.1. Coupled Tanks Liquid Level Control

For this setup, the parameters of the MPC controller are presented in Table 5.

According to Figure 10a,b higher values of

ρ

correspond to a smoother control signal and output signal while lower values of

ρ

correspond to a more oscillatory control signal and output signal. This coefficient becomes a parameter of design depending on the project specification in terms of performance requirements such as settling time and overshoot.

In Figure 11a,b is presented the closed loop performance of the control system in two distinct scenarios: firstly the strategy that compensates for the steady state error caused by external disturbances is not used whereas in the second case is. In both evaluations, the output signal is corrupted by Gaussian white noise with zero mean and variance of 0.05 cm. Furthermore, a static disturbance is added to the system using the input

q_{2}

with a constant flow of 10 cm³ s

^{- 1}

, at sample 1300.

6.2. Yeast Fermentation Reactor Temperature Control

For this evaluation scenario, the parameters of the MPC controller are presented in Table 6.

Likewise, we perform an analysis of the

ρ

influence in the control signal reaction in Figure 12a,b. As depicted, higher values of

ρ

correspond to a smoother control signal and output signal while lower values of

ρ

correspond to a more oscillatory control signal and output signal. To conclude the controller performance analysis, it is presented in Figure 13a the performance of the temperature control system regarding setpoint variations, while in Figure 13b is presented its response after increasing the raw material’s temperature from 25

°

C to 27

°

C at sample 250. This variation acts as unmeasured external disturbance to the control system. The controlled variable measurement is also corrupted by Gaussian white noise with zero mean and variance of 0.05

°

C.

7. Conclusions

In this paper was presented a technique for identification and control of non-linear systems with ANNs based on the Tensor Flow implementation. According to the results obtained in the example scenarios, this controller can follow the desired setpoint (within the operating range of the system) and to successfully overcome the influence of unmeasured external disturbances.

One advantage of this controller implementation is the ease with which it is tuned. Based on several training epochs that covered the nominal system operation conditions, one was able to develop a control loop capable of manipulating the controlled variable according to the desired reference. It is important to highlight that this result was achieved with a limited dimensionality model of the process, without even considering the influence of external disturbances. Surely higher dimensionality models could easily be obtained with a Tensor Flow structure but, a smaller Single-Input/Single-Output model approach enabled the use well know control algorithms, without entering the domains of multiple variable control problems.

However, one of the disadvantages of obtaining the controller output based on non-linear numerical optimization methods is the time it takes to calculate the control signal. For a slow dynamic system, this problem is not troublesome but, for faster systems the control signal must be calculated within the sampling period. This constraint may introduce a trade-off between model dimensionality/controller performance and the computational resources available to solve the problem.

Nevertheless, the proposed scheme stands as a generic and robust controller synthesis approach that can be applied to a multitude of application scenarios that may require an escalation on the model dimensionality, number of neurons and hidden layers.

Author Contributions

R.A. was responsible for the idea conceptualization, provided scientific supervision and was responsible for technical manuscript writing. J.A. was responsible for technical implementation. A.M. and R.E.M. provided scientific supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially funded by National Funds through FCT—Foundation for Science and Technology, in the context of the projects: ID/CEC/00127/2019.

Conflicts of Interest

The authors declare no conflict of interest.

References

Camacho, E.F.; Bordons, C. Model Predictive Control, 2nd ed.; Springer: London, UK, 2007. [Google Scholar]
Cutler, C.R.; Ramaker, B.L. Dynamic matrix control—A computer control algorithm. In Proceedings of the Joint Automatic Control Conference, San Francisco, CA, USA, 13–15 August 1980; Volume 17, p. 72. [Google Scholar]
Clarke, D.; Mohtadi, C.; Tuffs, P. Generalized predictive control—Part I. The basic algorithm. Automatica 1987, 23, 137–148. [Google Scholar] [CrossRef]
Lee, J.H. Model predictive control: Review of the three decades of development. Int. J. Control Autom. Syst. 2011, 9, 415. [Google Scholar] [CrossRef]
Sheta, A.; Braik, M.; Al-Hiary, H. Identification and Model Predictive Controller Design of the Tennessee Eastman Chemical Process Using ANN. In Proceedings of the 2009 International Conference on Artificial Intelligence, Las Vegas, NV, USA, 13–16 July 2009. [Google Scholar]
Eaton, J.W.; Rawlings, J.B. Model-predictive control of chemical processes. Chem. Eng. Sci. 1992, 47, 705–720. [Google Scholar] [CrossRef]
Mohanty, S. Artificial neural network based system identification and model predictive control of a flotation column. J. Process Control 2009, 19, 991–999. [Google Scholar] [CrossRef]
Bolognani, S.; Bolognani, S.; Peretti, L.; Zigliotto, M. Design and implementation of model predictive control for electrical motor drives. IEEE Trans. Ind. Electron. 2008, 56, 1925–1936. [Google Scholar] [CrossRef]
Stogiannos, M.; Alexandridis, A.; Sarimveis, H. Model predictive control for systems with fast dynamics using inverse neural models. ISA Trans. 2018, 72, 161–177. [Google Scholar] [CrossRef] [PubMed]
Guillaumin, M.; Verbeek, J.; Schmid, C. Multimodal semi-supervised learning for image classification. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13–18 June 2010. [Google Scholar]
Deng, L.; Li, X. Machine learning paradigms for speech recognition: An overview. IEEE Trans. Audio Speech Lang. Process. 2013, 21, 1060–1089. [Google Scholar] [CrossRef]
Collobert, R.; Weston, J. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, 5–9 July 2008. [Google Scholar]
Narendra, K.S.; Parthasarathy, K. Identification and control of dynamical systems using neural networks. IEEE Trans. Neural Netw. 1990, 1, 4–27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Toleubay, Y.; James, A.P. Getting Started with TensorFlow Deep Learning. In Deep Learning Classifiers with Memristive Networks. Modeling and Optimization in Science and Technologies; James, A., Ed.; Springer: Cham, Switzerland, 2020; Volume 14. [Google Scholar]
Chollet, F. Keras. 2015. Available online: https://keras.io (accessed on 5 June 2020).
Jia, Y.; Shelhamer, E.; Donahue, J.; Karayev, S.; Long, J.; Girshick, R.; Guadarrama, S.; Darrell, T. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia—MM ’14, Orlando, FL, USA, 3–7 November 2014. [Google Scholar]
Bhardwaj, S.; Curtin, R.R.; Edel, M.; Mentekidis, Y.; Sanderson, C. Ensmallen: A flexible C++ library for efficient function optimization. arXiv 2018, arXiv:1810.09361. [Google Scholar]
Tensor Flow Lite. 2019. Available online: https://www.tensorflow.org/lite/ (accessed on 5 June 2020).
Banbury, C.R.; Reddi, V.J.; Lam, M.; Fu, W.; Fazel, A.; Holleman, J.; Huang, X.; Hurtado, R.; Kanter, D.; Lokhmotov, A.; et al. Benchmarking TinyML Systems: Challenges and Direction. arXiv 2020, arXiv:2003.04821. [Google Scholar]
Lai, L.; Suda, N.; Chandra, V. CMSIS-NN: Efficient Neural Network Kernels for Arm Cortex-M CPUs. arXiv 2018, arXiv:1801.06601. [Google Scholar]
Phan, R. Neural NetPlayground A MATLAB implementation of the TensorFlow Neural Network Playground, GitHub. Retrieved 5 June 2020. Available online: https://github.com/StackOverflowMATLABchat/NeuralNetPlayground (accessed on 5 June 2020).
Nørgaard, M.; Ravn, O.; Poulsen, N.; Hansen, L.K. Neural Networks for Modelling and Control of Dynamic Systems: A Practitioner’s Handbook; Springer: London, UK, 2000. [Google Scholar]
Nocedal, J.; Wright, S.J. Numerical Optimization, 2nd ed.; Springer Series in Operations Research and Financial Engineering; Springer: New York, NY, USA, 2006. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; The MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Ławryńczuk, M. Computationally Efficient Model Predictive Control Algorithms—A Neural Network Approach, Studies in Systems, Decision and Control; Springer International Publishing: Cham, Switzerland, 2014; Volume 3. [Google Scholar]
Ogata, K. Modern Control Engineering, 5th ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2009. [Google Scholar]
Antão, R. Type-2 Fuzzy Logic: Uncertain Systems’ Modeling and Control; Springer: Singapore, 2017. [Google Scholar]
Wu, D.; Tan, W. A Simplified Type-2 Fuzzy Logic Controller for Real-Time Control. ISA Trans. 2006, 45, 503–516. [Google Scholar] [PubMed]
Nagy, Z. Model Based Control of a Yeast Fermentation Bioreactor Using Optimally Designed Artificial Neural Networks. Chem. Eng. J. 2007, 127, 95–109. [Google Scholar] [CrossRef]
Luyben, W. Chemical Reactor Design and Control; AIChE Wiley: Hoboken, NJ, USA, 2007. [Google Scholar]
Granzow, B. L-BFGS-B. 2017. Available online: https://github.com/bgranzow/L-BFGS-B (accessed on 5 June 2020).
Fatehi, A.; Sadjadian, H.; Khaki-Sedigh, A.; Jazayeri, A. Disturbance Rejection in Neural Network Model Predictive Control. IFAC Proc. 2008, 41, 3527–3532. [Google Scholar] [CrossRef] [Green Version]

Figure 1. System Architecture for the implementation of a Tensor Flow model-based control system.

Figure 2. Diagram of the coupled tanks system. Adapted from [27].

Figure 3. Steady-state gain and incremental gain variation in the Coupled Tanks Liquid Level Control problem.

Figure 4. Setup of the continuous fermentation reactor.

Figure 5. Steady-state gain and Incremental gain variation in the Yeast Fermentation Reaction Temperature Control problem.

Figure 6. NNARX model structure.

Figure 7. Comparison between the real output (y) and the predicted output (

\hat{y}

) by the neural network for the tank 2 liquid level.

Figure 7. Comparison between the real output (y) and the predicted output (

\hat{y}

) by the neural network for the tank 2 liquid level.

Figure 8. Comparison between the real output (y) and the predicted output (

\hat{y}

) by the neural network for the Yeast Fermentation Reactor Temperature.

Figure 8. Comparison between the real output (y) and the predicted output (

\hat{y}

) by the neural network for the Yeast Fermentation Reactor Temperature.

Figure 9. Structure of MPC algorithm.

Figure 10. Influence of the parameter

ρ

(Control action attenuation factor) in the dynamic behavior of the control system.

Figure 10. Influence of the parameter

ρ

(Control action attenuation factor) in the dynamic behavior of the control system.

Figure 11. Closed-loop performance of the Coupled Tanks Liquid Level Control system for several operation setpoints, for

ρ

= 5.

Figure 11. Closed-loop performance of the Coupled Tanks Liquid Level Control system for several operation setpoints, for

ρ

= 5.

Figure 12. Influence of the parameter

ρ

(Control action attenuation factor) in the dynamic behavior of the control system.

Figure 12. Influence of the parameter

ρ

(Control action attenuation factor) in the dynamic behavior of the control system.

Figure 13. Closed-loop performance of the Yeast Fermentation Reaction Temperature Control system.

Table 1. Parameters of the simulated Coupled Tanks System.

$a_{1}$	$a_{2}$	$α_{1}$	$α_{2}$	$α_{3}$
36.52 cm $^{2}$	36.52 cm $^{2}$	5.6186	5.6182	10

Table 2. Parameters used in the model training framework.

$ϵ$	$λ$	Minibatch Size	Epochs
0.2	1 × 10 $^{- 6}$	20,000	10,000

Table 3. Mean Squared Error of the Coupled Tanks Liquid Level model for several modeling approaches of with a different number of neurons and with two hidden layers.

Neurons per Layer Simulation	5		8		10
Neurons per Layer Simulation	MSE_Train	MSE_Test	MSE_Train	MSE_Test	MSE_Train	MSE_Test
1	0.002631	0.003017	0.001692	0.001852	0.002480	0.002759
2	0.002736	0.003170	0.002395	0.002700	0.002085	0.002271
3	0.002670	0.002945	0.001942	0.002243	0.002124	0.002390
4	0.002401	0.002648	0.002711	0.002969	0.001854	0.002125
5	0.003406	0.003835	0.002577	0.003015	0.002848	0.002979
6	0.002717	0.003131	0.002081	0.002418	0.001347	0.001503
7	0.003076	0.003466	0.002196	0.002379	0.002607	0.002854
8	0.002661	0.003206	0.002134	0.002444	0.001932	0.002175
9	0.004027	0.004568	0.001849	0.002119	0.002365	0.002592
10	0.002977	0.003258	0.002090	0.002337	0.002045	0.002316

Table 4. Mean Squared Error of the Yeast Fermentation Reactor Temperature model for several modeling approaches of with a different number of neurons and with two hidden layers.

Neurons per Layer Simulation	5		8		10
Neurons per Layer Simulation	MSE_Train	MSE_Test	MSE_Train	MSE_Test	MSE_Train	MSE_Test
1	0.006504	0.006682	0.005682	0.005494	0.004456	0.004262
2	0.006416	0.006292	0.005577	0.005460	0.004349	0.004266
3	0.006536	0.006303	0.005566	0.005347	0.004039	0.003952
4	0.006218	0.006984	0.005653	0.005407	0.006213	0.005869
5	0.006062	0.005963	0.005937	0.005694	0.005686	0.005519
6	0.006915	0.006563	0.005995	0.005849	0.005170	0.005014
7	0.007280	0.006389	0.005434	0.005325	0.004458	0.004384
8	0.006787	0.006527	0.005518	0.005355	0.004130	0.004032
9	0.006470	0.006265	0.006642	0.006216	0.004323	0.004200
10	0.006535	0.006423	0.005171	0.005100	0.004625	0.004351

Table 5. Parameters of the model-based predictive controller for the Coupled Tanks Liquid Level control.

$N_{1}$	$N_{2}$	$N_{u}$	$ρ$
1	6	3	5

Table 6. Parameters of the model-based predictive controller for the Yeast Fermentation Reactor Temperature control.

$N_{1}$	$N_{2}$	$N_{u}$	$ρ$
1	5	2	0.1

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Antão, R.; Antunes, J.; Mota, A.; Escadas Martins, R. Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models. Appl. Sci. 2020, 10, 3958. https://doi.org/10.3390/app10113958

AMA Style

Antão R, Antunes J, Mota A, Escadas Martins R. Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models. Applied Sciences. 2020; 10(11):3958. https://doi.org/10.3390/app10113958

Chicago/Turabian Style

Antão, Rómulo, José Antunes, Alexandre Mota, and Rui Escadas Martins. 2020. "Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models" Applied Sciences 10, no. 11: 3958. https://doi.org/10.3390/app10113958

APA Style

Antão, R., Antunes, J., Mota, A., & Escadas Martins, R. (2020). Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models. Applied Sciences, 10(11), 3958. https://doi.org/10.3390/app10113958

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Model Predictive Control of Non-Linear Systems Using Tensor Flow-Based Models

Abstract

1. Introduction

2. Developing a System Model with Tensor Flow

3. Benchmark Models

3.1. Coupled Tanks System

3.2. Yeast Fermentation Reaction

4. Process Identification

4.1. Experiment

4.2. Model Structure Selection

4.3. Model Estimation

4.4. Model Validation

4.5. Results

5. Process Control

5.1. Model-Based Predictive Control

5.2. Disturbance Rejection

6. Results

6.1. Coupled Tanks Liquid Level Control

6.2. Yeast Fermentation Reactor Temperature Control

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI