Next Article in Journal
The Influence of Viscosity on the Static and Dynamic Properties of PS-PEO Covered Emulsion Drops
Next Article in Special Issue
Online Optimization Applied to a Shockless Explosion Combustor
Previous Article in Journal
Extending Applications of High-Pressure Homogenization by Using Simultaneous Emulsification and Mixing (SEM)—An Overview
Previous Article in Special Issue
Performance Evaluation of Real Industrial RTO Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Study of Explorative Moves during Modifier Adaptation with Quadratic Approximation

Biochemical and Chemical Engineering Department, TU Dortmund, Emil-Figge-Str. 70, 44221 Dortmund, Germany
*
Author to whom correspondence should be addressed.
Processes 2016, 4(4), 45; https://doi.org/10.3390/pr4040045
Submission received: 31 October 2016 / Revised: 21 November 2016 / Accepted: 22 November 2016 / Published: 26 November 2016
(This article belongs to the Special Issue Real-Time Optimization)

Abstract

:
Modifier adaptation with quadratic approximation (in short MAWQA) can adapt the operating condition of a process to its economic optimum by combining the use of a theoretical process model and of the collected data during process operation. The efficiency of the MAWQA algorithm can be attributed to a well-designed mechanism which ensures the improvement of the economic performance by taking necessary explorative moves. This paper gives a detailed study of the mechanism of performing explorative moves during modifier adaptation with quadratic approximation. The necessity of the explorative moves is theoretically analyzed. Simulation results for the optimization of a hydroformylation process are used to illustrate the efficiency of the MAWQA algorithm over the finite difference based modifier adaptation algorithm.

1. Introduction

In the process industries, performing model-based optimization to obtain economic operations usually implies the need of handling the problem of plant-model mismatch. An optimum that is calculated using a theoretical model seldom represents the plant optimum. As a result, real-time optimization (RTO) is attracting considerable industrial interest. RTO is a model based upper-level optimization system that is operated iteratively in closed loop and provides set-points to the lower-level regulatory control system in order to maintain the process operation as close as possible to the economic optimum. RTO schemes usually estimate the process states and some model parameters or disturbances from the measured data but employ a fixed process model which leads to problems if the model does not represent the plant accurately.
Several schemes have been proposed towards how to combine the use of theoretical models and of the collected data during process operation, in particular the model adaptation or two-step scheme [1]. It handles plant-model mismatch in a sequential manner via an identification step followed by an optimization step. Measurements are used to estimate the uncertain model parameters, and the updated model is used to compute the decision variables via model-based optimization. The model adaptation approach is expected to work well when the plant-model mismatch is only of parametric nature, and the operating conditions lead to sufficient excitation for the estimation of the plant outputs. In practice, however, both parametric and structural mismatch are typically present and, furthermore, the excitation provided by the previously visited operating points is often not sufficient to accurately identify the model parameters.
For an RTO scheme to converge to the plant optimum, it is necessary that the gradients of the objective as well as the values and gradients of the constraints of the optimization problem match those of the plant. Schemes that directly adapt the model-based optimization problem by using the update terms (called modifiers) which are computed from the collected data have been proposed [2,3,4]. The modifier-adaptation schemes can handle considerable plant-model mismatch by applying bias- and gradient-corrections to the objective and to the constraint functions. One of the major challenges in practice, as shown in [5], is the estimation of the plant gradients with respect to the decision variables from noisy measurement data.
Gao et al. [6] combined the idea of modifier adaptation with the quadratic approximation approach that is used in derivative-free optimization and proposed the modifier adaptation with quadratic approximation (in short MAWQA) algorithm. Quadratic approximations of the objective function and of the constraint functions are constructed based on the screened data which are collected during the process operation. The plant gradients are computed from the quadratic approximations and are used to adapt the objective and the constraint functions of the model-based optimization problem. Simulation studies for the optimization of a reactor benchmark problem with noisy data showed that by performing some explorative moves the true optimum can be reliably obtained. However, neither the generation of the explorative moves nor their necessity for the convergence of set-point to the optimum was theoretically studied. Due to the fact that the estimation of the gradients using the quadratic approximation approach requires more data than those that are required by using a finite difference approach, the efficiency of the MAWQA algorithm, in terms of the number of plant evaluations to obtain the optimum, has been questioned, in particular for the case of several decision variables. In addition, in practice, it is crucial for plant operators to be confident with the necessity of taking the explorative moves which may lead to a deterioration of plant performance.
This paper reports a detailed study of the explorative moves during modifier adaptation with quadratic approximation. It starts with how the explorative moves are generated and then the factors that influence the generation of these moves are presented. The causality between the factors and the explorative moves is depicted in Figure 1, where the blocks with a yellow background represent the factors. The use of a screening algorithm to optimize the regression set for quadratic approximations is shown to ensure that an explorative move is only performed when the past collected data cannot provide accurate gradient estimates. Simulation results for the optimization of a hydroformylation process with four optimization variables are used to illustrate the efficiency of the MAWQA algorithm, which takes necessary explorative moves, over the finite difference based modifier adaptation algorithm.

2. Modifier Adaptation with Quadratic Approximation

Let J m ( u ) and C m ( u ) represent the objective and the vector of constraint functions of a static model-based optimization problem, assumed to be twice differentiable with respect to the vector of decision variables u R n u
min u J m ( u ) s . t . C m ( u ) 0 .
At each iteration of the modifier adaptation algorithm, bias- and gradient-corrections of the optimization problem are applied as
min u J m ( u ) + J p ( k ) J m ( k ) T u u ( k ) s . t . C m ( u ) + C p ( k ) C m ( k ) + C p ( k ) C m ( k ) T u u ( k ) 0 .
The symbols are explained in Table 1. J p ( k ) and C p ( k ) are usually approximated by the finite difference approach
J p ( k ) u 1 ( k ) u 1 ( k 1 ) u n u ( k ) u n u ( k 1 ) u 1 ( k ) u 1 ( k n u ) u n u ( k ) u n u ( k n u ) 1 J p ( k ) J p ( k 1 ) J p ( k ) J p ( k n u ) ,
where n u is the number of dimensions of u , J p ( k i ) , i = 0 , , n u , are the plant objectives at set-points u ( k i ) , i = 0 , , n u , and C p ( k ) is approximated similarly. The accuracy of the finite difference approximations is influenced by both the step-sizes between the set-points and the presence of measurement noise. In order to acquire accurate gradient estimations, small step-sizes are preferred. However, the use of small step-sizes leads to a high sensitivity of the gradient estimates to measurement noise.
In the MAWQA algorithm, the gradients are computed analytically from quadratic approximations of the objective function and of the constraint functions that are regressed based on a screened set (represented by U ( k ) at the kth iteration) of all the collected data (represented by U ). The screened set consists of near and distant points: U ( k ) = U n U d , where U n = { u : u u ( k ) < Δ u ; and u U } , and U d is determined by
min U d u U d u u ( k ) θ ( U d ) s . t . size ( U d ) C 2 n u + 2 1 U d U \ U n ,
where Δ u is sufficiently large so that U d guarantees robust quadratic approximations with noisy data, θ ( U d ) is the minimal angle between all possible vectors that are defined by u u ( k ) , and C 2 n u + 2 = ( n u + 2 ) ( n u + 1 ) / 2 is the number of data required to uniquely determine the quadratic approximations.
In the MAWQA algorithm, the regression set U ( k ) is also used to define a constrained search space B ( k ) for the next set-point move
B ( k ) : ( u u ( k ) ) T M 1 ( u u ( k ) ) γ 2 ,
where M = c o v ( U ( k ) ) is the covariance matrix of the selected points (inputs) and γ is a scaling parameter. B ( k ) is a n u -axial ellipsoid centered at u ( k ) . The axes of the ellipsoid are thus aligned with the eigenvectors of the covariance matrix. The semi-axis lengths of the ellipsoid are related to the eigenvalues of the covariance matrix by the scaling parameter γ. The adapted optimization (2) is augmented by the search space constraint as
min u J a d ( k ) ( u ) s . t . C a d ( k ) ( u ) 0 u B ( k ) ,
where J a d ( k ) ( u ) and C a d ( k ) ( u ) represent the adapted objective and constraint functions in (2).
In the application of the modifier adaptation with quadratic approximation, it can happen that the nominal model is inadequate for the modifier-adaptation approach and that it is better to only use the quadratic approximations to compute the next plant move. In order to ensure the convergence, it is necessary to monitor the performance of the adapted optimization and possibly to switch between model-based and data-based optimizations. In each iteration of the MAWQA algorithm, a quality index of the adapted optimization ρ m ( k ) is calculated and compared with the quality index of the quadratic approximation ρ ϕ ( k ) , where
ρ m ( k ) = max 1 J a d ( k ) J a d ( k 1 ) J p ( k ) J p ( k 1 ) , 1 C a d , 1 ( k ) C a d , 1 ( k 1 ) C p , 1 ( k ) C p , 1 ( k 1 ) , , 1 C a d , n c ( k ) C a d , n c ( k 1 ) C p , n c ( k ) C p , n c ( k 1 )
and
ρ ϕ ( k ) = max 1 J ϕ ( k ) J ϕ ( k 1 ) J p ( k ) J p ( k 1 ) , 1 C ϕ , 1 ( k ) C ϕ , 1 ( k 1 ) C p , 1 ( k ) C p , 1 ( k 1 ) , , 1 C ϕ , n c ( k ) C ϕ , n c ( k 1 ) C p , n c ( k ) C p , n c ( k 1 )
with J ϕ ( k ) and C ϕ ( k ) are the quadratic approximations of the objective and the constraint functions. If ρ m ( k ) ρ ϕ ( k ) , the predictions of the adapted model-based optimization are more accurate than that of the quadratic approximations and (6) is performed to determine the next set-point. Otherwise, an optimization based on the quadratic approximations is done
min u J ϕ ( k ) ( u ) s . t . C ϕ ( k ) ( u ) 0 u B ( k ) .
The MAWQA algorithm is given as follows:
Step 1.
Choose an initial set-point u ( 0 ) and probe the plant at u ( 0 ) and u ( 0 ) + h e i , where h is a suitable step size and e i R n u ( i = 1 , , n u ) are mutually orthogonal unit vectors. Use the finite difference approach to calculate the gradients at u ( 0 ) and run the IGMO approach [3] until k C 2 n u + 2 set-points have been generated. Run the screening algorithm to define the regression set U ( k ) . Initialize ρ m ( k ) = 0 and ρ ϕ ( k ) = 0 .
Step 2.
Calculate the quadratic functions J ϕ ( k ) and C ϕ ( k ) based on U ( k ) . Determine the search space B ( k ) by (5).
Step 3.
Compute the gradients from the quadratic functions. Adapt the model-based optimization problem and determine the optimal set-point u ^ ( k ) as follows:
(a)
If ρ m ( k ) ρ ϕ ( k ) , run the adapted model-based optimization (6).
(b)
Else perform the data-based optimization (9).
Step 4.
If u ^ ( k ) u ( k ) < Δ u and there exists one point u ( j ) U ( k ) such that u ( j ) u ( k ) > 2 Δ u , set u ^ ( k ) = u ( j ) + u ( k ) / 2 .
Step 5.
Evaluate the plant at u ^ ( k ) to acquire J p ( u ^ ( k ) ) and C p ( u ^ ( k ) ) . Prepare the next step as follows
(a)
If J ^ p ( k ) < J p ( k ) , where J ^ p ( k ) = J p ( u ^ ( k ) ) , this is a performance-improvement move. Define u ( k + 1 ) = u ^ ( k ) and run the screening algorithm to define the next regression set U ( k + 1 ) . Update the quality indices ρ m ( k + 1 ) and ρ ϕ ( k + 1 ) . Increase k by one and go to Step 2.
(b)
If J ^ p ( k ) J p ( k ) , this is an explorative move. Run the screening algorithm to update the regression set for u ( k ) . Go to Step 2.
Note that the index of iteration of the MAWQA algorithm is increased by one only when a performance-improvement move is performed. Several explorative moves may be required at each iteration. The number of plant evaluations is the sum of the numbers of both kinds of moves. The next section studies why the explorative moves are required and how they contribute to the improvement of the performance on a longer horizon.

3. Analysis of the Explorative Moves

In the MAWQA algorithm, the quadratic approximations of the objective and the constraint functions are started once C 2 n u + 2 data have been collected. It can happen that the distribution of the set-points is not “well-poised” [7] to ensure that the gradients are accurately estimated via the quadratic approximations, especially when the initial set-point is far away from the optimum and the following set-point moves are all along some search direction. Interpolation-based derivative-free optimization algorithms rely on a model-improvement step that generates additional set-point moves to ensure the well-poisedness of the interpolation set. Although the MAWQA algorithm was designed without an explicit model-improvement step, the generation of explorative moves can be considered as an implicit step to improve the poisedness of the regression set for the quadratic approximations. This section gives a theoretical analysis of the explorative moves. We start with some observations from the simulation results in [6] and relate the explorative moves to the estimation error of the gradients. The factors that influence the accuracy of the estimated gradients are analyzed. It is shown that the screening of the regression set leads to very pertinent explorative moves which, on the one hand, are sufficient to improve the accuracy of the gradient estimations, and, on the other hand, are less expensive than the model-improvement step in the derivative-free optimization algorithms.
The generation of the explorative moves is presented in Figure 2 where one MAWQA iteration for the optimization of the steady-state profit of the Williams-Otto reactor with respect to the flow rate and the reaction temperature [6] is illustrated. Here the blue surface represents the real profit mapping, and the mesh represents the quadratic approximation which was computed based on the regression set ( Processes 04 00045 i001: set-point moves, : measured profit values). The bottom part shows the contours of the profit as predicted by the uncorrected model (blue lines) , the constrained search space (dash-dot line), and the contours of the modifier-adapted profit (inside, magenta lines). Comparing the surface plot and the mesh plot, we can see that the gradient along the direction of the last set-point move is estimated well. However, a large error can be observed in the perpendicular direction. The gradient error propagates to the modifier-adapted contours and therefore, the next set-point move ( Processes 04 00045 i003) points to the direction where the gradient is badly estimated. Despite the fact that the move may not improve the objective function, the data collection in that direction can later help to improve the gradient estimation.
The example illustrates how the gradient error in a specific direction may lead to an explorative move along the same direction. In the MAWQA algorithm, the gradients are determined by evaluating J ϕ and C ϕ at u ( k ) . In order to be able to quantify the gradient error, we assume that the screened set U ( k ) is of size C 2 n u + 2 and the quadratic approximations are interpolated based on U ( k ) . In the application of the MAWQA algorithm, this assumption is valid when the current set-point is far away from the plant optimum. Recall the screening algorithm, U ( k ) consists of the near set U n and the distant set U d . From (4), we can conclude that the distant set U d is always of size C 2 n u + 2 1 . Step 4 of the MAWQA algorithm ensures that the near set U n only consists of u ( k ) until the optimized next move is such that u ^ ( k ) u ( k ) < Δ u and there are no points u ( j ) U ( k ) such that u ( j ) u ( k ) > 2 Δ u , that is, all the points in U d keep suitable distances away from u ( k ) for good local approximations. The above two conditions imply that u ( k ) u * Δ u , where u * represents the plant optimum. As a result of Step 4, when u ( k ) u * > Δ u , U n is always of size 1. For simplicity, a shift of coordinates to move u ( k ) to the origin is performed and the points in U ( k ) are reordered as { 0 , u d 1 , , u d n } , where n = C 2 n u + 2 1 .
Let ϕ = { 1 , u 1 , , u n u , u 1 2 , , u n u 2 , 2 u 1 u 2 , 2 u 1 u 3 , , 2 u n u 1 u n u } represent a natural basis of the quadratic approximation. Let α ̲ represent a column vector of the coefficients of the quadratic approximation. The quadratic approximation of the objective function is formulated as
J ϕ ( u ) = α 0 + i = 1 n u α i u i + i = 1 n u α n u + i u i 2 + 2 i = 1 n u 1 j = i + 1 n u α 2 n u + I ( n u , i , j ) u i u j ,
where I ( n u , i , j ) = n u ( i 1 ) ( i + 1 ) i / 2 + j . The coefficients α i , i = 0 , , n are calculated via the interpolation of the n + 1 data sets { u , J p ( u ) }
1 0 0 1 φ 1 ( u ( d 1 ) ) φ n ( u ( d 1 ) ) 1 φ 1 ( u ( d n ) ) φ n ( u ( d n ) ) M ̲ ( ϕ , U ( k ) ) α 0 α 1 α n α ̲ = J p ( 0 ) J p ( u ( d 1 ) ) J p ( u ( d n ) ) J ̲ p ( U ( k ) ) + ν 0 ν 1 ν n ν ̲ ,
where φ i ( u ) , i = 1 , , n , represent the polynomial bases in ϕ , J p and ν represent the noise-free objective and the measurement noise. Assume M ̲ ( ϕ , U ( k ) ) is nonsingular,
α ̲ = M ̲ ( ϕ , U ( k ) ) 1 J ̲ p ( U ( k ) ) + M ̲ ( ϕ , U ( k ) ) 1 ν ̲ .
The computed gradient vector at the origin via the quadratic approximation is
J ϕ ( 0 ) = ( α 1 , , α n u ) T = α ̲ J p + α ̲ ν ,
where α ̲ J p represents the noise-free estimation and α ̲ ν represents the influence of the measurement noise. From [8], a bound on the error between the noise-free estimation α ̲ J p and J p ( 0 ) can be obtained and simplified to
α ̲ J p J p ( 0 ) 1 6 G Λ i = 1 n u ( d i ) 3 ,
where G is an upper bound on the third derivative of J p ( u ) , and Λ is a constant that depends on the distribution of the regression set U ( k ) . Note that the bound in (14) is defined for the error between the plant gradients and the estimated gradients. It is different from the lower and upper bounds on the gradient estimates which were studied by Bunin et al. [5]. To simplify the study of Λ, assume u ( d i ) u ( k ) = Δ u . Λ is defined as
Λ max 0 i n max u Δ u | i ( u ) | ,
where i ( u ) , i = 0 , , n , are the Lagrange polynomial functions that are defined by the matrix determinants
i ( u ) = d e t M ̲ ( ϕ , U i ( u ) ) d e t M ̲ ( ϕ , U ( k ) )
with the set U i ( u ) = U ( k ) \ { u ( d i ) } { u } . The determinant of M ̲ ( ϕ , U ( k ) ) is computed as
d e t M ̲ ( ϕ , U ( k ) ) = φ 1 ( u ( d 1 ) ) φ n ( u ( d 1 ) ) φ 1 ( u ( d n ) ) φ n ( u ( d n ) ) .
Let v o l ( ϕ ( U ( k ) ) ) represent the volume of the n-dimensional convex hull spanned by the row vectors of the matrix in (17), we have
d e t M ̲ ( ϕ , U ( k ) ) = v o l ( ϕ ( U ( k ) ) ) n ! .
Except the vertex at the origin, all the other vertices of the convex hull distribute on a n-dimensional sphere with radius Δ u + Δ u 2 . v o l ( ϕ ( U ( k ) ) ) reaches its maximal value when the vectors are orthogonal to each other. Let v ̲ i and v ̲ j represent any two row vectors of the matrix in (17). The angle between them
c o s ( θ i , j v ) = v ̲ i · v ̲ j v ̲ i v ̲ j = u 1 ( d i ) u 1 ( d j ) + + u n u ( d i ) u n u ( d j ) + u 1 ( d i ) u 1 ( d j ) + + u n u ( d i ) u n u ( d j ) 2 Δ u + Δ u 2 .
Note that the angle θ i , j v is different from the angle θ i , j between vectors u d i and u d j
c o s ( θ i , j ) = u 1 ( d i ) u 1 ( d j ) + + u n u ( d i ) u n u ( d j ) Δ u .
The relationship between θ i , j v and θ i , j is illustrated in Figure 3, where the angle between two 3-dimensional unit vectors is changed from 0 to 180 degree and the angle between the corresponding quadratic interpolation vectors increases proportionally when θ 90 degree and stays in an interval of [90 98] degree from 90 to 180 degree. Recall the screening for the distant set U d via (4), the consideration of the minimal angle at the denominator of the objective function ensures the choosing of the best set in terms of the orthogonality of the matrix in (17) from all the collected data. As a result of (14)–(16) and (18), the lowest bound of the error between α ̲ J p and J p ( 0 ) based on the collected data is achieved.
The error due to the measurement noise α ̲ ν can be calculated by Cramer’s rule
α i ν = d e t M ̲ i ( ϕ , U ( k ) , ν ̲ ) d e t M ̲ ( ϕ , U ( k ) ) = δ n o i s e Δ u d e t M ̲ i ( ϕ , U ( k ) , ν ̲ ˜ ) d e t M ̲ ( ϕ , U ( k ) ) ,
where M ̲ i ( ϕ , U ( k ) , ν ̲ ) is the matrix formed by replacing the (i + 1)th column of M ̲ ( ϕ , U ( k ) ) by the column vector ν ̲ , δ n o i s e is the level of the measurement noise, ν i ( δ n o i s e , + δ n o i s e ) , and ν ̲ ˜ = Δ u ν ̲ / δ n o i s e represents the scaled vector of noises. In order to reduce α ̲ ν , the value of Δ u should be large enough. However, from (14) the error bound will increase accordingly. The optimal tuning of Δ u according to δ n o i s e and G can be a future research direction. For a given distribution of U d , the relative upper bound of α i ν is related to the variance of the elements at the (i + 1)th column. To show that, we start from the angle between the (i + 1)th column vector and the first column vector
c o s ( β i + 1 , 1 ) = j = 1 n u i ( d j ) n + 1 j = 1 n u i ( d j ) 2 ,
where we use β to differentiate the angle from that formed by the row vectors. The variance of the elements at the (i + 1)th column is
V a r ( { 0 , u i ( d 1 ) , , u i ( d n ) } ) = j = 1 n u i ( d j ) 2 n + 1 j = 1 n u i ( d j ) n + 1 2 .
From (22) and (23), we obtain that
c o s ( β i + 1 , 1 ) = j = 1 n u i ( d j ) ( n + 1 ) V a r + j = 1 n u i ( d j ) n + 1 2 .
For the same mean value, the orthogonality of the (i + 1)th column vector to the first column vector can be quantified by the variance of the elements at the (i + 1)th column. As discussed before, the absolute value of the determinant of M ̲ ( ϕ , U ( k ) ) is influenced by the orthogonality. As a result, to replace a column of elements with small variance leads to a higher upper bound of the error than to replace a column of elements with large variance. Note that this is consistent with the constrained search space which is defined by the covariance matrix of the regression set.
From (14) and (21), three factors determine the gradient estimation errors
  • Distribution of the regression set (quantified by Λ and the distance to the current point)
  • Non-quadratic nature of the plant (quantified by G, the upper bound on the third derivative)
  • Measurement noise (quantified by δ n o i s e ).
Figure 4 depicts how the three factors influence the generation of the explorative moves. Assume the plant functions are approximately quadratic in the region u u ( k ) Δ u around the current set-point and the value of Δ u is large enough, a well-distributed regression set normally leads to a performance-improvement move. If the regression set is not well-distributed but the plant functions are approximately quadratic, a performance-improvement move is still possible if the level of the measurement noise δ n o i s e is low. This is illustrated in Figure 5, where noise-free data was considered and the MAWQA algorithm did not perform any explorative moves. The explorative moves are only generated when either the plant functions deviate significantly from quadratic functions or the level of the measurement noise δ n o i s e is large. In the case shown in Figure 2, the mapping of the plant profit is approximately quadratic but considerable measurement noise was presented. As a result, the unfavourable distribution of the regression set leads to an explorative move.
Gao et al. [6] proved that the explorative moves can ensure an improvement of the accuracy of the gradient estimation in the following iterations. As an example, Figure 6 shows the MAWQA iteration after two explorative moves for the optimization of the Williams-Otto reactor with noisy data [6]. The mesh of the quadratic approximation well represents the real mapping (the blue surface) around the current point which locates at the center of the constrained search space (defined by the dash-dot line). The computed gradients based on the quadratic approximation are more accurate than those based on the quadratic approximation in Figure 2. As a result, a performance-improvement move (represented by Processes 04 00045 i003) was obtained.

4. Simulation Studies

The optimization of a hydroformylation process with four optimization variables is used to illustrate the efficiency of the MAWQA algorithm over the finite difference based modifier adaptation algorithm. The continuous hydroformylation of 1-dodecene in a thermomorphic multicomponent solvent (in short TMS) system is considered. This process was developed in the context of the collaborative research centre InPROMPT at the universities of Berlin, Dortmund and Magdeburg and was demonstrated on miniplant scale at TU Dortmund [9]. Figure 7 illustrates the simplified flow diagram of the TMS system together with the reaction network. The TMS system consists of two main sections: the reaction part and the separation part (here a decanter). The feed consists of the substrate 1-dodecene, the apolar solvent n-decane, the polar solvent dimethylformamide (in short DMF), and synthesis gas ( CO / H 2 ). The catalyst system consists of Rh ( acac ) ( CO ) 2 and the bidentate phosphite biphephos as ligand. During the reaction step, the system is single phase, thus homogeneous, so that no mass transport limitation occurs. During the separation step a lower temperature than that of the reactor is used and the system exhibits a miscibility gap and separates into two liquid phases, a polar and an apolar phase. The apolar phase contains the organic product which is purified in a down-stream process, while the catalyst mostly remains in the polar phase which is recycled. The main reaction is the catalyzed hydroformylation of the long-chain 1-dodecene to linear n-tridecanal. Besides the main reaction, isomerization to iso-dodecene, hydrogenation to n-dodecane and formation of the branched iso-aldehyde take place. Hernández and Engell [10] adapted the model developed by Hentschel et al. [11] to the TMS miniplant by considering the material balances of the above components as follows:
V R d C i d t = V ˙ i n C i , i n V ˙ o u t C i , o u t + V R C c a t M c a t l = 1 n r e a c t ν i , l r l
V R d C j d t = k e f f ( C j C j e q ) + V ˙ i n C j , i n V ˙ o u t C j , o u t + V R C c a t M c a t l = 1 n r e a c t ν j , l r l ,
where (25) is applied to the liquid components 1-dodecene, n-tridecanal, iso-dodecene, n-dodecane, iso-aldehyde, decane, and DMF. (26) is applied to the gas components CO and H 2 . The algebraic equations involved in the model are as follows:
C j e q = P j H j , 0 exp E j / R T
C c a t = C R h , p r e c u r s o r 1 + K c a t , 1 C C O + K c a t , 2 C C O / C H 2
K i = exp A i , 0 + A i , 1 T d e c a n t e r + A i , 2 T d e c a n t e r
n i , p r o d u c t = K i 1 + K i n i , d e c a n t e r
n i , c a t a l y s t = 1 1 + K i n i , d e c a n t e r .
Table 2 lists all the symbols used in the model together with their explanations. The optimization problem is formulated as the minimization of the raw material and operating cost per unit of n-tridecanal produced
min u P r 1 d o d e c e n e · F 1 d o d e c e n e + P r R h · F R h + C C o o l i n g + C H e a t i n g F t r i d e c a n a l ,
where P r 1 d o d e c e n e and P r R h represent the prices of 1-dodecene and of the catalyst, F 1 d o d e c e n e and F R h are the molar flow rates, C c o o l i n g and C h e a t i n g are the operating costs of cooling and heating, and F t r i d e c a n a l is the molar flow rate of n-tridecanal, the vector of the optimization variables u consists of the reactor temperature, the catalyst dosage, the pressure and the composition of the synthesis gas. A sensitivity analysis was performed on the different model parameters and it was found that the gas solubility and the equilibrium constants for the catalyst species have the largest influence on the cost function. In our study of the MAWQA approach, the plant-model mismatch is created by decreasing the Henry coefficients H j , 0 by 50 % and ignoring the influence of CO on the active catalyst ( K c a t , 2 = 0 ) in the model that is used by the optimization. Table 3 lists the operating intervals of the optimization variables and compares the real optimum with the model optimum. In order to test the robustness of the approach to noisy data, the real cost data is assumed to be subject to a random error which is normally distributed with standard deviation σ.
Simulation results of the modifier-adaptation approach using finite-difference approximation of gradient are illustrated in Figure 8. The figures on the left show the evolutions of the normalized optimization variables with respect to the index of RTO iterations. The small pulses, which are superimposed on the evolutions, represent the additional perturbations required for the finite-difference approximations of the gradients. The star symbols at the right end mark the real optima. The figures on the right show the evolutions of the cost and the number of plant evaluations with respect to the index of the RTO iteration. The inset figure zooms in on the cost evolution, and the dashed line marks the real optimum of the cost. Three cases with different combinations of the step size of the perturbation and the noise in the data are considered. In the first case a large step-size, Δ h = 0 . 1 , is used and the data is free of noise ( σ = 0 . 0 ). From Figure 8a we can see that three of the four optimization variables are still away from their real optimal values after 16 iterations. The cost evolution in Figure 8b shows an oscillating behavior above the cost optimum. This indicates that the step-size is too large to enable an accurate estimation of the gradients.
In the second case a reduced step-size ( Δ h = 0 . 05 ) is tried. From Figure 8c we can see that the optimization variables attain their real optimal values at the 14th iteration. However, numerical errors of the simulation of the plant cause deviations during the following iterations. On the one hand, the use of a small step-size reduces the error of the finite-difference approximation of the gradients. On the other hand, the small step-size leads to a high sensitivity of the gradient approximation to errors. This is illustrated by the third case, in which the data contains a normally distributed error ( σ = 0 . 3 ). The optimization variables do not reach the real optima (see Figure 8e,f).
Simulation results of the MAWQA algorithm are illustrated in Figure 9. The parameters of the MAWQA algorithm are listed in Table 4. The figures on the left show the evolutions of the normalized optimization variables with respect to iteration. The small pulses, which are superimposed on the evolutions, represent the additional plant evaluations, i.e., initial probes and explorative moves. The real optima are marked by the star symbols. The figures on the right show the evolutions of the cost and the number of plant evaluations with respect to the iterations. The inset figure zooms in on the cost evolution, and the dashed line marks the real optimum of the cost. In the first 4 iterations, the modifier-adaptation approach using finite-difference approximation of gradient is run. Afterwards, enough sampled points (here ( n u + 1 ) ( n u + 2 ) / 2 = 15 ) are available for the quadratic approximation and the MAWQA approach is run. In the case of noise-free data, the MAWQA approach takes 8 iterations to reach the real optima approximately (see Figure 9a). The total number of plant evaluations is 30, much less than that used in the second case of the finite-difference approximations of gradients which requires 55 plant evaluations to reach a similar accuracy. Note that the additional plant evaluations at the 10th iteration are attributed to the shrinking of the regression region by Step 4 of the MAWQA algorithm. Figure 9c,d show the optimization results in the presence of noise. The MAWQA algorithm takes 10 iterations to reach the real optima.
Finally, the MAWQA algorithm was tested with different values of the parameters. For each parameter, three values are considered and the costs after 30 plant evaluations are compared with the real optimum. The results are summarized in Table 5. The increase of Δ u leads to a decrease of the accuracy of the computed optima. This is due to the fact that the true mapping is only locally quadratic and therefore the use of too distant points can cause large approximation errors. The value of γ influences the rate of convergence since it is directly related to the size of the search space in each iteration. The value of Δ h determines the accuracy of the finite-difference approximation of the gradients at the starting stage of the MAWQA approach. In the absence of noise, a small Δ h is preferred. However, the search space is also determined by the distribution of the sampled points. A small Δ h leads to a more constrained search space and therefore decreases the rate of convergence. The overall influence of Δ h on the rate of convergence is a combination of both effects. Note that Δ h does not influence the accuracy of the optima if enough plant evaluations are performed.

5. Conclusions

This paper focuses on the explorative moves when using the MAWQA algorithm to optimize a plant. The explorative moves are generated as a result of the estimation errors of the plant gradients. Three factors that influence the estimation errors are analyzed: the non-quadratic nature of the plant mappings, the measurement noise, and the distribution of the regression set. The screening algorithm is shown to take the accuracy of the gradient estimation into consideration and to choose the best set from the collected data. This ensures that the MAWQA algorithm takes only necessary explorative moves to improve the gradient estimations, instead of an expensive model-improvement step as in the derivative-free optimization algorithms. The simulation results for the optimization of the hydroformylation process with four decision variables demonstrate the promising performance of the MAWQA algorithm over the finite difference based modifier adaptation algorithm. Further studies will be focused on how to implement a dual control between the performance improvement and the explorative moves.

Acknowledgments

The research leading to these results has received funding from the European Commission under grant agreement number 291458 (ERC Advanced Investigator Grant MOBOCON) and by the Deutsche Forschungsgemeinschaft (DFG) via the funding of the Collaborative Research Center “Integrated Chemical processes in Liquid Multiphase Systems-InPROMPT” (TR63). The financial support by the ERC and by DFG is most gratefully acknowledged.

Author Contributions

W.G. conceived the theoretical analysis, performed the simulation studies and wrote the paper; R.H. provided the TMS model; R.H. and S.E. corrected the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
RTOReal-time optimization
MAWQAModifier adaptation with quadratic approximation
TMSThermomorphic multicomponent solvent system
DMFDimethylformamide

References

  1. Chen, C.Y.; Joseph, B. On-line optimization using a two-phase approach: An application study. Ind. Eng. Chem. Res. 1987, 26, 1924–1930. [Google Scholar] [CrossRef]
  2. Roberts, P.D. An algorithm for steady-state system optimization and parameter estimation. Int. J. Syst. Sci. 1979, 10, 719–734. [Google Scholar] [CrossRef]
  3. Gao, W.; Engell, S. Iterative set-point optimization of batch chromatography. Comput. Chem. Eng. 2005, 29, 1401–1409. [Google Scholar] [CrossRef]
  4. Marchetti, A.; Chachuat, B.; Bonvin, D. Modifier-adaptation methodology for real-time optimization. Ind. Eng. Chem. Res. 2009, 48, 6022–6033. [Google Scholar] [CrossRef]
  5. Bunin, G.A.; François, G.; Bonvin, D. From discrete measurements to bounded gradient estimates: A look at some regularising structures. Ind. Eng. Chem. Res. 2013, 52, 12500–12513. [Google Scholar] [CrossRef]
  6. Gao, W.; Wenzel, S.; Engell, S. A reliable modifier-adaptation strategy for real-time optimization. Comput. Chem. Eng. 2016, 91, 318–328. [Google Scholar] [CrossRef]
  7. Conn, A.R.; Scheinberg, K.; Vicente, L.N. Introduction to Derivative-Free Optimization; SIAM: Philadelphia, PA, USA, 2009. [Google Scholar]
  8. Ciarlet, P.G.; Raviart, P.A. General lagrange and hermite interpolation in Rn with applications to finite element methods. Arch. Ration. Mech. Anal. 1972, 46, 177–199. [Google Scholar] [CrossRef]
  9. Zagajewski, M.; Behr, A.; Sasse, P.; Wittman, J. Continuously operated miniplant for the rhodium catalyzed hydroformylation of 1-dodecene in a thermomorphic multicomponent solvent system (TMS). Chem. Eng. Sci. 2014, 115, 88–94. [Google Scholar] [CrossRef]
  10. Hernández, R.; Engell, S. Modelling and iterative real-time optimization of a homogeneously catalyzed hydroformylation process. In Proceedings of the 26th European Symposium on Computer Aided Process Engineering, Portorož, Slovenia, 12–15 June 2016.
  11. Hentschel, B.; Kiedorf, G.; Gerlach, M.; Hamel, C.; Seidel-Morgenstern, A.; Freund, H.; Sundmacher, K. Model-based identification and experimental validation of the optimal reaction route for the hydroformylation of 1-Dodecene. Ind. Eng. Chem. Res. 2015, 54, 1755–1765. [Google Scholar] [CrossRef]
  12. Schäfer, E.; Brunsch, Y.; Sadowski, G.; Behr, A. Hydroformylation of 1-dodecene in the thermomorphic solvent system dimethylformamide/decane. Phase behavior-reaction performance-catalyst recycling. Ind. Eng. Chem. Res. 2012, 51, 10296–10306. [Google Scholar] [CrossRef]
Figure 1. Causality between the explorative moves and the influencing factors.
Figure 1. Causality between the explorative moves and the influencing factors.
Processes 04 00045 g001
Figure 2. Illustration of one MAWQA iteration with noisy data. Surface plot: real profit mapping, mesh plot: quadratic approximation, Processes 04 00045 i001: regression set-point, Processes 04 00045 i002: not chosen set-point, : measured profit, Processes 04 00045 i003: next set-point move, blue contours: model-predicted profit, magenta contours: modifier-adapted profit, dash-dot line: constrained search space.
Figure 2. Illustration of one MAWQA iteration with noisy data. Surface plot: real profit mapping, mesh plot: quadratic approximation, Processes 04 00045 i001: regression set-point, Processes 04 00045 i002: not chosen set-point, : measured profit, Processes 04 00045 i003: next set-point move, blue contours: model-predicted profit, magenta contours: modifier-adapted profit, dash-dot line: constrained search space.
Processes 04 00045 g002
Figure 3. Illustration of the relationship between θ and θ v .
Figure 3. Illustration of the relationship between θ and θ v .
Processes 04 00045 g003
Figure 4. Illustration of the generation of explorative moves.
Figure 4. Illustration of the generation of explorative moves.
Processes 04 00045 g004
Figure 5. Illustration of one MAWQA iteration with noise-free measurements. Surface plot: real profit mapping, mesh plot: quadratic approximation, Processes 04 00045 i001: regression set-point, Processes 04 00045 i002: not chosen set-point, : measured profit, Processes 04 00045 i003: next set-point move, blue contours: model-predicted profit, magenta contours: modifier-adapted profit, dash-dot line: constrained search space.
Figure 5. Illustration of one MAWQA iteration with noise-free measurements. Surface plot: real profit mapping, mesh plot: quadratic approximation, Processes 04 00045 i001: regression set-point, Processes 04 00045 i002: not chosen set-point, : measured profit, Processes 04 00045 i003: next set-point move, blue contours: model-predicted profit, magenta contours: modifier-adapted profit, dash-dot line: constrained search space.
Processes 04 00045 g005
Figure 6. Illustration of the MAWQA iteration after two explorative moves. Surface plot: real profit mapping, mesh plot: quadratic approximation, Processes 04 00045 i001: regression set-point, Processes 04 00045 i002: not chosen set-point, : measured profit, Processes 04 00045 i003: next set-point move, blue contours: model-predicted profit, magenta contours: modifier-adapted profit, dash-dot line: constrained search space.
Figure 6. Illustration of the MAWQA iteration after two explorative moves. Surface plot: real profit mapping, mesh plot: quadratic approximation, Processes 04 00045 i001: regression set-point, Processes 04 00045 i002: not chosen set-point, : measured profit, Processes 04 00045 i003: next set-point move, blue contours: model-predicted profit, magenta contours: modifier-adapted profit, dash-dot line: constrained search space.
Processes 04 00045 g006
Figure 7. (a) Thermomorphic multicomponent solvent (TMS) system; (b) Reaction network of the hydroformylation of 1-dodecene. Adapted from Hernández and Engell [10].
Figure 7. (a) Thermomorphic multicomponent solvent (TMS) system; (b) Reaction network of the hydroformylation of 1-dodecene. Adapted from Hernández and Engell [10].
Processes 04 00045 g007
Figure 8. Modifier-adaptation optimization of the thermomorphic solvent system using finite-difference approximation of the gradients, the left figures show the evolutions of the normalized optimization variables u (the additional set-point perturbations are represented by the small pulses which are superimposed on the set-point evolutions; the star symbols at the end mark the real optima), the right figures show the evolutions of the cost and the number of plant evaluations (the inset figure zooms in on the cost evolution, and the dashed line marks the real optimum).
Figure 8. Modifier-adaptation optimization of the thermomorphic solvent system using finite-difference approximation of the gradients, the left figures show the evolutions of the normalized optimization variables u (the additional set-point perturbations are represented by the small pulses which are superimposed on the set-point evolutions; the star symbols at the end mark the real optima), the right figures show the evolutions of the cost and the number of plant evaluations (the inset figure zooms in on the cost evolution, and the dashed line marks the real optimum).
Processes 04 00045 g008
Figure 9. MAWQA optimization of the thermomorphic solvent system, the left figures show the evolutions of the normalized optimization variables (the additional plant evaluations, i.e., initial probes and unsuccessful moves, are represented by the small pulses which are superimposed on the set-point evolutions; the star symbols at the end mark the real optima), the right figures show the evolutions of the cost and the number of plant evaluations (the inset figure zooms in on the cost evolution, and the dashed line marks the real optimum).
Figure 9. MAWQA optimization of the thermomorphic solvent system, the left figures show the evolutions of the normalized optimization variables (the additional plant evaluations, i.e., initial probes and unsuccessful moves, are represented by the small pulses which are superimposed on the set-point evolutions; the star symbols at the end mark the real optima), the right figures show the evolutions of the cost and the number of plant evaluations (the inset figure zooms in on the cost evolution, and the dashed line marks the real optimum).
Processes 04 00045 g009
Table 1. Symbols used in the modifier adaptation formulation.
Table 1. Symbols used in the modifier adaptation formulation.
SymbolDescription
kIndex of iteration
u ( k ) Current set-point
J p ( k ) Gradient vector of the plant objective function at u ( k )
J m ( k ) Gradient vector of the model-predicted objective function at u ( k )
C p ( k ) Vector of the plant constraint values at u ( k )
C m ( k ) Vector of the model-predicted constraint values at u ( k )
C p ( k ) Gradient matrix of the plant constraint functions at u ( k )
C m ( k ) Gradient matrix of the model-predicted constraint functions at u ( k )
Table 2. Model parameters and variables.
Table 2. Model parameters and variables.
SymbolDescription
C i Concentration of 1-dodecene, n-tridecanal, iso-dodecene, n-dodecane, iso-aldehyde, decane, and DMF
C j Concentration of CO and H 2
C j e q Equilibrium concentration of CO and H 2 at the G/L interface
V R Reactor volume
V ˙ i n Inflow rate
V ˙ o u t Outflow rate
C c a t Concentration of the active catalyst
M c a t Molar mass of the catalyst
ν i , l Coefficients of the stoichiometric matrix [11]
r l Reaction rate of the l th reaction
k e f f Mass transfer coefficient
P j Partial pressure
TReaction temperature
H j , 0 Henry coefficient
C R h , p r e c u r s o r Concentration of the catalyst precursor
K c a t , 1 \ 2 Equilibrium constants
T d e c a n t e r Decanter temperature
A i , 0 \ 1 \ 2 Coefficients regressed from experimental data [12]
n i , p r o d u c t Molar flows of the components in the product stream
n i , c a t a l y s t Molar flow of the components in the recycled catalyst stream
n i , d e c a n t e r Molar flow of the components in the decanter inlet stream
Table 3. Operating variables and optimum.
Table 3. Operating variables and optimum.
Operating variableOperating IntervalInitial Set-PointReal OptimumModel Optimum
Reactor temperature ( C )85∼10595.088.6485.10
Catalyst dosage (ppm)0.25∼2.01.10.510.49
Gas pressure (bar)1.0∼3.02.03.03.0
CO fraction0.0∼0.990.50.550.61
Cost (Euro/kmol) 899.04761.33818.88
Table 4. Parameters of MAWQA.
Table 4. Parameters of MAWQA.
DescriptionSymbolValue
Screening parameter Δ u 0.1
Search space parameterγ3
Perturbation step size Δ h 0.1
Table 5. Influence of MAWQA parameters.
Table 5. Influence of MAWQA parameters.
Δ u ΔhγCost after 30 Eval.Deviation from the True Optimum (%)
Initial0.10.13761.50.02%
Δ u 0.150.13762.50.15%
Δ u 0.20.13762.90.21%
Δ h 0.10.153761.40.01%
Δ h 0.10.23763.30.26%
γ 0.10.12763.20.24%
γ 0.10.11772.11.14%

Share and Cite

MDPI and ACS Style

Gao, W.; Hernández, R.; Engell, S. A Study of Explorative Moves during Modifier Adaptation with Quadratic Approximation. Processes 2016, 4, 45. https://doi.org/10.3390/pr4040045

AMA Style

Gao W, Hernández R, Engell S. A Study of Explorative Moves during Modifier Adaptation with Quadratic Approximation. Processes. 2016; 4(4):45. https://doi.org/10.3390/pr4040045

Chicago/Turabian Style

Gao, Weihua, Reinaldo Hernández, and Sebastian Engell. 2016. "A Study of Explorative Moves during Modifier Adaptation with Quadratic Approximation" Processes 4, no. 4: 45. https://doi.org/10.3390/pr4040045

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop