Next Article in Journal
Modifying Nanoporous Carbon through Hydrogen Peroxide Oxidation for Removal of Metronidazole Antibiotics from Simulated Wastewater
Previous Article in Journal
Performance Evaluation Using Multivariate Non-Normal Process Capability
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Design of Experiments for Liquid–Liquid Equilibria Characterization via Semidefinite Programming

by
Belmiro P.M. Duarte
1,2,*,
Anthony C. Atkinson
3,
José F.O. Granjo
2 and
Nuno M.C. Oliveira
2
1
Department of Chemical and Biological Engineering, ISEC, Polytechnic Institute of Coimbra, Rua Pedro Nunes, Quinta da Nora, 3030–199 Coimbra, Portugal
2
CIEPQPF, Department of Chemical Engineering, University of Coimbra, Rua Sílvio Lima, Pólo II, 3030–790 Coimbra, Portugal
3
Department of Statistics, London School of Economics, London WC2A 2AE, UK
*
Author to whom correspondence should be addressed.
Processes 2019, 7(11), 834; https://doi.org/10.3390/pr7110834
Submission received: 25 September 2019 / Revised: 27 October 2019 / Accepted: 6 November 2019 / Published: 8 November 2019
(This article belongs to the Section Chemical Processes and Systems)

Abstract

:
Liquid–liquid equilibria (LLE) characterization is a task requiring considerable work and appreciable financial resources. Notable savings in time and effort can be achieved when the experimental plans use the methods of the optimal design of experiments that maximize the information obtained. To achieve this goal, a systematic optimization formulation based on Semidefinite Programming is proposed for finding optimal experimental designs for LLE studies carried out at constant pressure and temperature. The non-random two-liquid (NRTL) model is employed to represent species equilibria in both phases. This model, combined with mass balance relationships, provides a means of computing the sensitivities of the measurements to the parameters. To design the experiment, these sensitivities are calculated for a grid of candidate experiments in which initial mixture compositions are varied. The optimal design is found by maximizing criteria based on the Fisher Information Matrix (FIM). Three optimality criteria (D-, A- and E-optimal) are exemplified. The approach is demonstrated for two ternary systems where different sets of parameters are to be estimated.

1. Motivation

This paper addresses the design of locally optimal experimental plans for liquid–liquid equilibria (LLE) characterization. For a systematic handling of the problem, we propose semidefinite programming (SDP) formulations combined with the Model-based Optimal Designs of Experiments (M-bODE) using thermodynamic models for forecasting the equilibria. The innovations of the paper are twofold: (i) presenting a systematic approach based on mathematical programming strategies to handle the optimal design of experiments for LLE characterization; and (ii) addressing the difficulties in model-based optimal designs of experiments for models described by implicit nonlinear algebraic equations including the factors, the measurements and the parameters to be estimated. Typically, the statistics community focuses on M-bODE for models described by explicit algebraic equations where the measurements are related to the experimental factors and the parameters. The generalization to implicit nonlinear models was, as far as we know, not previously considered. The novelty lies in the computation of the parametric sensitivities which, for the LLE problem, require solving a linear system of equations.
Liquid–liquid extraction is an important separation unit operation that finds application in a variety of areas of the chemical process industries, including those for petrochemicals, food, and pharmaceuticals. Accurate phase equilibrium data are important for the design and optimization of industrial plants for extraction processes. Moreover, ternary systems equilibrium data are essential for the proper understanding of the solvent extraction process and so for solvent selection. Typically, characterizing the LLE for ternary mixtures requires complex experimental plans to obtain data for subsequently fitting thermodynamic models able to simulate and optimize unit operations [1]. Several methods can be used to obtain LLE data [2] but in most cases the task requires considerable resources, both human and economic. The building of adequate mathematical models from LLE data typically involves:
(i)
choosing the appropriate thermodynamic model according to the nature of the mixture and the region of operation;
(ii)
defining the methodology to estimate parameters that are able to describe well the experimental data;
(iii)
iterative computation of LLE and stability analysis to prevent spurious phase predictions;
(iv)
refinement of existing models, by planning further experiments; and
(v)
designing experiments prior to the laboratory experiments.
While Items (i)–(iii) are extensively studied in literature, Items (iv)–(v) are scarcely tackled in the context of chemical thermodynamics. The main reasons are that the models are of large complexity and include highly nonlinear terms. This study contributes to filling the gap detected.
The approach proposed herein requires an approximated knowledge of the binary interaction parameters (BIPs), which can be obtained from data in the literature for similar components to those in the mixture of interest or estimated from group contribution methods, such as UNIFAC. In addition, a prior knowledge is required of the number of immiscible pairs of components in the ternary system, i.e., its type. Using this information, the method is capable of calculating a locally optimal experimental design. The experimental results either will confirm the estimates of the initial BIPS, or will update their values. In the latter case, the method finds another optimal experimental design, now based on the updated BIPs. The method is thus: (i) used for parameter confirmation; or (ii) included in a sequential procedure where the parameters are iteratively refined.
M-bODE generates efficient plans of experiments for parameter estimation and has currently gained importance due to the necessity of judiciously choosing the most informative experiments, given the budgetary constraints on resources. M-bODE thus allows important cost savings in scientific studies by determining the most efficient designs for accurate inference [3].
It is a matter of surprise that M-bODE is not reported for generating experimental plans for LLE characterization. The exception is the work of Dechambre et al. [4], who proposed a strategy combining the discretization of the initial mixture space with an algorithm similar those of the classic exchange methods [5,6,7].
Among the classic approaches, various numerical algorithms based on exchange methods [5,8,9] have been developed for the construction of optimal experimental designs. These algorithms are iterative, requiring a starting design and a stopping criterion in the search for the optimal solution. They iteratively replace current design points by one or more points that may be either new or already in the support of the current design. The rule for selecting the point or points for generating the next design vary depending on the type of algorithm and the design criterion. The numerical efficiency of these Wynn–Fedorov schemes has been improved by several authors, including Wu [10], Wu and Wynn [11], Pronzato [12] and Harman and Pronzato [13]. They were reviewed and compared by  Meyer and Nachtsheim [14] and Pronzato [15], among others. Another approach to finding continuous optimal designs is based on Multiplicative algorithms, which have found broad application due to their simplicity [16]. The basic algorithm was proposed by Titterington [17] and later exploited by Pázman [18], Fellman [19], Pukelsheim and Torsney [20], Torsney and Mandal [21],  Mandal and Torsney [22], Dette et al. [23], Torsney and Martín-Martín [24], Yu [25], and Yu [26]. Recently, cocktail algorithms, which rely on both exchange and multiplicative algorithms, have been proposed [27]. Another algorithm falling in the same class was proposed by Yang et al. [28] where the iterative procedure includes a Newton method-based solver for determining the weights given a set of support points and an optimization step where the dispersion function is maximized by choosing the set of support points. All these approaches find the optimal discrete set of design points constructed from the original continuous design space.
We also discretize the initial mixture domain to produce a set of candidate one-point experiments, but instead propose a mathematical programming (MP) based approach to determine the relative weights of individual one-point experiments (most weights will be zero). Mathematical programming approaches have been successfully applied to solve many design problems. Some examples are Linear Programming [29,30], Second Order Conic Programming [31,32], Semidefinite Programming (SDP) [33,34,35], Semi-Infinite Programming (SIP) [36,37], Nonlinear Programming (NLP) [38,39], NLP combined with stochastic procedures such as genetic algorithms [40,41], and global optimization techniques [42,43]. They are particularly efficient for approximate (also called continuous) optimal designs, characterized by a continuous representation of the weights of each constituent one-point experiment in the plan; the design problem has convex properties and the quality of the optimum can be checked with a General Equivalence Theorem (GET). Moreover, the resulting optimization problem can be solved efficiently using state-of-the-art solvers that guarantee global optimality, especially when the optimization problem belongs to a convex programming class.
It is important to emphasize that most of the MP-based formulations listed above are for explicit models where by “explicit models” we refer to functional forms that explicitly relate the response variables and the control factors. Contrarily, in this study, we address implicit models which are functionals of nonlinear type relating the responses, control factors and parameters; a classic example is the LLE thermodynamic model [44]. As far as we know, the use of classic approaches to handle implicit models is not reported in the literature because of the additional complexity of computing the parametric sensitivity. In practice, MP-based approaches are more suitable for this class of models since the construction of the FIM can be embedded in the formulation for finding the optimal design. MP-based approaches are also advantageous in finding constrained optimal designs, where additional constraints on the distribution of weights are explicitly included in the formulation. Finally, the mathematical programming problems falling into convex formulations (as is the case of SDP formulations), guarantee the global optimality of the obtained design for the chosen discrete set of candidate points. The  classic algorithms lack such a proof.
A disadvantage of MP-based methods is that the size of the optimization problem increases almost exponentially with the number of candidate points. The solvers may take larger computational times than those required by classic algorithms and may even not converge for problems of large dimension. However, the current SDP solvers based on interior point methods can efficiently handle problems with 1 × 104 constraints  [35].

Paper Organization and Nomenclature

Section 2 presents the mathematical representation of the LLE model, the background required for applying the methods of optimal experimental design and the fundamentals of SDP. Section 3 formulates the optimal design of experiments for an SDP solution, covering the details of: (i) FIM construction; and (ii) computational implementation. Section 4 applies the proposed formulations to systems of Type 1, which are the most common in liquid–liquid extraction processes; for the classification of LLE systems topology, see Treybal [45]. Two different classes of ternary system of Type 1 are considered for demonstration purposes: (i) Class I, where Components 2 and 3 are completely miscible but Components 1 and 2 and 1 and 3 are both partially miscible; and (ii) Class II, where Components 1 and 3 and 2 and 3 are completely miscible but Components 1 and 2 are only partially miscible ([46], p. 8.163).
We use bold face lowercase letters to represent vectors, bold face capital letters for continuous domains, blackboard bold capital letters for discrete domains, and capital letters for matrices. Finite sets containing ι elements are compactly represented by ι = { 1 , , ι } . The symbol “ ” is used to indicate the vector/matrix transpose operation, tr ( ) stands for the trace of a matrix and card ( ) for the cardinality of a discrete set.

2. Background

This section provides the background material required by the mathematical formulation for obtaining approximate optimal designs for LLE characterization. First, we introduce the model used to describe the LLE for ternary systems (Section 2.1). Next, the basic theory of model-based optimal designs of experiments is provided (Section 2.2), and Section 2.3 introduces the fundamentals of SDP.

2.1. Liquid–Liquid Equilibria: Mathematical Representation

We first present the model relating the compositions of each component of a ternary mixture z in each phase. The LLE characterization studies are performed at constant pressure and temperature. Let  c = { 1 , 2 , 3 } be the set of components in the mixture, and  h = { 1 , 2 } the set of phases into which the original mixture separates. Consequently, z R 3 , such that j = 1 3 z j = 1 and 0 z j 1 , j c . The set of candidate experiments is formed by a finite number ( n e ) of vectors z , i.e.,  Z = { z i : i n e } . The compositions of the components in each phase are denoted by y p , j where the first subscript identifies the phase, p h , and the second is for the component, j c . Similarly, in each phase and ignoring any measurement errors, the component fractions sum to 1, i.e.,  j = 1 3 y p , j = 1 , p h and  0 y p , j 1 , p h , j c .
The LLE model includes mass balances and the equilibrium conditions relating the compositions of the components in both phases. Here, we assume the equilibrium of chemical potentials which translate into iso-fugacity relations. The activity coefficient, here denoted by γ ( y , θ ) , is determined using the Non-random Two-liquid (NRTL) model [47], but without loss of generalization other models such as the Universal Quasichemical (UNIQUAC) model [48] and the UNIQUAC Functional-group Activity Coefficients (UNIFAC) model [49] can be used. The vector of parameters in the model, θ , includes the dimensionless pair of interaction parameters, τ , in turn related to the interaction energy parameters, and the non-randomness parameters, α . Consequently, in this study, we have θ { τ , α } . For ternary systems, τ is a  3 × 3 matrix containing the interactions between all the pairs of components in the mixture. Since the interaction energy of each species with itself is null, corresponding to τ j , j = 0 for j c , the complete set of parameters to be estimated from the proposed experimental plan includes six pairs of interaction parameters, { τ i , j : i , j c , i j } . Similarly, α is a symmetric 3 × 3 matrix, where the elements on the diagonal are 0, and  α i , j = α j , i : i , j c . For ternary systems, the number of additional parameters α is three. However, in some of the experimental plans, we assume α fixed having a value in the interval [ 0.2 , 0.4 ] (see Rappel et al. [50] among others) and focus only on the determination of the parameters  τ . Summing up, if the experimental plan aims at finding only the pairs of interaction parameters, θ  includes six τ s, i.e.,  θ { τ i , j : i , j c , i j } . Otherwise, if the goal is estimation of the pair interaction parameters and the pairs of non-randomness parameters, θ includes six τ s and three α s, i.e., θ { τ i , j : i , j c , i j } { α i , j : i , j c , i < j } . The calculation of designs for both cases is demonstrated in Section 4.
The LLE model is compactly written in the form g ( y 1 , y 2 , z , θ ) where g ( ) is a set of nonlinear algebraic equations:
y 1 , i γ i ( y 1 , θ ) y 2 , i γ i ( y 2 , θ ) = 0 , i c ,
i = 1 3 y j , i = 1 , j h ,
z j z 1 y 2 , 1 y 1 , 1 y 2 , 1 y 1 , j 1 z 1 y 2 , 1 y 1 , 1 y 2 , 1 y 2 , j = 0 , j { 2 } .
Equation (1a) represents the iso-fugacity constraints for all the components, Equation (1b) is for the summation of the component fractions in each phase after separation, and  Equation (1c) is the mass balance for the second component obtained by applying the lever rule. Alternatively, Equation (1c) can be replaced by the mass balance for the third component, corresponding to j { 3 } . The activity coefficients γ i ( y p , θ ) , i c , are modeled by the NRTL model presented in Appendix A. For compactness, y p represents the vector of compositions of the components in the pth phase. Note that: (i) the link between the equilibrium composition and that of the initial mixture is Equation (1c) but the equilibrium can only be fully characterized after solving all equations in Equation (1) together; and (ii) by assumption i = 1 2 z i 1 .
Typically, an experiment to characterize LLE requires choosing an initial mixture, z , providing a separation in two phases is obtained, and measuring at least the composition of two components in each phase, e.g., Components 1 and 2. The measurement errors of all four variables are assumed independent and identically distributed with mean zero and constant variance.

2.2. Optimal Design of Experiments

The regression models addressed herein are described by g ( ) and have a four-variate response with two independent variables (covariates) used to set the initial mixture composition, e.g., z 1 and z 2 . The remaining fraction is dependent on the first two so that z 3 = 1 z 1 z 2 0 . The mean response at z is
E [ y 1 , y 2 | z , θ ] : = arg [ Model ( 1 ) ] ,
where y 1 and y 2 solve Equation (1) for a given z . Here, θ Θ R n θ where Θ is a compact domain containing the parameters values and E [ ] is the expectation operator with respect to the error distribution.
Because the model is nonlinear, the FIM is dependent on the values of θ to be estimated from the experimental design. Here, we address locally optimal designs [51] where the model parameters are taken from previous studies or experiments and they are then treated as known in the FIM construction. We note that this framework is particularly useful when a good estimate is available and the study serves to confirm parameter estimates or is a step of a sequential design (see Atkinson et al. [52]).
In this study, we focus on approximate designs, ξ , which allocate a weight w i [ 0 , 1 ] to ith candidate experiment z i Z , with  i = 1 n e w i = 1 . The weights w i can also be interpreted as the relative effort put on initial mixtures z i , and many of them may be null in the optimal design if the additional information provided by the corresponding mixtures is low (or none). In practice, approximate designs are implemented by taking roughly n × w i replicates at mixture z i , i k after rounding n × w i to an integer subject to the constraint n × w 1 + + n × w k = n where k stands for the number of support point of the optimal design. Advantages of working with continuous designs are many, and there is a unified framework for finding optimal continuous designs for M-bODE problems when the design criterion is a convex function on the set of all approximate designs [53]. In particular, the optimal design problem can be formulated into a mathematical optimization program with convex properties and equivalence theorems are available to check the optimality of a design. In many algorithms for finding optimal designs, the number k of support points has to be specified by the experimenter, with comparisons between designs for different k leading to the optimal value. However, in our formulation, the optimal value of k is part of the output of the algorithm and is found solving the problem using SDP. Rounding procedures for exact designs can be found in the work of Pukelsheim and Rieder [54].
We identify the set Ξ of all feasible approximate designs in Z as the n e 1 -dimensional simplex in the space of weights satisfying Ξ w [ 0 , 1 ] n e : w i 0 , i n e , i = 1 n e w i = 1 , Z . An optimal experimental design is represented by a  k-tuple
ξ = ( z 1 z k w 1 w k )
where k candidate one-point designs have w i > 0 . Here, the first three lines are for the compositions in the initial mixture and the fourth for the weight of this candidate experiment. The corresponding equilibrium compositions in both phases are stored in a  2 × k -tuple
Y = ( y 1 , 1 y 1 , k y 2 , 1 y 2 , k ) .
where each column contains the compositions in each phase after separation for each one-point design forming the optimal design. The second subscript of y is to identify the support point while the first is for phase.
The performance of the design ξ is measured by a convex functional of its FIM. The elements of the normalized FIM obtained after adjusting for the sample size are the negatives of the expectation of the second-order derivatives with respect to the parameters of the log-likelihood of Equation (2) for component j in phase p given the set of candidate treatments Z , denoted as L p , j ( ξ | Z , θ ) . This matrix is proportional to
M p , j ( ξ | Z , θ ) = i = 1 k w i M p , j ( z i | Z , θ ) = i = 1 k w i θ L p , j ( ξ | Z , θ ) θ ,
where M p , j ( ξ | Z , θ ) is the FIM from the design ξ considering a single measurement—the composition of jth component in phase p, i.e.,  y p , j , j 2 , p h . M p , j ( z i | Z , θ ) is the corresponding elemental FIM from point z i . The global FIM including all four responses is obtained summing up the FIMs M p , j ( ξ | Z , θ ) , j 2 , p h weighted by the variance–covariance matrix for the corresponding pair of observations [55,56], i.e.,
M ( ξ | Z , θ ) = p = 1 2 j = 1 2 σ p , j M p , j ( ξ | Z , θ ) .
Here, we assume that the measurements are independent, which implies that the variance–covariance matrix is diagonal. Additionally, by assumption, all the measurements have the same error variance, which yields an equally weighted sum in Equation (4), i.e., σ p , j = 1.0 , j 2 , p h .
When errors are normally and independently distributed, the volume of the confidence region for the model parameters θ is inversely proportional to det [ M 1 / 2 ( ξ | Z , θ ) ] . Consequently, maximizing the determinant of the FIM, by choice of a design leads to the most accurate estimates for the parameters. If the interest is in finding the approximate D-optimal design, the optimization problem is
ξ D = arg max ξ Ξ { det [ M ( ξ | Z , θ ) ] } 1 / n θ .
Other design criteria optimize the FIM in different ways. For example, if the formulation is used to find A-optimal designs, the optimization problem is
ξ A = arg min ξ Ξ tr [ M 1 ( ξ | Z , θ ) ] ,
and, if the objective is finding E-optimal designs, the optimization problem is
ξ E = arg max ξ Ξ λ min [ M ( ξ | Z , θ ) ] ,
where λ min [ ] stands for the minimum eigenvalue of •. To check the optimality of the designs found, we use the equivalence theorems [52,57]. The scaled sensitivity function, Ψ ( z , ξ ) , for a point z Z is given by   
Ψ ( z , ξ ) = 1 n θ y 1 ( z ) θ y 2 ( z ) θ M 1 ( ξ D | Z , θ ) y 1 ( z ) θ y 2 ( z ) θ 1 for D - optimality 1 tr [ M ( ξ A | Z , θ ) ] y 1 ( z ) θ y 2 ( z ) θ M 2 ( ξ A | Z , θ ) y 1 ( z ) θ y 2 ( z ) θ 1 for A - optimality 1 λ min [ M ( ξ E | Z , θ ) ] y 1 ( z ) θ y 2 ( z ) θ ϱ ϱ y 1 ( z ) θ y 2 ( z ) θ 1 for E - optimality
where y 1 ( z ) θ y 2 ( z ) θ is the vector of sensitivities at z , and ϱ is the eigenvector of M ( ξ E | Z , θ ) corresponding to λ min . The sensitivity function in Equation (8) is limited from above by 0, and achieves the maximum at the support points.

2.3. Semidefinite Programming

In this paper, semidefinite programming is employed to solve the optimal design problems for D-, A- and E-optimality criteria over a given discrete domain Z . In this section, we introduce the fundamentals of this class of mathematical programs.
Let S + n θ be the space of n θ × n θ symmetric positive semidefinite matrices, and  S n θ the space of n θ × n θ symmetric matrices. A convex set S R n is semidefinite representable (SDr) if proj S exp ( ζ ) , ζ S , interpreted as the projection ζ on to a higher dimensional set S exp , can be described by Linear Matrix Inequalities (LMIs). That is, S is SDr if and only if there exists some symmetric matrices M 0 , , M m 1 , , M m 1 + m 2 S n θ such that [58]
ζ S v R m 2 : M 0 + i = 1 m 1 ζ i M i + j = 1 m 2 v j M m 1 + j 0 .
Here, ⪰ is the semidefinite operator, i.e.,  A 0 A , Ω 0 , Ω S + n θ , where . , . is the Frobenius inner product operator, ζ R n is a point of the original set S , and  v is a point of the incremental dimensional subspace.
In turn, a convex (or concave) function φ : R m 1 R is SDr if and only if the epigraph of φ , { ( t , ζ ) : φ ( ζ ) t } , or the hypograph, { ( t , ζ ) : φ ( ζ ) t } , respectively, are SDr and can be casted by LMIs [59,60]. The optimal values, ζ , of SDr functions are then formulated as semidefinite programs of the form
max ζ d ζ , i = 1 m 1 ζ i M i M 0 0 .
In our design context, d is a vector of known constants that depends on the design problem, and matrices M i , i { 0 , , m 1 } contain local FIMs and other matrices produced by the reformulation of the functions φ ( ζ ) into LMIs. The decision variables in vector ζ are the weights w i , i n e , of the optimal design and other auxiliary variables required. The problem of calculating a design for a pre-specified set of candidate experiments Z of points z i , i n e , is solved with the formulation in Equation (10) complemented by the linear constraints on w : (i) w 0 ; and (ii) 1 n e w = 1 , where 1 n e is a unitary column vector with n e lines. The problem in Equation (10) is the classic SDP problem which includes LMIs representing conic constraints.
Ben-Tal and Nemirovski [59] provided a list of SDr functions useful for solving continuous optimal design problems with SDP formulations (see Boyd and Vandenberghe [60], Section 7.3). Recently, Sagnol [61] showed that each criterion in the Kiefer class of optimality criteria defined by
Φ δ [ M ( ξ | Z , θ ) ] = 1 n θ tr ( M ( ξ | Z , θ ) δ ) 1 / δ
is SDr for all rational values of δ ( , 1 ] and general SDP formulations exist for them. This result also applies to the case when δ 0 ; the problems in Equations (5)–(7) fall into this class. Practically, the problem of finding optimal approximate plans of experiments for the most common (convex) criteria can be formulated as a semidefinite programming problem falling into the general representation (9) (see Vandenberghe and Boyd [33] and Duarte and Wong [35] among others).

3. SDP Formulations for Finding Optimal Design of Experiments

This section presents the formulations for finding approximate optimal designs via SDP. First, we introduce the approach used for calculating the FIM (see Section 3.1). In Section 3.2, we introduce the SDP formulations for finding approximate D-, A- and E-optimal designs for the LLE model, and  Section 3.3 covers implementation aspects.

3.1. FIM Construction

This section describes the procedure used to construct the global FIM for a candidate initial mixture  z Z . We recall the model in Equation (1) is formed by six algebraic equations with general form g m ( y 1 , y 2 , z , θ ) where m 6 . The model derivation with respect to the parameters leads to
p = 1 2 j = 1 3 g m ( y 1 , y 2 , z , θ ) y p , j s p , j , i + g m ( y 1 , y 2 , z , θ ) θ i = 0 , m 6 , i card ( θ ) ,
where s p , j , i denotes the sensitivity of the measurement y p , j to parameter θ i θ with θ defined in Section 2.1. The resulting set of linear algebraic Equation (12) can be cast as
F s p , j , i = b , p h , j c , i card ( θ ) ,
where F is the 6 n θ × 6 n θ matrix formed by terms g m ( y 1 , y 2 , z , θ ) y p , j , and b is a column vector of size 6 n θ containing the terms g m ( y 1 , y 2 , z , θ ) θ i , i card ( θ ) . Consequently, the solution of Equation (13) is obtained inverting F, i.e.,
s p , j , i = F 1 b , p h , j c , i card ( θ ) .
Now, the elemental FIM’s can be generated using the relation
M p , j ( z | Z , θ ) = s p , j , i s p , j , i , z Z , p h , j 2 ,
which enables the computation of the matrices M p , j ( ξ | Z , θ ) and M ( ξ | Z , θ ) using Equations (3) and (4), respectively. In case the matrix M ( ξ | Z , θ ) is singular, the strategies proposed by Shahmohammadi and McAuley [62] can be employed.

3.2. SDP Formulations for D-, A- and E-Optimal Designs

This section presents the formulations for finding optimal designs via SDP. The D-optimality criterion is addressed first. Next, the formulations for A- and E-optimal design problems are introduced.
Here, we recall the approximate D-optimal design problem in Equation (5), and use the formulation proposed by Vandenberghe and Boyd [33] to handle the problem. That is, given the LLE model, the set of local parameters, the set of candidate experiments Z (already generated), the elemental FIMs for each candidate experiment and the total number of experiments n e , the formulation for finding approximate D-optimal designs on Ξ may be formulated as follows:
max w [ 0 , 1 ] n e det M ( ξ | Z , θ ) 1 / n θ
s . t . M ( ξ | Z , θ ) 0 n θ × n θ
M ( ξ | Z , θ ) = h = 1 2 j = 1 2 M p , j ( ξ | Z , θ )
M p , j ( ξ | Z , θ ) = i = 1 n e w i M p , j ( z i | Z , θ ) , p h , j 2
M p , j ( ξ | Z , θ ) S + n θ , p h , j 2
1 n e w = 1
where Equation (16a) is the objective function, Equation (16b) assures the semidefinite positiveness of the global FIM, Equation (16c) is for the global FIM equality, Equation (16d) generates the FIMs for each measurement, Equation (16e) represents the semidefinite positiveness of M h , j ( ξ | Z , θ ) , and Equation (16f) guarantees that the weights sum to 1. Here, 0 n θ × n θ is a  n θ × n θ matrix of zeros. Similarly, the formulation for the A-optimality criterion introduced in Equation (6), is as follows:
min w [ 0 , 1 ] n e tr M 1 ( ξ | Z , θ )
s . t . Equations ( 16 b ) ( 16 f ) ,
where the difference is solely in the objective function. Finally, for the E-optimal design problem presented in Equation (7), the formulation is
max w [ 0 , 1 ] n e λ min M ( ξ | Z , θ )
s . t . Equations ( 16 b ) ( 16 f ) .

3.3. Implementation Aspects

In this section, we detail the computational implementation of the procedure for finding optimal designs for LLE experiments.
The nonlinear algebraic equations in Equation (1) are solved for each candidate mixture z using a Newton–Krylov–Armijo algorithm. The solver uses the Krylov method for computing Newton inexact directions and the Armijo rule to globalize the Newton method (see Kelley [63]). The relative and absolute tolerances used in the solver are 1 × 10−7 and 1 × 10−8, respectively.
The set of candidate experiments is constructed using an equidistributed grid in both coordinates z 1 and z 2 . An equal step Δ z is used for both and all the points satisfy the condition z 1 + z 2 1 . Finally,  z 3  is computed as 1 z 1 z 2 .
Both derivatives in Equation (12) ( g m ( y 1 , y 2 , z , θ ) y p , j and g m ( y 1 , y 2 , z , θ ) θ i ) are determined via automatic differentiation using ADiMat code [64].
To handle the SDP problems in Equations (16)–(18), there are user-friendly interfaces, such as cvx [65] or Picos [66], that automatically transform the semidefinite constraints in Equations (16b) and (16e) and the objective functions into a series of LMIs before passing them to SDP solvers such as SeDuMi [67] or Mosek [68]. This is possible when Φ δ is SDr, which is true for our design criteria. In our work, we solved all SDP problems using the cvx environment combined with the solver Mosek that uses an efficient interior point algorithm [69].
To check the optimality of the designs found, we compute the scaled sensitivity function for every initial mixture using Equation (8). From this set of values, we extract the maximum, that, practically, corresponds to a support point. The values of the scaled sensitivity function of the other support points, here denoted as z k opt , k , are also used for comparison and we require that min z k opt Ψ ( z k opt , ξ ) ε , k , where ε is the minimum threshold allowed and  ε was set to −0.02. The rational for choosing the value of ε is that we require approximate designs with at least 98% efficiency.
All computations in this paper were carried using on an Intel Core i7 machine (Intel Corporation, Santa Clara, CA, USA) running 64 bits Windows 10 operating system with 2.80 G Hz . The relative and absolute tolerances used to solve the SDP problems were set to 1 × 10−5 in all problems.

4. Results

In this section, we apply the formulations of Section 3.2 to find locally D-, A- and E-optimal designs for LLE characterization. We consider three examples to demonstrate the approach: In the first two examples, the set of parameters to be estimated from the experiments is only the vector of dimensionless binary interaction parameters, i.e., θ { τ i , j : i , j c , i j } , which is six parameters. We consider two ternary systems to demonstrate that the strategy can be applied to Type 1 systems of Class I (Section 4.1) and Class II (Section 4.2). Finally, we present an example where the non-random binary interaction parameters are also included in the set of parameters to be estimated, i.e., θ { τ i , j : i , j c , i j } { α i , j : i , j c , i < j } , thus there are nine parameters (see Section 4.3).
A generalization of the method for systems of Type 2 (with delamination in two binary subsystems) and of Type 3 (with delamination in all three binary subsystems) requires the adaptation of the strategy to the specificities of the system. However, the strategy to follow is similar and involves: (i) experiments of mutual solubility of immiscible pairs of components; (ii) determination of binodal points by successively adding the third component; and (iii) the determination of the tie-lines where the LLE is characterized at different compositions of the ternary mixture. The proposed method can be similarly used to assist the laboratory work in Step (iii) with Steps (i) and (ii) used to generate prior knowledge.
In all examples, the initial grid of mixtures result from equistributing the points with a constant interval in the composition domain [ 0 , 1 ] . In all examples, we use Δ z = 0 . 10 ; the complete set of candidate experiments is populated with 66 mixtures. Naturally, other step sizes can be used. Typically, the CPU time required by the SDP solver increases as the grid becomes finer, but the solver can efficiently tackle problems including thousands of LMIs.
All designs presented in following paragraphs were submitted to optimality checking employing the approach described in Section 3.3. All of them guarantee scaled sensitivity values at the support points below 2%, which allows inferring they are nearly optimal for practical implementation.

4.1. Example 1—Ternary System of Type 1, Class I; θ Including Dimensionless BIPs

First, let us consider that θ only includes the dimensionless binary interaction parameters and the ternary system is of Type 1 (Class I). Here, we consider the system formed by water (1), toluene (2) and aniline (3), compactly represented by WAT-TOL-ANL [46]. The matrices τ and α are, respectively,
τ = 0.00000 7.77063 4.81462 4.93035 0.00000 1.59806 1.27932 0.03509 0.00000 and α = 0.0000 0.2485 0.3412 0.2485 0.0000 0.3000 0.3412 0.3000 0.0000 .
Our goal is to find the optimal design to estimate the six dimensionless binary interaction parameters in τ considering α fixed. The optimal designs are listed in Table 1, while Table 2 lists the expected compositions in both equilibrium phases. In Table 1, the columns of the matrices forming the optimal design ( ξ ) correspond to support points (which have w i > 0 ); the first three lines in each column specify the composition of the initial mixture ( z j , j c ) and the fourth is for the weights. In Table 2, each column reports the composition in separating phases after reaching the equilibrium. The first three lines of each column are for the composition of Phase 1, and the lines below are for the composition of Phase 2.
The results show that: (i) the D-optimal design has five support points and the other criteria require designs with six support points; and (ii) the CPU time required by all problems is relatively low. We note that each support point provides four measurements, thus, for D-optimal design, we end with a set of 20 measurements to fit a model with six parameters.
Figure 1 shows sets of tie lines connecting the two phases in equilibrium for any mixture on the line, the amounts of the equilibrium phases depending on distances along the line. The complete set of tie-lines connecting the phases in equilibrium that contain the initial 66 mixtures is displayed in Figure 1a. Figure 1b shows the tie-lines for the locally D-optimal design presented in Table 1. We note that the mixtures for this design are located at extreme conditions, some with little ANL and near the critical point where the mixtures are homogeneous. The support points of the designs found for all the optimality criteria are similar; contrarily, the weights are significantly different, which is a trend observed for all the other examples.

4.2. Example 2—Ternary System of Type 1, Class II; θ Including Dimensionless BIPs

Now, we consider ternary systems of Type 1 (Class II) and θ only includes the dimensionless binary interaction parameters. The ternary system is formed by water (1), trichloroethylene (2) and acetone (3), represented by WAT-TCE–KET [46]. The matrices τ and α are:
τ = 0.00000 5.98775 1.38800 3.60977 0.00000 0.19920 0.75701 0.20102 0.00000 and α = 0.0000 0.2485 0.3000 0.2485 0.0000 0.3000 0.3000 0.3000 0.0000 .
The optimal designs obtained are listed in Table 3 and the corresponding composition of the phases in equilibrium is in Table 4. Here, we note the D-optimal design has seven support points, the A-optimality criterion produces a design with four points and the E-optimal design is formed by nine support points. Figure 2a presents the initial set of candidate mixtures and corresponding tie-lines while Figure 2b shows the tie-lines for the set of mixtures forming the D-optimal design (Table 3).

4.3. Example 3—Ternary System of Type 1, Class I; θ Including Dimensionless and Non-Random BIPs

Here, we assume a ternary system of Type 1 (Class I) and θ includes the dimensionless binary interaction parameters and the binary interaction non-randomness parameters. The support points of the optimal designs for this problem are shown in Table 5; we note that the D-optimality criterion requires a design with 7 initial mixtures, A-optimality requires 9 and E-optimality needs 17, some of them having reduced relative importance (presented in two lines). Typically, all the designs have more support points than those in Table 1, determined for the same system, when the aim was to estimate only the elements of matrix τ . That the number of initial mixtures would increase was expected since the number of free parameters is larger. However, the computation time is similar to that of Example 1 as the number of candidate initial mixtures used is the same, which, in turn, yield SDP problems of equal size. The main difference is that the dimension of the FIM increases from six to nine. Table 6 shows the equilibrium compositions for designs in Table 5.

5. Conclusions

We present SDP formulations for finding approximate D-, A- and E-optimal experimental designs for characterizing LLE in ternary systems. We consider the NRTL for describing the activity coefficient and derive M-bODE for estimating the dimensionless pairs of interaction parameters (six parameters) as well as the non-randomness pairs (three parameters). The strategy requires discretizing the space of initial mixtures and generating a finite set of candidate experiments. Then, the composition of the separating phases is determined for each candidate point, and the sensitivity of the measurements (the composition of two of the components of the mixture in each phase) obtained after applying automatic differentiation. Afterwards, the sensitivity is used to construct local FIMs and the SDP problem representing the approximate optimal design is solved using an interior point-based solver. The solution is described by the set of relative experimental effort at each initial candidate point—many initial points are not included in the optimal experiments. We apply the strategy to systems of Type 1 falling in Classes I and II, and consider different cases where: (i) only the interaction parameters are to be estimated; and (ii) the interaction and the non-randomness parameters are both to be estimated. One development which, in the interests of brevity, we do not discuss here, is that the approximate designs can be used to compute related exact optimal designs for a previously given number of experiments using classic rounding algorithms. The SDP formulation used here can be extended to other thermodynamic models. Our problems using an initial set of 66 candidate mixtures require mild computational times.

Author Contributions

Conceptualization, B.P.M.D. and J.F.O.G.; methodology, B.P.M.D. and A.C.A.; software, B.P.M.D.; validation, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; formal analysis, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; investigation, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; resources, B.P.M.D.; data curation, B.P.M.D.; writing–original draft preparation, B.P.M.D.; writing–review and editing, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; visualization, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; supervision, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; project administration, B.P.M.D., A.C.A., J.F.O.G. and N.M.C.O.; funding acquisition, B.P.M.D., and N.M.C.O.

Funding

This research received no external funding. The article processing charges were funded by Centro de Investigação em Engenharia dos Processos Químicos e dos Produtos da Floresta (CIEPQPF).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. NRTL Model

In this section, we present the NRTL model for activity coefficients [47]. The activity model is
γ i = exp j = 1 n y j τ j , i G j , i k = 1 n y k G k , i + j = 1 n y j G i , j k = 1 n y k G k , j τ i , j m = 1 n y m τ m , j G m , j k = 1 n y k G k , j
G i , j = exp α i , j τ i , j

References

  1. Aspentech, Inc. Aspen Physical Property System; V9; Aspentech, Inc.: Bedford, MA, USA, 2016. [Google Scholar]
  2. Muhlbauer, A. Phase Equilibria: Measurement & Computation; Series in Chemical and Mechanical Engineering; Taylor & Francis: Abingdon, UK, 1997. [Google Scholar]
  3. Fedorov, V.V.; Leonov, S.L. Optimal Design for Nonlinear Response Models; Chapman and Hall/CRC Press: Boca Raton, FL, USA, 2014. [Google Scholar]
  4. Dechambre, D.; Wolff, L.; Pauls, C.; Bardow, A. Optimal experimental design for the characterization of liquid-liquid equilibria. Ind. Eng. Chem. Res. 2014, 53, 19620–19627. [Google Scholar] [CrossRef]
  5. Fedorov, V.V. Theory of Optimal Experiments; Academic Press: Cambridge, MA, USA, 1972. [Google Scholar]
  6. Mitchell, T.J. An algorithm for the construction of D-optimal experimental designs. Technometrics 1974, 16, 203–210. [Google Scholar]
  7. Wynn, H.P. Results in the theory and construction of D-optimum experimental designs. J. R. Stat. Soc.-Ser. B 1972, 34, 133–147. [Google Scholar] [CrossRef]
  8. Mitchell, T.J.; Miller, F.L., Jr. Use of Design Repair to Construct Designs for Special Linear Models; Technical Report 130-131; Oak Ridge National Laboratory: Oak Ridge, TN, USA, 1970. [Google Scholar]
  9. Wynn, H.P. The sequential generation of D-optimum experimental designs. Ann. Math. Statist. 1970, 41, 1655–1664. [Google Scholar] [CrossRef]
  10. Wu, C.F. Some algorithmic aspects of the theory of optimal designs. Ann. Statist. 1978, 6, 1286–1301. [Google Scholar] [CrossRef]
  11. Wu, C.F.; Wynn, H.P. The convergence of general step-length algorithms for regular optimum design criteria. Ann. Statist. 1978, 6, 1273–1285. [Google Scholar] [CrossRef]
  12. Pronzato, L. Removing non-optimal support points in D-optimum design algorithms. Stat. Probab. Lett. 2003, 63, 223–228. [Google Scholar] [CrossRef]
  13. Harman, R.; Pronzato, L. Improvements on removing non-optimal support points in D-optimum design algorithms. Stat. Probab. Lett. 2007, 77, 90–94. [Google Scholar] [CrossRef]
  14. Meyer, R.K.; Nachtsheim, C.J. The coordinate-exchange algorithm for constructing exact optimal experimental designs. Technometrics 1995, 37, 60–69. [Google Scholar] [CrossRef]
  15. Pronzato, L. Optimal experimental design and some related control problems. Automatica 2008, 44, 303–325. [Google Scholar] [CrossRef] [Green Version]
  16. Mandal, A.; Wong, W.K.; Yu, Y. Algorithmic searches for optimal designs. In Handbook of Design and Analysis of Experiments; CRC Press: New York, NY, USA, 2015; Chapter 21; pp. 755–786. [Google Scholar]
  17. Titterington, D.M. Algorithms for computing D-optimal design on finite design spaces. Proc. Conf. Inf. Sci. Syst. 1976, 3, 213–216. [Google Scholar]
  18. Pázman, A. Foundations of Optimum Experimental Design; Mathematics and Its Applications; Springer: Dordrecht, The Netherlands, 1986. [Google Scholar]
  19. Fellman, J. An empirical study of a class of iterative searches for optimal designs. J. Statist. Plann. Inference 1989, 21, 85–92. [Google Scholar] [CrossRef]
  20. Pukelsheim, F.; Torsney, B. Optimal weights for experimental designs on linearly independent support points. Ann. Statist. 1991, 19, 1614–1625. [Google Scholar] [CrossRef]
  21. Torsney, B.; Mandal, S. Two classes of multiplicative algorithms for constructing optimizing distributions. Comput. Stat. Data Anal. 2006, 51, 1591–1601. [Google Scholar] [CrossRef]
  22. Mandal, S.; Torsney, B. Construction of optimal designs using a clustering approach. J. Stat. Plan. Inference 2006, 136, 1120–1134. [Google Scholar] [CrossRef]
  23. Dette, H.; Pepelyshev, A.; Zhigljavsky, A.A. Improving updating rules in multiplicative algorithms for computing D-optimal designs. Comput. Stat. Data Anal. 2008, 53, 312–320. [Google Scholar] [CrossRef]
  24. Torsney, B.; Martín-Martín, R. Multiplicative algorithms for computing optimum designs. J. Stat. Plan. Inference 2009, 139, 3947–3961. [Google Scholar] [CrossRef]
  25. Yu, Y. Strict monotonicity and convergence rate of Titterington’s algorithm for computing D-optimal designs. Comput. Stat. Data Anal. 2010, 54, 1419–1425. [Google Scholar] [CrossRef]
  26. Yu, Y. Monotonic convergence of a general algorithm for computing optimal designs. Ann. Statist. 2010, 38, 1593–1606. [Google Scholar] [CrossRef]
  27. Yu, Y. D-optimal designs via a cocktail algorithm. Stat. Comput. 2010, 21, 475–481. [Google Scholar] [CrossRef]
  28. Yang, M.; Biedermann, S.; Tang, E. On optimal designs for nonlinear models: A general and efficient algorithm. J. Am. Stat. Assoc. 2013, 108, 1411–1420. [Google Scholar] [CrossRef]
  29. Gaivoronski, A. Linearization methods for optimization of functionals which depend on probability measures. In Stochastic Programming 84 Part II; Prékopa, A., Wets, R.J.B., Eds.; Mathematical Programming Studies Book Series; Springer: Berlin/Heidelberg, Germany, 1986; Volume 28, pp. 157–181. [Google Scholar]
  30. Harman, R.; Jurík, T. Computing c-optimal experimental designs using the Simplex method of linear programming. Comput. Stat. Data Anal. 2008, 53, 247–254. [Google Scholar] [CrossRef]
  31. Sagnol, G. Computing optimal designs of multiresponse experiments reduces to second-order cone programming. J. Stat. Plan. Inference 2011, 141, 1684–1708. [Google Scholar] [CrossRef] [Green Version]
  32. Sagnol, G.; Harman, R. Computing exact D-optimal designs by mixed integer second order cone programming. Ann. Stat. 2015, 43, 2198–2224. [Google Scholar] [CrossRef]
  33. Vandenberghe, L.; Boyd, S. Applications of semidefinite programming. Appl. Numer. Math. 1999, 29, 283–299. [Google Scholar] [CrossRef]
  34. Papp, D. Optimal designs for rational function regression. J. Am. Stat. Assoc. 2012, 107, 400–411. [Google Scholar] [CrossRef]
  35. Duarte, B.P.M.; Wong, W.K. Finding Bayesian optimal designs for nonlinear models: A semidefinite programming-based approach. Int. Stat. Rev. 2015, 83, 239–262. [Google Scholar] [CrossRef] [PubMed]
  36. Duarte, B.P.M.; Wong, W.K. A semi-infinite programming based algorithm for finding minimax optimal designs for nonlinear models. Stat. Comput. 2014, 24, 1063–1080. [Google Scholar] [CrossRef]
  37. Duarte, B.P.M.; Wong, W.K.; Atkinson, A.C. A Semi-Infinite Programming based algorithm for determining T-optimum designs for model discrimination. J. Multivar. Anal. 2015, 135, 11–24. [Google Scholar] [CrossRef] [PubMed]
  38. Chaloner, K.; Larntz, K. Optimal Bayesian design applied to logistic regression experiments. J. Stat. Plan. Inference 1989, 59, 191–208. [Google Scholar] [CrossRef]
  39. Molchanov, I.; Zuyev, S. Steepest descent algorithm in a space of measures. Stat. Comput. 2002, 12, 115–123. [Google Scholar] [CrossRef]
  40. Heredia-Langner, A.; Montgomery, D.C.; Carlyle, W.M.; Borror, C.M. Model-robust optimal designs: A Genetic Algorithm approach. J. Qual. Technol. 2004, 36, 263–279. [Google Scholar] [CrossRef]
  41. Zhang, Y. Bayesian D-Optimal Design for Generalized Linear Models. Ph.D. Thesis, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA, 2006. [Google Scholar]
  42. Boer, E.; Hendrix, E. Global optimization problems in optimal design of experiments in regression models. J. Glob. Optim. 2000, 18, 385–398. [Google Scholar] [CrossRef]
  43. Duarte, B.P.M.; Wong, W.K.; Oliveira, N.M.C. Model-based optimal design of experiments—Semidefinite and nonlinear programming formulations. Chemom. Intell. Lab. Syst. 2016, 151, 153–163. [Google Scholar] [CrossRef] [PubMed]
  44. Englezos, P.; Kalogerakis, N.; Bishnoi, P. Simultaneous regression of binary VLE and VLLE data. Fluid Phase Equilibria 1990, 61, 1–15. [Google Scholar] [CrossRef]
  45. Treybal, R.E. Liquid Extraction, 2nd ed.; MacGraw-Hill: New York, NY, USA, 1963. [Google Scholar]
  46. Poling, B.E.; Prausnitz, J.M.; O’Connell, J.P. The Properties of Gases and Liquids; McGraw-Hill: New York, NY, USA, 2001. [Google Scholar]
  47. Renon, H.; Prausnitz, J.M. Local compositions in thermodynamic excess functions for liquid mixtures. AIChE J. 1968, 14, 135–144. [Google Scholar] [CrossRef]
  48. Abrams, D.S.; Prausnitz, J.M. Statistical thermodynamics of liquid mixtures: A new expression for the excess Gibbs energy of partly or completely miscible systems. AIChE J. 1975, 21, 116–128. [Google Scholar] [CrossRef]
  49. Fredenslund, A.; Jones, R.L.; Prausnitz, J.M. Group-contribution estimation of activity coefficients in nonideal liquid mixtures. AIChE J. 1975, 21, 1086–1099. [Google Scholar] [CrossRef]
  50. Rappel, R.; Góis, L.M.N.; Mattedi, S. Liquid–liquid equilibria data for systems containing aromatic + nonaromatic + sulfolane at 308.15 and 323.15 K. Fluid Phase Equilibria 2002, 202, 263–276. [Google Scholar] [CrossRef]
  51. Chernoff, H. Locally optimal designs for estimating parameters. Ann. Math. Stat. 1953, 24, 586–602. [Google Scholar] [CrossRef]
  52. Atkinson, A.C.; Donev, A.N.; Tobias, R.D. Optimum Experimental Designs, with SAS; Oxford University Press: Oxford, UK, 2007. [Google Scholar]
  53. Fedorov, V.V. Convex design theory. Math. Operationsforsch. Statist. Ser. Statist. 1980, 11, 403–413. [Google Scholar]
  54. Pukelsheim, F.; Rieder, S. Efficient rounding of approximate designs. Biometrika 1992, 79, 763–770. [Google Scholar] [CrossRef]
  55. Draper, N.R.; Hunter, W.G. Design of experiments for parameter estimation in multiresponse situations. Biometrika 1966, 53, 525–533. [Google Scholar] [CrossRef]
  56. Wong, W.K.; Yin, Y.; Zhou, J. Optimal designs for multi-response nonlinear regression models with several factors via Semidefinite Programming. J. Comput. Graph. Stat. 2019, 28, 61–73. [Google Scholar] [CrossRef] [PubMed]
  57. Kiefer, J.; Wolfowitz, J. The equivalence of two extremum problem. Can. J. Math. 1960, 12, 363–366. [Google Scholar] [CrossRef]
  58. Helton, J.W.; Vinnikov, V. Linear matrix inequality representation of sets. Comm. Pure Appl. Math. 2007, 60, 654–674. [Google Scholar] [CrossRef]
  59. Ben-Tal, A.; Nemirovski, A.S. Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2001. [Google Scholar]
  60. Boyd, S.; Vandenberghe, L. Convex Optimization; University Press: Cambridge, UK, 2004. [Google Scholar]
  61. Sagnol, G. On the semidefinite representation of real functions applied to symmetric matrices. Linear Algebra Appl. 2013, 439, 2829–2843. [Google Scholar] [CrossRef]
  62. Shahmohammadi, A.; McAuley, K.B. Sequential model-based A-optimal design of experiments when the Fisher Information Matrix is noninvertible. Ind. Eng. Chem. Res. 2019, 58, 1244–1261. [Google Scholar] [CrossRef]
  63. Kelley, C.T. Iterative Methods for Linear and Nonlinear Equations; Frontiers in Applied Mathematics series; Number 16; SIAM: Philadelphia, PA, USA, 1995. [Google Scholar]
  64. Bischof, C.H.; Bücker, H.M.; Lang, B.; Rasch, A.; Vehreschild, A. Combining Source Transformation and Operator Overloading Techniques to Compute Derivatives for Matlab Programs. In Proceedings of the Second IEEE International Workshop on Source Code Analysis and Manipulation (SCAM 2002), Montreal, QC, Canada, 1 October 2002; IEEE Computer Society: Los Alamitos, CA, USA, 2002; pp. 65–72. [Google Scholar]
  65. Grant, M.; Boyd, S.; Ye, Y. cvx Users Guide for cvx, version 1.22; CVX Research, Inc.: Austin, TX, USA, 2012. [Google Scholar]
  66. Sagnol, G. PICOS, a Python Interface to Conic Optimization Solvers; Technical Report 12-48; ZIB: Berlin, Germany, 2012. [Google Scholar]
  67. Sturm, J. Using SeDuMi 1.02, a Matlab toolbox for optimization oversymmetric cones. Optim. Methods Softw. 1999, 11, 625–653. [Google Scholar] [CrossRef]
  68. Andersen, E.; Jensen, B.; Jensen, J.; Sandvik, R.; Worsøe, U. MOSEK, version 6; Technical Report TR–2009–3; MOSEK: Copenhagen, Denmark, 2009. [Google Scholar]
  69. Ye, Y. Interior Point Algorithms: Theory and Analysis; John Wiley & Sons: New York, NY, USA, 1997. [Google Scholar]
Figure 1. D-optimal design of experiments for system WAT-TOL-ANL: (a) candidate initial mixtures and corresponding tie-lines; and (b) tie-lines corresponding to D-optimal design (symbols: o, Phase 1 composition; *, Phase 2 composition; ∇, initial mixture composition; tie lines in red).
Figure 1. D-optimal design of experiments for system WAT-TOL-ANL: (a) candidate initial mixtures and corresponding tie-lines; and (b) tie-lines corresponding to D-optimal design (symbols: o, Phase 1 composition; *, Phase 2 composition; ∇, initial mixture composition; tie lines in red).
Processes 07 00834 g001
Figure 2. D-optimal designs for system WAT-TCE–KET: (a) candidate initial mixtures and corresponding tie-lines; and (b) tie-lines corresponding to D-optimal design (symbols: o, Phase 1 composition; *, Phase 2 composition; ∇, initial mixture composition; tie lines in red).
Figure 2. D-optimal designs for system WAT-TCE–KET: (a) candidate initial mixtures and corresponding tie-lines; and (b) tie-lines corresponding to D-optimal design (symbols: o, Phase 1 composition; *, Phase 2 composition; ∇, initial mixture composition; tie lines in red).
Processes 07 00834 g002
Table 1. Optimal experimental designs for ternary system WAT-TOL-ANL (initial mixtures).
Table 1. Optimal experimental designs for ternary system WAT-TOL-ANL (initial mixtures).
CriterionOptimal Design (Initial Composition)OptimumCPU (s)
D-optimal ( 0.4500 0.5500 0.7500 0.8500 0.9500 0.0500 0.0500 0.0500 0.0500 0.0500 0.5000 0.4000 0.2000 0.1000 0.0000 0.1895 0.1858 0.2187 0.2410 0.1650 ) 2.30 × 10 3 32.23
A-optimal ( 0.4500 0.5500 0.7500 0.7500 0.8500 0.9500 0.0500 0.0500 0.0500 0.1500 0.0500 0.0500 0.5000 0.4000 0.2000 0.1000 0.1000 0.0000 0.0871 0.1993 0.3798 0.1964 0.0678 0.0697 ) 1.675 × 10 5 32.69
E-optimal ( 0.4500 0.5500 0.7500 0.7500 0.8500 0.9500 0.0500 0.0500 0.0500 0.1500 0.0500 0.0500 0.5000 0.4000 0.2000 0.1000 0.1000 0.0000 0.0827 0.2471 0.4285 0.1858 0.0328 0.0231 ) 1.1 × 10 5 32.06
Table 2. Optimal experimental designs for ternary system WAT-TOL-ANL (compositions in equilibrium phases).
Table 2. Optimal experimental designs for ternary system WAT-TOL-ANL (compositions in equilibrium phases).
CriterionEquilibrium Compositions in Separating Phases
D-optimal ( 0.6640 0.7427 0.8686 0.9325 0.9994 0.0148 0.0096 0.0039 0.0019 0.0006 0.3212 0.2477 0.1275 0.0656 0.0000 0.4448 0.3482 0.1707 0.0782 0.0081 0.0508 0.0923 0.2752 0.5003 0.9919 0.5044 0.5595 0.5541 0.4215 0.0000 )
A-optimal ( 0.6640 0.7427 0.8686 0.9596 0.9325 0.9994 0.0148 0.0096 0.0039 0.0013 0.0019 0.0006 0.3212 0.2477 0.1275 0.0392 0.0656 0.0000 0.4448 0.3482 0.1707 0.0439 0.0782 0.0081 0.0508 0.0923 0.2752 0.6511 0.5003 0.9919 0.5044 0.5595 0.5541 0.3050 0.4215 0.0000 )
E-optimal ( 0.6640 0.7427 0.8686 0.9596 0.9325 0.9994 0.0148 0.0096 0.0039 0.0013 0.0019 0.0006 0.3212 0.2477 0.1275 0.0392 0.0656 0.0000 0.4448 0.3482 0.1707 0.0439 0.0782 0.0081 0.0508 0.0923 0.2752 0.6511 0.5003 0.9919 0.5044 0.5595 0.5541 0.3050 0.4215 0.0000 )
Table 3. Optimal experimental designs for ternary system WAT-TCE–KET (initial mixtures).
Table 3. Optimal experimental designs for ternary system WAT-TCE–KET (initial mixtures).
CriterionOptimal Design (Initial Composition)OptimumCPU ( s )
D-optimal ( 0.1500 0.4500 0.5500 0.7500 0.7500 0.8500 0.9500 0.0500 0.4500 0.3500 0.0500 0.1500 0.0500 0.0500 0.8000 0.1000 0.1000 0.2000 0.1000 0.1000 0.0000 0.1164 0.0078 0.1014 0.2194 0.1747 0.2134 0.1669 ) 1.75 × 10 4 15.45
A-optimal ( 0.1500 0.7500 0.7500 0.9500 0.0500 0.0500 0.1500 0.0500 0.8000 0.2000 0.1000 0.0000 0.1886 0.1834 0.3946 0.2333 ) 1.16 × 10 7 17.58
E-optimal ( 0.0500 0.1500 0.1500 0.2500 0.3500 0.4500 0.7500 0.7500 0.9500 0.8500 0.0500 0.7500 0.6500 0.5500 0.4500 0.0500 0.1500 0.0500 0.1000 0.8000 0.1000 0.1000 0.1000 0.1000 0.2000 0.1000 0.0000 0.0136 0.0726 0.0301 0.1355 0.3421 0.0055 0.0109 0.2791 0.1105 ) 9.4 × 10 7 14.63
Table 4. Optimal experimental designs for ternary system WAT-TCE–KET (compositions in equilibrium phases).
Table 4. Optimal experimental designs for ternary system WAT-TCE–KET (compositions in equilibrium phases).
CriterionEquilibrium Compositions in Separating Phases
D-optimal ( 0.9935 0.9970 0.9967 0.9944 0.9960 0.9951 0.9999 0.0000 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0065 0.0029 0.0032 0.0055 0.0039 0.0048 0.0000 0.1348 0.0095 0.0119 0.0989 0.0277 0.0691 0.0025 0.0509 0.8123 0.7715 0.1830 0.5901 0.3187 0.9975 0.8143 0.1782 0.2166 0.7181 0.3822 0.6122 0.0000 )
A-optimal ( 0.9935 0.9944 0.9960 0.9999 0.0000 0.0001 0.0001 0.0001 0.0065 0.0055 0.0039 0.0000 0.1348 0.0989 0.0277 0.0025 0.0509 0.1830 0.5901 0.9975 0.8143 0.7181 0.3822 0.0000 )
E-optimal ( 0.9977 0.9935 0.9975 0.9974 0.9972 0.9970 0.9944 0.9960 0.9999 0.0001 0.0000 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0022 0.0065 0.0024 0.0025 0.0027 0.0029 0.0055 0.0039 0.0000 0.0058 0.1348 0.0063 0.0070 0.0080 0.0095 0.0989 0.0277 0.0025 0.8896 0.0509 0.8771 0.8613 0.8406 0.8123 0.1830 0.5901 0.9975 0.1046 0.8143 0.1166 0.1317 0.1514 0.1782 0.7181 0.3822 0.0000 )
Table 5. Optimal designs for ternary system WAT-TOL-ANL and θ including nine parameters (initial mixtures).
Table 5. Optimal designs for ternary system WAT-TOL-ANL and θ including nine parameters (initial mixtures).
CriterionOptimal Design (Initial Composition)OptimumCPU ( s )
D-optimal ( 0.4500 0.5500 0.6500 0.7500 0.7500 0.8500 0.9500 0.0500 0.0500 0.0500 0.0500 0.1500 0.0500 0.0500 0.5000 0.4000 0.3000 0.2000 0.1000 0.1000 0.0000 0.1734 0.1833 0.0068 0.2091 0.1014 0.1922 0.1338 ) 1.02 × 10 3 32.50
A-optimal ( 0.3500 0.4500 0.4500 0.5500 0.6500 0.7500 0.7500 0.8500 0.9500 0.1500 0.0500 0.1500 0.0500 0.0500 0.0500 0.1500 0.0500 0.0500 0.5000 0.5000 0.4000 0.4000 0.3000 0.2000 0.1000 0.1000 0.0000 0.0527 0.1684 0.0447 0.1575 0.0753 0.1816 0.0819 0.1320 0.1059 ) 5.837 × 10 5 33.38
E-optimal Processes 07 00834 i001 2.5 × 10 4 33.59
Table 6. Optimal designs for ternary system WAT-TOL-ANL and θ including nine parameters (compositions in equilibrium phases).
Table 6. Optimal designs for ternary system WAT-TOL-ANL and θ including nine parameters (compositions in equilibrium phases).
CriterionEquilibrium Compositions in Separating Phases
D-optimal ( 0.6640 0.7427 0.8067 0.8686 0.9596 0.9325 0.9994 0.0148 0.0096 0.0064 0.0039 0.0013 0.0019 0.0006 0.3212 0.2477 0.1869 0.1275 0.0392 0.0656 0.0000 0.4448 0.3482 0.2610 0.1707 0.0439 0.0782 0.0081 0.0509 0.0923 0.1582 0.2752 0.6511 0.5003 0.9919 0.5044 0.5595 0.5808 0.5541 0.3050 0.4215 0.0000 )
A-optimal ( 0.8244 0.6640 0.8550 0.7427 0.8067 0.8686 0.9596 0.9325 0.9994 0.0056 0.0148 0.0044 0.0096 0.0064 0.0039 0.0013 0.0019 0.0006 0.1699 0.3212 0.1406 0.2477 0.1869 0.1275 0.0392 0.0656 0.0000 0.2356 0.4448 0.1908 0.3482 0.2610 0.1707 0.0439 0.0782 0.0081 0.1848 0.0509 0.2432 0.0923 0.1582 0.2752 0.6511 0.5003 0.9919 0.5796 0.5044 0.5660 0.5595 0.5808 0.5541 0.3050 0.4215 0.0000 )
E-optimal Processes 07 00834 i002

Share and Cite

MDPI and ACS Style

Duarte, B.P.M.; Atkinson, A.C.; Granjo, J.F.O.; Oliveira, N.M.C. Optimal Design of Experiments for Liquid–Liquid Equilibria Characterization via Semidefinite Programming. Processes 2019, 7, 834. https://doi.org/10.3390/pr7110834

AMA Style

Duarte BPM, Atkinson AC, Granjo JFO, Oliveira NMC. Optimal Design of Experiments for Liquid–Liquid Equilibria Characterization via Semidefinite Programming. Processes. 2019; 7(11):834. https://doi.org/10.3390/pr7110834

Chicago/Turabian Style

Duarte, Belmiro P.M., Anthony C. Atkinson, José F.O. Granjo, and Nuno M.C. Oliveira. 2019. "Optimal Design of Experiments for Liquid–Liquid Equilibria Characterization via Semidefinite Programming" Processes 7, no. 11: 834. https://doi.org/10.3390/pr7110834

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop