Next Article in Journal
Advances in Biosensors Technology for Detection and Characterization of Extracellular Vesicles
Next Article in Special Issue
Recreating the Motion Trajectory of a System of Articulated Rigid Bodies on the Basis of Incomplete Measurement Information and Unsupervised Learning
Previous Article in Journal
A Wearable Soft Fabric Sleeve for Upper Limb Augmentation
Previous Article in Special Issue
COVID-19 Case Recognition from Chest CT Images by Deep Learning, Entropy-Controlled Firefly Optimization, and Parallel Feature Fusion
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Study on Multi-Model Soft Sensor Modeling Method and Its Model Optimization for the Fermentation Process of Pichia pastoris

Key Laboratory of Agricultural Measurement and Control Technology and Equipment for Mechanical Industrial Facilities, School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
*
Author to whom correspondence should be addressed.
Sensors 2021, 21(22), 7635; https://doi.org/10.3390/s21227635
Submission received: 28 September 2021 / Revised: 10 November 2021 / Accepted: 11 November 2021 / Published: 17 November 2021
(This article belongs to the Collection Artificial Intelligence in Sensors Technology)

Abstract

:
The problems that the key biomass variables in Pichia pastoris fermentation process are difficult measure in real time; this paper mainly proposes a multi-model soft sensor modeling method based on the piecewise affine (PWA) modeling method, which is optimized by particle swarm optimization (PSO) with an improved compression factor (ICF). Firstly, the false nearest neighbor method was used to determine the order of the PWA model. Secondly, the ICF-PSO algorithm was proposed to cooperatively optimize the number of PWA models and the parameters of each local model. Finally, a least squares support vector machine was adopted to determine the scope of action of each local model. Simulation results show that the proposed ICF-PSO-PWA multi-model soft sensor modeling method accurately approximated the nonlinear features of Pichia pastoris fermentation, and the model prediction accuracy is improved by 4.4884% compared with the weighted least squares vector regression model optimized by PSO.

1. Introduction

An expression system is a molecular biology technique that uses model organisms such as bacteria, yeast, animal cells or plant cells to express exogenous gene proteins. In short, the expression process is the synthesis of proteins under the guidance of genes. The Pichia pastoris expression system is a technically mature eukaryotic expression system, and one of the most successful foreign protein expression systems [1]. The expression system of Pichia pastoris has been developed for more than 30 years, and so far, nearly 1000 heterologous proteins have been successfully expressed by this system, which has a promising development [2]. In addition, the low cost but high yield of the Pichia pastoris expression system makes it economically valuable. Pichia pastoris is also a facultative anaerobe, which is very easy to carry out genetic manipulation and cultivation on [3]. The Pichia pastoris expression system has obvious advantages in the processing, external separation, post-translational modification, and glycosylation modification of expression products [4]. Nowadays, many regions are being plagued by the Coronavirus (COVID-19). In order to fight COVID-19, the recombinant expression of proteinase K in Pichia pastoris has attracted widespread attention [5,6]. Protease K has drawn attention for its following capabilities: cracking the COVID-19 virus to release nucleic acid, eliminating ribonuclease (RNase) to prevent ribonucleic acid (RNA) degradation, and inactivating the COVID-19 virus to denature the virus protein [7]. Therefore, protease K becomes an important component of nucleic acid detection kit for sample pretreatment, and plays a significant role in COVID-19 detection [8]. However, the fermentation process of recombinant expression of protease K by Pichia pastoris is nonlinear, coupled, uncertain, and time-varying [9]. There are few online detection instruments for the key biomass variables of the Pichia pastoris fermentation process (such as cell concentration, protease K concentration, etc.) [10] As for the offline assay and analysis methods, the long interval of data acquisition drags down the real-time performance, and adds to the risk of flora pollution. Moreover, the assay results are often inaccurate, due to the large errors of human operation [11]. With the development of the times, biosensors that can detect biomass on-line in real time emerge as the times require. Biosensors have the advantages of high sensitivity and a fast analysis speed; however, a highly automated, miniaturized and integrated biosensor is often very expensive [12].
The above problems give rise to the soft sensor technology. The basic idea of this technology is to model the relationship between target and auxiliary variables, and thus estimate the target variables indirectly, without causing any pollution to flora. The Soft sensor technology is powerful and cost-effective, and it can work in harmony with other software and hardware, which means that it is highly compatible [13]. However, in most cases, it is difficult for a single global soft sensor model to adequately and quickly characterize the process of a complex object. Furthermore, a single global soft sensor model is usually complex in structure and requires a lot of time to solve it. Wang and Han [14] proposed a single soft sensor model based on a recurrent wavelet neural network and Gaussian process regression methods for online prediction of whether Chlortetracycline fermentation broth is contaminated by non-target bacteria. Although the experimental results show that the proposed soft sensor model can be used to predict the occurrence of pollution during the Chlortetracycline fermentation process, and the effectiveness of the modeling method is verified based on field data, the Chlortetracycline fermentation process is a very complex and non-linear process, and most microbial fermentation processes including Chlortetracycline can be divided into four stages: adjustment, logarithmic growth, stabilization and decay phases. During these four phases, the process characteristics of Chlortetracycline are completely different, especially during the exponential growth phase, when the fermentation process is most intense. Therefore, it is impractical to try to describe the complete fermentation process by a single soft sensor model. Although the soft sensor model proposed in this paper showed good prediction results for the Chlortetracycline fermentation process, the model has limitations and cannot be extended to soft sensor modelling of other microbial fermentation processes, i.e., it is not generalizable.
Therefore, in order to address the drawbacks of a single soft sensor model, multi-model soft sensor modeling strategies have attracted a great deal of scholarly attention [15,16,17]. Scholars have found that using the divide and conquer modeling strategy, building a multi-local soft sensor model can greatly simplify the model structure, save calculation time, and fully mine the information of each sample, hence multi-local soft sensor modeling methods are becoming an inevitable trend in the development of soft sensor technology in the future [18]. As a result, it is highly practical to build a multi-local soft sensor model that supports the real-time online measurement of key biomass variables in Pichia pastoris fermentation, as well as the efficient (protease K yield) and high-quality generation of products (protease K quality).
Following the above ideas, Zhang and Cai [19] presented a multi-model soft sensor modeling method based on the improved locally weighted partial least squares (LWPLS), and adopted the soft sensor model to predict the key biomass variables in the fermentation process of alkaline protease. Their approach copes well with the multi-stage and large lag features of microbial fermentation, and realizes good prediction accuracy. However, it is difficult to identify the structural and local parameters of the extremely complex model, and it is time-consuming to complete the heavy calculations. Hence, their approach is not applicable to a wide range of fermentation processes.
In order to avoid the problems mentioned above, this paper will apply a multi-model modeling method called piecewise affine (PWA), which is also called piecewise linear (PWL). Based on the idea of multi-model “decomposition-synthesis”, the PWA/PWL multi-model divides the sample space of the system into a finite number of sub-intervals, and each sub-interval is described by a PWA/PWL local model, which transforms the problem of solving a complex nonlinear system into the problem of solving several linear systems [20]. The advantages of the PWA multi-model are that the model structure is very simple; after all, the structure of linear systems is far less complex than that of nonlinear systems, and PWA multi-model is less time consuming than other multi-models because the time to solve a linear system is much less than that to solve a nonlinear system [21,22]. These advantages have made PWA systems a hot research topic in the field of modeling, and they are widely used in many engineering practices. Sun and Wu [23] applied a PWA model to speed regulation in longitudinal dynamics of intelligent vehicles. Muhammad and Michael [24] applied a PWA modeling method to optimal planning of thermal energy systems for microgrids. Sindareh-Esfahani [25] identified a discrete-time PWA model of a wind turbine during Maximum Power Point Tracking region. Mattsson and Zachariah [26] applied the PWA modeling method to the identification of discrete-time nonlinear dynamic models of cascade tanks. All these cases proved the effectiveness and feasibility of the PWA modeling method. Therefore, in this paper, a PWA modeling method was applied to soft sensor modeling of key variables in the fermentation process of Pichia pastoris.
In order to improve the prediction accuracy of the PWA multi-model, two model parameters must be optimized, i.e., the number of local models and the parameters of each local model. Too many local models will lead to complex models with a large computational effort and huge dimensionality of local model parameters while too few local models will lead to too simple models with poor prediction performance. In recent years, the research on intelligent optimization algorithms and its applications in model parameter optimization has been very active and has achieved encouraging results [27,28,29]. Inspired by the foraging behavior of birds, a PSO algorithm as an intelligent swarm optimization algorithm has been successfully applied to the optimization of model parameters [30,31,32]. The PSO algorithm has no crossover and variational operations, relies on particle velocity to complete the search, and only the optimal particle passes the information to other particles in the iterative evolution, which makes the search speed very fast. The PSO algorithm requires fewer parameters to be adjusted and has a simple structure, which is an effective method to solve practical engineering problems. Many numerical examples show that PSO is better than Differential Evolution (DE), Genetic Algorithm (GA) and Ant Colony Algorithm (ACO) in terms of search efficiency and convergence speed of the global optimal solution [33,34,35]. This is why this algorithm is used in this paper to optimize the number of local models and the parameters of each local model. However, the PSO algorithm also has some drawbacks, such as the lack of dynamic adjustment of the speed, which is easy to fall into local optimum and requires the selection of appropriate parameters for different problems to achieve the optimal results. Therefore, it is essential to improve the PSO algorithm before the PSO algorithm is used to optimize the number of local models and the parameters of each local model. The reason why the PSO algorithm easily falls into local optimum is that the particle velocity cannot be dynamically adjusted, so the particle velocity must be constrained. The most effective way to constrain the particle velocity is to introduce a compression factor into the particle velocity update formula [36]. However, the traditional compression factor has a fixed form and poor self-adaptability, which makes it difficult to achieve the desired constraint effect [37]. Therefore, this paper will study an improved compression factor (ICF) that can make the compression factor change dynamically as the number of PSO iterations increases [38]. Hence, this study eventually uses this ICF-PSO algorithm to optimize the number of local models and the parameters of each local model. The selection of specific parameters and performance analysis of the ICF-PSO algorithm will be analyzed in detail in the following. The complete PWA modeling process requires not only the optimization of the parameters of the PWA multi-model, but also the determination of the order of the PWA multi-model and the determination of dividing surfaces where each sub-model plays a role, all of which will be explained in detail in the following. To verify the feasibility of the ICF-PSO-PWA multi-model, this study obtained real and accurate original sample data through real fermentation experiments. Supported by these sample data, this study demonstrated the feasibility of applying the PWA multi-model to the Pichia pastoris fermentation process through MATLAB simulations, and compared the ICF-PSO-PWA multi-model with the weighted least squares support vector regression model optimized by standard PSO algorithm (PSO-WLSSVR). The simulation results showed that the ICF-PSO-PWA multi-model can effectively predict the key biological variables of the Pichia pastoris fermentation process online, and it outperformed the PSO-WLSSVR model.
In summary, the purpose of this study can be summarized into two aspects. The primary study focus is to construct and solve a PWA soft sensor multi-model based on the fermentation process of Pichia pastoris, and to demonstrate the effectiveness of the PWA soft sensor multi-model through simulations. The secondary study focus is to propose a novel ICF-PSO optimization algorithm to optimize the PWA soft sensor multi-model, which not only serves the primary study focus but also is an innovative point of this paper.
The rest of the paper is structured as follows: Section 2 consists of materials and methods, which explains PWA model basics, determination of the PWA multi-model order, proposed ICF-PSO algorithm, determination of dividing surfaces and experimental work. Section 3 includes results and discussion. The paper is concluded in Section 4.

2. Materials and Methods

2.1. Structure Diagram of Overall Modeling Algorithm

In order to clarify the components of the prediction algorithm used in this study and their respective functions, this paper first constructs a diagram of the overall modeling algorithm structure here. The functions of each component are then explained in detail in the following sections of this paper. The overall modeling algorithm structure is shown in Figure 1.

2.2. Description of PWA Modeling Algorithm

The existing sample data evidently testifies the nonlinearity of Pichia pastoris fermentation. To facilitate the PWA modeling, the Pichia pastoris fermentation (Multi-input Multi-output Nonlinear System) was decomposed into several multiple input single output (MISO) discrete time systems, each of which can be described as a piecewise auto-regressive exogenous (PWARX) model [39]:
y ( k ) = f ( X ( k ) ) + ε ( k )
where, y ( k ) R is the system output; ε ( k ) R is the error term.
The PWA mapping f ( ) can be defined as:
f ( X ( k ) ) = { [ X T ( K ) , 1 ] θ 1 X Ω 1 [ X T ( K ) , 1 ] θ 2 X Ω 2 [ X T ( K ) , 1 ] θ s X Ω s
where, X(k)∈Rn is the input vector, which consists of the output and input of the system in the past moment:
X ( k ) = [ y ( k 1 ) , , y ( k n y ) , u   1 ( k p 1 ) , , u   r ( k p 1 ) , , u   1 ( k p n   u ) , , u   r ( k p n   u ) ]   T
where, k is a time series; y R is the system output; u R r is the system input; p is the system delay; ny and nu are the orders of the model (n = ny + nu); S is the number of models; θ = [ θ 1 , θ 2 , , θ S ] T R ( n y + n u r + 1 ) × S is the parameter vector of each linear local model of the PWA system.
The bounded input space ΩN can be decomposed into S closed polyhedral regions { Ω i } i = 1 S . Assuming that these closed polyhedral regions have no overlapping parts except the common boundary:
{ i S Ω i = Ω N R n Ω i Ω j =       i j
where, Ωi is a closed polyhedron: Ω i = { X R n | H i [ X T , 1 ] T 0 } , with H i R q i which determines the boundary of a closed polyhedron Ωi, i.e., the dividing surface dividing Ω i from other polyhedrons (qi is the number of linear inequalities defining the i-th polyhedron region), and being an inequality:
If   H i = [ h i 1 h i 2 ] h i 1 R n + 1     and   h i 2 R n + 1
Then   H i [ X T , 1 ] T 0 { h i , 1 [ X T , 1 ] T 0 h i , 2 [ X T , 1 ] T 0
Suppose the PWA model is unknown, and the modeling space consists of N datasets on the fermentation process:
Z N = { z ( k ) } k = 1 N     and   z ( k ) = [ X T ( k ) , y ( k ) ] T
where, ZNR(n + 1)N. Then, the identification problem of the PWA aims to reconstruct the PWA mapping to characterize all the features of the original system, that is, to solve the following optimization problem:
min J 1 = min k [ y ( k ) y ( k ) ] 2 = min z ( k ) Z N [ y ( k ) f ( X ( k ) , θ i ) ] 2
where, y ( k ) R is the output predicted based on the input vector X(k) at time k; f ( , ) R is the local PWA model of the current work space:
Let   δ i = { 1 X ( k ) Ω i 0 X ( k ) Ω i   Ω i Ω N
Then, Equation (2) can be represented as:
f ( X ( k ) , θ i ) = i = 1 S [ X T ( k ) , 1 ] T θ i δ i   and   X ( k ) Ω N
Therefore, the solution problem of MISO nonlinear system in each fermentation process can be transformed into a mixed integer quadratic programming (MIQP) problem [40]:
min n , θ , S J 1 = min n , θ , S i = 1 N [ y ( k ) f ( X ( k ) , θ i ) ] 2 s . t .   f ( X ( k ) , θ i ) = i = 1 S [ X T ( k ) , 1 ] T θ i δ i
where, δ i = { 1 X ( k ) Ω   i 0 X ( k ) Ω   i   , y ( k ) = f ( X ( k ) ) , X ( k ) Ω N and [ X T ( k ) , y ( k ) ] T Z N . It is not hard to see from the objective function J1 that MIQP only identifies θi. However, in order to identify θi, several prerequisites are needed, namely, solving for the order n of the input vector X(k) and determining the number of local models S. The problem of identifying n and S is a problem of identifying two hyperparameters, and in this study the optimal n and S will be solved by different algorithms respectively. When S and n are known, the objective function J1 becomes an easier-to-solve MIQP problem containing only one decision variable θi. It is only necessary to find the optimal θi by the corresponding optimization algorithm for J1. After all the above parameters are determined, the range Ωi in which the local model works can be easily determined by just the corresponding classification algorithm.
Without sufficient prior information, however, it is usually difficult to determine the number of models or the parameters of each local model. Therefore, this paper modifies the goal of optimization modeling from maximizing model accuracy by single optimization to maximizing the approximation degree of the model while minimizing the number of local models. That is, the single-objective Problem (6) was transformed into a multi-objective optimization problem:
min { J 1 , J 2 }
where, J 2 = S is an integer representing the number of local models; J 1 is given by Equation (8), reflecting the approximation of the model. The value of J 1 is negatively correlated with the approximation degree of the model. Thus, the goal of maximizing the approximation degree of the model is equivalent to minimizing the objective function J 1 .

2.3. Determining Model Order

In the absence of sufficient prior information, it is a complex issue to determine the appropriate order of the PWA model. The topological delay embedding theorem [41] suggests that a nonlinear system cannot be completely modeled if the final model has a too-small order; if the model order is too large, the modeling can be complete in theory, but the actual modeling will require heavy computations, and face lots of system noises and a large rounding error [42].
The false nearest neighbor (FNN) method is suitable for determining the embedding dimension in the state space reconstruction of nonlinear time series. The core idea of this method is to iteratively determine the false and real neighboring points of a certain trajectory xn, while increasing the embedding dimension in each iteration. The iterative process terminates when the number of false neighbors is zero, and outputs the minimum required embedding dimension. From the geometric perspective, the FNN is a very simple and easy approach to determine the embedding dimension [43]. Hence, this paper adopts this method to calculate the order of the PWA multi-model.
Step 1. Determining the false nearest neighbor
Considering the input/output data on fermentation, construct the following n-dimensional regression vector:
X ( k ) = [ y ( k 1 ) , , y ( k n y ) , u   1 ( k p 1 ) , , u   r ( k p 1 ) , , u   1 ( k p n   u ) , , u   r ( k p n   u ) ]   T
Rewrite the regression vector X(n)(k) as:
x p = [ x p 1 , x p 2 , , x p n y , x p ( n y + 1 ) , , x p ( n y + n u ) , , x p ( n y + n u r ) ] T
p = N 0 , N 0 + 1 , , N
where, N 0 = n + 1 .
Define the distance from xi to xj in Rn by L norm:
x i x j = max | x i + 1 x j + 1 |
Let xp be the nearest point of xη(p):
x η ( p ) x p ( n ) = min j = N 0 , , N ; j n x p x j = min j = N 0 , , N ; j n max   0 l n 1 | x p + l x j + l |
Then, N0η(p) ≤ N, and η(p) is related to n, that is, η(p) is related to ny and nu.
As the embedding dimension changes from n to n + 1, the distance from xi to xj becomes: x η ( p ) x p ( n + 1 ) = min j = N 0 , , N ; j n x p x j = min j = N 0 , , N ; j n max   0 l n 1 | x p + l x j + l | . If x η ( p ) x p ( n ) x η ( p ) x p ( n + 1 ) , namely, two non-adjacent points become two adjacent points when projected on the low trajectory, such adjacent points are the false neighbors.
Specifically, xη(p) is considered as the false nearest neighbor of xp if it satisfies:
| x η ( p ) x p ( n + 1 ) x η ( p ) x p ( n ) x η ( p ) x p ( n + 1 ) | R T
where, RT is a threshold value, R T [ 10 , 50 ] .
With the increase of dimension n, the ratio of false neighbors δ decreases, where δ is the ratio of false neighbors to total neighbors. When δ is less than a certain threshold δ0 or δ no longer changes with the change of n, then the geometric structure of the state space has been completely opened, and the dimension n at this time is the embedding dimension.
Step 2. Calculating the minimum embedding dimension
Firstly, define the variable α(p,n):
α ( p , n ) = x η ( p ) x p ( n + 1 ) x η ( p ) x p ( n )
Then, define the mean value of α(p,n) with respect to n as:
E ( n ) = 1 N N 0 + 1 n = N 0 N α ( p , n )
To capture the change of embedding dimension from n to n + 1, redefine:
E 0 ( n ) = E ( n + 1 ) E ( n )
The minimum embedding dimension of reconstructed state space can be judged according to the change of E0(n). If it is found that when the embedding dimension increases to a certain extent, E0(n) will no longer change or change to a very small extent, then n + 1 is the required minimum embedding dimension. At this time, the corresponding ny and nu are the orders of the corresponding model [44]. After using the false nearest neighbor method, the optimal order of the input vector (nu) and output vector (ny) of the PWA model are 1 and 2 respectively.

2.4. Collaborative Optimization of the Number and Parameters of the Models

In order to improve the prediction performance of PWA multi-models, the number of sub-models and the parameters of each sub-model must be optimized. The PSO is widely adopted for fuzzy system control, neural network training, and function optimization, because it is simple to define and easy to implement, with fast computing speed [45]. Therefore, this paper applies the PSO to determine the number of models and to optimize the parameters of local models in the PWA multi-model, and introduces the ICF to improve the ability of the standard PSO to avoid the local optimum early and perform accurate local search thereafter.

2.4.1. PSO Algorithm

In the standard PSO algorithm, each particle in the swarm is of zero volume in the D-dimensional search space, and is represented by its current position. Each particle flies at a certain velocity in the search space, and keeps adjusting the velocity based on its experience and that of every other particle.
The current velocity and position of the i-th particle can be defined as:
V i = ( v i , 1 , v i , 2 , , v i , D )   and   P i = ( p i , 1 , p i , 2 , , p i , D )
Then, the individual best-known position of the i-th particle and the global best-known position of the swarm can be respectively expressed as:
P i b = ( P i , 1 b , P i , 2 b , , P i , D b )   and   P g b = ( P g , 1 b , P g , 2 b , , P g , D b )
For the generation t + 1, the velocity and position of the i-th particle in the j-th dimension (1 ≤ jD) can be updated by:
v i , j ( t + 1 ) = w v i , j ( t ) + c 1 r 1 , i ( p i , j b ( t ) p i , j ( t ) ) + c 2 r 2 , i ( p g , j b ( t ) p i , j ( t ) )
p i , j ( t + 1 ) = p i , j ( t ) + v i , j ( t + 1 )
where, w is inertia weight; c1 and c2 are acceleration coefficients, both of which are constants; r1,i and r2,i are two random functions that change in the range of [0, 1].
In the position update Equation (21) of the standard PSO, if p i , j = p i , j b = p g , j b , which means that the current position of particle i is the best position it has experienced and also the best position the whole particle population has experienced so far, then the change of v i , j ( t + 1 ) only depends on w v i , j ( t ) . This implies that when the swarm approaches a local optimal solution, its velocity will approach zero. This means that all particles tend to stop moving near the local optimal solution. Thus, the standard PSO would converge prematurely, and get stuck in the local optimal position. However, there are many local optimal solutions for non-convex problems, and the standard PSO will converge to a local optimum very easily. For these reasons, PSO must be improved.

2.4.2. ICF-PSO Algorithm

To overcome premature convergence, this paper improves the PSO into the ICF-PSO. The position update formula was not changed, while the velocity update formula was refined by introducing the ICF μ:
v i , j ( t + 1 ) = μ ( w v i , j ( t ) + c 1 r 1 , i ( p i , j b ( t ) p i , j ( t ) ) + c 2 r 2 , i ( p g , j b ( t ) p i , j ( t ) ) )
where, μ = δ σ G N + G 1 is the compression factor, which both restrains the velocity of particles and enhances the local search ability of the algorithm. Note that δ and σ are constants in μ, both of which are key to the control of the constraint effect of the compression factor; N is the total number of iterations; G is the current number of iterations. The rationale for the improved compression factor is as follows. In this paper, the characteristic curve of μ is made to be a monotonically decreasing curve in the first quadrant as the number of current iterations G increases by determining the appropriate values of δ, σ and N. This curve must contain the following characteristics, that is, the value of μ must vary between 1 and 0; the value of μ must decrease rapidly at the beginning of the iteration, and then the rate of decrease of the value of μ tends to smooth out as the iteration proceeds to the middle and later stages. The compression factor satisfying the above conditions is substituted into the velocity update formula of the standard PSO algorithm, which makes the particle velocity decrease continuously during the iteration process under the constraint of the compression factor. The particle velocity satisfies the following characteristics during the decreasing process, the particle can maintain a fast velocity at the beginning of the iteration, which can make the particle jump out of the local optimum effectively with the fastest speed, but the rate of particle velocity decreasing is fast during this time. As the iteration proceeds to the middle and later stages, the particle velocity becomes small and the particle velocity decreases very little and smoothly at this time. This kind of particle is able to search the global optimum precisely and smoothly, just like the carpet search of radar. The above is the rationale for the improved compression factor μ. This paper will describe in detail how to determine reasonable values of δ, σ and N, in the following subsections.

2.4.3. Determination of Parameter Values in ICF

This paper tries to maximize the efficiency and optimization effect of the compression factor by determining the value range of δ and σ. Firstly, the total number of iterations N was assumed to be 300. Secondly, both δ and σ must be positive numbers, according to mathematical knowledge (otherwise, μ will be meaningless). Finally, the value of the compression factor should decrease with the increase of G, judging by its functional features. Therefore, the value range of δ and σ could only be one of the two cases: (1) δ ( 0 , 1 ) and σ ( 1 , + ) ; (2) δ ( 1 , + ) and σ ( 0 , 1 ) . The variation curves of μ in these two cases are discussed in detail below:
(1)
δ ( 0 , 1 ) and σ ( 1 , + )
Figure 2a shows the curve of μ with the increase of σ at δ = 0.5, and Figure 2b shows the curve of μ with the increase of δ at σ = π. It can be seen that when the value of δ was constant, the constraint performance of ICF μ was better at a larger σ value; when the value of σ was fixed, the smaller the value of δ was, the better the constraint performance was. The values of σ and δ should be considered comprehensively to optimize the constraint performance of the ICF μ. Experimental results show that, in case (1), the constraint performance of μ is the best at δ = 0.1 and σ = 2π (Figure 2c).
(2)
δ ( 1 , + ) and σ ( 0 , 1 )
Figure 2d shows the curve of μ with the increase of σ at δ = 2π, and Figure 2e shows the curve of μ with the increase of δ at σ = 0.5. It can be seen that when the value of δ was unchanged, the constraint performance of ICF μ was better at a smaller σ value; when the value of σ was fixed, the larger the value of δ was, the better the constraint performance was. The values of σ and δ should be considered comprehensively to optimize the constraint performance of the ICF μ. Experimental results show that, in case (2), the constraint performance of μ is the best at δ = 24 and σ = 0.01 (Figure 2f).
To sum up, the compression factor μ has an optimal state in either case, depending on the values of δ and σ. Here, the δ and σ values in compression factor μ are set to 24 and 0.01, respectively.
In fact, the concept of compression factor φ was put forward by Clerc et al. in as early as 1999 [46]:
φ = 2 | 2 ρ ρ 2 4 ρ |     and   ρ = c 1 + c 2
where, ρ > 4 and φ is the compression factor; c1 and c2 are acceleration constants. The problem is that the compression factor is too rigid [47]. The value of φ only depends on the size of c1 and c2, failing to consider the iterative effect. With the elapse of time, such a compression factor is no longer suitable for optimizing complex nonlinear systems. Compared with the original compression factor φ, the improved compression factor μ proposed by the authors can change iteratively, and adapt its constraint effect to the specific optimization task, in the light of the combined effect of factors like δ, σ, N and G.
The details of the performance analysis of the ICF-PSO algorithm are shown in the Appendix A.

2.4.4. ICF-PSO Algorithm Applied to Optimize the Number and Parameters of PWA Local Models

The multi-objective optimization Problem (9) cannot be easily solved by the traditional phased strategy, because of the strong coupling between the two objective functions. Drawing on the idea of non-inferior optimal solution, this paper chooses to optimize objective function J1 by the step-by-step iterative method, in view of the fact that J1 depends heavily on J2 but J2 is independent of J1: First, select the non-optimal solution S that satisfies J2 and determine an optimal search threshold Yts for J1. Then, S is substituted into J1, and J1 is optimized to obtain the optimal solution of the corresponding local model parameter; if the optimized J1 cannot meet the preset threshold Yts, optimize S and re-optimize J1. Repeat these steps until the threshold Yts is satisfied.
To optimize objective function J2, it is critical to reasonably divide the input space ΩN into S subspaces that satisfy conditional Equation (25), so as to determine the number of local models:
i = 1 S M i = N
where, Mi is the number of input vectors contained in the i-th subspace; Ω i ( i = 1 , 2 , , S ) is the i-th input subspace.
After the input space dividing Ωi is obtained, the corresponding sample dataset can be determined by:
Z i = { z ( k ) | X ( k ) Ω i ,   y ( k ) = f ( X ( k ) ) }
where, i = 1 S X i = Z N ; Zi is the sample dataset corresponding to Ωi, which uniquely determines the modeling space of f (X(k)).
When the system model is unknown, y(k)∈R can be replaced by the actually observed output. The difficulty of J2 is that the division of input space cannot be separated from the parameter identification of the local model. The strong coupling between the two objective functions makes it necessary to determine the number of models and estimate the model parameters simultaneously. Therefore, this paper employs the ICF-PSO algorithm, which excels in parallel search and data clustering to optimize the number of models and parameters.
During the optimization, the modeling parameters reflecting each local model are combined with the corresponding cluster center into a particle:
P = { θ 1 , Z 1 , , θ i , Z i , , θ S , Z S }
where, Z i ( i = 1 , 2 , , S ) is the cluster center of the i-th cluster.
By the ICF-PSO algorithm, the study optimized the number of local models and the parameters of each local model at the same time [48]. The detailed steps are as follows:
(1)
Step 1. Initialization
Set the number of non-optimal models S = 1, swarm size NS, inertia weight w, acceleration coefficients c1 and c2, the threshold Yts of objective function J1, maximum allowable number of iterations Nit, initial position Pi and initial velocity of each particle Vi, etc.
(2)
Step 2. Data clustering
Because the PWA system is locally linear, the data close to each other are very likely to fall into the same class. Thus, each local subspace Zi can be divided by the distance between the data and the given cluster center. The specific method is to calculate the norm distance from each modeling space vector z(k) to a given cluster center Z i ; if the modeling space vector is the closest to a given cluster center, it will be assigned to that cluster:
z ( k ) Z i   and   I = arg min k z ( k ) Z i 2
where, i = 1 , 2 , , S .
(3)
Step 3. Initial fitness evaluation
Evaluate the initial fitness of each particle according to the objective function J1.
(4)
Step 4. Evolution of particle swarm
Update the position and velocity of particles according to Equations (22) and (23), respectively.
(5)
Step 5. Fitness update
Cluster the data by Step 2, and update the fitness of each particle by objective function J1.
(6)
Step 6. Termination conditions
(1)
If the maximum number of iterations is not reached and the threshold of the objective function is not met, return to Step 3.
(2)
If the maximum number of iterations is not reached, but the threshold of the objective function is met, terminate the algorithm.
(3)
If the maximum number of iterations is reached and the threshold of the objective function is met, terminate the algorithm.
(4)
If the maximum number of iterations is reached, but the threshold of the objective function is not met, go to Step 7.
(7)
Step 7. Make J1 = S + 1, return to Step 1, and restart the search
The above steps can be organized into the following flow chart in Figure 3.

2.5. Determination of Dividing Surfaces

So far, the sample space has been divided into S local spaces Ωi (i = 1, 2, ···, S). According to the definition of the PWA system in Equation (2), the convex polyhedrons with different scopes have no overlapping parts except the common boundary. Therefore, the problem of finding the optimal dividing surface is directly transformed into the linear division of cluster data. The least square support vector machine (LS-SVM) algorithm was selected to solve the optimal dividing surface between local PWA models. The selection was made because the LS-SVM is a classical classifier to handle small samples, and high-dimensional nonlinear data. The training speed of the model is accelerated, owing to the combination between the standard SVM and the least squares method.

2.5.1. Least Square Support Vector Machine

The core idea of LS-SVM is that in a given sample data set { x k , y k } k = 1 N , in which xkRn and yk∈{−1, 1}. To find the optimal classification hyperplane w T x + b = 0 , that is, to solve the following optimization problems [49], namely
min J ( w , e ) = min w , b , e 1 2 w T w + μ k = 1 N e k 2 s . t . y k [ w T φ ( x k ) + b ] = 1 e k       k = 1 , 2 , , N
where, φ ( g ) : R n R n H is a function that maps input data to a high-dimensional feature space; weight vector w R n H ; error variables and offset values satisfy e k R and b R ; μ > 0 is the weight coefficient, which can make LS-SVM find the optimal hyperplane and ensure the minimum deviation [50].
Introducing Lagrange function into Equation (29), then
L ( w , b , e ; α ) = J ( w , e ) k = 1 N α k { [ y k ( w T φ ( x k ) + b ) 1 + e k ] }
where, α k 0 is Lagrange multiplier; xk is the support vector.
By deriving w, b, ek, αk in the above formula respectively, the following linear equation system about α and b can be obtained:
[ 0 y T y Ω + I / μ ] [ b α ] = [ 0 1 v ]
where, y = [y1, ,yN]; α = [α1, , αN]; 1v = [1, ,1]; Ω is called a kernel matrix, and its expression is
Ω i , j = y i y j φ ( x i ) T φ ( x j ) = y i y j K ( x i , x j )
where, i,j = 1, ,N. A set of α and b can be obtained by solving Equation (31). Finally, the classification hyperplane obtained by LS-SVM is shown below:
f ( x ) = s g n [ k = 1 N α K y k K ( x , x k ) + b ]
where, the kernel function must satisfy Mercer condition. The kernel function of this paper uses RBF kernel function. The expression of the RBF kernel function is shown below.
K ( x i , x j ) = exp ( x i x j 2 σ 2 )
where, σ is the width parameter of the function, which controls the radial range of action of the function.

2.5.2. LS-SVM Applied to Determine the Parameters of Dividing Surfaces

To begin with, a sample space is defined:
  Z i = { z ( k ) | X ( k ) Ω   i ,   y ( k ) = f ( x ( k ) ) } k = 1 M i
where, Z N = i = 1 S Z i .
The LS-SVM determines the dividing surface in two steps: First, find out the adjacent subspaces; Then, determine the parameters of dividing surfaces between adjacent subspaces. The specific steps are as follows:
Step 1. Determine the modeling subspaces adjacent to each other by the nearest neighbor rule of cluster center, namely:
{ Z i , Z j } = min 2 i , j s , i j { Z i Z j 2 }
Step 2. Determine the parameters of the dividing surface between two adjacent subspaces.
According to Equation (4), the dividing surface of two adjacent subspaces can be described as: h i , j = { w i , j T X + v i , j = 0 } . Suppose y(i) = 1 for the sample belonging to Ωi subspace, and y(j) = −1 for the samples belonging to Ωj subspace. Then, kernel matrix Ω in Equation (31) can be transformed by LS-SVM into:
Ω i , j = y i y j φ ( X i ) T φ ( X j ) = y i y j K ( X i , X j )
Without changing other parameters in Equation (31), N groups of α and b can be obtained by solving the new linear equation set. Finally, the dividing surface can be obtained by LS-SVM as:
f ( x ) = s g n [ i = 1 N α i y i K ( X , X i ) + b ]
where, N = Mi + Mj is the total number of vectors in subspace Ωi and subspace Ωj.
Accordingly, the parameters of dividing surfaces can be obtained by:
w i , j = i = 1 N α i y i X i     and   v i , j = y i w i , j T X i
Therefore, the parameters of dividing surfaces can be expressed as h i , j = [ w i , j T , v i , j ] .

2.6. Modeling Steps of the PWA Soft Sensor Model

Let D be the sample dataset of the fermentation process. The sample dataset is a kind of MISO Problem (1). Then, the model identification problem is transformed into an MIQP optimization Problem (8). The optimal solution of the MIQP problem can be obtained by the following steps:
(1)
Step 1. Determining model order
The FNN is introduced to determine the embedding dimension in the state space reconstruction of nonlinear time series in the fermentation process. The false nearest neighbor can be identified by Equation (15). When the dimension n increases continuously, the ratio δ of the false nearest neighbor will approach a threshold. The dimension at this very moment is taken as the embedding dimension. The change of embedding dimension is evaluated by E0(n) in Equation (18). When the embedding dimension n = ny + nu increases to a certain extent, E0(n) tends to be stable. At this moment, the ny and nu are the order of the corresponding model.
(2)
Step 2. Collaborative optimization of the number and parameters of models
Because of their strong coupling, J1 and J2 must be determined synchronously. Therefore, the powerful parallel search ability of ICF-PSO algorithm is utilized to optimize the multi-objective optimization Problem (9). By the ICF-PSO, Equations (22) and (23) are updated, such that the particles can avoid the local optimum.
(3)
Step 3. Determining dividing surfaces
The adjacent modeling subspaces {Zi, Zj} are determined by the nearest neighbor rule of cluster center, and the dividing surfaces of adjacent subspaces are determined by h i , j = { w i , j T X + v i , j = 0 } . By solving the new linear equation set, N groups of α and b are obtained. Then, the parameters w i , j T and v i , j of dividing surfaces can be solved by Equation (38), namely, h i , j = [ w i , j T , v i , j ] .
The modeling process of the PWA algorithm is illustrated in Figure 4.

2.7. Introduction of Experimental Work

The online control of Pichia pastoris fermentation is a difficult problem, due to the complexity of the fermentation process, and the cost of online detection instruments for key biomass variables being very high. To provide information for the online control and to optimize the fermentation process, it is significant to establish real-time soft sensor models for cell concentration X and protease K concentration P.
The sample data used in this paper all came from the microbial fermentation lab of Zhenjiang Yangzhong Weikete Bioengineering Equipment Co., Ltd. (Zhenjiang, China), and the fermentation experiment equipment adopted RTY-C−100 L model. Taking Pichia pastoris fermentation as the object, the Pichia pastoris KM71, MutS constructed by the Key Laboratory of Animal Husbandry and Veterinary Institute of Shanghai Academy of Agricultural Sciences, was selected as the strain, and the expression vector and foreign genes were pPICZαA and IFNαcDNA [51]. The experiment was carried out according to the schematic diagram of the Pichia pastoris fermentation process (Figure 5). After deeply analyzing the fermentation process, the temperature of fermentation broth T, dissolved oxygen concentration DO, pH of fermentation broth, air flow q, stirring speed v, and the pressure of fermentation tank p were selected as auxiliary variables. In order to clarify the auxiliary and key biomass variables of the ICF-PSO-PWA model at a glance, a model expression with a generalized form for the input and output variables is given in Equation (39).
φ ( X , P ) = f ( T , D o , pH , q , v , p )
where, f ( g ) is a generalized form of the complex nonlinear relationship between the auxiliary variables in the ICF-PSO-PWA model; φ ( X , P ) is any one of X and P. That is, the auxiliary variables (T, Do, pH, q, v, p) are the input variables of the model we use, and the key biomass variables (X, P) are the output variables we need.
The initial values of the input variables in the fermentation process were set as follows: the pressure of the fermentation tank was controlled at 0.04 Mpa, the stirring speed of the motor was about 250 rpm, the fermentation temperature was controlled at about 28 ± 0.5 °C, and the air flow was controlled at the range of 1000–1200 L/h. The oxygen content was maintained between 35% and 45%, and the pH was 7.3.
Under normal fermentation conditions, the environmental variables of the fermentation processes were monitored by sensors every 5 min and uploaded to the host computer. The cell concentrations and protease K concentrations were measured offline every 1 h. The cell concentration was measured by the American Lehman Lamotte 1200 photoelectric colorimeter, and the concentration of proteinase k was measured by the UV−260 model automatic spectrophotometer of Shimadzu Corporation, Japan. The adjustment, logarithmic growth and stabilization phases in the fermentation processes were selected as the data collection time period, which was approximately the first 80 h of the fermentation process. Finally, 10 batches of sample data were extracted to apply our algorithm. Subsequently, 6 out of these 10 batches were used for training, the 7th and 8th batch for online correction of the initial PWA model, and the 9th and 10th batch for verifying the effectiveness and prediction accuracy of the PWA model. As the output variables are measured offline once every 1 h and the input variables once every 5 min, there is no one-to-one correspondence between the two, so the interpolation methods are used to convert the output variables offline every 1 h to the input variables every 5 min, forming a one-to-one correspondence. The data are then pre-processed by a coordinate transformation, which normalizes all sample data to between −1 and 1, making the calculation easier and faster. Through coordinate transformation and interpolation operation on 10 batches of sample data, a total of 960 sets of sample data were obtained at last, including 576 sets of sample data in training set, 204 sets of sample data in verification set and 180 sets of sample data in test set.

3. Results and Discussion

After determining that the order of input vector (nu) is 1 and the order of output vector (ny) is 2, XT(k) = [uT(k), yT(k), yT(k + 1)] is selected to form the input vector and the local model is chosen to be of the form y(k) = [yT(k − 1), uT(k − 1), 1]θ. The dividing surface equation is hi,j[yT(k − 1), uT(k − 1), 1]T = 0, which separates the i-th local model from the region where the j-th local model is located. After using the ICF-PSO algorithm to optimize the PWA model, the number of local models is S = 3, which exactly corresponds to the three phases of the sample data in this paper: adjustment, logarithmic growth and stable phase. The local model parameters θi (i = 1, 2, 3) and the parameters of dividing surfaces hi,j are shown in the Table 1.
To verify its performance, the ICF-PSO-PWA model was compared with the PSO-WLSSVR model based on global modeling, which used Gaussian radial basis kernel (RBF) function k ( x 1 , x 2 ) = exp ( x 1 x x 2 / σ 2 ) , with σ = 0.2, and penalty coefficient C = 10. The WLS-SVR model introduces the least squares algorithm and weights on the basis of the SVR model, which greatly enhances the prediction accuracy and generalization ability of the SVR model. It is a very good comparator. All simulation results were obtained based on MATLAB R2019b software.
Figure 6a,b present the prediction results of soft sensor models based on ICF-PSO-PWA and PSO-WLSSVR, respectively. Obviously, the fitting effect of ICF-PSO-PWA soft sensor model is better than that of the PSO-WLSSVR model, regardless of the bacterial concentration or the concentration of protease K. In particular, the fitting effect of the PSO-WLSSVR model is clearly inferior to the ICF-PSO-PWA model in terms of the cell concentration in the later stage.
Figure 7a,b compare the relative error of cell concentration and protease K concentration based on ICF-PSO-PWA and PSO-WLSSVR, respectively. For both ICF-PSO-PWA and PSO-WLSSVR soft sensor models, the prediction errors gradually decrease with time and are eventually stabilized. However, the ICF-PSO-PWA soft sensor model has a much smaller error range, a relatively high accuracy in the whole sample interval, and an obviously smaller final error than the PSO-WLSSVR model.
Table 2 shows the mean absolute percentage error (MAPE) of cell concentration and proteinase K concentration based on ICF-PSO-PWA and PSO-WLSSVR. It can be seen from Table 2 that compared with the PSO-WLSSVR soft sensor model, the ICF-PSO-PWA soft sensor model has an improved accuracy by 2.4642% when predicting the cell concentration, and an improved accuracy by 6.5127% when predicting the proteinase K concentration. The mean prediction accuracy of ICF-PSO-PWA soft sensor model has been improved by 4.4884%.
Furthermore, the root mean square errors (RMSEs) of the ICF-PSO-PWA and PSO-WLSSVR soft sensor models were tested on different scale datasets. The test results in Table 3 show that, with the growing number of test samples, the RMSE of the PSO-WLSSVR soft sensor model increases, while that of the ICF-PSO-PWA model does not change significantly. This is because the latter model divides the working interval of the system into several subintervals, and models each subinterval separately; besides, multiple local models are used in place of the global single model, making full use of the information of each sample. As time goes by, the ICF-PSO-PWA model can adapt to the changing working conditions, and achieve excellent generalization performance and self-adaptability.

4. Conclusions

In order to solve the problem that the key biological variables in the fermentation process of Pichia pastoris are difficult to directly measure online, a PWA multi-model soft sensor modeling method based on the idea of “decomposition-synthesis” was proposed to measure the key variables in the fermentation process of Pichia pastoris online. In order to identify the PWA multi-model, the FNN was used to solve the order of the input and output vectors of the model. Then, a novel ICF-PSO algorithm was proposed for collaborative optimization of the number of local models and the parameters of each local model. Finally, the LS-SVM was used for the determination of the parameters of the dividing surfaces in the modeling process, so as to delineate the regions where each local model plays a role. MATLAB simulation results show that the prediction accuracy of the ICF-PSO-PWA model for X and P reach 1.8932% and 2.5974%, respectively, compared with 4.3573% and 9.1101% of the prediction accuracy of the PSO-WLSSVR model for X and P, the prediction accuracy of the ICF-PSO-PWA model improved by 2.4642% and 6.5127%. The RMSEs of the ICF-PSO-PWA model for X and P prediction results remain at 0.015402352 and 0.711287476, respectively, which are smaller than 0.033457354 and 2.278762339 of the PSO-WLSSVR model, indicating that the prediction results of the ICF-PSO-PWA model are more stable. The simulation results above demonstrate the effectiveness of the ICF-PSO-PWA soft sensor method in predicting the key biological variables in the fermentation process of Pichia pastoris. The proposed model provides a feasible theoretical approach to solve the soft sensing of key biological variables in the Pichia pastoris fermentation process.
The PWA modeling method, as a special form of modeling method, partitions the nonlinear complex system into several linear systems, so that the system can be solved using the existing mature linear theory, breaking through the bottleneck that the traditional single model is extremely complex and cannot meet the needs of actual engineering when the global description of increasingly complex controlled objects is performed. The strong nonlinear approximation capability and simple model structure of PWA are not only loved by scholars in the field of modeling, but also a hot research topic in the control community. In the future, we are interested in studying the optimal control of PWA models and the design of online monitoring systems for the whole fermentation process.

Author Contributions

Conceptualization, X.W.; methodology, X.W.; software, X.W.; validation, X.W. and B.W.; formal analysis, X.W.; investigation, X.W. and M.H.; resources, B.W.; data curation, B.W. and X.W.; writing—original draft preparation, X.W.; writing—review and editing, B.W. and M.H.; visualization, X.W. and B.W.; supervision, B.W. and X.Z.; project administration, B.W.; funding acquisition, B.W. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was carried out by the National Natural Science Foundation of China (Grant No 41376175), The Innovative Research Group Project of the National Natural Science Foundation of China (CN) (Grant No BK20140568), and The key R & D project of Zhenjiang City (Grant No (SH2020005)).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are thankful to the Key Laboratory of Agricultural Measurement and Control Technology and Equipment for Mechanical Industry Facilities, School of Electrical and Information Engineering, Jiangsu University, for their technical support of our experiment. The authors are also thankful to the anonymous reviewers whose comments helped improve and clarify.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Performance Analysis of ICF-PSO Algorithm

In order to study the influence of the algorithm improvement scheme on the algorithm performance, this paper selects six well-known classic test functions for verification. Since the ICF-PSO algorithm is used for the collaborative optimization of the number of PWA models and the parameters of PWA sub-models, three classic test functions for multi-objective optimization problems similar to the PWA model proposed in this paper are selected in the test function. The expressions of the functions are shown in Table A1, where f2(x (De Jong’s f2), f4(x) (Shaffer’s f6) and f5(x) (Levy N.13) are test functions for multi-objective optimization problems. This is because both values of x1 and x2 must be searched for simultaneously in order to determine the final minimum values of f2(x), f4(x) and f5(x). The parameter settings of the 6 classic test functions are shown in Table A2.
Table A1. Functions used to test the performance of different optimization algorithms.
Table A1. Functions used to test the performance of different optimization algorithms.
Function NameExpression
Ackley f 1 ( x ) = 20 exp ( 0.2 1 D d = 1 D x d 2 ) exp ( 1 D d = 1 D cos ( 2 π x d ) ) + 20 + e
De Jong’s f2 f 2 ( x ) = 100 ( x 1 2 x 2 ) 2 + ( 1 x 1 ) 2
Griewank f 3 ( x ) = 1 4000 d = 1 D x d 2 d = 1 D cos ( x d d ) + 1
Shaffer’s f6 f 4 ( x ) = 0.5 + sin x 1 2 + x 2 2 0.5 ( 1.0 + 0.001 ( x 1 2 + x 2 2 ) ) 2
Levy N.13 f 5 ( x ) = sin 2 ( 3 π x 1 ) + ( x 1 1 ) 2 [ 1 + sin 2 ( 3 π x 2 ) ] + ( x 2 1 ) 2 [ 1 + sin 2 ( 2 π x 2 ) ]
Rosenbrock f 6 ( x ) = d = 1 D ( 100 ( x d + 1 x d 2 ) 2 + ( x d 1 ) 2 )
Table A2. Function parameters for the test problems.
Table A2. Function parameters for the test problems.
FunctionDimensionInitial Range of xiGlobal Minimum
Ackley10±32.7680
De Jong’s f22±500
Griewank10±3000
Shaffer’s f62±1000
Levy N.132±100
Rosenbrock10±150
In order to further verify the performance of the ICF-PSO algorithm, compare it with the standard PSO, Bat Algorithm (BA) and Firefly Algorithm (FA) [52,53]. To facilitate quick comparison of performance, setting the maximum number of iterations to 100 is enough, and calculate the average (AVE) and standard deviation (SD) of the calculation results. The comparison results are shown in Table A3, and the convergence curve of each algorithm test function is shown in Figure A1.
Table A3. Function test results.
Table A3. Function test results.
FunctionICF-PSOPSOBAFA
AVESDAVESDAVESDAVESD
f1(x)0.356331.703260.605772.453820.704292.004451.243172.57169
f2(x)0.218040.790720.464141.047820.968091.820611.081552.41802
f3(x)0.086030.185510.159120.375730.110130.223980.146480.24683
f4(x)0.043140.121910.084330.096640.098120.092550.060480.08848
f5(x)0.837622.868362.742587.559323.105938.564971.973375.36548
f6(x)0.327713.194350.733934.535031.288826.831311.950527.77723
Figure A1. The fitness curves of different optimization algorithms on different test functions (a) Ackley; (b) De Jong’s f2; (c) Griewank; (d) Schaffer’s f6; (e) Levy N.13; (f) Rosenbrock.
Figure A1. The fitness curves of different optimization algorithms on different test functions (a) Ackley; (b) De Jong’s f2; (c) Griewank; (d) Schaffer’s f6; (e) Levy N.13; (f) Rosenbrock.
Sensors 21 07635 g0a1
It can be seen from the test results in Table A3 that the AVE and SD of ICF-PSO are significantly smaller than the standard PSO, BA and FA, indicating that ICF-PSO can find the minimum zero stably. The optimization accuracy of ICF-PSO is also significantly higher than the other three algorithms. Summarized from the six curves in Figure A1, the convergence curve of ICF-PSO is obviously below the other curves, indicating that ICF-PSO has stronger optimization ability and the convergence speed is significantly higher than the other three algorithms, especially in the Figure A1e, ICF-PSO quickly converges to the minimum in less than ten steps, and the convergence speed is at least twice that of the other three algorithms, and the stability is stronger. It can also be seen in Figure A1b,d that the convergence speed of ICF-PSO is at least ten steps ahead of other algorithms, and the stability is stronger.
Besides, as can be seen from Figure A1 and Table A3, the PSO algorithm performs second only to ICF-PSO in terms of the outperformance of these six test functions. Therefore, in order to assert that the ICF-PSO algorithm performs better, converges faster, and is more robust, the PSO algorithm is again compared with the ICF-PSO algorithm. De Jong’s f2 and Shaffer’s f6 were chosen as the test functions and three different initial populations were chosen for each function, i.e., the optimization performance of the two optimization algorithms was compared separately for different initial populations of each test function. The comparison results are shown in Table A4, and the convergence curve of each algorithm test function is shown in Figure A2.
Table A4. Function test results.
Table A4. Function test results.
FunctionInitial PopulationsICF-PSOPSO
AVESDAVESD
De Jong’s f2Type I0.333800.877041.107742.13176
Type II1.586822.724502.259543.64831
Type III0.212510.790771.169162.20947
Shaffer’s f6Type I0.027000.083910.116210.09779
Type II0.024090.039070.067150.12842
Type III0.039740.099220.060170.11168
Figure A2. The fitness curves optimized for De Jong’s f2 under three different initial populations (a) Type I; (b) Type II; (c) Type III; The fitness curves optimized for Schaffer’s f6 under three different initial populations (d) Type I; (e) Type II; (f) Type III.
Figure A2. The fitness curves optimized for De Jong’s f2 under three different initial populations (a) Type I; (b) Type II; (c) Type III; The fitness curves optimized for Schaffer’s f6 under three different initial populations (d) Type I; (e) Type II; (f) Type III.
Sensors 21 07635 g0a2aSensors 21 07635 g0a2b
It can be seen from the test results in Table A4 that the AVE and SD of ICF-PSO are significantly smaller than the standard PSO, indicating that ICF-PSO can find the minimum stably. The optimization accuracy of ICF-PSO is also significantly higher than the PSO algorithm. Summarized from the six curves in Figure A2, the convergence curve of ICF-PSO is obviously below the curves of PSO, indicating that ICF-PSO has stronger optimization ability and the convergence speed is significantly higher than the PSO algorithm.
Therefore, considering the above performance comparison results, the ICF-PSO algorithm with better convergence speed, convergence accuracy and convergence stability was chosen to optimize the PWA model in this paper.
The ICF-PSO algorithm can be implemented in the following steps:
(1)
Step 1. Initialize the particle swarm.
(2)
Step 2. Calculate the initial fitness of each particle by the objective function F and evaluate particle fitness.
(3)
Step 3. Update the position and velocity of each particle in the swarm by Equations (22) and (23), respectively.
(4)
Step 4. Continue to evaluate the initial fitness of each particle according to the objective function F.
(5)
Step 5. Terminate the algorithm if the termination condition is met; otherwise, jump back to Step 3.

References

  1. Hori, K.; Okano, S.; Sato, F. Efficient microbial production of stylopine using a Pichia pastoris expression system. Sci. Rep. 2016, 6, 22201. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Jie, W.; Zhang, X.X.; Yu, H.S.; Li, W.; Jia, Y.P.; Guo, J.Y.; Zhang, L.J.; Song, X.B. Research progress of high density fermentation process of pichia pastoris. China Biotech. 2016, 36, 108–114. [Google Scholar]
  3. Kalinina, A.N.; Gordeeva, T.L.; Sineoky, S.P. Expression of the xylanase gene from paenibacillus brasilensis X1 in pichia pastoris and characteristics of the recombinant enzyme. Appl. Biochem. Microbiol. 2019, 55, 797–804. [Google Scholar] [CrossRef]
  4. Wang, L.; Hu, T.; Jiang, Z.; Yan, Q.; Yang, S. Efficient production of a novel alkaline cold-active phospholipase C from Aspergillus oryzae by molecular chaperon co-expression for crude oil degumming. Food Chem. 2021, 350, 129212. [Google Scholar] [CrossRef] [PubMed]
  5. Chiniadis, L.; Bratsos, I.; Bethanis, K.; Karpusas, M.; Giastas, P.; Papakyriakou, A. High-resolution crystal structures of a “half sandwich”-type Ru(II) coordination compound bound to hen egg-white lysozyme and proteinase K. JBIC J. Biol. Inorg. Chem. 2020, 25, 635–645. [Google Scholar] [CrossRef] [PubMed]
  6. Han, M.; Wang, W.; Zhou, J.; Gong, X.; Xu, C.; Li, Y.; Li, Q. Activation of the unfolded protein response via co-expression of the HAC1i gene enhances expression of recombinant elastase in pichia pastoris. Biotechnol. Bioprocess. Eng. 2020, 25, 302–307. [Google Scholar] [CrossRef]
  7. Yao, Z.; Wang, J.H.; Liu, X.Q. Selective quenching detection of proteinase K by croconaine based organic sensor. World Sci. Res. J. 2021, 7, 472–479. [Google Scholar]
  8. Martens, T. Urgent cardiac surgery and COVID-19 infection: Uncharted territory: Reply. Ann. Thorac. Surg. 2021, 111, 1735. [Google Scholar] [CrossRef]
  9. Babbal, A.; Mohanty, S.; Khasa, Y.P. Nitrogen supplementation ameliorates product quality and quantity during high cell density bioreactor studies of Pichia pastoris: A case study with proteolysis prone streptokinase. Int. J. Biol. Macromol. 2021, 180, 760–770. [Google Scholar] [CrossRef]
  10. Abdulrachman, D.; Thongkred, P.; Kocharin, K.; Nakpathom, M.; Somboon, B.; Narumol, N.; Champreda, V.; Eurwilaichitr, L.; Suwanto, A.; Nimchua, T.; et al. Heterologous expression of Aspergillus aculeatus endo-polygalacturonase in Pichia pastoris by high cell density fermentation and its application in textile scouring. BMC Biotechnol. 2017, 17, 15. [Google Scholar] [CrossRef] [Green Version]
  11. Hu, X.Q.; Chu, J.; Zhang, S.L.; Zhuang, Y.P.; Wu, X.; Chen, H.X.; Lv, Z.Y.; Yuan, Z.Y. An alkaline pH control strategy for methionine adenosyltransferase production in Pichia pastoris fermentation. Biotech. Bioprocess. Eng. 2014, 19, 900–907. [Google Scholar] [CrossRef]
  12. Adem, S.; Jain, S.; Sveiven, M.; Zhou, X.H.; O’Donoghue, A.J.; Hall, D.A. Giant magnetoresistive biosensors for real-time quantitative detection of protease activity. Sci. Rep. 2020, 10, 7941. [Google Scholar] [CrossRef]
  13. Alakent, B. Soft-sensor design via task transferred just-in-time-learning coupled transductive moving window learner. J. Process. Control 2021, 101, 52–67. [Google Scholar] [CrossRef]
  14. Wang, M.C.; Han, X.; Sun, Y.M.; Sun, Q.Y.; Chen, X.G. Study on soft sensor modeling method for sign of contaminated fermentation broth in Chlortetracycline fermentation process. Prep. Biochem. Biotechnol. 2021, 51, 76–85. [Google Scholar] [CrossRef]
  15. López-Martínez, J.; Arzola-Sotelo, E.A.; Nevárez-Martínez, M.O.; Álvarez-Tello, F.J.; Morales-Bojórquez, E. Modeling growth on the cannonball jellyfish Stomolophus meleagris based on a multi-model inference approach. Hydrobiologia 2020, 847, 1399–1422. [Google Scholar] [CrossRef]
  16. Chakraborty, S.; Jain, S.; Nandy, A.; Venture, G. Pathological gait detection based on multiple regression models using unobtrusive sensing technology. J. Signal Process. Syst. 2021, 93, 1–10. [Google Scholar] [CrossRef]
  17. Wang, B.; Yu, M.F.; Zhu, X.L.; Zhu, L. Soft-sensing modeling based on ABC-MLSSVM inversion for marine low-temperature alkaline protease MP fermentation process. BMC Biotech. 2020, 20, 343–354. [Google Scholar] [CrossRef] [Green Version]
  18. Lu, B.; John, S.; Thomas, E.F. Data-driven adaptive multiple model system utilizing growing self-organizing maps. J. Process. Control 2018, 67, 56–68. [Google Scholar] [CrossRef]
  19. Zhang, W.G.; Cai, K. Soft sensor modeling of alkaline protease fermentation process based on improved LWPLS. Transducer Microsyst. Technol. 2020, 39, 108–114. [Google Scholar]
  20. Zhu, X.J.; Song, C.Y.; Zhao, J.; Xu, Z.H. A Semi-continuous PWA model based optimal control method for nonlinear systems. Proceses 2020, 8, 170. [Google Scholar] [CrossRef] [Green Version]
  21. Sun, X.Q.; Hu, W.W.; Wu, P.C.; Wong, P.K.; Chen, L. Modeling of tire longitudinal-slip mechanical characterisics based on piecewise affine identification method. J. Xi’an Jiaotong Univ. 2021, 55, 52–60. [Google Scholar]
  22. Wang, J.-H. Zonotope parameter identification for piecewise affine system. WSEAS Trans. Power Syst. 2020, 19, 36–46. [Google Scholar] [CrossRef]
  23. Sun, X.; Wu, P.; Cai, Y.; Wang, S.; Chen, L. Piecewise affine modeling and hybrid optimal control of intelligent vehicle longitudinal dynamics for velocity regulation. Mech. Syst. Signal. Process. 2021, 162, 108089. [Google Scholar] [CrossRef]
  24. Muhammad, M.; Michael, S.; Michael, Z.; Klaus, L.; Hans, A.; Armin, C. Optimal planning of thermal energy systems in a microgrid with seasonal storage and piecewise affine cost function. Energy 2021, 215, 119095. [Google Scholar]
  25. Sindareh-Esfahani, P.; Pieper, J. Machine learning–based piecewise affine model of wind turbines during maximum power point tracking. Wind. Energy 2019, 23, 404–422. [Google Scholar] [CrossRef]
  26. Mattsson, P.; Zachariah, D.; Stoica, P. Identification of cascade water tanks using a PWARX model. Mech. Syst. Signal Process. 2018, 106, 40–48. [Google Scholar] [CrossRef]
  27. Muhammad, A.R.; Reham, R.M.; Ozgur, K.; Mundher, Y.Z.; Shamsuddin, S.; Mohammad, Z.K. Improving streamflow prediction using a new hybrid ELM model combined with hybrid particle swarm optimization and grey wolf optimization. Knowl. Based Syst. 2021, 230, 107379. [Google Scholar]
  28. Han, K.; Huang, T.; Yin, L. Quantum parallel multi-layer Monte Carlo optimization algorithm for controller parameters optimization of doubly-fed induction generator-based wind turbines. Appl. Soft Comput. 2021, 112, 107813. [Google Scholar] [CrossRef]
  29. Yuan, J.; Xie, J.; Huang, M.; Fan, H.; Feng, E.; Xiu, Z. Robust optimal control problem with multiple characteristic time points in the objective for a batch nonlinear time-varying process using parallel global optimization. Optim. Eng. 2020, 21, 905–937. [Google Scholar] [CrossRef]
  30. Jiang, X.; Zhang, X.; Zhang, Y. Establishment and optimization of sensor fault identification model based on classification and regression tree and particle swarm optimization. Mater. Res. Express 2021, 8, 085703. [Google Scholar] [CrossRef]
  31. Hu, X.; Chen, F.; Zhang, Z. Model construction and optical properties investigation for multi-sectioned compound parabolic concentrator with particle swarm optimization. Renew. Energy 2021, 179, 379–394. [Google Scholar] [CrossRef]
  32. Ren, M.; Song, Y.; Chu, W. An improved locally weighted pls based on particle swarm optimization for industrial soft sensor modeling. Sensors 2019, 19, 4099. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Javed, M.S.; Ma, T.; Jurasz, J.; Ahmed, S.; Mikulik, J. Performance comparison of heuristic algrithms for optimization of hybrid offgrid renewable energy systems. Energy 2020, 210, 118599. [Google Scholar] [CrossRef]
  34. Wihartiko, F.D.; Wijayanti, H.; Virgantari, F. Performance comparison of genetic algorithms and particle swarm optimization for model integer programming bus timetabling problem. IOP Conf. Ser. Mater. Sci. Eng. 2018, 332, 012020. [Google Scholar] [CrossRef] [Green Version]
  35. Pandey, H.M. Performance review of harmony search, differential evolution and particle swarm optimization. IOP Conf. Ser. Mater. Sci. Eng. 2017, 225, 012221. [Google Scholar] [CrossRef]
  36. Liu, Y.D.; Zhou, M.R.; Xie, J.Y.; Tong, Y.L. Research on simulation model for data assimilation in solar radiation prediction based on particle swarm optimizer with time varying constrict factor. Acta Ener. Sol. Sin. 2021, 42, 181–185. [Google Scholar]
  37. Zhang, C.X. Comprehensive informed particle swarm optimizer based on constrict factor. J. Front. Comput. Sci. Technol. 2014, 8, 506–512. [Google Scholar]
  38. Wu, Z.; Zhang, S.; Wang, T. A cooperative particle swarm optimization with constriction factor based on simulated annealing. Computing 2018, 100, 861–880. [Google Scholar] [CrossRef] [Green Version]
  39. Olfa, Y.; Zeineb, L.; Kamel, A. Predictive control based on fuzzy supervisor for PWARX hybrid model. Int. J. Autom. Comput. 2019, 16, 683–695. [Google Scholar]
  40. Maximo, M.R.O.A.; Afonso, R.J.M. Mixed-integer quadratic programming for automatic walking footstep placement, duration, and rotation. Optim. Control Appl. Methods. 2020, 41, 1928–1963. [Google Scholar] [CrossRef]
  41. Han, M.; Ren, W.J.; Li, B.S.; Feng, S.B. Review of chaotic time series analysis and prediction. Inf. Control 2020, 49, 24–35. [Google Scholar]
  42. Vignesh, R.; Jothiprakash, V.; Sivakumar, B. Streamflow variability and classification using false nearest neighbor method. J. Hydrol. 2015, 531, 706–715. [Google Scholar] [CrossRef]
  43. Zhao, L.; Fang, J.; Wang, Y.; Huang, Z.M. Output state analysis and classification of semiconductor fiber ring lasers. Acta Photonica Sin. 2009, 38, 2449–2457. [Google Scholar]
  44. Jia, Q.L. Time series prediction model of power load based on probability statistics. Mod. Electron. Technol. 2020, 43, 179–186. [Google Scholar]
  45. Xia, B.; Liu, T.; Ding, T.; Wang, Z. An improved PSO localization algorithm for UWB sensor networks. Wirel. Pers. Commun. 2020, 117, 2207–2223. [Google Scholar] [CrossRef]
  46. Huang, G.; Cai, Y.; Cai, H. A time varying constrict factor PSO algorithm research. J. Comput. Methods Sci. Eng. 2018, 18, 725–735. [Google Scholar] [CrossRef]
  47. Zhang, Z.; Wang, Y.P. Particle swarm optimization algorithm with compression factor for complex shading MPPT problem. Mod. Electron. Tech. 2021, 44, 102–105. [Google Scholar]
  48. Zhang, H.; Jin, Y. Particle swarm cooperative optimization algorithm based on geometric algebra. J. Discret. Math. Sci. Cryptogr. 2017, 20, 913–930. [Google Scholar] [CrossRef]
  49. Zhu, X.L.; Chen, W.; Ding, Y.H.; Wang, B.; Zhu, L.; Jiang, Z.Y. Soft sensing of photosynthetic bacteria fermentation based on IBA-LSSVM. Comput. Meas. Control. 2019, 27, 41–54. [Google Scholar]
  50. Zhu, X.L.; Hua, T.Z. Soft sensor modeling of straw fermentation process based on chaos drosophila optimized least squares support vector machine. J. Beijing Univ. Technol. 2016, 42, 1468–1474. [Google Scholar]
  51. Gao, M.J.; Li, Z.; Yu, R.S. Methanol/sorbitol co-feeding induction enhanced porcine interferon-α production by P. pastoris asso-ciated with energy metabolism shift. J. Bioprocess. Biosyst. Eng. 2020, 35, 1125–1136. [Google Scholar] [CrossRef] [PubMed]
  52. Cui, Y.B.; Huang, C.C.; Cui, Y.P. A novel compound wind speed forecasting model based on the back propagation neural network optimized by bat algorithm. Environ. Sci. Pollut. Res. 2020, 27, 7353–7365. [Google Scholar] [CrossRef] [PubMed]
  53. Jalili, A.; Taheri, B. Optimal sizing and sitting of distributed generations in power distribution networks using firefly algorithm. Technol. Econ. Smart Grids Sustain. Energy 2020, 5, 33–44. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Overall structure diagram of modeling algorithm.
Figure 1. Overall structure diagram of modeling algorithm.
Sensors 21 07635 g001
Figure 2. The characteristic curve of μ when the values of δ and σ change, including (a) δ = 0.5 and σ was increasing between (1,+∞), (b) σ = π and δ was increasing between (0.1), (c) one of the best constraint performance of μ was at δ = 0.1 and σ = 2π, (d) δ = 2π and σ was increasing between, (e) σ = 0.5 and μ was increasing between (1,+∞), (f) one of the best constraint performance of μ was at δ = 24 and σ = 0.01.
Figure 2. The characteristic curve of μ when the values of δ and σ change, including (a) δ = 0.5 and σ was increasing between (1,+∞), (b) σ = π and δ was increasing between (0.1), (c) one of the best constraint performance of μ was at δ = 0.1 and σ = 2π, (d) δ = 2π and σ was increasing between, (e) σ = 0.5 and μ was increasing between (1,+∞), (f) one of the best constraint performance of μ was at δ = 24 and σ = 0.01.
Sensors 21 07635 g002aSensors 21 07635 g002b
Figure 3. Flow chart of ICF-PSO for optimizing the number and parameters of PWA local models.
Figure 3. Flow chart of ICF-PSO for optimizing the number and parameters of PWA local models.
Sensors 21 07635 g003
Figure 4. Flow chart of ICF-PSO-PWA soft sensor modeling algorithm.
Figure 4. Flow chart of ICF-PSO-PWA soft sensor modeling algorithm.
Sensors 21 07635 g004
Figure 5. Schematic diagram of Pichia pastoris fermentation process.
Figure 5. Schematic diagram of Pichia pastoris fermentation process.
Sensors 21 07635 g005
Figure 6. Comparison curve between the soft sensor value and the actual value of (a) cell concentration and (b) protease K concentration.
Figure 6. Comparison curve between the soft sensor value and the actual value of (a) cell concentration and (b) protease K concentration.
Sensors 21 07635 g006
Figure 7. Comparison curve of relative error of (a) cell concentration and (b) protease K concentration.
Figure 7. Comparison curve of relative error of (a) cell concentration and (b) protease K concentration.
Sensors 21 07635 g007
Table 1. Local model modeling parameters.
Table 1. Local model modeling parameters.
The Local Model ParametersThe Parameters of Dividing Surfaces
Local model Iθ1 = [0.5957, −0.1847, −0.0964, 0.4447, 0.3635, 0.0464, 0.5567, −0.2888] Th13 = [1, 1.0445, −0.0045, 0.9854, 0.5421, −0.2347, −1.0786, 0.0864]
Local model IIθ2 = [0.0962, 0.9649, 0.0434, 0.0762, 0.9564, 0.0615, −0.6332, 1] T
Local model IIIθ3 = [0.1629, 0.9885, −0.0003, 1, 0.1005, −0.6575, 0.3209, 0.6887] Th23 = [1, −1.8753, 0.9753, −0.8656, 1.9531, 0.6542, −0.5423, 1.0112]
Table 2. Comparison of prediction accuracy of soft sensor models.
Table 2. Comparison of prediction accuracy of soft sensor models.
PSO-WLSSVRICF-PSO-PWAAccuracy Improvement
MAPE of cell concentration0.0435734880.0189319132.4642%
MAPE of protease K concentration0.0911008160.0259740786.5127%
Mean accuracy improvement4.4884%
Table 3. RMSE of different test sample sets.
Table 3. RMSE of different test sample sets.
NumberPSO-WLSSVRICF-PSO-PWA
XPXP
200.0347533780.2424212280.0138632650.057610602
400.0412274121.7261081150.0171898750.537857149
600.0436464582.0861903150.0213521180.598752504
800.0411678942.2724309420.0195738280.635771991
1000.0393578342.4026387460.0184636270.710257895
1200.0366791432.357145440.0173601730.70555966
1400.0355637142.386777460.0168150660.745174526
1600.0350080222.3729304880.0161380980.716623673
1800.0334573542.2787623390.0154023520.711287476
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, B.; Wang, X.; He, M.; Zhu, X. Study on Multi-Model Soft Sensor Modeling Method and Its Model Optimization for the Fermentation Process of Pichia pastoris. Sensors 2021, 21, 7635. https://doi.org/10.3390/s21227635

AMA Style

Wang B, Wang X, He M, Zhu X. Study on Multi-Model Soft Sensor Modeling Method and Its Model Optimization for the Fermentation Process of Pichia pastoris. Sensors. 2021; 21(22):7635. https://doi.org/10.3390/s21227635

Chicago/Turabian Style

Wang, Bo, Xingyu Wang, Mengyi He, and Xianglin Zhu. 2021. "Study on Multi-Model Soft Sensor Modeling Method and Its Model Optimization for the Fermentation Process of Pichia pastoris" Sensors 21, no. 22: 7635. https://doi.org/10.3390/s21227635

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop