Detection and Location of Model-Plant Mismatch in Multiple Input Multiple Output Systems under Model Predictive Controller Using Granger Causality Method

: In closed-loop control systems, the model accuracy exerts large inﬂuences on the controlla-bility, stability and quality of the whole process. Among all the faults that could affect the system performance, Model Plant Mismatch (MPM) is the one that not only directly threatens the system stability but also deteriorates the controller performance. Meanwhile, MPM has a major inﬂuence on the qualities of outputs about industrial products. In this work, a new detection method based on Granger Causality is proposed to detect and locate the MPM in multiple input multiple output systems. Causality can reﬂect the relations between the mismatch fault and its negative effects on model predictive control(MPC) systems. With the assistance of disturbance transfer function models, the causality method can further be used to locate the mismatch positions and get the correct channels of each kind of mismatches. The proposed method was examined and validated in the Wood-Berry process in contrast to the decussation location method under model predictive controller.


Introduction
Model Predictive Control (MPC) has been occupying a dominant position in advanced control technologies in the industrial production processes [1] due to its maturation in the controller selection, its ability to address complex constraints on closed-loop systems [2] and its preferable control effects and superior optimization results [3]. Therefore, MPC has received much attention and adoption as an advanced controller in some industrial processes [4]. However, the superiority in MPC versus other typical controllers such as Proportional-Integral-Differential(PID), is on account of its model-based structure and model-oriented optimization design [5]. Hence, the model accuracy imposes considerable influence on the control effects and output production qualities [6], and somehow makes the control stability untrue for engineering practice [7].
Thus far, fault diagnosis has attracted increasing interests in process control community in recent decades. Considering that most diagnosis technologies based on data driven methodology can reflect the control effects but provide little information on the variation of the controlled models, Model Predictive Control technology, whose control performance relies largely on the process models, gets less assistance from pure data methods. Modelbased diagnosis methods, rely on a model that defines nominal behavior of a dynamic system to detect abnormal behaviors and isolate faults. On the other hand, data-driven diagnosis algorithms detect and isolate system faults by operating exclusively on system measurements and using very little knowledge about the system [8]. By comparison, the Control Performance Assessment (CPA) is associated with objective models to a certain extent, which embodies some statistical information or details of controlled models. Hence, CPA technologies were devised and designed as a series of controller monitoring and performance assessment methods for this reason.
Harris first introduced Minimum Variance Benchmark (MVB) into control systems as an assessment method to monitor the closed loops [9], creating a new world for assessing, analyzing, monitoring and managing the control performance. For evaluating the MVB index of Multiple Input and Multiple Output (MIMO) systems, the predicament is that a Nilpotent interactor matrix [10] is significant for the user but difficult to calculate for the computer. To avoid this inconvenience, Biao Huang ameliorated the Nilpotent algorithm and estimated a suboptimal benchmark with a reduction of priori knowledge of interactor matrix [11]. Jie Yu and S.Joe Qin invented left-right diagonal interactor rather than directly using Nilpotent interactor matrix as another auxiliary calculation method [12] for performance computation. However, the emergence and existence of Model Plant Mismatch (MPM) had had an impact on control performance evaluations [13] and even weakened the reliability of evaluating benchmark. In addition, time delay mismatch would affect the Minimum Variance Benchmark (MVB) and lead to the changes in both the system output variances and the Minimum Variance index [14].
Several literatures focus on quantifying the impact caused by model mismatch [15] or the design of suitable index to evaluate MPM problems [16]. Other scholars like Yucai Zhu used small sinusoidal test signals to detect the MPC model error [17] or identification of k-step-ahead prediction error in MPC to distinguish the difference between the nominal model and the real plant model [18]. However, the system identification technologies were purchased at too great a price for industrial production especially in closed loops [19]. Besides, most mismatch faults of chemical production affect a little fraction of stability but large proportion of performance. Therefore, it is much more appropriate for manufacturing industry to select MPM detection as the first option.
Among the great majority of detection methods, correlation analysis plays a most common technique role in model plant mismatch detection [20]. Abhijit S. Badwe used a partial correlation analysis method in MPC systems [21] to estimate the statistical characteristics while Siyun Wang estimated the MPC model mismatch by a means of auto-covariance analysis [22]. Gui Chen had the aid of mutual information to detect the model-plant mismatch of nonlinear multivariate systems. Viviane Botelho proposed a method of evaluating model quality based on the investigation of closed-loop data and the nominal output sensitivity function [23]. In practical terms, correlation estimation of variables could not be as accurate as expected. The natural relationship between any two types of different data and the complicated condition in the MIMO control systems after mismatch would increase the imprecision of correlation analysis techniques and reduce the probability of reliability of algorithms.
Some detection technologies are based on an orthogonal projection method [24] to analyze the extent and calculate the index of MPM problems [25]. However, detecting MPM would be a hot issue to address while locating the position of MPM would be another. Lijjuan Li defined the correlation analysis method between the input and the disturbance (CAID) index of assessing the MPM and presented the decussation thought, synthetically combining CAID method and model quality index (MQI) index together to get the specific location of mismatch [26]. During the process of estimation by orthogonal projection, the loss of data information is unavoidable to degrade a lot accuracy of the method. In the mean time, further variable correlation analysis depending on orthogonal projection will also be incorrect more or less, and mislead the judgment of location.
In this paper, we first introduce the Granger Causality Analysis (GCA) method of Model-Plant Mismatch detection in MIMO systems. The Granger Causality Analysis [27] expresses the direct relation between two variables with less dependence on statistical correlation and return back the cause and effect of MPM problems of MIMO systems with the help of closed-loop data. Considering that the Granger Causality Analysis might inevitably be influenced by the loop interconnection and the control strategies, we make further efforts to combine the noise models with Granger Causality Analysis (GCA) to locate the specific channel of MPM and the experiments on Wood-Berry distillation system are implemented after that.

Model-Plant Mismatch in Closed-Loop Systems
To directly distinguish the difference of the nominal controlled model and the real process model, Internal Model Control (IMC) has been generally adopted in some control systems. An advantage which Internal Model Control structure is in possession of is that its feedback part promptly returning back the error of two models, the nominal model G M and the practical model G P , respectively. Hence, data from the control loops can conveniently help to cope with the MPM problems.
The block diagram of IMC structure of closed loop systems is depicted in Figure 1. The whole transfer function of system output y(k) is, In the Equation (1) and the Figure 1, r(k) and a(k) represent the input setpoint value and the white noise disturbance, respectively. u(k) is the controller output sequence, namely manipulated variable, which is generated by the process controller G C . The sequence d(k) means the direct noise that is conducted into the control loops, which is produced by the white noise a(k) after disturbance transfer function N. Direct Error Feedback Type of Internal Model Control changes the denominators of the whole transfer function of output y(k) and has an impact on control stability.
The controller G C acts directly on the process model G P in conventional control structure. Thus, the closed loop transfer function without IMC is presented as It is obvious that the denominators in two transfer functions differ from each other by contrast Equation (1) with Equation (2).
The conventional controller is designed for the process model so that G C is in the service of the G M . If there is no mismatch, the result ∆G = 0 infers from the expression G P = G M , which implies that the designed controller directly acts on the objective process. Once MPM occurred, controller would be applied to a new model G P in consideration of a primary fact that any devised controller had some robustness and could address a certain degree of model error.
However, in Equation (1), the controller should deal with not merely the two different models, but the deviation from controlled process, which means that the controller is designed for ∆G rather than G P or G M . It is possible that ∆G can alter the model structure, parameters and even Pole-Zero points, which brings about a certain dilemma that the designed controller is insensible of its object models. The alteration of Pole-Zero points is able to worsen the stability of closed loop systems and elevate quite a few difficulties for the engineer to design a proper controller.
To overcome these shortcomings, putting the nominal model G M into the feedback section is advisable and the proof that this IMC structure influence nothing in comparison with Equation (2) is given later. Theorem 1. The structure of output transfer function in the schematic of Internal Model Control in Figure 2 is equivalent to that in Equation (2) and adds no burden to initial controller design. Proof of Theorem 1. The relationship of all kinds of variables display as below, The MPM problems are split into two circumstances. The first is that there is no mismatch in systems and the other is the occurrence of model plant mismatch conditions. when there is no state of MPM in the system, ∆G = 0, Hence, the controller output sequence u(k) is expressed as and the real control loop output result sequence y(k) is expressed as Considering the fact that no mismatch occurs and G P = G M , the Equation (7) is the same as the Equation (2).
When model mismatch happens, the relationships within variables are At this moment, the controller law u(k) is and the output data y(k) is which is also the same sa before in Equation (2).
No matter whether there is model plant mismatch between these loops, the structure of control system transfer functions is invariant. Only model mismatch itself from the nominal model G M to the mismatched plant G P changes the parameters, which is rational and rational for daily operating systems.
The Theorem 1 is proved.

Problem Description
The most common condition of closed loops in industrial production processes is steady-state but of unsatisfactory performance. As almost all the controlled process are linearized before implementation, it is a natural state that the real practical models must be somewhat different from the nominal ones that users had already designed the suitable controller for. Theoretically, the nominal linear model can accomplish its adaptation to the control system. However, manufacturing industries are always accompanied by many unexpected conditions, which result in the changes of industrial art or alteration of model parameters. Therefore, the occurrence of Model Plant Mismatch (MPM) can hardly be avoided.
A fundamental formula from the Equation (4) reflects that the variable d(k) consists of two portions, the model value difference ∆G = G P − G M with control law result u(k) and the pure disturbance signal d(k) = Na(k) that conducts into the closed loops.
The decussation location [26] drew supports from the statistical correlation between u(k) and d(k). The existence of correlation within these two sequences is explicit but the conspicuousness of u(k) and d(k) in the computation of correlation can not get guaranteed. Besides, the sequence d(k) is estimated from loop data, which shows a big difference from the real noise signals.
The variable d(k) is generated by two sections, one of which presents the model discrepancy on MPM problems. The formula can be rewritten as In this work, we bring d(k) and u(k) and Granger Causality Analysis together into MPM detection. One virtue of this formula is that it can represent whether there exists model plant mismatch in closed loop systems. Another merit is its reduction in the degree of difficulty to estimate and compute the signal sequence like d(k) or a(k), which abates the data loss of orthogonal projection evaluation or system identification.

Introduction of Decussation Location
According to Figure 2, correlation analysis method between the input and the disturbance (CAID) focuses on the relationship between the controller effect u(k) and the estimated noise model d(k). d(k) cannot be directly obtained from closed loops while an estimation method is required for further calculation. r(u, d) is the computation of CAID results.
The projection to the orthogonal complement of Z(k) was introduced to finish this task. Z(k) consists of two arranged variables.
and the orthogonal projection is, The estimated sequences, a(k) and d(k), are as below.
Model quality index (MQI) is a simple control performance assessment index for evaluating the systems. a(k) is the real noise model sequence while a(k) is an estimated one. Decussation location method contains two techniques, CAID and MQI, respectively. CAID is designed to locate the row channel mismatch while MQI is to detect the line channel.

Causality
It was Granger who initially introduced causality and its practical implementation into economical data analysis [28]. Causality is one kind of data processing methods in the context of linear regression.
Assume that there are two stochastic processes X t and Y t and both of them possess a feature that they can be modeled as an autoregressive linear process. Then the data X t can be written as two sections, an autoregressive sequence b 1j X t−j and remaining noise model a 1t .
The remaining model part is white noise with zero-mean and basic variance Σ 1 under normal conditions. "E()" is the symbol of data expectation, and "D()", statistical variance.
As the sequence owns time feature, the data X t can be decomposed by its past information. Furthermore, auto-regression can reflect the relationship between the data and its past information to a certain degree. Since X t is generated by other reasons, auto-regression as well as noise model is not able to characterize all the causation at a time. If an engineer judges that the process Y t could cause the change of X t , then a prediction will be made to testify the causality.
the mean value of white noise sequence a 2t is also zero and its variance changes to D(a 2t ) whose value becomes Σ 2 . If the predictability of X t is enhanced because of the existence of Y t , there will exist causality from Y t to X t .
The independence of Y t and X t portends that Y t is of no use for predicting the sequence X t so that Σ 1 = Σ 2 and the index F Y→X is zero, which shows there is no causality from Y t to X t .
The index F X,Y is composed of three portions, explicating the causality from Y t to X t , causality from X t to Y t and causality of Y t as well as X t .
The Equations (9) and (11) embody the alteration of d(k) about the mismatch part and the disturbance part. Analysis from cause section to variable is enough to judge the causality. Therefore adoption of index F Y→X to test the causality from the mismatch ∆G to the signal d(k) can help to detect the fault problems.

Causality for Detection the Mismatch
The best method to analyze the model plant mismatch is to address the issues by detecting or assessing the faults rather than identifying the coefficients in closed loops. Honestly, system identification is a recommendable detection means to monitor and diagnose the working condition, but this technique often encounters some realistic obstacles, such as numerous difficulties on dependability of detection and identification, data precision of results about model parameters, huge cost of implementation in closed loops, etc. More often, internal model control structure can embody both the nominal model and the practical one in just one closed loop and directly obtain model-plant differences. Lijuan Li [26] estimated the white noise and the input type disturbance and combined these with the controller output using statistical correlation. In view of that quite a few statistical approaches get interference from data and noise, estimated sequences do undergo such dilemmas. Granger causality analysis avoids this evaluation predicament, reduces the loss of data calculation and obtains the causality results of two or more sequences.
No matter whether there is model mismatch in the system, sequence d(k) contains the noise from beginning to end. In this section we use D t to represent the data sequence of variable d(k), and U t to mean that of u(k). d t is for d(k) and a t is for a(k) in the same way.
Every b 1j describes the corresponding coefficient of autoregressive sequences of D t . And ε 1t indicates the model deviation quantity is hardly avoidable.

Theorem 2. The existence of Causality
(i) ε 1t represents a t actually, and it is also a type of white noise sequence.
(ii) Causality between D t and U t exists. U t can be used to predict D t .

Proof of Theorem 2. The existence of Causality
For the models in Figure 2, Transfer function of G P can be expanded to polynomials with q −1 implying the back shift operator.
Every discrete transfer function model can be expanded as above and their lengths are endless. Sequence number n tend to be infinite. When n is big enough, the left parameters remain to be zero in theory.
The model ∆G plays as conceptual deviation of the two models, which is nearly unable obtained or calculated in real industrial processes.
(i) If there is no mismatch occurring in the loop, Expand N −1 to discrete polynomials, Rearrange the Equation (27), Compare the Equation (22) with Equation (28), the coefficient of b 1j is, and the remaining noise model ε 1t is In the light of initial structure in Figure 2, the noise a t is white with zero mean. Hence, Proof (i) is done.
(ii) If model plant mismatch occurs in the process of chemical production, Causality and correlation analysis methods are incapable of processing the noise, even system identification remains the noise part ultimately. Therefore, white noise is the only information left in the end.
Suppose that the sequence expansion of N −1 ∆G is Then the prediction of D t will be, Since the regression above is a new data processing, the coefficient should change from b 1j to b 2j and c 1j to c 2j . The left ε 2t noise is variant of ε 1t in the formula expression, too. The Equation (34) is a causality analysis from Equation (19), with sequence U t from a control loop to assess D t .
The white noise a t deems to be independent of other sequence and the relation of a t and U t is almost zero because of time delay in systems. The transfer function between a t and U t is Generally refers to any transfer function from the beginning a t to the end U t . As the time delay q −d acts on the control system, effect of this is equivalent to time lag for noise from a t to a t−d .
For white noise a t , The mathematical expectation of two data is zero.
Ψ D means the left information about D t after Granger regression computation. The variance of D t after regression, namely Ψ D , consists of two sections. The first is information about noise a t and the second, U t . As the regression of D t could extract a little information about U t , the variance of U t always exists in the statistical characteristics in D t .
K U is the comprehensive coefficient mainly caused by N −1 ∆G. For the index in Granger causality from Equation (20), It is obvious that the data of U t can help to influence the variance of D t . Therefore, causality of these two variables necessarily exists. Proof (ii) is finished. Theorem 3. The alteration of MPM in parameters or model structure rather than mismatch position is independent of judgment on causality result.
So long as the model mismatch exists in the closed loop system, the increase or decrease of the practical model that is throughout different from the nominal model, and its type or other changes from the plant model, influence nothing of causality judgment.
Proof of Theorem 3. when the real practical industrial model changes, The time sequence of D t will raise to a new level.
A unified function ∆G 2 here represents all kinds of model plant mismatches, and any change in the part of G P is included.
Even mismatch is varied from a former one, the regression variance information Ψ D alters to a new one, By promoting the level of regression for u t and that of auto-regression D t , the regression variance of D t decreases. It was not until the moment that the variance of D t dropped down to the minimum value that the causality analysis could be called as an accomplished algorithm.
The variance alteration mainly depends on D(U t ) and its coefficients K U2 . Considering that the variance of D(U t ) is greater than or equal to zero, The Equation (51) indicates that the existence of model mismatch causes the causality among corresponding sequences. The Granger index F U→ D t > 0 shows the invariance of causality, namely the alteration of mismatch affecting the value of variances and Granger index but nothing on the causality judgment result. The judgment of causality is immune from model change in Theory 3. But the prediction of the variable D t is affected by data and model regression order. The raise of model orders leads to the monotone increase of Granger index.
(i) The increase of auto regressive model orders makes the variance of auto regression reduce to a lower bound. And the minimum variance is that of white noise a t with value σ 2 a . (ii) The variance of D t gets its reduction due to the prediction of U t and its minimum value is the same as before.
(iii) The Granger index increases with the increase of model orders.
Proof of Theorem 4. The raise of model orders results in monotone increasing of Granger index. The summation in Equation (34) is composed of infinite sequences, which means n → ∞. But in practice, all the regressions can not be implemented to infinite parts. In addition, when the number n is big enough, the left infinite sequences incline to zero. Thus, it is not necessary to calculate more dispensable data. Assume that the model orders selected are n 1 for D t−j and n 2 for U t−j , respectively.
As it is essential for regression analysis that the regression part should be deleted, the left part information is described as Ψ, The remaining sequences reflect the data that cannot be used or got further regressions.
When model order increases from n 1 to n 1 + ∆n 1 , In steady systems, the data U t contains the information of u t and mismatch. Hence, it can be written as, With the Equation (37), Rewrite the formua as, is the regression variance of symbol Ψ. As the auto-regression can only contain little information on mismatch transfer function and seldom include the controller output sequence u t , the variance D(Ψ) is composed of two different parts. σ 2 a is generated from white noise of systems while variance of manipulated variable σ 2 U t comes from MPC in practice.
The coefficients of σ 2 U t is decomposed into three parts, implies that when the regression length of U t is n 1 , a certain variance will be extracted from D(Ψ), and its weighting factor will be K 1 .
Seeing that regression from u t to Ψ signifies a certain information extraction from sequence.
The increase of model orders leads to that the information from n 1 to n 2 is predicted, and the variance will be The auto regression at first is a kind of rough decomposition so that D(Ψ) includes much influence of U t .
If model order of U t is n 1 , the regression variance will be If model order increases, then the variance will be The Granger index will change from namely, namely, Analyzing the two index in various model order modes, the size comparison is, Therefore, for any model order n 2 > n 1 , the regression variance will reduce and its Granger index will get promoted from F n1 to F n2 . The monotone increasing of model order from n 2 to n 1 results in the monotone increasing of Granger causality index.
(iii) Granger causality Combine the two proofs together and it is obvious that the promotion of model orders will cause the variance reduction, which shows the depth of predictions. The decrease of variance would promote the Granger index in both regression computations and auto regression calculations.

Further Location of MPM with Assistance of Noise MODEL Correction
The coupling of data or variables in closed loops adds too much correlation for mathematical calculations. Pure statistical algorithms can detect the model mismatch more or less but hardly get concrete positions. Granger causality is similarly influenced by the data interconnections in the system. Easy mismatch makes no trouble for MPM location but similarly complex conditions would reduce the judgment of locations. Here we completely utilize the model information about the loop to assist in MIMO channel locations.
A reasonable regression needs a proper prediction from u t for the variable D t , and the assumption ∆G → 0 is rational. The residual error of the model Ψ is The sequence from the regression is −g 1 q −1 , · · · − g n q −n . Compare this regressive sequence with the real ones in Equation (72) A index for counting the coefficient deviation is introduced Discrepancy often is less than 10% under normal conditions and parameter λ means the discrepancy that the users could afford.
Parameter γ can be used to monitor the regression effects.
In view of the Equation (79), rational coefficient selection of λ can limit the parameter discrepancy in computations, and 10% is enough for the value of λ.
For MIMO systems, the detection variable D t is affected by multivariable input, Assume that there is no MPM from channel u 1t to this loop, then ∆G u1 = 0 and D t consists of other variables.
u 1t has no Granger causality with D t and the prediction variable should be u 2t .
The size relation of these variances reflects the causality of each variable.
If both variables contributes the fluctuation of D t , With the help of noise model comparison, location to specific channel can be fulfilled with causality method.

Wood and Berry Distillation
The Wood and Berry distillation is a typical binary column one that is obtained from and designed for a methanol water mixture MIMO system. This distillation model has been widely used in many literatures as a benchmark of MIMO control scheme for deep study, further comparison and engineering practice. The closed loop mainly contains four sections, the two manipulated variables (MV) u 1 and u 2 , the two controlled variables (CV) y 1 and y 2 , The 2 × 2 transfer function G(s) as well as the disturbance noise signals d 1 and d 2 . The relations of these symbols form the formula below.
The variables and their meanings in the system is shown in Table 1. In practice, the transfer function of continuous systems after discretization is built on the sampling. Considering the physical units of two manipulated variables (MV), the zero order holder (ZOH) sampling time is 1 min.
The discretized transfer function of disturbance noise is N(q).
The MPC controller was selected as the brain of this MIMO closed loop system and implemented in the process. The prediction horizon of MPC values 100 and the control horizon is equal to 10, which are the same as those in document [26] by comparison. The weighting factors of control variables (CV), the manipulated variables (MV), and the set-points of two outputs are (1, 50), (1, 2) and (90, 5) in Table 2, respectively. The variances of two white noises a(t) are equal to 1.0, with discretized disturbance noise formula expression d(t) = N(q)a(t). In this section, the Granger causality method was carried out under the condition that there was no model plant mismatch problem in the system. The data from operating mode that worked well could be gathered as a data base benchmark for further fault diagnosis.
Considering the Equation (17), the Granger causality was started from self regression of the variables. d(k) in different channels should be regressed so that a basic understanding of systematical data could be obtained and compared later.
The character "R(u1,0),AR(d1,1-20)" in Figure 3 means that this causality analysis is only interrelated to the auto regression of d 1 (k) in first closed loop without any other factors affecting the analysis. The change from "R(u1,0)" to "R(u1,1)" implied that one step Granger regression was implemented to detect the causality from manipulated variable output u 1 (k) to the detected data sequence d 1 (k) in first channel.  The red line is merely corresponding to the variances of auto-regression mode. The blue curve describes just one step causality regression of u t and the green one gives a description of two step causality regressions.
Every part of four pictures in Figure 3 consists of three kinds of curves. The redpoint curve implies the result of causality. The value of regression variance descends with the increase of auto-regression length, which is equivalent to the model order selection in system identification. Higher order can depict more details of the whole system but enhance the difficulty of controlling. It is necessary and reasonable to choose a proper size,like AIC criterion. As the nominal model is known to us, too high order is costly and unnecessary when it is greater than a convergent one. The biggest length of experiment is 20, which is enough to reveal the uniform convergence of auto-regression.
"Times" means the length of regression or auto regression coefficients, namely, the result of model order selection. Table 3, the variance of white noise a 1 and that of a 2 were equal to 1, which played as a simplified noise benchmark in theory. But in real industrial, the variance is never just right the same as the set point value and always changing with data fluctuations. Sometimes the variance will be a little higher or lower than the designed one. The fluctuated value converges on the theoretical variance. Table 3. Explanations of labels in Figure 3.
From Theorem 4, the raise of model orders results in monotone increasing of Granger index. If curves do not coincide, such monotone increasing will cause regression curves to approach the benchmark line. Hence, model order should not be too high. Figure 3 displayed the problems that statistical characteristics of d 1 (k) and d 2 (k) were approximately equal to theoretical value 1. In consideration of that the concept of overfitting and statistical fluctuation are inevitable, the results of variance could be accepted.
The model orders ware embodied in the lengths of auto regression about d 1 (k) and d 2 (k). Too large order would lead to overfitting problems. Both of the two kind variances were gradually declining on the whole due to overfitting. Other than system identification, model orders were not so important as the causality. And to avoid the statistical interference in Granger analysis, upper and lower limits were adopted. If the causality regression results of u 1 (k) or u 2 (k) are within the limit lines, then the variance of d 1 (k) will be regarded as equal with 0.975 ≤ Σ 1 ≤ 0.99. The expectation of Σ 2 is almost the same with Σ 2 ≈ Σ 1 so that l b,d 1 ≤ Σ 2 ≤ u bb,d 1 .
Similarly, it can be obtained that

Causality Compared with Decussation in G 21 Mismatch
The decussation method consists of two techniques, correlation analysis method between the input and the disturbance (CAID) and model quality index (MQI). Both methods can help to locate the MPM positions after coordination.
The process model varied from G M (q) to G P (q).
Obviously, MPM occurred in Equation (92) and G 21 got mismatched in the system. The data correlation between u1, the manipulated variable in first channel named u 1 (k) and ds, the estimated noise in CAID algorithm, was below zero and its absolute value declined from 0.7071 to 0.4255 in Tables 4 and 5. However, the correlation between u2 and ds also reduced. Besides, The MQI(2) was 0.9697 and similar to 1.0, which was as ideal as theoretical one.  (2) 1.0

Mathematical Symbols Values
CAID(u1,ds) −0.4255 CAID(u2,ds) 0.1242 MQI (1) 1.0 MQI (2) 0.9697 In short, MQI lost its accuracy in MPM location due to no significant decrease in model quality index. In the meantime, CAID revealed that both loops mismatched, which was not true of only G 21 got wrong with Equation (92).
The red and the blue curve that got coincidence in Figure 4a,b implies it may have no causality because the regression of u t dedicates no variance decrease to fault analyses. The scopes of variances are also below 1.0 close to 0.97 as the same condition of auto regression overfitting. Even the green curves describe some causality, and the scopes of variances are next to convergence value.
The Figure 4a,b reveals that d 1 (k) in the first channel got no mismatch, which was under the same condition with that in Figure 3. The bottom pictures showed some differences and suggested the mismatch problems.
The sequence of d 2 (k) is still composed of two parts. The first one is the disturbance noise a t as well as its transfer function N(q). The other is the mismatched part, ∆G × u t . Thus, the autoregression of d 2 (k) would get larger than its theoretical values.
The variance of d 2 (k) was above 1.0 in the beginning and got remarkable declines after Granger causality analysis.
The bottom two pictures (c)(d) in Figure 4 indicate that model plant mismatch exists in second channel but it is uncertain for the user to judge where the fault position is or whether two channels mismatch occurs together. To overcome these shortcomings, the assistant model of noise d 2 (k) is introduced here.
The Equation (93) showed that causality in the second loop was larger than zero and the existence of mismatch was ensured.  Due to model mismatch in G 21 , the Granger index changed with this kind of interference. At this moment, d 2 (k) consisted of two parts, the mismatch part as well as the noise part.
If G 21 got mismatched, the Granger causality index would reduce after regression analysis from u 1 to d 2 . Before regression method adopted, the sequence d 2 (k) contained the mismatched information. After regression, the mismatched signal was extracted so that the left sequence d 2 (k) contained nothing on u 1 and G 21 .
The top picture in Figure 5 reveals a better identification. The first coefficient of noise model in N(q) is 0.9. The maximum value in upper picture is almost 0.9, and the left coefficients are near to zero at the same time. The lower picture got a worse identification for misunderstanding. It was because that no matter u 2 or G 22 devoted nothing on mismatch that the extraction got wrong shapes. Considering u 2 increased more space for regression, the overfitting occurred.
The Figure 5a depicts a series of curves fluctuating slightly and converging to the theoretical values as noise transfer function. Better convergence effects than picture Figure 5b are because the regression of u t extracted most causality from the channel u 1 towards d 2 . The Figure 5b got bounding but only the facts that top line is equal to 0.9 as noise model set and the other parameters remains to be zero, embody correct regression. It was because the channel from u 2 towards d 2 devoted no causality to the MIMO systems that the regression in this channel did not know the mismatch condition. the causality algorithm forced it to overfit, and the parameters which are not near zero were not essential but once existed, would get over-fitted.  The convergence features of coefficients indicate the correct regression result from u i to d j and help to locate the channel i to j. Better convergence implies more reasonable regression and the less variance indicates the more significant Granger causality index. It is the real mismatched channel that reflects the occurrence of MPM so that the extraction of this channel information with regression technique can help to get a stable causality calculations. The index γ(i) can quantify the fluctuations of parameters in N(q).
Considering λ = 10%, the regression effects γ(i) were in Figures 6 and 7. All the parameters in Figure 6 were less than λ = 0.1 and its variances were less than λ 2 = 0.01, too. A better regression revealed the mismatch was from u 1 to y 2 , which showed that G 21 had a MPM problem.
Among the 20 coefficients in Figure 7, eight of them were larger than λ 2 = 0.01 and became over-limited. No mismatch plus regression method made this overfitting.
With the help of Equation (94), the location was finished and the mismatch position was from G 21 .

Causality Compared with Decussation in G 21 and G 12 Mismatches
In this section, two mismatches occurred in different channels. Besides, the mismatch positions in model parameters were also diverse. G 21 got its model mismatch in gain parts while G 12 became distinct in its denominator coefficient.
The decussation method showed that the significant declines of MQI(1) and CAID(u2,ds) pointed to the mismatch in G 12 , but bad effects on CAID(u1,ds) with good results of MQI(2) could not located the mismatch in G 21 .
CAID shows a negative influence of mismatches in both row channels. MQI(1) and CAID(u1,ds) locates an incorrect mismatch from u 1 to d 1 in Table 6. The reason is obvious, for data in MIMO system will get mutual interference. The information from G 21 and G 12 mismatches conducted to G 11 link and change the statistical characteristics, which caused erroneous judgment.

Mathematical Symbols Values
CAID(u1,ds) 0.1074 CAID(u2,ds) −0.1074 MQI(1) 0.1178 MQI (2) 0.8297 G 21 and G 12 mismatches had not been detected while the G 11 were wrongly judged. Besides, MQI(2) was seeming to normal but a little abnormal. If the judgment was correct, the G 12 mismatch would be missed. If the judgment was incorrect, then the four channels would get mismatched, which did not happen.
These suggest that decussation based on statistical correlation should be inaccurate, although it is a momentous means in analyzing the control loops.
Causalities in Figure 8 reveal all the channels got notable decreases in variances. It was possible that at least two mismatches existing in the closed loop system.
The variance of auto regression of d 1 (k) was 1.1 in Figure 8, suggesting that mismatch brought about worse Granger regression. And the variance of d 2 (k) still got increased than before from 0.9939 to1.03. Assistant model parameters would be, and the locations of mismatch from u 1 to y 2 and another from u 2 to y 1 were ensured in view of Formula (98).
The Figure 9b,c shows a better convergence feature, which indicates two significant regression results from u 1 to d 2 and from u 2 to d 1 . Regression method extracts almost all mismatch information so that the left parameters of noise models are correct. The first coefficient of N(q) is converging to 0.8 and the second one to 0.9, which are consistent with Equation (88).

Quadruple-Tank Process
The quadruple-tank process that was comprehensively introduced by Johansson can be established and formed by two double-tank processes [29]. The process inputs consist of two voltages, v 1 and v 2 , as the input variables to control the pumps. Also the outputs y 1 and y 2 are voltage values collected from the level measurement in real systems. Mass balances and Bernoulli's law must be obeyed to form the dynamic equations as follows.
Four dynamic functions are included in Equations (98), which explicates each liquid level of different tanks. Obviously, the dynamic functions are a series of nonlinear processes that should be dealt with before a proper controller is implemented in closed loops in Table 7. The variable h 1 , h 2 , h 3 and h 4 represent the levels of four tanks, while the variable x 1 as well as x 2 reveals the condition of its corresponding valve.
Any output mainly contains the controlled object models as well as unmeasurable noise. So the output value can be unified and written as, Therefore, all the outputs could be written in a whole matrix format.
The quadruple-tank process in this section is a four-input and four-output nonlinear system. The process must be linearized before implementing a suitable model predictive controller for this loop.   Table 8. The fourth setpoint values are limited in a special zone so that the absolute values of its real input sequences are within −9 and 9 in Figure 10.

No Model Mismatch under Complex Inputs
The MIMO system was implemented by a linear MPC with the help of the software (Model Predictive Control Toolbox in MATLAB) under two complex inputs.
All the variances are less than the initial white noise sequence ones that equal to 1.000. Besides, the red-point line was almost in coincidence with the blue curve as well as the green curve in Figure 11. These two phenomena hinted that a few over-fittings of every white noise signal in light degrees existed on four channels, which implied that there was no model plant mismatch in this system.

Higher Order Mismatch in G 23
The transfer function in channel G 23 alters to The Figure 12e-h presents different conditions from those in Figure 11. The variances are more than 1.000, which shows that there exists mismatch state in at least one channel in the second loop.
The three curves in Figure 12e-h are almost in coincidence, while the picture in Figure 12f reveals a little Granger causality.
The remaining pictures of Figure 12 show that there is no mismatch in the first, the third and the fourth loop in the whole system. The ranges of variances in Figure 12 are between 0.97 and 0.93 while those in Figure 11 varies from 0.96 to 0.92.
To further locate the mismatch channel, the assistance of disturbance transfer function models were carried out.
The disorder regressions in Figure 13b,d reveal that these two channels have no mismatch so that the regression effects are in divergence modes. The fluctuation of Figure 13a,c are not as obvious as the condition above, which implies that MPM might occur.

Integral Mismatch in G 32
When the process had an integral changes in some channels, the model structure would be changed. The object model G 32 had an integral mismatch as below, MPC is a kind of controller based on the model structures, parameters and the degree of model accuracy. When an integral mismatch arose in G 32 channel, the object plant had a different structure from the nominal model, which was designed and tuned by the initial model predictive controller. Therefore, this alteration added some instability into the system and resulted in divergence of all the outputs in Figure 14.
When any loop in the system is not stable as daily routine, the operating data of the processes can not be used to analyze the condition effectively.
(i) If the integral section is not existing in the nominal models and it occurs in a model plant mismatch mode, the system will have a higher probability to be unstable and the instability will prevent the engineers from analyzing the mismatch.
(ii) If the integral section is always existing, the mismatch conditions about its parameter changes that do not affect the stability will be the same as those under Granger causality analysis.

Zero Gain Mismatch in G 33
Some channel will have zero gain module, which means that this manipulated variable influences nothing towards the corresponding loop. However, some zero gain will disappear in the future so that the object model is not nought. Also, this kind of mismatch condition must obey the rule that the MPM should not affect the system stability.
Pictures in the third lines of Figure 15 and the third picture of Figure 16c imply that there is a mismatch in the G 33 channel.   The monitored variances in pictures of Figure 17a,b,d are larger than 0.1 in the third loop and these data indicate a poor regression result from no mismatch parts. The picture of Figure 17c is less than 0.1 as expected, which is in accordance with the calculation in Equation (109). The location of this mismatch was finished with the assistant noise models.

Conclusions
Granger causality analysis can be used to detected the mismatch in closed loop system and obtain better results and accuracy from operating data in contrast to decussation location method using CAID and MQI. The decussation technique was implemented with statistical correlation methods and it was easily affected by mutual interference of control loops.
Granger causality analysis can provide for the users an explicit feedback whether the channels are influenced by MPM problems or not. The condition that causality index is greater than zero reflects a possible mismatch signal in the channel from manipulated variables u t to system output variables y t . However, once the mismatch conditions become complicated, the Granger causality as well as most MPM detection methods cannot guarantee the location of MPM.
To get further location of model plant mismatch, the assistant model parameters of white noise transfer function was introduced. When the mismatch occurred in specific channel from u i to y j , the regression of u i would contribute to the decrease of variance and the increase of Granger index. In the meantime, the assistant model parameters would get better identification so that coefficient deviation would become very little. Variables γ could be a monitoring tool for location.
The assistance of remaining noise models theoretically contain only noise sequences when there is no MPM occurence. However, mismatch leads to failure of 100 percent information extractions of process models so that the remaining noise models consist of two parts, the noise information and mismatched model features. This would result in the causality between d(k) and u(k). Hence, Granger causality is adopted for detection as mentioned before. To test the remaining noise models compared with N(q), the assistance of noise models can locate the real channels. When regression coefficients got exact results, it meant d(k) contained some information about u(k), which reveals the fact that this channel got its mismatch. While the regression coefficients got the wrong estimation of process model, it shows that overfitting of variables u(k) towards d(k) and further describes the ruleless fitting of no mismatch variables, which demonstrates no mismatch in this channel but existence mismatch in system. This information provides the user to detect other channels to get the correct location.