Abstract
System identification is often limited to parameter identification, while model uncertainties are disregarded or accounted for by a fictitious process noise. However, modelling assumptions may have a large impact on system identification. For this reason, we propose to use an unscented Kalman filter (UKF) empowered by online Bayesian model evidence computation for the sake of system identification and model selection. This approach employs more than one model to track the state of the system and associates with each model a plausibility measure, updated whenever new measurements are available. The filter outcomes obtained for different models are then compared and a quantitative confidence value is associated with each of them. Only the system identification outcomes related to the model with the highest plausibility are considered. While the coupling of extended Kalman filters (EKFs) and Bayesian model evidence was already addressed, we modify the approach to exploit the most striking features of the UKF, namely, the ease of implementation and higher-order accuracy in the description of the evolution of the state mean and variance. A challenging identification problem related to structural dynamics is discussed to show the effectiveness of the proposed methodology.
1. Introduction
Kalman filters (KFs) are well-known tools for system identification. They work by applying a predictor phase, in which a suitable model is needed to predict the evolution of a dynamic system, and a correction phase, in which corrections to the prediction are applied by recursively processing system measurements [1].
In civil and mechanical engineering, different model classes, consisting of different parametrizations of the structure to be identified, can be formulated. They are built upon different levels of complexity in the description of the system mechanics and uncertainty in the formulation of the modelling assumptions. Emphasis is usually placed on improving the quality of the parameter estimate, especially whenever nonlinear dynamic systems are handled. With this goal, KF extensions such as the extended Kalman filter (EKF) or the unscented Kalman filter (UKF) have been introduced. On the contrary, model uncertainties are often disregarded or accounted for by a fictitious process noise. In this work, we propose a way to tackle this aspect by calculating a quantitative estimate, referred to as model evidence, measuring how much the model employed by the KF is plausible with respect to other possible parametrizations. While a similar estimate was discussed in [2] for the EKF, here, we develop a model evidence formula suited for the UKF to exploit its ease of implementation and higher-order accuracy in the description of the evolution of the state mean and variance.
The remainder of the contribution is organized as follows: In Section 2, first, the governing equations of a mechanical elasto-dynamic system are discussed; second, the related algorithm showing the application of the UKF for parameter estimation is reported and finally, the equations allowing for recursive model evidence calculation are presented. In Section 3, a case study featuring a shear building excited by real ground acceleration is discussed, showing how parameter identification outcomes are affected by different structural parametrizations and how model evidence can be used for the sake of model selection a posteriori. Conclusions are finally discussed in Section 4.
2. Methodology
2.1. Elasto-Dynamic Problem
We focus on situations where the system dynamics is described by the finite element (FE) discretised version of a general elasto-dynamic problem. At time , it reads:
where , and are the mass, damping and stiffness matrices, respectively; are the nodal displacements, velocities and accelerations, respectively; and is the external force vector, assumed to be known.
Equation (1) is integrated in time by using the -method [3], ruled by the parameters , and . At each time step, the displacement field is obtained by solving
The modified matrix and the right hand side vector are computed by
where , , .
Moreover, the mechanical system is assumed to be only partially observed. Accordingly, a Boolean matrix establishes the connection between the observed quantities and the kinematic fields, as follows:
2.2. Unscented Kalman Filter for Parameter Estimation
In this study, the ultimate goal of filtering is to estimate the unknown parameters ruling the mechanical response of the structure to be identified, where typically and . In [1,4], Kalman filtering techniques were successfully applied, even in the presence of nonlinearities due to damage evolution in the observed system, by solving a dual estimation problem and thereby adopting as state variables the model displacements and the unknown parameters governing the response of the mechanical domain. However, treating FE solutions characterized by a large number n of degrees of freedom (DOF) may result in an excessive computational burden when dealing with dual estimation. A possible solution consists of obtaining a reduced order model (ROM) representation of the mechanical domain and adopting as state variables, instead of the nodal kinematics, the ROM DOF [5,6]. This strategy has been explored in [7]. Here, we consider only as a state variable to avoid the computational burden connected to the combined use of UKF and DOF tracking when large FE models are addressed, despite the enhanced performance usually guaranteed by state tracking [1,8]. The following state-space representation is used:
where the is driven by a random walk ruled by , modelled as a white process noise , and the FE predicted output is related to the actual response of the structure by adding a measurement noise , modelled as white . The matrices and are symmetric and positive defined. The time variation of , introduced by the random walk formulation, is fictitious.
KFs attempt to propagate the mean and the covariance of the state variable vector through the state-space and the measurement update equations. Instead of propagating the probability density functions associated with the state variables, it is indeed preferable to draw a set of samples, deterministically propagate them and finally compute the mean and covariance of the state vector; this is especially profitable when the state-space and/or the measurement update equations are nonlinear. The UKF is based on this idea. The propagated vector collects a set of so-called sigma points (SPs) , , distributed such that the mean and covariance of these points match those of the state variables. A scaled version of the UKF is used by setting the parameters , and , as detailed in [9], to avoid sampling nonlocal effects that would spoil the state variable mean and covariance reconstruction [10]. In the predictor phase, this vector is propagated from the k-th to the k+1-th time step through the state-space equations. In the corrector phase, the estimated output covariance and the estimated cross covariance are used to compute the Kalman gain needed to correct the propagated mean and covariance on the basis of the collected measurements . The full expression of these quantities and the application of the UKF are detailed in Algorithm 1, adapted from [11].
2.3. Model Evidence Computation for Unscented Kalman Filter
System identification is usually limited to select a particular parametric model of the underlying structural system, estimating the corresponding unknown parameters . However, the use of either excessively simplified or too complex models may have a detrimental effect on the possibility to track the system state: oversimplified models may underestimate the effect of a physical process taking place; on the other hand, complex models may lead to good data fitting but possibly yield to poor predictions. In the latter case, the model overfits the incoming data. In [2], an online model class selection strategy was proposed in the framework of an EKF’s parameter estimates. Here, a similar approach was adopted for a simultaneous parametric estimate and model class selection exploiting the UKF. Adopting a number of possible model classes, the model evidence (or plausibility) consisting of the probability was computed for each model class , with at each time step . The sum of the model evidences is equal to the unity. To derive the expression of , first, the Bayes theorem was used, giving
where , called conditional evidence, represents the contribution of the measurement at to the plausibility of the m-th model class.
Second, we extended the procedure explained in [2] from the EKF to the UKF. As a result, at the end of the corrector phase (after Step 15 of Algorithm 1), the following expression for the conditional evidence applied:
where calculates the determinant of the input matrix. The reported expression approximates due to the use of Laplace’s asymptotic expansion [2].
| Algorithm 1 UKF for parameter estimation, linear elastic case. |
|
3. Results and Discussion
As a numerical case study, we studied how to determine the interstorey stiffness and damping of the two DOF shear building models () reported in Figure 1. The mechanical properties of the building were adimensionalised to ease the UKF tuning by setting the matrices in Equation (1) equal to
Figure 1.
Two DOF shear model. Acceleration monitoring.
The building was excited by the ground acceleration reported in Figure 2, lasting . The response of the building was monitored by recording the floor acceleration with a sampling frequency of for a total of samples. A white noise, featuring a standard deviation of , was added to and to to mimic the signal perturbation affecting micro-electro-mechanical accelerometers [12].
Figure 2.
Ground acceleration.
The acceleration recordings coming from this reference building were used as measurements in the corrector phase of the filtering procedure (Steps 9 and 10 of Algorithm 1). Three model classes, , and , featuring different structural parametrizations, were considered, as shown in the following:
Model class is governed by the parameter ruling the interstorey stiffness of both floors (for this reason, is factored out from ); is governed by ruling, respectively, the interstorey stiffness and damping of both floors; is governed by , where and rule the first and second floor interstorey stiffness and rules the damping associated with both floors. Comparing these parametrizations with the reference model, it is clear that is underparametrizing the mechanical system, not associating any parameter with the damping properties of the structures and suffering a model bias, being ; is overparametrizing the stiffness matrix and is performing a correct parametrization of the structural response, and it is therefore expected to allow for the best estimate of the system mechanical properties. For all model classes, the initial guesses of the relevant parameters underestimated by of the parameter values ruling the reference structure.
KF tuning is usually problem-dependent and is performed through a trial-and-error procedure. In this case, we have set the SP scaling parameters to , and ; the measurement noise covariance to , where is the identity matrix; the process noise covariance to , with ; and the initial parameter covariance to . The value of depends on the number of parameters employed by each model ( for , for and for ).
In Figure 3, the predicted output of , computed according to Step 8 of Algorithm 1, is reported against the floor acceleration measurements, showing the filter capacity of tracking the shear building accelerations despite the presence of noise. A small discrepancy between the reference model and the predicted output is observable only magnifying the curves. The predicted outputs of and , not reported for lack of space, exhibit an even smaller discrepancies.
Figure 3.
predicted outputs (dot dashed blue line) are reported against the noise-corrupted reference model recordings (orange line). The left figure refers to the first floor and the right figure to the second floor. Black lines depict the reference model acceleration when not corrupted by noise.
The filter capacity of tracking the system output was expected to greatly help parameter identification. In Figure 4, Figure 5 and Figure 6, the time evolution of the parameters employed by , and are reported, respectively. Black colour is used for parameters involved in the expression of the structural stiffness; orange colour when related to the structural damping. The plots report both the parameter posterior estimates and the confidence intervals of these estimates. Looking at the confidence intervals, stiffness-related parameters seem to assume negative values during the first part of the analyses. This is due to to the initial choice of . However, positive values have been always associated with the interstorey stiffness due to the use of the scaled version of the UKF. Similar reasoning applies to damping-related parameters.
Figure 4.
Model class , time evolution of . The thicker dotted line reports the posterior estimate; the thinner dotted lines the confidence interval of the estimate, determined using the posterior covariance. The continuous line reports the parameter value assumed by the reference model.
Figure 5.
Model class , time evolution of . The thicker dotted line reports the posterior estimate; the thinner dotted line the confidence interval of the estimate, determined using the posterior covariance. The continuous line reports the parameter values assumed by the reference model.
Figure 6.
Model class , time evolution of . The thicker continuous line reports the posterior estimates; the thinner dotted lines the confidence interval of the estimate, determined using the posterior covariance. The continuous line reports the parameter values assumed by the reference model.
Looking at Figure 4, the UKF was unable to provide a correct estimate for , despite the uncertainty reduction linked to the narrowing of the confidence interval. Even the stiffness-related parameters and of , depicted in Figure 6, seem not able to converge to the desired value. On the contrary, coming to , was correctly identified with small uncertainty, as shown in Figure 5. These results were somehow expected due to the underparametrization of the mechanical system operated by and the overparametrization of the mechanical system exhibited by , while embodied the correct description of the reference model.
Model class was unable to provide any idea of the damping properties, ending up pushing to 0. Model class provided a better estimate, still quite poor, overestimating by the damping related parameter . These difficulties were due to the relevance of damping in the identification of continuously excited structures, discussed in [4].
From the results reported above, seems to lead to the best system identification; however, we reached this conclusion by knowing the mechanical properties of the reference system. It would have been very hard, if not impossible, to judge model plausibility simply looking at the predicted outputs. Indeed, as shown in Figure 3, the UKF has been able to reproduce the monitoring system outcome even when is employed. For this reason, model evidence computation, whose outcome is reported in Figure 7, is extremely relevant to understand which model can be trusted the most.
Figure 7.
Model evidence evolution of each model.
At the beginning of the identification procedure, equal plausibility was associated with the three models. Their values were recursively updated as soon as new measurements became available using Equations (7) and (8). During the first part of the analysis, appeared to be the most plausible model class. This is in agreement with intuition: is the easiest to tune, employing just one parameter, and the bias in the modelling of damping has a marginal relevance when due to the strong ground motion undergone by the structure. In a second stage, resulted to be the most plausible model class. This was due to the good estimate of both the stiffness-related parameters and the damping-related parameters in the central part of the analysis. Finally, the overcomplexity of led to a deterioration of the parameter identification, while the good convergence of the stiffness-related parameters and the reasonable damping estimate promoted as the most plausible model class.
This numerical example shows that model evidence evaluation can be successfully used for model selection. The reader should note that, due to the recursive nature of Equation (7), a certain time delay occurred between the improved identification capacity of the filter equipped with a certain model and the increase in plausibility of this model.
4. Conclusions
In this work, we have discussed an algorithm for simultaneous parameter estimation and model evidence calculation in dynamic linear elastic problems. Starting from the work of [2], a recursive expression for model evidence evaluation was derived for when the unscented Kalman filter is used. Numerical results show that model evidence can guide system identification in the presence of model uncertainties by associating a plausibility measure with different employed models featuring possible parametrizations of mechanical domains. Indeed, model evidence can be successfully used to select the most plausible structure parametrization as parameter identification is carried out.
Author Contributions
Conceptualization, L.R., S.E.A., A.C. and S.M.; methodology, L.R., S.E.A., A.C. and S.M.; software, L.R. and S.E.A.; validation, L.R., S.E.A., A.M., A.C. and S.M.; formal analysis, L.R., S.E.A. and S.M.; investigation, L.R. and S.E.A.; resources, S.E.A.; data curation, L.R. and S.E.A.; writing—original draft preparation, L.R.; writing—review and editing, S.E.A., A.M., A.C. and S.M.; visualization, L.R.; supervision, S.E.A. and A.C.; project administration, S.E.A. and A.C.; funding acquisition, S.E.A. and A.C. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors are indebted to Rodrigo Astroza, Universidad de los Andes (Chile), for the valuable discussions on the topic of this contribution.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Mariani, S.; Ghisi, A. Unscented Kalman filtering for nonlinear structural dynamics. Nonlinear Dyn. 2007, 49, 131–150. [Google Scholar] [CrossRef]
- Yuen, K.V.; Mu, H.Q. Real-time system identification: an algorithm for simultaneous model class selection and parametric identification. Comput.-Aided Civ. Infrastruct. Eng. 2015, 30, 785–801. [Google Scholar] [CrossRef]
- Hilber, H.M.; Hughes, T.J.R.; Taylor, R.L. Improved numerical dissipation for time integration algorithms in structural dynamics. Earthq. Eng. Struct. Dyn. 1977, 5, 283–292. [Google Scholar] [CrossRef] [Green Version]
- Mariani, S.; Corigliano, A. Impact induced composite delamination: state and parameter identification via joint and dual extended Kalman filters. Comput. Methods Appl. Mech. Eng. 2005, 194, 5242–5272. [Google Scholar] [CrossRef]
- Eftekhar Azam, S.; Mariani, S.; Attari, N.K.A. Online damage detection via a synergy of proper orthogonal decomposition and recursive Bayesian filters. Nonlinear Dyn. 2017, 89, 1489–1511. [Google Scholar] [CrossRef]
- Eftekhar Azam, S.; Mariani, S. Online damage detection in structural systems via dynamic inverse analysis: A recursive Bayesian approach. Eng. Struct. 2018, 159, 28–45. [Google Scholar] [CrossRef]
- Gobat, G.; Azam, S.E.; Mariani, S. SHM and efficient strategies for reduced-order modeling. Eng. Proc. 2020, 2, 2098. [Google Scholar] [CrossRef]
- Kopp, R.E.; Orforf, R.J. Linear regression applied to system identification for adaptive control systems. AIAA J. 1963, 1, 2300–2306. [Google Scholar] [CrossRef]
- Wan, E.; Van Der Merwe, R. The unscented Kalman filter for nonlinear estimation. In Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373), Lake Louise, AL, Canada, 1–4 October 2000; pp. 153–158. [Google Scholar] [CrossRef]
- Julier, S. The scaled unscented transformation. In Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301), Anchorage, AK, USA, 8–10 May 2002; Volume 6, pp. 4555–4559. [Google Scholar] [CrossRef] [Green Version]
- Castiglione, J.; Astroza, R.; Eftekhar Azam, S.; Linzell, D. Auto-regressive model based input and parameter estimation for nonlinear finite element models. Mech. Syst. Signal Process. 2020, 143, 106779. [Google Scholar] [CrossRef]
- D’Alessandro, A.; Vitale, G.; Scudero, S.; D’Anna, R.; Costanza, A.; Fagiolini, A.; Greco, L. Characterization of MEMS accelerometer self-noise by means of PSD and Allan Variance analysis. In Proceedings of the 7th IEEE International Workshop on Advances in Sensors and Interfaces IWASI, Vieste, Italy, 15–17 June 2017; pp. 159–164. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).