Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate

Torzoni, Matteo; Manzoni, Andrea; Mariani, Stefano

doi:10.3390/IOCA2021-10889

Open AccessProceeding Paper

Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate^†

by

Matteo Torzoni

^1,2,*

,

Andrea Manzoni

²

and

Stefano Mariani

¹

Dipartimento di Ingegneria Civile e Ambientale, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy

²

MOX, Dipartimento di Matematica, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy

^*

Author to whom correspondence should be addressed.

^†

Presented at the 1st International Electronic Conference on Algorithms, 27 September–10 October 2021; Available online: https://ioca2021.sciforum.net/.

Comput. Sci. Math. Forum 2022, 2(1), 16; https://doi.org/10.3390/IOCA2021-10889

Published: 22 September 2021

(This article belongs to the Proceedings of The 1st International Electronic Conference on Algorithms)

Download

Browse Figures

Versions Notes

Abstract

:

To meet the need for reliable real-time monitoring of civil structures, safety control and optimization of maintenance operations, this paper presents a computational method for the stochastic estimation of the degradation of the load bearing structural properties. Exploiting a Bayesian framework, the procedure sequentially updates the posterior probability of the damage parameters used to describe the aforementioned degradation, conditioned on noisy sensors observations, by means of Markov chain Monte Carlo (MCMC) sampling algorithms. To enable the analysis to run in real-time or quasi real-time, the numerical model of the structure is replaced with a data-driven surrogate used to evaluate the (conditional) likelihood function. The proposed surrogate model relies on a multi-fidelity (MF) deep neural network (DNN), mapping the damage and operational parameters onto sensor recordings. The MF-DNN is shown to effectively leverage information between multiple datasets, by learning the correlations across models with different fidelities without any prior assumption, ultimately alleviating the computational burden of the supervised training stage. The low fidelity (LF) responses are approximated by relying on proper orthogonal decomposition for the sake of dimensionality reduction, and a fully connected DNN. The high fidelity signals, that feed the MCMC within the outer-loop optimization, are instead generated by enriching the LF approximations through a deep long short-term memory network. Results relevant to a specific case study demonstrate the capability of the proposed procedure to estimate the distribution of damage parameters, and prove the effectiveness of the MF scheme in outperforming a single-fidelity based method.

Keywords:

structural health monitoring; Markov chain Monte Carlo; deep learning; multi-fidelity; reduced order modeling; damage identification

1. Introduction

Civil structures and infrastructures are critical for the life of the world population and play a strategic role for the global economy [1]. Aging and ever-increasing extreme loading conditions threaten existing and new structural systems, stressing the need of real-time structural health monitoring (SHM) procedures to detect and identify any deviation from the damage-free baseline [2].

Vibration-based SHM techniques investigate the structural health by recording and analyzing the vibration response, e.g., acceleration or displacement multivariate time series, of the monitored structure. Two competitive SHM approaches can be formally distinguished [3]: the model-based one, e.g., [4,5], and the data-based one, e.g., [6,7]. The former is usually implemented through an updating strategy of a physics-based model on the basis of measured experimental data, which attempts to estimate the location and the extent of the occurred structural changes. The latter is based on a machine learning (ML) paradigm that, once trained, can be used as a black-box tool. ML systems automatically learn how the features, originated from the recorded data, are statistically correlated with the sought damage patterns [8]. After the advent of deep learning (DL), which can incorporate the selection and extraction of optimized features into the end-to-end learning processes, the feature engineering stage has been progressively automatized.

This work proposes an output-only approach to the damage localization problem (see for instance [9,10]), leveraging a synergic combination of multi-fidelity (MF) data-driven meta-modeling and Bayesian parameter identification. The probability distribution of the unknown damage parameters is approximated through a Markov chain Monte Carlo (MCMC) sampling algorithm.

MCMC has been applied in Bayesian model updating and model class selection in structural mechanics as well as in SHM, see, e.g., [11,12]. In this work, MCMC is used to construct a Markov chain of the sought damage parameters, whose limit distribution is the target probability distribution. The probability distribution is sequentially updated by exploring the support of the damage parameters with a density of steps proportional to the unknown posterior distribution. The sampling acceptance is governed by the evidence of the current parameters to represent sparse dynamic response measurements, as provided by a sensors network, by means of a data-driven surrogate model.

Because handling finite element (FE) simulations within an MCMC analysis is computationally impractical, a FE model capable of simulating the effect of damage on the structural response is adopted only to build labelled datasets of vibration recordings for known damage positions, see for instance [13]. A data-driven surrogate model is adopted instead to map operational and damage parameters to the associated vibration signals in place of the FE model. Such surrogate is based on a multi-fidelity deep neural network (MF-DNN) trained on synthetic data of multiple fidelities, a ML paradigm adopted and extended for instance in [14,15]. Specifically, a limited amount of high fidelity (HF) data and a lot of cheaper low fidelity (LF) data are considered. This type of meta-modeling is useful to alleviate the high demand during training of HF data, potentially expensive to collect. Indeed, the LF data supply useful information on the trends of HF data, allowing the MF-DNN to enhance the prediction accuracy only leveraging few HF data in comparison to the single-fidelity method [16].

2. SHM Methodology

The proposed methodology is detailed as follows. The composition of the datasets used to train the surrogate model is specified in Section 2.1, the considered numerical models are discussed in Section 2.2, the MF-DNN surrogate model is described in Section 2.3, and the setup of the MCMC analysis for damage localization is explained in Section 2.4.

2.1. Datasets Definition

The LF and HF datasets, respectively

D_{LF}

and

D_{HF}

, are built from the assembly of

I_{LF}

and

I_{HF}

instances, as follows

D_{LF} = {(x_{i}^{LF}, U_{i}^{LF})}_{i = 1}^{I_{LF}}, D_{HF} = {(x_{j}^{HF}, U_{j}^{HF})}_{j = 1}^{I_{HF}};

(1)

each LF instance is provided by a LF model of the structure to be monitored in undamaged conditions, and consists of the input parameters

x_{i}^{LF} \in R^{N_{par}^{LF}}

defining the operational conditions, i.e., the loadings acting on the structure during the i-th instance, and the relative LF vibration time-histories

U_{i}^{LF} (x_{i}^{LF}) = {[u_{1}^{LF}, \dots, u_{N_{u}}^{LF}]}_{i} \in R^{N_{u} \times L}

shaped as

N_{u}

arrays of length L. The HF counterpart is provided by a HF model of the same structure, which also accounts for the presence of structural damage and internal damping. Each HF instance consists of the input parameters

x_{j}^{HF} \in R^{N_{par}^{HF}}

, defining the operational and damage conditions, with

N_{par}^{HF} > N_{par}^{LF}

, and the associated HF vibration recordings

U_{j}^{HF} (x_{j}^{HF}) \in R^{N_{u} \times L}

. As often done in the SHM literature, see for instance [3,6,12], the structural damage is modeled as a selective reduction of the material stiffness, applied to a subdomain identified by the spatial coordinates of its center

θ_{j} \subset x_{j}^{HF}

. For simplicity, the same sampling frequency and monitored degrees of freedom (dofs) are considered for the two fidelities, but there are no restrictions on this respect. Each instance refers to a time window

(0, T)

, short enough to assume steady operational, environmental, and damage conditions. In the reminder of the paper the indexes

i, j

will be dropped.

2.2. Datasets Population

The monitored structure is modeled as an elastic continuum discretized in space by means of a FE triangulation. The HF numerical model results from the semi-discretized form of the elasto-dynamic problem defined over the FE mesh. On the other hand, in order to ease the construction of a large LF dataset, a projection-based model order reduction strategy for parametrized systems is adopted to build the LF model, see, e.g., [9]. To this aim, the reduced basis method [17] relying on the proper orthogonal decomposition (POD)-Galerkin approach is considered. Hence, the LF approximation is obtained as a linear combination of POD-basis functions, yet not accounting for the presence of damage and structural damping. The LF and HF models read respectively as

\begin{matrix} \{\begin{matrix} M^{R} {\ddot{d}}^{R} (t) + K^{R} d^{R} (t) = f^{R} (x^{LF}), & t \in (0, T) \\ d^{R} (0) = W^{⊤} d_{0} \\ {\dot{d}}^{R} (0) = W^{⊤} {\dot{d}}_{0}, \end{matrix} \end{matrix}

(2)

\begin{matrix} \{\begin{matrix} M \ddot{d} (t) + C (x^{HF} (θ)) \dot{d} (t) + K (x^{HF} (θ)) d (t) = f (x^{HF}), & t \in (0, T) \\ d (0) = d_{0} \\ \dot{d} (0) = {\dot{d}}_{0}, \end{matrix} \end{matrix}

(3)

where the superscripts L and H are omitted from all the arrays for simplicity, while the superscript R stands for reduced. Having denoted by:

t \in (0, T)

the time coordinate;

d (t) \in R^{M}

,

\dot{d} (t) \in R^{M}

and

\ddot{d} (t) \in R^{M}

the vectors of nodal displacements, velocities and accelerations, respectively, whereas M is the number of dofs;

M \in R^{M \times M}

the mass matrix;

C (x^{HF} (θ)) \in R^{M \times M}

the damping matrix, modeled as Rayleigh damping for mathematical convenience;

K (x^{HF} (θ)) \in R^{M \times M}

the stiffness matrix;

f (x^{LF}), f (x^{HF}) \in R^{M}

the vectors of nodal forces;

d_{0}

and

{\dot{d}}_{0}

the initial conditions at

t = 0

;

W = [w_{1}, \dots, w_{M_{R}}] \in R^{M \times M_{R}}

the matrix gathering the

M_{R} ≪ M

retained POD-basis functions;

M^{R}, K^{R}, f^{R} (x^{LF}), d^{R} (t)

the reduced arrays, playing the same role of the FE matrices but with dimension ruled by

M_{R}

instead of M. It has to be noted that, even if in this case the two fidelities differ through the presence of structural damage and viscous damping in the HF model, the proposed computational framework is general and can be arbitrarily adapted to different modeling choices.

The datasets

D_{LF}

and

D_{HF}

are populated accordingly to Equation (1) by sampling the parametric input spaces, respectively defined by a uniform probability distribution over

x^{LF}

and

x^{HF}

, via latin hypercube sampling. The relevant vibration recordings

U^{LF}

and

U^{HF}

are extracted from

d^{LF}

and

d^{HF}

, respectively, through a Boolean operation.

2.3. MF-DNN Surrogate Model

The MF-DNN

{NN}_{MF}

is composed of a LF neural network

{NN}_{LF}

, trained on low-cost data, which is used as baseline model, and a HF neural network

{NN}_{HF}

, trained on few HF data, which is used to adaptively learn the correlation between LF and HF data. The overall evaluation of

{NN}_{MF}

reads as

{\hat{U}}^{HF} = {NN}_{MF} (x^{HF}, x^{LF}) = {NN}_{HF} (x^{HF}, {\hat{U}}^{LF}), {\hat{U}}^{LF} = reshape [Y (\frac{1}{ω} ⊙ {NN}_{LF} (x^{LF}))];

(4)

here:

Y = [y_{1}, \dots, y_{M_{LF}}] \in R^{L_{concat} \times M_{LF}}

, with

L_{concat} = L \times N_{u}

, is a matrix gathering

M_{LF}

POD-basis functions built upon

D_{L}

and used to compress the LF data in order to ease the complexity of

{NN}_{LF}

;

{NN}_{LF}

is a fully connected DNN, mapping the LF input parameters onto the POD-basis coefficients;

ω \in R^{M_{LF}}

is a vector of numbers linearly decreasing from 1 to

0.2

, used to weight the regression over the POD-basis coefficients by their relative importance; ⊙ denotes the Hadamard product; the

reshape

operation is used to recast the reconstructed LF signals from a single vector of size

L_{concat}

into

N_{u}

arrays of length L;

{NN}_{HF}

is a long short-term memory (LSTM) NN that, as more appropriate to solve time-dependent problems, is adopted to map the HF input parameters and the approximated LF signals onto the HF signals.

2.4. Damage Localization via MCMC

Accordingly to the Bayes’ rule, the posterior probability density function (pdf) of the damage parameters

θ

, conditioned on the observed signals

U_{1, \dots, N_{obs}}^{EXP}

is

p (θ | U_{1, \dots, N_{obs}}^{EXP}, {NN}_{MF}) = \frac{p (U_{1, \dots, N_{obs}}^{EXP} | θ, {NN}_{MF}) p (θ, {NN}_{MF})}{\int p (U_{1, \dots, N_{obs}}^{EXP} | θ, {NN}_{MF}) p (θ, {NN}_{MF}) d θ},

(5)

where:

p (θ, {NN}_{MF})

is the prior of

θ

;

p (U^{EXP} | θ, {NN}_{MF})

is the likelihood of the evidence, which measures the goodness of fit of

{NN}_{MF}

to

U^{EXP}

given the parameters

θ

. By assuming that the uncertainties follow a Gaussian distribution, the likelihood function can be assumed Gaussian too thanks to the central limit theorem:

p (U_{1, \dots, N_{obs}}^{EXP} | θ, {NN}_{MF}) = \prod_{k}^{N_{obs}} \frac{1}{{(\sqrt{2 π})}^{N_{u}} \sqrt{| Σ_{c} |}} exp (- \frac{\frac{1}{L} \sum_{τ = 1}^{L} [{(e_{τ}^{⊤} Δ_{k})}^{⊤} Σ_{c}^{- 1} (e_{τ}^{⊤} Δ_{k})]}{2});

(6)

here:

N_{obs}

is the batch size of the processed observations;

Δ_{k} = U_{k}^{EXP} - {\hat{U}}^{HF} (x^{HF} (θ), x^{LF})

is the prediction error for the k-th observation, assumed independent between different time instants and modeled as a Gaussian random vector with zero mean and covariance matrix

Σ_{c} \in R^{N_{u} \times N_{u}}

, describing the spatial correlation of prediction errors due to modeling errors and measurement noise;

e_{τ}

is a Boolean vector with a single non-zero entry in

τ

-th position, used to extract the relevant time step. For further details see, e.g., [18].

To avoid the expensive computation of the integral at the denominator of Equation (5), an MCMC sampling algorithm is adopted to approximate the posterior pdf. Specifically, the posterior pdf is sequentially updated accordingly to the Metropolis-Hastings (MH) algorithm [19]. The MH algorithm simulates a chain of

θ

samples distributed according to the posterior, with each sample only depending on the previous one. This generate a random walk in the space of

θ

, where each point is sampled with a frequency proportional to its probability. Hence, the stationary distribution of the Markov chain, under the assumption of ergodicity, asymptotically approaches the target pdf.

Let

q (ξ | θ)

be the assumed proposal pdf and

δ (θ) = p (U_{1, \dots, N_{obs}}^{EXP} | θ, {NN}_{MF}) p (θ, {NN}_{MF})

for the sake of simplicity. The MH algorithm recursively simulate the next Markov chain sample

θ_{k + 1}

from the current sample

θ_{k}

, with

k = 1, \dots, L_{chain}

, as follows [20]: sample a candidate

ξ

from

q (ξ | θ_{k})

; compute the ratio

α = \frac{δ (ξ) q (θ_{k} | ξ)}{δ (θ_{k}) q (ξ | θ_{k})}

; accept the candidate

ξ

with probability

min {1, α}

and store it as next state of the chain, i.e.,

θ_{k + 1} = ξ

, otherwise reject it and keep the current state of the chain, i.e.,

θ_{k + 1} = θ_{k}

.

After

L_{chain}

states are evaluated, the burn-in period of the chain, i.e., the initial transitory phase, is removed to eliminate the initialization effect. The resulting chain is thinned up to

{\tilde{L}}_{chain} = \frac{L_{chain}}{k_{T}}

, with

k_{T}

a small fixed integer, in order to remove dependencies among consecutive samples. The target distribution can be ultimately approximated via histograms and the posterior expected values and covariance can be eventually approximated with the empirical mean and covariance of the

θ_{1}, \dots, θ_{{\tilde{L}}_{chain}}

samples:

\begin{matrix} μ_{θ} = E (θ | U_{1, \dots, N_{obs}}^{EXP}, {NN}_{MF}) \approx \frac{1}{{\tilde{L}}_{chain}} \sum_{l = 1}^{{\tilde{L}}_{chain}} θ_{l}, \end{matrix}

(7)

\begin{matrix} c o v (θ | U_{1, \dots, N_{obs}}^{EXP}, {NN}_{MF}) \approx \frac{1}{{\tilde{L}}_{chain} - 1} \sum_{l = 1}^{{\tilde{L}}_{chain}} [θ_{l} - μ_{θ}] {[θ_{l} - μ_{θ}]}^{⊤} . \end{matrix}

(8)

3. Virtual Experiment

The proposed method is validated on the digital twin shown in Figure 1. The HF model in Equation () is obtained from a FE discretization resulting in

M = 4659

dofs and integrated in time using the Newmark method. The structure is made of concrete, whose mechanical properties are: Young’s modulus

E = 30 G P a

; Poisson’s ratio

ν = 0.2

; density

ρ = 2500 k g / m^{3}

. The structure is excited at the tip by a distributed load

q (t)

, acting on an area of

(0.3 \times 0.3) m^{2}

, as depicted in Figure 1. The load

q (t)

varies in time according to

q (t) = Q sin (2 π f t)

, where

Q \in [1, 5] k P a

and

f \in [10, 60] H z

respectively denote the load amplitude and frequency, collected as

x^{LF} = {(Q, f)}^{⊤}

. Damage is introduced by reducing the material stiffness by

25 %

within the subdomain

Ω

, which is a box

(0.3 \times 0.3 \times 0.4) m^{3}

as depicted in Figure 1. The target position of this reduction is given by the coordinates of its center and can be identified with a single abscissa

θ_{Ω} \in [0.15, 7.55] m

running along the axis of the structure. Hence, the input parameters of the HF part are collected as

x^{HF} = {(Q, f, θ_{Ω})}^{⊤}

. Also the Rayleigh damping matrix, which account for a

5 %

damping ratio on the first 4 structural modes, is affected by the damage through the stiffness matrix. Synthetic displacement recordings

u_{n} (t)

, with

n = 1, \dots, N_{u}

, are collected from

N_{u} = 8

dofs, mimicking a monitoring system arranged as depicted in Figure 1, for a time interval

(0, T = 1 s)

, providing

L = 200

data points.

The reduced-order model in Equation (2), i.e., the LF model used to construct

D_{LF}

, has been built performing a POD upon 40,000 snapshots in time, collected while exploring the parametric input space

x^{LF}

. 14 POD-bases are selected and stored in matrix

W

, in place of the original 4659 dofs, after having fixed a suitable tolerance on the energy norm of the reconstruction error (

{tol}_{POD} = 10^{- 3}

); for further details see, e.g., [9,13].

For the training of the surrogate model in Equation (4),

I_{LF}

= 10,000 and

I_{HF}

= 1000 instances have been collected from the LF and HF model, respectively. Concerning the compression of the LF data for the sake of prior dimensionality reduction, 104 POD-bases have been selected (

{tol}_{POD} = 10^{- 3}

) and stored in matrix

Y

, in place of 1600 data points.

The mean squared error and the mean absolute error have been used as loss functions for the training of

{NN}_{LF}

and

{NN}_{HF}

, respectively, together with the Adam optimization algorithm [21]. The implementation has been carried out through the Tensorflow-based Keras API [22], running on an Nvidia GeForce RTX 3080 GPU card.

An example of the reconstruction capabilities achieved by the surrogate model is shown in Figure 2 for the monitored gdl

u_{8} (t)

, where the outcome of the regression over the POD-basis coefficients, ruled by the

{NN}_{LF}

, and the corresponding expanded LF signal are reported together with the signal enrichment, provided by the

{NN}_{HF}

. To quantify the accuracy of the predicted signals, the Pearson correlation coefficients (PCC) between predicted and ground truth HF signals are adopted as a measure of fitness. The PCC coefficients are evaluated with respect to 40 testing instances generated with the HF model while exploring the parametric input space

x^{HF}

. The minimum PCC value over the 40 testing instances for each monitored channel is respectively

{0.983; 0.988; 0.994; 0.995; 0.998; 0.998; 0.998; 0.998}

, which largely validate the performance of the surrogate model. The other way around, if the

{NN}_{HF}

is employed without being coupled with the

{NN}_{LF}

, the maximum PCC value drops to

{0.605; 0.603; 0.601; 0.601; 0.791; 0.735; 0.709; 0.696}

, showing the utility of the MF setting that outperforms the single-fidelity based method.

In the absence of experimental data, the Bayesian estimation of the damage parameter

θ_{Ω}

is simulated by considering pseudo-experimental instances, generated with the HF model, that have been corrupted by adding independent, identically distributed Gaussian noise featuring a signal-to-noise ratio equal to 80 to each vibration recording. Batches of

N_{obs} = 3

observations relative to the same damage condition but different operational conditions are processed during the evaluation of the likelihood in Equation (6). The prior pdf

p (θ_{Ω}, {NN}_{MF})

is taken as uniform, while, to account for the bounded domain in which

θ_{Ω}

can fall, a truncated Gaussian centered on the last accepted state is considered for the proposal

q (ξ | θ_{Ω})

. The adaptive Metropolis [23] algorithm is adopted in order to ease the calibration of the proposal distribution, enabling its covariance to be tuned on the basis of past samples as the sampling evolves. The MCMC algorithm is run for 5000 samples, the first 500 of which are removed to get rid of the burn-in period. The obtained chain is ultimately thinned by discarding 3 samples over 4 to remove dependencies among consecutive samples.

Two examples of MCMC analyses are reported in Figure 3, showing the generated Markov chains alongside the estimated posterior mean and credibility intervals. In both cases, the damage parameter

θ_{Ω}

, here normalized between 0 and 1, is properly identified. It has to be noted that the larger uncertainty in the second case is somehow expected; indeed, given the structural layout and the placing of the sensors, the sensitivity of measures to damage positions far apart from the clamped side is smaller.

4. Conclusions

This paper has presented a stochastic approach for SHM, here applied to the problem of damage localization in case of slow damage progression. The presence of damage has been postulated as already detected, e.g., as identified by an early warning tool, and only the localization task has been analyzed. The Bayesian identification of damage parameters is achieved through an MCMC sampling algorithm, adopted to approximate their posterior distribution conditioned on a set of measurements. Few investigations are present in literature involving the use of MCMC for the health monitoring of civil structures, and this is the first one considering a MF-DNN surrogate model to accelerate the computation of the conditional likelihood. The surrogate model learns from simulated data of multiple fidelities, i.e., few HF data and several inexpensive LF data, such to alleviate the computational burden of the supervised training stage. The method has been assessed on a numerical case study, showing remarkable accuracy under the effect of measurement noise and varying operational conditions.

The method is suitable for structural typologies whose damage patterns can be represented by a stiffness reduction fixed within the time interval of interest. Since it enables a time scale separation between damage growth and damage assessment, this is a standard assumption for most practical scenarios in SHM. Such description of damage is consistent with the adopted vibration-based SHM approach, and allows the structure to be modeled as a linear system both in the presence and absence of damage. Moreover, as shown in [9], even if the stiffness reduction takes place over domains of different size from that one adopted during the dataset construction, it is still possible to identify the correct position of damage.

Considering data-driven algorithms, damage localization is often addressed by exploiting a DL feature extractor followed by a classification or a regression module, e.g., as done in [9,10,13]. However, due to the need of training in a simulated environment, the risk of losing generalization capabilities on real monitoring data is high. The proposed procedure tries to overcomes such generalization problems. Damage is located by seeking for those parameters of the surrogate model producing the closest output to the measured one, in terms of a suitable distance function measuring the signals similarity. For this reason and thanks to the fully stochastic framework here considered, which is suitable for dealing with noisy data and modeling inaccuracies, it is reasonable to expect a better ability of generalizing outside the training regime.

Besides the need of validating the proposed methodology within a suitable experimental setting, the next studies will extended the Bayesian identification also to the parameters controlling the operational conditions. Moreover, a usage monitoring tool powered by a suitable data-driven paradigm will be considered to provide useful prior knowledge as opposite to an informative flat prior. The analysis of dynamic effects resulting from localized damage mechanisms is also left for future investigations.

Author Contributions

Conceptualization, M.T., A.M. and S.M.; methodology, M.T., A.M. and S.M.; software, M.T.; validation, M.T., A.M. and S.M.; formal analysis, M.T., A.M. and S.M.; investigation, M.T.; resources, A.M. and S.M.; data curation, M.T.; writing—original draft preparation, M.T.; writing—review and editing, M.T., A.M. and S.M.; visualization, M.T.; supervision, A.M. and S.M.; project administration, A.M. and S.M.; funding acquisition, A.M. and S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data generated during the study are available from the corresponding author upon reasonable request.

Acknowledgments

M.T. acknowledges the financial support by Politecnico di Milano through the interdisciplinary Ph.D. Grant “Physics-Informed Deep Learning for Structural Health Monitoring”.

Conflicts of Interest

The authors declare no conflict of interest.

References

Flah, M.; Nunez, I.; Ben Chaabene, W.; Nehdi, M.L. Machine Learning Algorithms in Civil Structural Health Monitoring: A Systematic Review. Arch. Comput. Methods Eng. 2021, 28, 2621–2643. [Google Scholar] [CrossRef]
Mariani, S.; Azam, S.E. Health Monitoring of Flexible Structures Via Surface-mounted Microsensors: Network Optimization and Damage Detection. In Proceedings of the 5th ICRAE, Singapore, 20–22 November 2020; pp. 81–86. [Google Scholar] [CrossRef]
Worden, K. Structural fault detection using a novelty measure. J. Sound Vib. 1997, 201, 85–101. [Google Scholar] [CrossRef]
Corigliano, A.; Mariani, S. Parameter identification in explicit structural dynamics: Performance of the extended Kalman filter. Comput. Methods Appl. Mech. Eng. 2004, 193, 3807–3835. [Google Scholar] [CrossRef]
Eftekhar Azam, S.; Chatzi, E.; Papadimitriou, C. A dual Kalman filter approach for state estimation via output-only acceleration measurements. Mech. Syst. Signal Process. 2015, 60–61, 866–886. [Google Scholar] [CrossRef]
Farrar, C.; Worden, K. Structural Health Monitoring A Machine Learning Perspective; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
Fink, O.; Wang, Q.; Svensen, M.; Dersin, P.; Lee, W.; Ducoffe, M. Potential, Challenges and Future Directions for Deep Learning in Prognostics and Health Management Applications. Eng. Appl. Artif. Intell. 2020, 92, 103678. [Google Scholar] [CrossRef]
Bishop, C.M. Pattern Recognition and Machine Learning (Information Science and Statistics); Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Rosafalco, L.; Torzoni, M.; Manzoni, A.; Mariani, S.; Corigliano, A. Online structural health monitoring by model order reduction and deep learning algorithms. Comput. Struct. 2021, 255, 106604. [Google Scholar] [CrossRef]
Sajedi, S.O.; Liang, X. Vibration-based semantic damage segmentation for large-scale structural health monitoring. Comput.-Aided Civ. 2020, 35, 579–596. [Google Scholar] [CrossRef]
Mirzazadeh, R.; Eftekhar Azam, S.; Mariani, S. Mechanical characterization of polysilicon MEMS: A hybrid TMCMC/POD-kriging approach. Sensors 2018, 18, 1243. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lam, H.F.; Yang, J.H.; Au, S.K. Markov chain Monte Carlo-based Bayesian method for structural model updating and damage detection. Struct. Contr. Health Monit. 2018, 25, e2140. [Google Scholar] [CrossRef]
Torzoni, M.; Rosafalco, L.; Manzoni, A. A Combined Model-Order Reduction and Deep Learning Approach for Structural Health Monitoring Under Varying Operational and Environmental Conditions. Eng. Proc. 2020, 2, 94. [Google Scholar] [CrossRef]
Meng, X.; Babaee, H.; Karniadakis, G.E. Multi-fidelity Bayesian Neural Networks: Algorithms and Applications. J. Comput. Phys. 2021, 438, 110361. [Google Scholar] [CrossRef]
Guo, M.; Manzoni, A.; Amendt, M.; Conti, P.; Hesthaven, J.S. Multi-fidelity regression using artificial neural networks: Efficient approximation of parameter-dependent output quantities. arXiv 2021, arXiv:2102.13403. [Google Scholar] [CrossRef]
Meng, X.; Karniadakis, G.E. A composite neural network that learns from multi-fidelity data: Application to function approximation and inverse PDE problems. J. Comput. Phys. 2020, 401, 109020. [Google Scholar] [CrossRef] [Green Version]
Quarteroni, A.; Manzoni, A.; Negri, F. Reduced Basis Methods for Partial Differential Equations: An Introduction; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
Papadimitriou, C.; Lombaert, G. The effect of prediction error correlation on optimal sensor placement in structural dynamics. Mech. Syst. Signal Process. 2012, 28, 105–127. [Google Scholar] [CrossRef]
Hastings, W.K. Monte Carlo Sampling Methods Using Markov Chains and Their Applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
Goulet, J.A. Probabilistic Machine Learning for Civil Engineers; MIT Press: Cambridge, MA, USA, 2020. [Google Scholar]
Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd ICLR, San Diego, CA, USA, 7–9 May 2015; pp. 1–13. [Google Scholar]
Keras. 2015. Available online: https://keras.io (accessed on 24 February 2022).
Haario, H.; Saksman, E.; Tamminen, J. An adaptive Metropolis algorithm. Bernoulli 2001, 7, 223–242. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Physics-based digital twin of the monitored structure.

Figure 2. Reconstruction capacity of

{NN}_{MF}

: (a) regression over the POD-basis coefficients relative to a compressed LF signal; (b) decompressed LF signal; (c) regression over the HF signal.

Figure 2. Reconstruction capacity of

{NN}_{MF}

: (a) regression over the POD-basis coefficients relative to a compressed LF signal; (b) decompressed LF signal; (c) regression over the HF signal.

Figure 3. Examples of MCMC analysis, in case of damage position (a) close to the clamped side and (b) far from the clamped side.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Torzoni, M.; Manzoni, A.; Mariani, S. Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate. Comput. Sci. Math. Forum 2022, 2, 16. https://doi.org/10.3390/IOCA2021-10889

AMA Style

Torzoni M, Manzoni A, Mariani S. Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate. Computer Sciences & Mathematics Forum. 2022; 2(1):16. https://doi.org/10.3390/IOCA2021-10889

Chicago/Turabian Style

Torzoni, Matteo, Andrea Manzoni, and Stefano Mariani. 2022. "Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate" Computer Sciences & Mathematics Forum 2, no. 1: 16. https://doi.org/10.3390/IOCA2021-10889

APA Style

Torzoni, M., Manzoni, A., & Mariani, S. (2022). Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate. Computer Sciences & Mathematics Forum, 2(1), 16. https://doi.org/10.3390/IOCA2021-10889

Article Menu

Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate^†

Abstract

1. Introduction

2. SHM Methodology

2.1. Datasets Definition

2.2. Datasets Population

2.3. MF-DNN Surrogate Model

2.4. Damage Localization via MCMC

3. Virtual Experiment

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate †

Abstract

1. Introduction

2. SHM Methodology

2.1. Datasets Definition

2.2. Datasets Population

2.3. MF-DNN Surrogate Model

2.4. Damage Localization via MCMC

3. Virtual Experiment

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Health Monitoring of Civil Structures: A MCMC Approach Based on a Multi-Fidelity Deep Neural Network Surrogate^†