Cost–Benefit Optimization of Structural Health Monitoring Sensor Networks

Capellari, Giovanni; Chatzi, Eleni; Mariani, Stefano

doi:10.3390/s18072174

Open AccessArticle

Cost–Benefit Optimization of Structural Health Monitoring Sensor Networks

by

Giovanni Capellari

^1,*,†,

Eleni Chatzi

^2,†

and

Stefano Mariani

^1,†

¹

Politecnico di Milano, Dipartimento di Ingegneria Civile e Ambientale, Piazza Leonardo da Vinci 32, 20133 Milano, Italy

²

ETH Zürich, Institut für Baustatik und Konstruktion Stefano-Franscini-Platz 5, 8093 Zürich, Switzerland

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sensors 2018, 18(7), 2174; https://doi.org/10.3390/s18072174

Submission received: 11 June 2018 / Revised: 4 July 2018 / Accepted: 4 July 2018 / Published: 6 July 2018

(This article belongs to the Special Issue Selected Papers from the 4th International Electronic Conference on Sensors and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Structural health monitoring (SHM) allows the acquisition of information on the structural integrity of any mechanical system by processing data, measured through a set of sensors, in order to estimate relevant mechanical parameters and indicators of performance. Herein we present a method to perform the cost–benefit optimization of a sensor network by defining the density, type, and positioning of the sensors to be deployed. The effectiveness (benefit) of an SHM system may be quantified by means of information theory, namely through the expected Shannon information gain provided by the measured data, which allows the inherent uncertainties of the experimental process (i.e., those associated with the prediction error and the parameters to be estimated) to be accounted for. In order to evaluate the computationally expensive Monte Carlo estimator of the objective function, a framework comprising surrogate models (polynomial chaos expansion), model order reduction methods (principal component analysis), and stochastic optimization methods is introduced. Two optimization strategies are proposed: the maximization of the information provided by the measured data, given the technological, identifiability, and budgetary constraints; and the maximization of the information–cost ratio. The application of the framework to a large-scale structural problem, the Pirelli tower in Milan, is presented, and the two comprehensive optimization methods are compared.

Keywords:

structural health monitoring; Bayesian inference; cost–benefit analysis; stochastic optimization; information theory; Bayesian experimental design; surrogate modeling; model order reduction

1. Introduction

Structural health monitoring (SHM) allows the detection and estimation of variations in the behavior and thereby condition of engineered systems [1], and therefore helps in making decisions about the actions needed to maintain or recover the overall structural safety [2]. Amongst the available methods for SHM in the literature (see [1]), the Bayesian framework allows both unknown system properties and their associated uncertainties to be estimated, as introduced in [3]. The effectiveness of any SHM system to estimate and detect damage, here assumed as a variation of mechanical properties, depends on both the estimation method exploited to crunch the data and the SHM system itself. From a theoretical point of view, we can interpret SHM as an experimental procedure, where the measurements obtained through the SHM system are exploited to reach the goal of the experiment (i.e., the estimation of the parameters and their associated uncertainties). In this view, the experimental setup in SHM includes all the settings which can affect the measurements—namely, the position of the sensors on the structure, the physical quantities to be measured, the number of sensors, and the type of sensors.

In the present work, a SHM system cost–benefit optimization method is presented, which allows the most suitable experimental settings to be chosen in order to maximize the estimation potential and simultaneously minimize the cost of the SHM system. Two alternative approaches are proposed and discussed: (i) the system is optimized by maximizing its effectiveness and simultaneously fulfilling the budgetary constraint; (ii) the system is optimized by maximizing the ratio between its effectiveness and its cost.

Two main advantages can be derived from optimizing the sensor network. Let us first assume that by optimizing the sensor network in terms of position and types of sensors, their number can be decreased. This results in a cost reduction of the overall SHM system and a simplification of the data acquisition system and the system assembly phase. Moreover, an additional side benefit lies in the reduction of the amount of data to be processed. In other words, the resulting optimized SHM system is more “informative”, and thus fewer sensors are required to guarantee the same accuracy of the estimated quantities. Consequently, both the cost and the complexity of the data storage system and the required computational resources can be significantly reduced. Moreover, since the amount of data to be processed is lower, the applicability of real-time estimation methods is enhanced, and the required data storage is reduced for non-real-time applications. On the contrary, if the number and type of sensors are assumed to be constant, the optimization of the sensor network guarantees an increase of information provided by the monitoring system, and thus a consequent reduction of the estimate uncertainties.

Several methods have been presented in the literature to optimally design SHM systems: in all of them, the type and number of sensors are assumed to be constant. The vast majority of these prescribes the type and number of sensors as constant parameters in the optimization problem, allowing for the optimization of the spatial configuration alone. Among them, a stochastic approach for the optimal sensor placement, based on the minimization of a Bayesian loss function, was introduced in [4]: the sensor locations were chosen such that the expectation of the squared error loss function between the estimated and the target values of the quantities to be estimated was minimized. In [5], the evaluation of entropy and mutual information was proposed in order to quantify the amount of information, which can be inferred experimentally (and therefore from an SHM system).

In [6,7], Papadimitriou et al. proposed the minimization of the information entropy as a rationale to optimize the spatial configuration of the sensor network. In order to numerically evaluate the associated objective function, the integral terms arising from an analytical manipulation of the information entropy were approximated through the Laplace method of asymptotic expansions, which allows an associated algebraic formulation to be obtained. Unlike in the present study, the objective function was locally approximated through a smooth replica centered on nominal parameter values, which have to be chosen a priori.

The optimization algorithm to obtain the optimal solution (i.e., maximization of the objective function) has mostly been treated as a discrete optimization problem in the existing literature, with genetic algorithms exploited in [8,9]. According to an alternative greedy approach proposed in [7,10], the optimal configuration can be obtained by splitting the optimization problem into a number of sub-problems, where only one sensor is added at each step, so that the increase in the objective function value is maximized.

In contrast to the existing methods, in the present work a method is introduced to comprehensively optimize the sensor network not only in terms of sensor placement, but also in terms of the type and number of sensors. The effectiveness of a sensor network is quantified through an index based on information theory, originally developed within the computer science research community for the quantification of uncertainty relating to random variables. The SHM sensor network is therefore optimized in terms of number, position, and type of sensors by maximizing the relevant expected Shannon information gain, which is a measure of the utility of the measurements with respect to the quantities to be estimated. It should be underlined that the proposed method was developed within a Bayesian framework and is therefore valid for SHM procedures aiming at Bayesian inference, such as Bayesian model updating or parameter characterization. It can be applied to processes that run both offline and in real-time (as in the reasoning of Kalman filtering-based estimation [11,12,13,14]). As in most stochastic approaches, large computational resources would be needed to take the uncertainties into account, preventing the applicability of the method to large structural models. The coupling with surrogate models (polynomial chaos expansion, [15]), which aims at replacing the original computationally expensive numerical model by reproducing the relation between inputs and outputs, and model order reduction strategies (principal component analysis) allow the latter problem to be overcome. The use of stochastic optimization methods (covariance matrix adaptation-evolutionary strategy, [16]) allows high-dimensional problems to be solved, which was not possible with the previously exploited methods.

The method developed herein is applicable to both static and dynamic monitoring applications of diverse sensing capabilities, where a Bayesian framework is intended to be implemented. The method is demonstrated herein for static measurements only. However, the same framework can be implemented for dynamic monitoring as well if the objective function is expressed in terms of frequency and mode-shape matching.

This paper is organized as follows: first, the theoretical framework is presented in Section 2 by introducing the Bayesian experimental design in Section 2.1 and defining the optimization statement in Section 2.2. Then, the approach for numerically evaluating the objective function is discussed in Section 3 and Section 4. In Section 5, the whole optimization procedure is summarized. Then, the application of the method to a tall building, namely the Pirelli tower in Milan, is presented in Section 6. Finally, some concluding remarks are gathered in Section 7.

2. Theoretical Basis

2.1. Bayesian Experimental Design

Let the goal of the SHM system be the estimation of a set of parameters (e.g., mechanical properties, geometrical properties, or damage indices) defined within an appropriate numerical model of a structure, used to predict its response to given loads. The following random vectors are defined: the parameter vector

θ = [θ_{1} θ_{2} \dots θ_{n_{θ}}] \in R^{n_{θ}}

to be estimated; the data vector

y \in R^{n_{y}}

, with measurements assumed to be collected through a set of sensors. Here,

n_{θ}

is the number of parameters and

n_{y}

is the number of measurements.

The prior probability density function (pdf)

p (θ)

represents the prior knowledge on

θ

, and it can be suitably chosen in order to take initial information into account, such as that provided by previous experiments or by the subjective belief of an expert. If no previous information is available on the values of parameters in

θ

, an uninformative distribution can be considered. The pdf may then be updated considering the data

y

, through Bayes’ theorem:

p (θ | y) = \frac{p (y | θ) p (θ)}{p (y)},

(1)

where the expression

p (\cdot | \cdot)

represents the conditional pdf of the first term with respect to the second one. Thus,

p (θ | y)

is the posterior pdf (i.e., the probability density function of

θ

), given

y

;

p (y | θ)

is the likelihood;

p (y)

is the evidence, that is, the distribution of the observed data, marginalized over

θ

.

Bayes’ theorem is particularly suited within such a context in SHM problems: given a structure, a class of models can be a-priori defined to describe the behaviour of the system, and updated as soon as the structure response is measured. Bayesian model updating was first introduced for structural applications in [3,17], and it allows the posterior probability density function

p (θ | y)

to be obtained based on prior knowledge of

θ

and the data, that is, the maximum a posteriori estimate

θ^{*} = \arg {max}_{θ} [p (θ | y)]

, along with the related uncertainty level.

Within this framework, the effectiveness of the sensor network can be quantified by following the decision-theoretic approach introduced in [18,19]: in the case at hand, prior to performing the measurements, the choice of the experimental settings—in terms of spatial sensor configuration, number of sensors, and type of sensors—must be made. An additional term—the design variable

d \in R^{n_{d}}

, with

n_{d}

designating the dimension of the design variable vector—must be accordingly introduced in the formulation in order to parametrize the network topology and the sensor features. Bayes’ theorem in Equation (1) is then modified as:

p (θ | y, d) = \frac{p (y | θ, d) p (θ | d)}{p (y | d)},

(2)

where all the previously introduced pdfs are conditioned with respect to the design variable

d

, as both the measurements and the parameters to be estimated depend on the experimental settings. Here, the unknown variable

d

is supposed to define the spatial configuration of the sensors (e.g., in terms of spatial coordinates), for a constant number of measurements

n_{y}

and type of sensors.

According to [18], the expected utility of one experiment can be quantified through:

U (d) = \int_{Y} \int_{Θ} u (d, y, θ) p (θ | y, d) p (y | d) d θ d y,

(3)

where

Y

and

Θ

respectively represent the domains of the measurements

y

and of the parameters

θ

.

The function

u (d, y, θ)

is called utility function and defines a scalar measure of the usefulness of the experiment. That is, it quantifies the extent to which certain measurement values are preferable to attain the goal of SHM.

Within a stochastic environment and from a decision-theory perspective, the expected utility allows to choose which action should be performed in order to achieve a certain goal. Therefore, it can be defined as the weighted average of the utilities of each possible consequence of a certain action, wherein the weights describe the probabilities that an action would lead to a certain outcome. For SHM applications, the action is represented by the design of the monitoring system, and the goal is the estimation of the unknown structural parameters.

The choice of

u (d, y, θ)

depends on the goal of the experiment. A thorough review of utility functions is presented in [20]. In the present case, the aim of the experiment is the inference (estimation) of the parameters

θ

. Therefore, following [21], a suitable utility function is the Kullback-Leibler divergence (KLD) [22,23] (also called relative entropy) between the prior and the posterior pdfs. Supposing that the structural response to loading is linear and the posterior pdfs are Gaussian, the optimization problem results in the so-called Bayesian D-optimality [24], which corresponds to the maximization of the determinant of the Fisher information matrix [25] of the measurements.

The KLD from P to Q, which are two generic probability distributions of a random variable

x

, is defined as:

D_{K L} (P | | Q) = \int_{X} p (x) \ln \frac{p (x)}{q (x)} d x,

(4)

where

X

is the domain of

x

;

p (x)

and

q (x)

are the pdfs related to P and Q. The KLD thus measures the increase of information from Q to P. If the two distributions are identical (i.e.,

P = Q

almost everywhere), then

D_{K L} (P | | Q) = 0

.

If the goal of the Bayesian inference problem is the design of the sensor network such that the most possible information is provided by the measurements

y

on the parameters

θ

to be estimated, the design variable

d

has to be optimized by maximizing the gain between the prior pdf

p (θ | d)

and the posterior pdf

p (θ | y, d)

. By specializing Equation (4) for the problem considered here, the resulting utility function is given as:

u (d, y, θ) = D_{K L} [p (θ | y, d) | | p (θ | d)] = \int_{Θ} p (θ | y, d) \ln \frac{p (θ | y, d)}{p (θ | d)} d θ .

(5)

It should be noted that in the integral of Equation (5) the parameter vector

θ

serves as a dummy variable. Therefore,

u (d, y, θ)

is not a function of

θ

. Thus, the expected utility function in Equation (3) can be written as:

\begin{matrix} U (d) = \int_{Y} \int_{Θ} p (θ | y, d) \ln \frac{p (θ | y, d)}{p (θ | d)} p (y | d) d θ d y, \end{matrix}

(6)

where

U (d)

is called the expected Shannon information gain [26] or the Lindley information measure [18].

2.2. Optimal Design of the SHM System

In order to provide a comprehensive strategy to optimize the sensor network, the number of measurements

n_{y}

and the pdf

p_{ϵ}

of the so-called prediction error

ϵ \in R^{n_{y}}

are taken into account as unknown variables of the relevant optimization problem.

Let the prediction error

ϵ

be sampled from a zero mean Gaussian noise

p_{ϵ} = N (0, Σ)

, where

Σ

is the covariance matrix (however, in principle the proposed method can be applied to any kind of

p_{ϵ}

). The expected Shannon information gain then generally depends on the sensor configuration, number of measurements, and prediction error (i.e.,

U = U (d, n_{y}, Σ)

).

Within this framework, the prediction error accounts for the measurement errors related to the sensor characteristics and the model error associated with the intrinsic numerical approximations. Assuming independence between these two error sources, the covariance matrix can be written as:

Σ = Σ_{m} + Σ_{n},

(7)

where

Σ_{m}

and

Σ_{n}

respectively account for the model and the measurement error. In [27], it has been shown that the optimal sensor configuration can also be affected by the spatial correlation among different measurements, which can be taken into account in

Σ_{m}

. In practice, the correlation between any couple of measurements decays exponentially with the distance between their locations. A spatial correlation length can be then introduced to constrain the optimal spatial configuration.

Σ_{n}

can be instead related to the type of sensors to be employed in the SHM system—that is, to the instrumental noise which depends on their characteristics (e.g., signal-to-noise ratio).

For the sake of simplicity, it is assumed that: the sensor type is unique, and so the measurement noise can be accounted for through

Σ_{n} = σ^{2} I

; and the model error

Σ_{m}

is treated as a constant that can affect the optimal configuration, but it is not handled as a further object of the optimization procedure. The resulting optimization statement thus reads:

(d^{*}, n_{y}^{*}, σ^{*}) = arg max [U (d, n_{y}, σ)] .

(8)

Three types of constraints must be taken into account in the problem:

(a): identifiability constraint: $n_{y} > n_{i d e n}$ , where $n_{i d e n}$ is the minimum number of measurements which are required in order to guarantee the identifiability of the parameters $θ$ (see [17,28,29]);
(b): technological constraint: $σ > σ_{s e n s}$ , where $σ_{s e n s}$ is the lowest standard deviation of the measurement noise, associated with the sensors available on the market and chosen to measure the structural output;
(c): cost constraint: $C (n_{y}, σ) \leq B$ , where $C (n_{y}, σ)$ is the cost model of the SHM system and B is the maximum budget available for SHM.

The whole optimization problem can therefore be stated as:

\begin{matrix} (d^{*}, n_{y}^{*}, σ^{*}) = arg max [U (d, n_{y}, σ)], \\ subject to \{\begin{matrix} n_{y} > n_{i d e n}, \\ σ > σ_{s e n s}, \\ C (n_{y}, σ) \leq B . \end{matrix} \end{matrix}

(9)

Regarding the cost model

C (n_{y}, σ)

, the simplest possible consists of a combination of a sensor network cost

C_{0}

, which for example includes the data acquisition hardware, database, assemblage, etc., plus a variable cost (i.e., the cost of all the sensors to be deployed over the structure). Accordingly:

C (n_{y}, σ) = C_{0} + c (σ) n_{y},

(10)

where

c (σ)

is the unitary cost per sensor.

In order to solve the optimization problem, a possible approach would be to embed the unknown variables

n_{y}

and

σ

into the design variable vector

d

. An alternative approach, which is particularly suitable for real applications if only a limited set of sensor types is available, is to explore the function

\bar{U} = U (d^{*}, n_{y}, σ)

[30], which represents the maximum of the expected Shannon information gain over a search grid of points

{n_{y}, σ}

.

d^{*}

is then the optimal configuration obtained by solving the relevant optimization statement, with fixed values of

n_{y}

and

σ

. Since

d^{*}

depends on the choice of

(n_{y}, σ)

, it is possible to conclude that the function

\bar{U} = \bar{U} (n_{y}, σ)

depends exclusively on

(n_{y}, σ)

.

In place of the preceding formulation, based on budget constraint, a procedure based on a cost–benefit analysis can be followed (see [31,32]). In the problem at hand, the benefit is represented by the expected Shannon information gain. Although

U (d, n_{y}, σ)

cannot be directly converted into an expected monetary gain (benefit), it is possible to define a utility–cost index (UCI) through [33]:

UCI (d, n_{y}, σ) = \frac{U (d, n_{y}, σ)}{C (n_{y}, σ)},

(11)

whose measurement unit is [nat/€], [nat] standing for the natural unit of information. The associated optimization problem would then be:

\begin{matrix} (d^{*}, n_{y}^{*}, σ^{*}) = arg max [\frac{U (d, n_{y}, σ)}{C (n_{y}, σ)}], \\ subject to \{\begin{matrix} n_{y} > n_{i d e n}, \\ σ > σ_{s e n s}, \\ C (n_{y}, σ) \leq B . \end{matrix} \end{matrix}

(12)

This optimization formulation allows the most efficient SHM design to be obtained (i.e., to maximize the information per unitary cost).

The same considerations reported previously hold for the optimization problem in Equation (12). The optimal solutions can be obtained by maximizing the associated objective function

\bar{U C I} = UCI (d^{*}, n_{y}, σ)

, where

d^{*}

is the optimal spatial configuration for each

{n_{y}, σ}

set in the search grid.

A comparison between the results of the two strategies defined in Equations (9) and (12) is discussed in Section 6. It is important to underline now that, because the measurements

y

depend on the loading conditions, the optimal sensor placement depends on them as well. Therefore, in order to obtain a sensor network design which is robust with respect to the input loading, several optimizations under different loads should be performed, and the final sensor network configuration should be chosen as the one providing the maximum value of the objective function or, alternatively, as a kind of envelope of all the available solutions.

3. Numerical Approach

As explained in Section 2.1, the optimal design of the SHM system is obtained by maximizing the expected Shannon information gain

U (d, n_{y}, σ)

(see Equation (9)), or a function related to it (see Equation (12)). At given values of

n_{y}

and

σ

, the optimal experimental design

d^{*}

defines the spatial configuration of the network for which the utility is maximized:

d^{*} = \arg max_{d \in D} [\int_{Y} \int_{Θ} p (θ | y, d) \ln \frac{p (θ | y, d)}{p (θ | d)} p (y | d) d θ d y] = \arg \max_{d \in D} [U (d)]

(13)

D

being the design space, which is the domain of all the possible experimental settings (e.g., the locations where the sensors can be placed).

Because the experimental design has to be put in place before performing the measurements, the optimal solution

d^{*}

cannot be found by simply maximizing

U (d, y, θ)

with respect to

y

and

θ

, which are random variables. The optimal point is instead looked for in the design space

D

, by exploring the probability distributions

p (θ | y, d)

and

p (y | d)

in the domains

Y

and

Θ

.

In order to solve the optimization problem, a strategy to compute

U (d)

is needed. Since the double integration in Equation (6) generally cannot be performed analytically, a numerical procedure has to be adopted. Following [34,35] and assuming that

p (θ | d) = p (θ)

(i.e., that the prior distribution is independent of the design variable), Equation (6) can be approximated through the associated Monte Carlo (MC) estimator:

\hat{U} (d) = \frac{1}{N_{o u t}} \sum_{i = 1}^{N_{o u t}} \{\ln [p (y^{i} | θ^{i}, d)] - \ln [p (y^{i} | d)]\},

(14)

where

N_{o u t}

is the number of samples

θ^{i}

and

y^{i}

to be respectively drawn from

p (θ)

and

p (y | θ = θ^{i}, d)

.

The term

p (y^{i} | d)

can be computed through an analogous MC estimator as:

p (y^{i} | d) ≃ \frac{1}{N_{i n}} \sum_{j = 1}^{N_{i n}} p (y^{i} | θ^{j}, d),

(15)

where

N_{i n}

is the number of samples

θ^{j}

to be drawn from

p (θ)

.

The computational cost of such an MC approach can be reduced by using the same batch of samples

θ^{i} = θ^{j}

in Equations (14) and (15). The resulting number of likelihood function evaluations then decreases from

N_{i n} \times N_{o u t}

to

N = N_{i n} = N_{o u t}

(see [34]). The MC estimator of

\hat{U} (d)

is then obtained as:

\hat{U} (d) = \frac{1}{N} \sum_{i = 1}^{N} \{\ln [p (y^{i} | θ^{i}, d)] - \ln [\frac{1}{N} \sum_{j = 1}^{N} p (y^{i} | θ^{j}, d)]\} .

(16)

Model Response

In Equation (16), a major issue is represented by the evaluation of the likelihood function

p (y^{i} | θ^{j}, d)

. Let the structural system, whose SHM network has to be designed, be subjected to a set of forces and constraints. Since its response to the loads depends on the unknown parameters

θ

, the measurements can be linked to the design variables in accordance with:

y = L (d) v (θ) + ϵ = M (d, θ) + ϵ,

(17)

where

M (d, θ) : R^{n_{d}} \times R^{n_{θ}} \to R^{n_{y}}

is the forward model operator which relates the model inputs (i.e., the design variables

d

and

θ

) with the measurements

y

under the considered loading,

L \in R^{n_{y}} \times R^{n_{d o f}}

is a Boolean operator which aims at selecting from

v

the actually measured response components,

n_{d o f}

is the number of degrees of freedom (DOFs) of the numerical model, and

v \in R^{n_{d o f}}

is the structural response (e.g., displacements, rotations, etc.) of the numerical model for all the

n_{d o f}

degrees of freedom.

Following [28], the likelihood function can then be expressed as:

p (y^{i} | θ^{j}, d) = p_{ϵ} (y^{i} - M (d, θ^{j})) .

(18)

It can be underlined that the same approach can be applied to dynamic cases as well. To this end, only Equation (17) has to be modified as follows:

y = L (d) Φ (θ) + ϵ,

(19)

where

Φ \in R^{n_{d o f}} \times R^{N_{m}}

is a matrix containing

N_{m}

modal shapes of the structure and

y

are the relevant measurements. Equation (18) is accordingly changed to [28]:

p (y^{i} | θ^{j}, d) = \prod_{m = 1}^{N_{m}} p_{ϵ} (y^{i} - L (d) Φ_{m} (θ^{j})) .

(20)

Apart from the definition of the likelihood function in Equation (20), the rest of the framework also remains valid for the dynamic case. Since this application goes beyond the scope of the present paper, future work will be devoted to the implementation of such an approach to dynamic testing.

Returning to the static case and considering Equations (16) and (18) and knowing that

y^{i} = M (d, θ^{i}) + ϵ^{i}

(see Equation (17)), the MC estimator of the expected Shannon information gain is obtained as:

\begin{matrix} \hat{U} (d) = \frac{1}{N} \sum_{i = 1}^{N} \{\ln [p_{ϵ} (ϵ^{i})]\} - \frac{1}{N} \sum_{i = 1}^{N} \{\ln [\frac{1}{N} \sum_{j = 1}^{N} p_{ϵ} (M (d, θ^{i}) + ϵ^{i} - M (d, θ^{j}))]\} . \end{matrix}

(21)

The MC estimator is thus offered as the sum of two terms: the first one depends on the prediction error only; the second one depends on the design variables as well as on the parameters.

If the prediction error

ϵ

can be assumed independent of the design variable

d

(i.e., if

p (θ | d) = p (θ)

), the first term in Equation (21) turns out to be independent of

d

. Therefore, it can be dropped from the computation, as we are interested only in the design vector

d^{*}

providing the optimum. In this way, the computing time required for the evaluation of the objective function would be significantly reduced. This case occurs, for example, if a unique type of sensor is planned to be installed on the structure. If the standard deviation of measurements depends on the design variable (i.e., if

σ = σ (d)

), the first term must be kept in the objective function because it affects the optimal solution.

4. Surrogate Modeling

The computational cost of the MC estimator in Equation (21), as is true of any other MC analysis, may be attributed to the repeated evaluation of the model response

M (d, θ^{i})

, for each of the N samples

θ^{i} = θ^{j}

drawn from

p (θ)

. From a practical point of view, the computation of

\hat{U} (d)

can become infeasible due to the high number of degrees of freedom (DOFs) in the numerical models (e.g., of real-life structures).

To reduce the overall computational costs of the evaluation of the model response in the optimization procedure, the exploitation of surrogate models has been proposed in [35] and applied to SHM sensor network optimization in [36]. A surrogate model (or metamodel) is aimed at providing the relationship between input and output through a more computationally efficient formulation. These approaches can therefore be classified as data-driven ones: the underlying physics of the problem is lost. Accordingly, if the physical behavior of the problem changes, a new surrogate model should be built by using the new relevant input–output data. Alternative methods to reduce the computational cost of model evaluation are model-based (e.g., [37,38]).

One of the most widely exploited types of surrogate models is based on polynomial chaos expansion (PCE). PCE was first introduced in [15,39] for standard Gaussian random variables, and was then generalized to other probability distributions in [40,41,42].

Following the investigation in [43], it is assumed that the input random vector is constituted by the unknown parameters only, featuring a joint pdf

p (θ)

, whereas the design variable

d

is not considered in the surrogate model.

Assuming a finite variance model, the PCE of the response

v

in Equation (17) reads:

v_{k} = M_{k} (θ) = \sum_{α \in N^{M}} ϕ_{α} Ψ_{α} (θ) k = 1, \dots, n_{d o f},

(22)

where

Ψ_{α}

are multivariate polynomials which are orthonormal with respect to

p (θ)

;

α = {α_{1}, \dots, α_{M}} \in N^{M}

is a multi-index associated with the components of

Ψ

, and

ϕ_{α} \in R

are the related coefficients. For real-life applications, the sum in Equation (22) is truncated by retaining only those polynomials whose total degree

| α |

is less than a certain value p:

v_{k} ≃ M_{k}^{P C E} (χ) = \sum_{| α | \leq p} ϕ_{α} Ψ_{α} (θ) k = 1, \dots, n_{d o f},

(23)

where

| α | = \sum_{i = 1}^{M} α_{i}

and

M_{k}^{P C E}

is the surrogate model. The response

v

can be approximated in a component-wise fashion by building a set of

n_{y}

PCE surrogate models according to:

v ≅ M^{P C E} (θ) = \{\begin{matrix} M_{1}^{P C E} (θ) \\ ⋮ \\ M_{n_{d o f}}^{P C E} (θ) \end{matrix}\} .

(24)

In order to compute the unknown polynomial coefficients

ϕ_{α}

for each surrogate, both intrusive and non-intrusive methods can be adopted [44]. Intrusive approaches rely on the projection of the original computational model onto the subspace spanned by the PCE, through a Galerkin projection [45]. In such methods, the variables in the governing equations are replaced by their polynomial chaos expansions. For instance, for linear structural problems, the stiffness matrix and the response vector are approximated through a truncated expansion, leading to a linear system of equations to be solved [46]. While these methods demonstrate an increase of the computational cost which is linear with the number of basis polynomials, they are not suitable for the current purposes, since they require the custom modification of the computational solver.

Non-intrusive methods instead allow the bases to be computed by simply processing a batch of sampled input variables

θ

and the corresponding model evaluations

v

, which form the so-called experimental design. No manipulations of the solver are needed, rendering this approach particularly suitable for general-purpose problems. Two methods can be used for non-intrusively computing the coefficients: the projection approach [47,48], where the computation of each coefficient is formulated as a multi-dimensional integral; and least-squares minimization [49].

The latter method is employed here, since an arbitrary number of samples can be used in order to estimate the coefficients of the expansion in Equation (23). The corresponding formulation of the least-square minimization problem is:

\hat{ϕ} = \arg \min_{ϕ} E \{{[ϕ^{T} Ψ (χ) - M (χ)]}^{2}\},

(25)

where

\hat{ϕ}

is the whole set of coefficients to be estimated. In order to reduce the computational cost of the least-squares approach, a method based on least angle regression introduced in [50,51] is adopted. The method relies on the selection of the most significant coefficients of the PC expansion, allowing a reduction in the number of model evaluations, which are required to build the experimental design for the coefficient estimation.

Following the previously described non-intrusive method, a set of

N^{P C E} < < N

samples of the input variable

θ

must be drawn from

p (θ)

, and the corresponding model responses

v

are numerically computed through the model

M (θ, d)

. Once the metamodel

M^{P C E}

is built, the N samples required for the estimation of

\hat{U} (d)

can be computed through the surrogate. The number

N^{P C E}

of input–output samples needed to build such a surrogate model should be chosen by considering the required accuracy of the metamodel in predicting the response of the original model.

According to the adopted formulation for surrogate modeling,

n_{d o f}

PCE surrogates would be required (see Equation (24)), thus making the computation unbearable. Dimensionality reduction strategies are further required to overcome this computational issue. Principal component analysis (PCA) offers a statistical tool for handling large datasets, first introduced by Pearson [52] and Hotelling [53], and later developed in [54,55] for different fields of application: Karhunen–Loeve decomposition (KLD) [56,57] in signal processing, proper orthogonal decomposition (POD) [58] in mathematics, and singular value decomposition (SVD) in mechanical engineering [59]. Some examples of the application of POD in structural health monitoring can be found in [60,61,62,63], where the method has been employed for the order reduction of dynamical models feeding Bayesian updating schemes.

PCA allows the computational burden to be reduced according to the following procedure. Let the model parameters (i.e., the input variables of the surrogate) be sampled from the prior pdf:

θ_{i} \sim p (θ)

, with

i = 1, \dots, N^{P C E}

. Compute the response vectors, which are instead the output variable of the surrogate, as

v_{i} = v (θ_{i})

through the full-order numerical model, building the so-called experimental design of the surrogate model. The model response data are gathered in the matrix

V = [\begin{matrix} v_{1} & \dots & v_{N^{P C E}} \end{matrix}] \in R^{n_{d o f} \times N^{P C E}}

.

V

is projected onto a new space of

n_{d o f}

of uncorrelated variables:

T = W V,

(26)

where

W \in R^{n_{d o f} \times n_{d o f}}

is a square orthogonal matrix, whose rows are the eigenvectors of the matrix

V^{T} V

and form an orthogonal basis;

T \in R^{n_{d o f} \times N^{P C E}}

is the matrix of the principal component scores (i.e., the representation of

V

in the principal component space).

The dimension of the response matrix is reduced through the PCA by retaining only the first

l < < n_{d o f}

components in the solution:

T_{l} = W_{l} V,

(27)

where

T_{l} \in R^{l \times N^{P C E}}

is the reduced-order response matrix and

W_{l} \in R^{l \times n_{d o f}}

is orthonormal.

The formulation defined in Equation (24) is then modified by setting the output of the PCE surrogate model as the reduced-dimension response vector, thus establishing a relation between the parameters

θ

and the first principal components of

v

.

In conclusion, by combining the PCE surrogate and the PCA dimensionality reduction technique, the original model response

M (θ, d)

can be approximated through:

M (θ, d) ≅ M^{M E T A} (θ, d) = L (d) {W_{l}}^{T} M^{P C E} (θ),

(28)

where

M^{M E T A} (θ, d)

is the PCA-PCE-based metamodel.

The resulting MC estimator then reads:

\begin{matrix} \hat{U} (d) = \frac{1}{N} \sum_{i = 1}^{N} \{\ln [p_{ϵ} (ϵ^{i})]\} - \\ - \frac{1}{N} \sum_{i = 1}^{N} \{\ln [\frac{1}{N} \sum_{j = 1}^{N} p_{ϵ} (L (d) {W_{l}}^{T} M^{P C E} (θ^{i}) + ϵ^{i} - (L (d) {W_{l}}^{T} M^{P C E} (θ^{j}))]\}, \end{matrix}

(29)

where, according to the formulation presented, the design variable

d

that defines the spatial configuration of the sensor network is defined as follows:

d = \{\begin{matrix} d_{1} \\ ⋮ \\ d_{s} \\ ⋮ \\ d_{n_{y}} \end{matrix}\},

(30)

where:

d_{s} = {{x_{1}}^{s} {x_{2}}^{s} {x_{3}}^{s} δ_{s}}^{T},

(31)

and

{x_{1}}^{s}, {x_{2}}^{s}, {x_{3}}^{s}

are the coordinates of the location where the s-th measurement is supposed to be taken.

δ_{s}

is a scalar integer value which defines the measured DOF—either a displacement or a rotation.

A possible alternative formulation relies on the nodal labeling of the numerical model DOFs. Despite the beneficial dimension reduction that can be reached, the adoption of this formulation would be detrimental in the solution of the optimization problem, as it would lead to consistent discontinuities of the objective function in the associated search space.

5. Optimization Procedure

The proposed procedure for optimal sensor placement is based on the estimation of the expected Shannon information gain

U (d)

through the MC estimator

\hat{U} (d)

.

According to [7], a sequential strategy can be adopted to solve the problem. At each iteration, only the position of one sensor is optimized, while all other sensors deployed in the previous algorithm steps are held fixed. This strategy was termed forward sequential sensor placement. On the contrary, with the backward sequential sensor placement strategy, the initial configuration is populated with sensors at all of the nodes, and they are later dropped from the optimal configuration one-by-one. In this regard, such sequential strategies are expected to yield sub-optimal solutions, since they cannot guarantee that the optimal solution (i.e., the global maximum of the objective function) is attained. Independently of the method adopted, the iterations are stopped when the desired number

n_{y}

of sensors are placed over the structure.

Since the estimator

\hat{U} (d)

is based on Monte Carlo sampling of measurement error

ϵ

∼

p_{ϵ}

and parameters vector

θ

∼

p (θ | d)

, the resulting objective function becomes noisy. As discussed in Section 2.1, the prior pdf can be assumed to be independent of the position of the sensors (i.e.,

θ

∼

p (θ | d) = p (θ)

). The same batch of samples

θ

can be used for each realization of

d

, resulting in a less-noisy objective function. It should be underlined that, following this assumption, the objective function will be affected by a constant bias, which therefore will not influence the resulting optimal solutions in terms of sensor configuration. Moreover, since there is no need to re-sample

θ

and compute the corresponding structural response for each different sample of

d

at each iteration of the optimization procedure, a consistent reduction in the overall computational cost is achieved.

Due to the noisy objective function, in order to avoid the attainment of a false local optimum, the covariance matrix adaptation evolution strategy (CMA-ES) [64] is adopted here. It is an iterative evolutionary derivative-free algorithm that is suitable for stochastic optimization problems, introduced in [65,66].

The pseudo-code of the CMA-ES is listed in Algorithm 1. The algorithm is based on an evolutionary strategy, where at each iteration i, a total number

N^{o p t}

of samples

d

are drawn from a multivariate normal distribution

d_{j}

∼

m + σ_{c} N (0, C)

, where

C \in R^{n_{d \times d}}

is the covariance matrix,

m \in R^{n_{d}}

is the mean of the design points distribution, and

σ_{c}

is the step size. Then, the values of

m

,

C

, and

σ_{c}

are updated in order for the population of new points

d_{1 : N^{o p t}}

to move towards the maximum of the objective function

\hat{U} (d)

. The evolution of the design variable (i.e., the sequence of consecutive steps of the mean

m

) is performed through the so-called cumulation technique, detailed in Algorithm 1, moving from the initial condition

d_{0}

, and

μ

and

μ_{w}

are parameters needed to control the update phase. In Algorithm 1,

c_{c}

,

c_{σ_{c}}

,

c_{1}

,

d_{σ_{c}}

are parameters which control the optimization procedure and have to be set empirically for each numerical application. The iterations are stopped whenever at least one of the following criteria are fulfilled:

\{\begin{matrix} |\hat{U} (d_{k}) - \hat{U} (d_{k - 1})| \leq ρ_{U}, \\ ∥d_{k} - d_{k - 1}∥ \leq ρ_{d}, \end{matrix}

(32)

where the symbol

|\cdot|

stands for the absolute value of the argument;

∥\cdot∥

represents an appropriately chosen norm of vectors (e.g., the

L^{2}

norm used here); and

ρ_{U}

and

ρ_{d}

are parameters that tolerances the accuracy of the solution in terms of objective function and design variable, respectively. These parameters cannot be chosen a priori, as they are dependent on the specific application, on the model discretization, and on the desired accuracy. For further details on the algorithm, the interested reader may refer to [16,67].

Algorithm 1 Covariance matrix adaptation evolution strategy

The overall procedure for computing the optimal sensor configuration is listed in Algorithm 2, and the corresponding flowchart is shown in Figure 1. First, the parameter vector

θ

is sampled from the prior pdf

p (θ)

, which is chosen a priori. For each sample

θ_{i}

(with

i = 1, \dots, N^{P C E}

), the corresponding response

v_{i} = v (θ_{i})

is computed through the numerical model. Then, the dimension of the response vector is reduced from

n_{d o f}

to l by performing the PCA of

V = [v_{1} \dots v_{N^{P C E}}]

. A total number l of model surrogates is built by considering

θ

as the input variable and the components of the reduced-space vector

T_{l}

as the output variables. A fresh batch of

N > > N^{P C E}

samples

θ

is drawn from prior

p (θ)

, and the corresponding system response is computed through the PCE surrogates. In the end, the optimal configuration is obtained through the CMA-ES optimization method listed in Algorithm 1, where the evaluation of the objective function is performed through the MC estimator defined in Equation (16).

Algorithm 2 Algorithm for the optimization of SHM sensor networks through Bayesian experimental design

6. Results: Application to the Monitoring of a Tall Building

The framework described in the present work was applied to a real large-scale structure: the Pirelli Tower, a 130-m-tall building in Milan (Italy). The building consists of 35 stories out of ground, which are approximately 70 m long and 30 m wide (see Figure 2). The structural system is entirely made of reinforced concrete. Four symmetric triangular cores at the two extremities of each storey are connected by T-shaped beams (see Figure 2b). The structure was modeled using the commercial software SAP2000 v19 (Computer and Structures, Inc., Berkeley, CA, USA) and the associated finite element (FE) model, shown in Figure 2a, consists of 4106 nodes with 6 DOFs each (the 3 displacements

u_{x_{1}}

,

u_{x_{2}}

,

u_{x_{3}}

, and the 3 rotations

φ_{x_{1}}

,

φ_{x_{2}}

,

φ_{x_{3}}

about the axes of the reported orthonormal reference frame), resulting in a total number of DOFs

n_{d o f}

= 24,500. The structure is supposed to be subjected to a horizontal force acting in the

x_{2}

direction on the top floor (Figure 2b), see also [14,38,68,69]. The force was assumed to be eccentric, as shown in Figure 2b, in order to induce a complex bending-torsional mechanical response of the tower. For further details on the structural characteristics and on the FE model, the readers may refer to [68,70,71].

The sensor network was assumed to be optimized in terms of the

n_{θ} = 6

parameters, which are listed in Table 1. The parameters were chosen to render the example as general as possible. Both mechanical and geometrical properties are handled, associated to both vertical and horizontal structural members. The chosen parameters were the Young’s moduli of column groups LC and RC, the Young’s moduli of beam groups LB, CB, and RB, and the beam thickness of group CB. The prior pdfs of each parameter are also shown in Table 1. The prior pdfs of the concrete Young’s modulus were assumed to be uniform, with lower and upper bounds respectively equal to

24 GPa

and

36 GPa

. The prior pdf of the beam thickness was considered to be uniform as well, with lower and upper bounds respectively equal to

0.7 m

and

0.9 m

.

Since the structural model features both displacement and rotation DOFs at each node, the design variable

d

must be defined such that both the spatial position of the sensors and the physical quantity to be measured are taken into account, in accordance with Equations (30) and (31).

As discussed in Section 2.2, in the optimization procedure it is assumed that only the standard deviation

σ

associated with the measurement error can be varied, while the model error is supposed to be a constant term. As

σ

is assumed to be dependent on the sensor characteristics, we also aim to provide a procedure which allows the optimal type of sensor to be chosen to better estimate the chosen parameters, such as possible variations of the estimated properties from the initial health state of the structure.

The contour plot of the objective function

\bar{U} (n_{y}, σ) = U (d^{*}, n_{y}, σ)

is shown in Figure 3. Here, the objective function is computed at the following discrete points of the grid

(n_{y}, σ^{2}) = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10} \times {10^{- 8}, 10^{- 7.5}, 10^{- 7}, 10^{- 6.5}, 10^{- 6}}

. As expected, the maximum value of the expected Shannon information gain increases as the number of sensors increases, as analytically proven in [6] and numerically shown in [72], while the standard deviations decreases, since more information is provided by the SHM system. The associated optimal sensor configuration, which corresponds to the maximum of the objective functions, is shown in Figure 4. A further discussion of the optimal sensor placement with this method can be found in [43].

It can be also observed that the increase in the expected Shannon information gain decreases as more measurements are allowed for. The quantity

\frac{\partial U}{\partial n_{y}}

is therefore a decreasing function of

n_{y}

. From a decision-making perspective, it is interesting to underline that this behaviour corresponds to the so-called “law of diminishing marginal utility”, also known as Gossen’s First Law [73], which is used in economics for the optimization of resource allocation. The law states that the marginal utility of each unit decreases as the supply of units increases. In the problem of optimal SHM system design, the utility of the sensor network is quantified by the expected Shannon information gain [18], and the unit is represented by each measurement. Applications of this law to sensor network optimization in different engineering fields can also be found in [74,75,76].

As a simple linear cost model is assumed (see Equation (10)), the red lines in Figure 3 represent different budget constraints (i.e., the solutions

\{σ n_{y}\}

of the equation

B = C_{0} + c (σ) n_{y}

, where B is the total available budget). By using this chart, it is possible to optimally design the SHM network: the optimal point

\{σ^{*} n_{y}^{*}\}

, for which

\bar{U} (n_{y}, σ)

is maximum, is ruled by the available budgetary constraint and it is uniquely associated with the corresponding optimal configuration

d^{*}

.

A different approach for decision-making is to define a Pareto-like graph, as shown in Figure 5. Each line corresponds to the optimal design for a certain standard deviation (i.e., a certain type of sensors). The cost saving is defined in order to normalize the cost function with respect to the chosen budget. Accordingly, any solution point located at the left of the vertical line represents a non-optimal design solution, since the associated cost does not correspond to the best choice of

\{d, n_{y}, σ\}

. The oscillations of the objective functions in Figure 5 are due to the possible presence of local maxima. Although this problem cannot be solved a priori, it can be mitigated by running the optimization algorithm several times with different initial points

d_{0}

, and choosing the optimal configuration which corresponds to the maximum value of the objective function among the correspondent solutions.

This graph can be particularly useful in appropriately allocating economic resources. For a chosen budget, it is possible to select the type of sensors which results in the highest accuracy as indicated in Figure 5, the number of sensors, and their location, associated with the maximum possible expected information gain. The trend of each Pareto front provides an indication about the change of maximum utility due to a variation of budget, and thus it helps to decide if additional spending is justified. Moreover, given the value of U, it is possible to compare, from an economic point of view, different solutions in terms of the number and type of sensors.

An alternative design approach discussed in Section 2.2 with Equation (11) is based on the maximization of the ratio

U C I (n_{y}, σ) = \frac{\bar{U} (d^{*}, n_{y}, σ)}{C (n_{y}, σ)}

. The resulting optimal solution depends on the cost model. In Figure 6a the SHM system is supposed to have a low initial cost (i.e.,

C_{0} = 500 €

). In Figure 6b the SHM system is supposed to have a high initial cost (i.e.,

C_{0} = 1000 €

). In both cases, the most efficient employment of resources is reached if the best sensor in terms of measurement noise is chosen, while the optimal number of sensors depends on the cost model.

Note that while the objective function

\bar{U} (n_{y}, σ)

always increases with

n_{y}

and

σ

, the function

U C I (n_{y}, σ)

presents a maximum for

n_{y} < \infty

. This is because, as previously discussed, the increase in information associated with each additional sensor decreases as more sensors are added to the monitoring system. From a cost–benefit point of view, it is therefore worthless to add sensors (i.e., to increase the SHM cost) if the resulting benefit in terms, for example, of the additional expected Shannon information gain is very low.

7. Conclusions

The present paper presents a stochastic cost–benefit methodology to optimally design structural health monitoring systems.

The benefit or usefulness of the SHM system is quantified through the expected Shannon information gain between the prior and the posterior pdfs of the parameters to be estimated. By maximizing this objective function, it is possible to choose the best position, type, and number of sensors, which guarantees the minimization of the uncertainties associated with the quantities to be estimated, or in other words, the maximization of the information obtained through the measurements.

The objective function can be numerically approximated through a Monte Carlo sampling approach. The resulting estimator is expressed as a double sum of terms, which depend on the likelihood function. Since a high number of model response evaluations is required, a procedure based on surrogate models and model order reduction strategies is proposed. The combination of the PCE surrogate model and a model order reduction technique (PCA) allows a computationally efficient meta-model to be built in order to mimic the relation between input and output variables. Since the resulting objective function is affected by noise, leading to possible undesired local maxima, an evolutionary strategy (CMA-ES) suitable for stochastic problems is used.

In order to find the optimal solution, the cost, identifiability of the model parameters, and technological constraints have to be taken into account. A further optimization problem is considered, established by maximizing the information gain per unitary cost by means of a cost–benefit analysis.

Application of the framework to a large-scale numerical model demonstrates that the maximum expected Shannon information gain of the SHM system increases as more sensors are added to the system and lower standard deviations of the prediction error are considered. The optimal solution, in terms of maximum information gain, does not necessarily correspond to the most efficient one (see Figure 3), in terms of the ratio between information and cost. This is because the increase in information gain due to additional sensors is reduced as more measurements are considered. A Pareto-front approach can also be followed in order to choose the best solution, both in terms of maximum information and minimum cost (Figure 5).

An alternative procedure based on the maximization of the utility–cost ratio can be implemented to optimally allocate the available resources. In this case, the optimal solution depends on the variation of the sensor network cost with respect to the number of measurements and the sensor type (see Figure 6a,b). It is worth noting that the same consideration also holds for the case where only a few types of sensors are available, and therefore if it is not possible to establish a cost model. The optimization can be performed in the same way. That is, by computing the maximum values of the objective function (which correspond to the optimal spatial configurations) over the discrete search grid.

The proposed strategy is completely non-intrusive, in that it does not require computation of the gradient of the objective function, but instead exclusively relies on evaluations of model response. Moreover, the method is general and no restrictive assumptions, such as linearity or Gaussianity, are placed.

Further work will be dedicated to the application of this framework to dynamic testing and to more complex structural models.

Author Contributions

Authors contributed equally to this work.

Funding

Eleni Chatzi would like to acknowledge the support of the ERC Starting Grant award WINDMIL (ERC-2015-StG # 679843) on the topic of “Smart Monitoring, Inspection and Life-Cycle Assessment of Wind Turbines”.

Acknowledgments

Giovanni Capellari acknowledges the financial support by IDEA League through a Ph.D. Student Grant. The authors also acknowledge the Chair of Risk, Safety and Uncertainty Quantification and the Computational Science and Engineering Laboratory at ETH Zürich for having provided the MATLAB-based software UQLab and CMA-ES, used in the implementation of the method. Authors are indebted to Gianluca Barbella and Federico Perotti, who provided the numerical model of the Pirelli tower.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

SHM	Structural Health Monitoring
PCE	Polynomial Chaos Expansion
PCA	Principal Component Analysis
CMA-ES	Covariance Matrix Adaptation Evolutionary Strategy
KLD	Kullback–Leibler Divergence
UCI	Utility-Cost Index
MC	Monte Carlo
MI	Mutual Information
DOF	Degree of Freedom
FE	Finite Element

References

Balageas, D.; Fritzen, C.P.; Güemes, A. Structural Health Monitoring; Wiley-ISTE: London, UK, 2006. [Google Scholar]
Farrar, C.; Worden, K. An introduction to structural health monitoring. Philos. Trans. R. Soc. A 2007, 365, 303–315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Beck, J.; Katafygiotis, L. Updating models and their uncertainties. I: Bayesian statistical framework. J. Eng. Mech. 1998, 124, 455–461. [Google Scholar] [CrossRef]
Heredia-Zavoni, E.; Esteva, L. Optimal instrumentation of uncertain structural systems subject to earthquake ground motions. Earthq. Eng. Struct. Dyn. 1998, 27, 343–362. [Google Scholar] [CrossRef]
Sobczyk, K. Theoretic information approach to identification and signal processing. Reliab. Optim. Struct. Syst. 1987, 33, 373–383. [Google Scholar]
Papadimitriou, C.; Beck, J.; Au, S.K. Entropy-based optimal sensor location for structural model updating. J. Vib. Control 2000, 6, 781–800. [Google Scholar] [CrossRef]
Papadimitriou, C. Optimal sensor placement methodology for parametric identification of structural systems. J. Sound Vib. 2004, 278, 923–947. [Google Scholar] [CrossRef]
Yao, L.; Sethares, W.; Kammer, D. Sensor placement for on-orbit modal identification via a genetic algorithm. AIAA J. 1993, 31, 1922–1928. [Google Scholar] [CrossRef]
Chisari, C.; Macorini, L.; Amadio, C.; Izzuddin, B. Optimal sensor placement for structural parameter identification. Struct. Multidiscip. Optim. 2017, 55, 647–662. [Google Scholar] [CrossRef]
Kammer, D. Sensor placement for on-orbit modal identification and correlation of large space structures. J. Guid. Control Dyn. 1991, 14, 251–259. [Google Scholar] [CrossRef]
Eftekhar Azam, S.; Mariani, S. Dual estimation of partially observed nonlinear structural systems: A particle filter approach. Mech. Res. Commun. 2012, 46, 54–61. [Google Scholar] [CrossRef]
Chatzi, E.N.; Fuggini, C. Online correction of drift in structural identification using artificial white noise observations and an unscented Kalman filter. Smart Struct. Syst. 2015, 16, 295–328. [Google Scholar] [CrossRef]
Eftekhar Azam, S.; Chatzi, E.; Papadimitriou, C. A dual Kalman filter approach for state estimation via output-only acceleration measurements. Mech. Syst. Signal Process. 2015, 60, 866–886. [Google Scholar] [CrossRef]
Capellari, G.; Eftekhar Azam, S.; Mariani, S. Towards real-time health monitoring of structural systems via recursive Bayesian filtering and reduced order modelling. Int. J. Sustain. Mater. Struct. Syst. 2015, 2, 27–51. [Google Scholar] [CrossRef]
Ghanem, R.; Spanos, P. Polynomial chaos in stochastic finite elements. J. Appl. Mech. 1990, 57, 197–202. [Google Scholar] [CrossRef]
Hansen, N. The CMA evolution strategy: A comparing review. In Towards a New Evolutionary Computation. Advances on Estimation of Distribution Algorithms; Springer: Berlin, Germany, 2006; pp. 75–102. [Google Scholar]
Katafygiotis, L.; Beck, J. Updating models and their uncertainties. II: Model identifiability. J. Eng. Mech. 1998, 124, 463–467. [Google Scholar] [CrossRef]
Lindley, D. On a measure of the information provided by an experiment. Ann. Math. Stat. 1956, 27, 986–1005. [Google Scholar] [CrossRef]
Raiffa, H.; Schlaifer, R. Applied Statistical Decision Theory; Wiley-Interscience: Hoboken, NJ, USA, 1961. [Google Scholar]
Chaloner, K.; Verdinelli, I. Bayesian experimental design: A review. Stat. Sci. 1995, 10, 273–304. [Google Scholar] [CrossRef]
De Groot, M. Uncertainty, information, and sequential experiments. Ann. Math. Stat. 1962, 33, 404–419. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Kullback, S. Statistics and Information Theory; Courier Corporation: Chelmsford, MA, USA, 1959. [Google Scholar]
Bernardo, J.M. Expected information as expected utility. Ann. Stat. 1979, 7, 686–690. [Google Scholar] [CrossRef]
Schervish, M.J. Theory of Statistics; Springer: Berlin, Germany, 2012. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Papadimitriou, C.; Lombaert, G. The effect of prediction error correlation on optimal sensor placement in structural dynamics. Mech. Syst. Signal Process. 2012, 28, 105–127. [Google Scholar] [CrossRef]
Yuen, K.V. Bayesian Methods for Structural Dynamics and Civil Engineering; John Wiley & Sons: Hoboken, NJ, USA, 2010. [Google Scholar]
Capellari, G.; Chatzi, E.; Mariani, S. Parameter identifiability through information theory. In Proceedings of the 2nd ECCOMAS Thematic Conference on Uncertainty Quantification in Computational Sciences and Engineering (UNCECOMP), Rhodes Island, Greece, 15–17 June 2017. [Google Scholar]
Capellari, G.; Chatzi, E.; Mariani, S. Optimal design of sensor networks for damage detection. Procedia Eng. 2017, 199, 1864–1869. [Google Scholar] [CrossRef]
Khoshnevisan, M.; Bhattacharya, S.; Smarandache, F. Utility of choice: An information theoretic approach to investment decision-making. arXiv, 2002; arXiv:math/0212134. [Google Scholar]
Parnell, G.; Driscoll, P.; Henderson, D. Decision Making in Systems Engineering and Management; John Wiley & Sons: Hoboken, NJ, USA, 2011; Volume 81. [Google Scholar]
Capellari, G.; Chatzi, E.; Stefano, M. Cost-Benefit Optimization of Sensor Networks for SHM Applications. Proceedings 2018, 2, 132. [Google Scholar] [CrossRef]
Ryan, K. Estimating expected information gains for experimental designs with application to the random fatigue-limit model. J. Comput. Graph. Stat. 2003, 12, 585–603. [Google Scholar] [CrossRef]
Huan, X.; Marzouk, Y. Simulation-based optimal Bayesian experimental design for nonlinear systems. J. Comput. Phys. 2013, 232, 288–317. [Google Scholar] [CrossRef] [Green Version]
Capellari, G.; Chatzi, E.; Mariani, S. An optimal sensor placement method for SHM based on Bayesian experimental design and Polynomial Chaos Expansion. In Proceedings of the VII European Congress on Computational Methods in Applied Sciences and Engineering, Crete, Greece, 5–10 June 2016; Volume 3, pp. 6272–6282. [Google Scholar]
Eftekhar Azam, S.; Mariani, S.; Attari, N. Online damage detection via a synergy of proper orthogonal decomposition and recursive Bayesian filters. Nonlinear Dyn. 2017, 89, 1489–1511. [Google Scholar] [CrossRef]
Eftekhar Azam, S.; Mariani, S. Online damage detection in structural systems via dynamic inverse analysis: A recursive Bayesian approach. Eng. Struct. 2018, 159, 28–45. [Google Scholar] [CrossRef] [Green Version]
Wiener, N. The homogeneous chaos. Am. J. Math. 1938, 60, 897–936. [Google Scholar] [CrossRef]
Xiu, D.; Karniadakis, G. The Wiener—Askey polynomial chaos for stochastic differential equations. SIAM J. Sci. Comput. 2002, 24, 619–644. [Google Scholar] [CrossRef]
Xiu, D.; Lucor, D.; Su, C.H.; Karniadakis, G. Stochastic modeling of flow-structure interactions using generalized polynomial chaos. J. Fluids Eng. 2002, 124, 51–59. [Google Scholar] [CrossRef]
Xiu, D.; Karniadakis, G. Modeling uncertainty in flow simulations via generalized polynomial chaos. J. Comput. Phys. 2003, 187, 137–167. [Google Scholar] [CrossRef] [Green Version]
Capellari, G.; Chatzi, E.; Mariani, S. Structural Health Monitoring Sensor Network Optimization through Bayesian Experimental Design. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civ. Eng. 2018, 4, 04018016. [Google Scholar] [CrossRef]
Le Maître, O.; Knio, O.M. Spectral Methods for Uncertainty Quantification: With Applications to Computational Fluid Dynamics; Springer: Berlin, Germany, 2010. [Google Scholar]
Ghanem, R.; Spanos, P. Stochastic Finite Elements: A Spectral Approach; Courier Corporation: London, UK, 2003. [Google Scholar]
Herzog, M.; Gilg, A.; Paffrath, M.; Rentrop, P.; Wever, U. Intrusive versus non-intrusive methods for stochastic finite elements. In From Nano to Space; Springer: Berlin, Germany, 2008; pp. 161–174. [Google Scholar]
Ghiocel, D.M.; Ghanem, R.G. Stochastic finite-element analysis of seismic soil–structure interaction. J. Eng. Mech. 2002, 128, 66–77. [Google Scholar] [CrossRef]
Le Maitre, O.; Reagan, M.; Najm, H.; Ghanem, R.; Knio, O. A stochastic projection method for fluid flow: II. Random process. J. Comput. Phys. 2002, 181, 9–44. [Google Scholar] [CrossRef]
Berveiller, M.; Sudret, B.; Lemaire, M. Stochastic finite element: A non intrusive approach by regression. Eur. J. Comput. Mech. 2006, 15, 81–92. [Google Scholar] [CrossRef]
Blatman, G. Adaptive Sparse Polynomial Chaos Expansions for Uncertainty Propagation and Sensitivity Analysis. Ph.D. Thesis, Blaise Pascal University, Clermont-Ferrand, France, 2009. [Google Scholar]
Blatman, G.; Sudret, B. Adaptive sparse polynomial chaos expansion based on least angle regression. J. Comput. Phys. 2011, 230, 2345–2367. [Google Scholar] [CrossRef]
Pearson, K. On lines and planes of closest fit to systems of points in space. Lond. Edinb. Dublin Philos. Mag. J. Sci. 1901, 2, 559–572. [Google Scholar] [CrossRef]
Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 1933, 24, 417. [Google Scholar] [CrossRef]
Liang, Y.; Lee, H.; Lim, S.; Lin, W.; Lee, K.; Wu, C. Proper Orthogonal Decomposition and Its applications, Part I: Theory. J. Sound Vib. 2002, 252, 527–544. [Google Scholar] [CrossRef]
Wu, C.; Liang, Y.; Lin, W.; Lee, H.; Lim, S. A note on equivalence of proper orthogonal decomposition methods. J. Sound Vib. 2003, 265, 1103–1110. [Google Scholar] [CrossRef]
Karhunen, K. Über lineare Methoden in der Wahrscheinlichkeitsrechnung. Ann. Acad. Sci. Fenn. Math. 1947, 37, 1–79. [Google Scholar]
Loeve, M. Asymptotical Study of Dependent Random Variables. Ph.D. Thesis, Universite de Paris, Paris, France, 1941. [Google Scholar]
Kosambi, D. Statistics in function space. J. Indian Math. Soc. 1948, 7, 76–88. [Google Scholar]
Mees, A.; Rapp, P.; Jennings, L. Singular-value decomposition and embedding dimension. Phys. Rev. A 1987, 36, 340–346. [Google Scholar] [CrossRef]
Eftekhar Azam, S. Online Damage Detection in Structural Systems; Springer: Berlin, Germany, 2014. [Google Scholar]
Capellari, G.; Eftekhar Azam, S.; Mariani, S. Hybrid Reduced-Order Modeling and Particle-Kalman Filtering for the Health Monitoring of Flexible Structures. Available online: https://www.researchgate.net/publication/269045554_Hybrid_Reduced-Order_Modeling_and_Particle-Kalman_Filtering_for_the_Health_Monitoring_of_Flexible_Structures (accessed on 11 June 2018).
Mirzazadeh, R.; Eftekhar Azam, S.; Jansen, E.; Mariani, S. Uncertainty quantification in polysilicon MEMS through on-chip testing and reduced-order modelling. In Proceedings of the 18th International Conference on Thermal, Mechanical and Multi-Physics Simulation and Experiments in Microelectronics and Microsystems (EuroSimE), Dresden, Germany, 2–5 April 2017; pp. 1–8. [Google Scholar]
Mirzazadeh, R.; Eftekhar Azam, S.; Mariani, S. Mechanical Characterization of Polysilicon MEMS: A Hybrid TMCMC/POD-Kriging Approach. Sensors 2018, 18, 1243. [Google Scholar] [CrossRef] [PubMed]
Hansen, N.; Müller, S.D.; Koumoutsakos, P. Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES). Evol. Comput. 2003, 11, 1–18. [Google Scholar] [CrossRef] [PubMed]
Hansen, N.; Ostermeier, A.; Gawelczyk, A. On the adaptation of arbitrary normal mutation distributions in evolution strategies: The generating set adaptation. In Proceedings of the 6th International Conference on Genetic Algorithms, Pittsburgh, PA, USA, 15–19 July 1995; pp. 57–64. [Google Scholar]
Hansen, N.; Ostermeier, A. Completely derandomized self-adaptation in evolution strategies. Evol. Comput. 2001, 9, 159–195. [Google Scholar] [CrossRef] [PubMed]
Hansen, N. The CMA evolution strategy: A tutorial. arXiv, 2016; arXiv:1604.00772. [Google Scholar]
Eftekhar Azam, S. Dual Estimation and Reduced Order Modelling of Damaging Structures. Ph.D. Thesis, Politecnico di Milano, Milano, Italy, 2012. [Google Scholar]
Capellari, G.; Eftekhar Azam, S.; Mariani, S. Damage detection in flexible plates through reduced-order modeling and hybrid particle-Kalman filtering. Sensors 2016, 16, 2. [Google Scholar] [CrossRef] [PubMed]
Barbella, G. Frequency Domain Analysis of Slender Structural Systems under Turbulent Wind Excitation. Ph.D. Thesis, Politecnico di Milano, Milano, Italy, 2009. [Google Scholar]
Barbella, G.; Perotti, F.; Simoncini, V. Block Krylov subspace methods for the computation of structural response to turbulent wind. Comput. Methods Appl. Mech. Eng. 2011, 200, 2067–2082. [Google Scholar] [CrossRef]
Capellari, G.; Chatzi, E.; Mariani, S. Optimal sensor placement through Bayesian experimental design: Effect of measurement noise and number of sensors. Proceedings 2017, 1, 41. [Google Scholar] [CrossRef]
Gossen, H. The Laws of Human Relations and the Rules of Human Action Derived Therefrom; MIT Press: Cambridge, MA, USA, 1983. [Google Scholar]
Lee, R.W.; Kulesz, J.J. A risk-based sensor placement methodology. J. Hazard. Mater. 2008, 158, 417–429. [Google Scholar] [CrossRef] [PubMed]
Marbukh, V.; Sayrafian-Pour, K. Mobile sensor networks self-organization for system utility maximization: Work in progress. In Proceedings of the Fifth International Conference on Wireless and Mobile Communications (ICWMC ’09), Nice, France, 23–27 July 2009; pp. 416–419. [Google Scholar]
Tan, L.; Zhang, Y. Optimal resource allocation with principle of equality and diminishing marginal utility in wireless networks. Wirel. Pers. Commun. 2015, 84, 671–693. [Google Scholar] [CrossRef]

Figure 1. Flowchart of the proposed procedure. CMA-ES: covariance matrix adaptation evolution strategy; FE: finite element.

Figure 2. Structural details of the Pirelli Tower: (a) 3D view and (b) plan representation.

Figure 3. Contour plot of

\bar{U} (n_{y}, σ)

, and curves representing the budget constraints

B = C (σ, n_{y})

, with

B_{1} = 2000 €

,

B_{2} = 2500 €

,

B_{3} = 3000 €

.

Figure 3. Contour plot of

\bar{U} (n_{y}, σ)

, and curves representing the budget constraints

B = C (σ, n_{y})

, with

B_{1} = 2000 €

,

B_{2} = 2500 €

,

B_{3} = 3000 €

.

Figure 4. Optimal sensor placement and physical quantity to be measured, with

n_{y} = 10

and

σ = 10^{- 8}

.

Figure 4. Optimal sensor placement and physical quantity to be measured, with

n_{y} = 10

and

σ = 10^{- 8}

.

Figure 5. Pareto fronts of the structural health monitoring (SHM) sensor network optimization problem, for different values of standard deviation

σ

.

Figure 5. Pareto fronts of the structural health monitoring (SHM) sensor network optimization problem, for different values of standard deviation

σ

.

Figure 6. Contour plot of

U C I (n_{y}, σ) = \frac{\bar{U} (d^{*}, n_{y}, σ)}{C (n_{y}, σ)}

, with (a)

C_{0} = 500 €

and (b)

C_{0} = 1000 €

. UCI: utility–cost index.

Figure 6. Contour plot of

U C I (n_{y}, σ) = \frac{\bar{U} (d^{*}, n_{y}, σ)}{C (n_{y}, σ)}

, with (a)

C_{0} = 500 €

and (b)

C_{0} = 1000 €

. UCI: utility–cost index.

Table 1. Definition of parameters

θ

(see Figure 2) and related prior probability density function (pdf)

p (θ)

.

Table 1. Definition of parameters

θ

(see Figure 2) and related prior probability density function (pdf)

p (θ)

.

Position	Physical Quantity	Prior pdf
20th floor left columns (LC)	Young’s modulus E (GPa)	$U (24, 36)$
20th floor right columns (RC)	Young’s modulus E (GPa)	$U (24, 36)$
20th floor left beams (LB)	Young’s modulus E (GPa)	$U (24, 36)$
20th floor right beams (RB)	Young’s modulus E (GPa)	$U (24, 36)$
20th floor central beams (CB)	Young’s modulus E (GPa)	$U (24, 36)$
20th floor central beams (CB)	Beam thickness t (m)	$U (0.7, 0.9)$

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Capellari, G.; Chatzi, E.; Mariani, S. Cost–Benefit Optimization of Structural Health Monitoring Sensor Networks. Sensors 2018, 18, 2174. https://doi.org/10.3390/s18072174

AMA Style

Capellari G, Chatzi E, Mariani S. Cost–Benefit Optimization of Structural Health Monitoring Sensor Networks. Sensors. 2018; 18(7):2174. https://doi.org/10.3390/s18072174

Chicago/Turabian Style

Capellari, Giovanni, Eleni Chatzi, and Stefano Mariani. 2018. "Cost–Benefit Optimization of Structural Health Monitoring Sensor Networks" Sensors 18, no. 7: 2174. https://doi.org/10.3390/s18072174

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cost–Benefit Optimization of Structural Health Monitoring Sensor Networks

Abstract

1. Introduction

2. Theoretical Basis

2.1. Bayesian Experimental Design

2.2. Optimal Design of the SHM System

3. Numerical Approach

Model Response

4. Surrogate Modeling

5. Optimization Procedure

6. Results: Application to the Monitoring of a Tall Building

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI