Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring

Kim, Joo-Wang; Torzoni, Matteo; Corigliano, Alberto; Mariani, Stefano

doi:10.3390/ecsa-9-13354

Open AccessProceeding Paper

Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring^†

Dipartimento di Ingegneria Civile e Ambientale, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy

^*

Author to whom correspondence should be addressed.

^†

Presented at the 9th International Electronic Conference on Sensors and Applications, 1–15 November 2022; Available online: https://ecsa-9.sciforum.net/.

Eng. Proc. 2022, 27(1), 43; https://doi.org/10.3390/ecsa-9-13354

Published: 1 November 2022

(This article belongs to the Proceedings of The 9th International Electronic Conference on Sensors and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Automated vibration-based structural health monitoring (SHM) strategies have been recently proven to be promising in the presence of aging and material deterioration threatening the safety of civil structures. Within such a framework, ensuring high-quality and informative data is a critical aspect that is highly dependent on the deployment of the sensors in the network and on their capability to provide damage-sensitive features to be exploited. This paper presents a novel data-driven approach to the optimal sensor placement devised to identify sensor locations that maximize the information effectiveness for SHM purposes. The optimization of the sensor network is addressed by means of a deep neural network (DNN) equipped with an attention mechanism, a state-of-the-art technique in natural language processing (NLP) that is useful in focusing on a limited number of important components in the information stream. The trained attention mechanism eventually allows for quantifying the relevance of each sensor in terms of the so-called attention scores, thereby enabling to identify the most useful input channels to solve the relevant downstream SHM task. With reference to the damage localization task, framed here as a classification problem handling a set of predefined damage scenarios, the DNN is trained to locate damage on labeled data that had been simulated to emulate the effects of damage under different operational conditions. The capabilities of the proposed method are demonstrated by referring to an eight-story shear building, characterized by damage states possibly located at any story and of unknown severity.

Keywords:

attention mechanism; optimal sensor placement; sensor networks; structural health monitoring; deep learning; damage identification

1. Introduction

Civil structures such as buildings, highways, tunnels and bridges are the backbone of our modern society [1]. Aging and ever-increasing extreme loading conditions threaten such systems, stressing the need of SHM strategies to detect and identify any deviation from the damage-free state, ultimately allowing for reducing maintenance costs and avoiding potential tragic events.

Traditionally, the condition assessment of civil structures is carried out through nondestructive testing and visual inspection, which can provide only local health assessment and highly depend on personal expertise. Inservice remote vibration-based SHM is instead a standard and widespread approach for the continuous and automated global health monitoring (see, e.g., [2,3]), allowing for assessing damage from the vibration response in terms of, e.g., acceleration or displacement multivariate time series acquired with pervasive sensing systems [4,5]. As these SHM techniques rely on their capability to extract damage-sensitive features from raw sensor recordings, ensuring a satisfactory quality and the informativeness of recorded data is a critical aspect. Besides the limited amount of available sensors due to installation costs, the optimization of sensor deployment in the network is a key aspect in order to maximize the information effectiveness for SHM purposes.

The optimal sensor placement (OSP) problem was systematically addressed in the literature; for an overview, interested readers can refer to [6]. Notable contributions in this field have been achieved by means of the Fisher information matrix and its related metrics [7,8], information entropy [9,10], and the value of information [11,12].

This work proposes a novel approach to the OSP, leveraging on data-driven methods empowered by deep learning (DL) algorithms. Its key component is the use of an attention mechanism [13,14] in a neural network, trained in a supervised fashion to resolve an SHM task by exploiting structural response data from a set of feasible sensor locations. Besides allowing for addressing the considered SHM task, the trained DNN also enables to identify the most useful input channels by assigning an attention score to each sensor.

The use of DL in SHM is very effective in automatizing the feature engineering stage required to improve the effectiveness of a damage detection strategy. Indeed, DL allows for automatizing the selection and extraction of optimized damage-sensitive features through an end-to-end learning processes, to ultimately relate them with the corresponding structural states. Nevertheless, supervised techniques require labeled data referring to the possible damage states of the structure that cannot be obtained for real civil structures. To address this, we resorted to a simulation-based approach (see, e.g., [15,16]), by adopting the physics-based model of the structure to be monitored, allowing for systematically simulating the effect of damage on the structural response under different operational conditions.

The proposed methodology was investigated through the virtual monitoring of an eight-story shear building, with reference to the damage localization task. The latter was framed as a classification problem involving a set of predefined damage states, possibly located at any story. The obtained results confirm the capabilities of the proposed approach in terms of both damage localization and optimal sensor placement.

2. SHM Methodology

The proposed methodology is detailed as follows: the composition of the training set is specified in Section 2.1; the working principle of the attention mechanism is described in Section 2.2; the setup of the proposed OSP approach is explained in Section 2.3.

2.1. Datasets Definition

Considering an observation time window

(0, T)

that is short enough to assume invariant operational and damage conditions, a training set

D

is assembled by collecting vibrational data from a virtual sensing network deployed to feature

N_{u}

feasible sensor locations, and a sampling period

Δ t

. The training set

D

is built from the assembly of I instances as follows:

D = {(U_{i}, y_{i})}_{i = 1}^{I},

(1)

with each instance consisting of vibrational time histories

U_{i} = U (x_{i}, y_{i}, δ_{i}) = {[u_{1}, \dots, u_{N_{u}}]}_{i} \in R^{N_{u} \times L}

shaped as

N_{u}

arrays of

L = 1 + T / Δ t

measurements. This was obtained from a numerical model of the structure to be monitored for the corresponding

N_{par}

input parameters

x_{i} \in R^{N_{par}}

defining the operational conditions (for instance the loadings acting on the structure), and for the relevant damage state characterized by

y_{i}

and

δ_{i}

, with

y_{i} \in {0, \dots, Y}

that labels the specific damage scenario that the structure undergoes while collecting the i-th instance from among a set of predefined Y damage states, each referring to a different damage location, and with

y_{i} = 0

identifying the damage-free baseline. Damage was modeled as a selective reduction in the material stiffness of amplitude

δ_{i}

, taking place within the predesignated region associated to

y_{i}

. In this work,

x_{i}

and

δ_{i}

were not considered to be part of the label, as only the damage localization task is addressed. To populate

D

, the parametric input space was assumed to display a uniform probability distribution for each parameter, and it was sampled via the latin hypercube rule. Unless necessary, index i is dropped in the remainder of the paper for ease of notation.

2.2. Attention Mechanism for Data Analytics in SHM

In the neural network community, attention is a mechanism to mimic the cognitive attention behavior that is useful in adaptively focusing on a few but important components of the data stream. This is achieved by means of learnable weights optimized through gradient descent algorithms that can change at runtime as a function of the input data. Originally proposed for neural machine translation problems [13], attention is a state-of-the-art technique in NLP. The main reason behind its popularity is that it allows for coding the data stream into a series of embeddings and learning how to adaptively choose a subset of them, thus preventing early information from becoming lost, as is often the case when processing long sequences with sequence-to-sequence recurrent encoder–decoders.

The corresponding working principle can be described as mapping a query and a set of key-value pairs to an output, computed as a weighted sum of the values, and weights assigned by a compatibility function of the query with the corresponding key. Queries, keys, and values can be obtained in several ways, and most often are the output of previous layers in the neural network. In this work, the scaled dot-product attention introduced in [14] was employed as an effective and efficient form of self-attention. The input consisted of a set of

m_{Q}

queries

Q \in R^{m_{Q} \times d_{Q}}

of length

d_{Q}

, of a set of

m_{K}

keys

K \in R^{m_{K} \times d_{K}}

of length

d_{K}

, and of a set of

m_{K}

values

V \in R^{m_{K} \times d_{V}}

of length

d_{V}

. The output of the scaled dot-product attention was computed as follows:

A (Q W_{Q}, K W_{K}, V W_{V}) = Softmax (\frac{\bar{Q} {\bar{K}}^{⊤}}{\sqrt{s_{K}}}) \bar{V}, A (Q, K, V) \in R^{m_{Q} \times s_{V}},

(2)

where:

R^{m_{Q} \times s_{K}} ∋ \bar{Q} = Q W_{Q}

,

R^{m_{K} \times s_{K}} ∋ \bar{K} = K W_{K}

, and

R^{m_{K} \times s_{V}} ∋ \bar{V} = V W_{V}

are the projections of queries, keys, and values, respectively, onto different subspaces spanned by learnable matrices

W_{Q} \in R^{d_{Q} \times s_{K}}

,

W_{K} \in R^{d_{K} \times s_{K}}

and

W_{V} \in R^{d_{V} \times s_{V}}

; the scaled dot-product in brackets is the previously mentioned compatibility function measuring the alignment of each query with each key; the

Softmax

function serves to obtain a set of weights on the values, which are the so-called attention score, summing to 1 for each query.

There are only a few contributions in the SHM literature exploiting attention techniques; see e.g., [17,18,19]. However, to the best of our knowledge, this is the first application explicitly using the attention scores to address the OSP problem. In particular, each attention score is exploited to assess the informativeness of the corresponding sensor for the downstream damage location task. That is, attention is applied across a fictitious sensor dimension, comprised by the set of feasible sensor locations, deprived of any geometrical notion of spatial location.

2.3. Attention-Mechanism-Driven Sensor Placement

The DNN adopted to address the OSP problem for damage localization purposes is reported in Figure 1. The architecture is composed of two main branches, namely, the query branch and the key/value branch. The former takes vibrational recordings

U

from all the available channels and runs them through three one-dimensional (1D) convolutional units, each comprising a

ReLU

-activated 1D convolutional layer and a max pooling layer. The resulting output is then passed through a fully connected layer to obtain a query

Q \in R^{1 \times d_{Q}}

, representing the current structural response. The key/value branch instead consists of a stack of

N_{u}

base neural networks operating in parallel, each receiving input data

u_{j}

from the corresponding j-th sensor, with

j = 1, \dots, N_{u}

, but all sharing the same set of tuneable parameters. The base neural network features three

ReLU

-activated 1D convolutional layers and a normalization layer. The output of each base neural network is a key

k_{j} \in R^{d_{K}}

, with

m_{K} = N_{u}

, that coincides with the associated value

v_{j} \in R^{d_{V} = d_{K}}

, and accounts for sensor-specific damage-sensitive features extracted by acting separately on each input channel.

Query

Q

and keys

K

then feed the scaled dot-product attention to compute the attention scores and the attention module output

V \in R^{1 \times s_{V}}

, according to Equation (3). Each query can be interpreted as: “where should I look for to most sensitive answer to the damage localization problem given the current structural behavior?” Similarly, we can look at the keys as “the possible answers to the query, i.e., sensor locations, with good answers aligned to the question, i.e., high attention score, and bad answers orthogonal to it, i.e., low attention score”.

The remainder of the DNN architecture simply addresses the downstream damage localization task, and consists of a normalization layer and of two fully connected layers; the first is

ReLU

-activated, while the second is activated by a

Softmax

function, which is the standard choice for classification problems.

During training, the set of tuneable weights parametrizing the DNN is optimized by minimizing the categorical cross-entropy between the predicted and target label classes using the Adam algorithm, which is a first-order stochastic gradient descent optimizer. Once the DNN is trained, the OSP is addressed by processing a testing set not seen during training, and by extracting the corresponding attention scores from the attention module. These attention scores can then be used in several ways to rank the sensors according to their relevance to locate the damage. In the present work, we simply computed the mean attention score for each channel under different operational conditions and looked for the channels featuring the highest values.

3. Results: Eight-Story Shear Building

The proposed approach was assessed on the eight-story shear building model depicted in Figure 2a, adapted from [20]. Each story featured a mass

m = 625

t with an interstory stiffness

k_{s h} = 106

kN/m. Structural damping was introduced through a Rayleigh model, accounting for a

1 %

damping ratio on each vibrational mode. By neglecting the axial deformability of the elements, only the horizontal degrees of freedom were considered. The structure was excited by harmonic loads, acting on each floor with the same frequency and phase according to:

p_{j} (t) = \frac{j}{8} P_{0} sin (2 π f t), j = 1, \dots, 8,

(3)

where:

P_{0} \in [2, 3] kN

is the load amplitude;

f \in [0, 13] Hz

is the load frequency sampled in a range including all the natural frequencies of the structure; factor

\frac{j}{8}

shapes a triangular load distribution along the building elevation, with j growing from the bottom. Therefore, the parametrization ruling the operational conditions was based on

x = {P_{0}, f}^{⊤}

.

The possible damage states were defined by a

δ = 25 %

reduction in the corresponding interstory stiffness, with associated labels

y = 1, \dots, 8

from the ground interstory to the roof one, and with

y = 0

labeling the undamaged case.

Displacement time histories

U (x, y, δ) = [u_{1}, \dots, u_{8}]

were recorded from a virtual sensing system made of

N_{u} = 8

sensors, placed at each floor. The recordings were provided for a time interval characterized by

T = 5

s and with a sampling period of

Δ t = 0.01

s, thus consisting of

L = 501

measurements each.

The dataset

D

was assembled from

I = 9999

instances generated for different values of the parameters selected via the latin hypercube sampling rule. Before training, data were polluted by adding an independent identically distributed Gaussian noise featuring a signal-to-noise ratio equal to 100. Moreover, the data were preprocessed via discrete Fourier transform and subsequently standardized to improve the damage localization performance of the DNN.

In terms of damage localization capabilities, the classifier achieved a satisfactory

88.6 %

classification accuracy against a testing set without showing any particular misclassification trend. The obtained results are summarized by the confusion matrix in Figure 2b, characterized by high values along the main diagonal.

Given the good damage localization capability of the DNN, the corresponding attention scores can be considered to be optimized. The average attention score for each input channel is reported in Figure 2c, showing a clear trend with increasing values from the ground to the top floor. This is reasonable, as the response of the upper floors is expected to be more sensitive to damage on a floor below it than that to damage on a floor above it.

4. Conclusions

This paper presented an approach to the optimal sensor placement for structural health monitoring purposes. By relying on deep neural networks, the strength of the procedure stems from the interpretability of the attention scores associated to a set of feasible sensor locations. The method rests on a numerical model of the structure, which is useful in obtaining labeled data pertaining to specific damage conditions. With reference to a damage localization case study, the obtained results showed the capability of the attention mechanism to identify the most informative input channels to locate damage.

Future studies will investigate the proposed method while exploiting multiple attention heads, as dealing with features from different representation subspaces is expected to improve the overall performance. Moreover, the effect of a strong

L^{1}

regularization will also be analyzed with the aim of inducing sparsity in the attention score vector.

Author Contributions

Conceptualization, M.T., A.C. and S.M.; methodology, J.-W.K., M.T. and S.M.; software, J.-W.K. and M.T.; validation, J.-W.K.; formal analysis, J.-W.K., M.T., A.C. and S.M.; investigation, J.-W.K. and M.T.; data curation, J.-W.K.; writing—original draft preparation, M.T.; writing—review and editing, M.T. and S.M.; visualization, M.T.; supervision, A.C. and S.M.; project administration, A.C. and S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data generated during the study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors would like to thank Luca Rosafalco (Politecnico di Milano) for having provided the numerical model of the shear frame. M.T. acknowledges the financial support by Politecnico di Milano through the interdisciplinary Ph.D. grant “Physics-Informed Deep Learning for Structural Health Monitoring”.

Conflicts of Interest

The authors declare no conflict of interest.

References

Torzoni, M.; Manzoni, A.; Mariani, S. Structural health monitoring of civil structures: A diagnostic framework powered by deep metric learning. Comput. Struct. 2022, 271, 106858. [Google Scholar] [CrossRef]
Rosafalco, L.; Torzoni, M.; Manzoni, A.; Mariani, S.; Corigliano, A. Online structural health monitoring by model order reduction and deep learning algorithms. Comput. Struct. 2021, 255, 106604. [Google Scholar] [CrossRef]
Torzoni, M.; Manzoni, A.; Mariani, S. A Deep Neural Network, Multi-fidelity Surrogate Model Approach for Bayesian Model Updating in SHM. In European Workshop on Structural Health Monitoring; Springer International Publishing: Berlin/Heidelberg, Germany, 2023; pp. 1076–1086. [Google Scholar] [CrossRef]
Rosafalco, L.; Torzoni, M.; Manzoni, A.; Mariani, S.; Corigliano, A. A Self-adaptive Hybrid Model/data-Driven Approach to SHM Based on Model Order Reduction and Deep Learning. In Structural Health Monitoring Based on Data Science Techniques; Springer International Publishing: Berlin/Heidelberg, Germany, 2022; pp. 165–184. [Google Scholar] [CrossRef]
García-Macías, E.; Ubertini, F. Integrated SHM Systems: Damage Detection Through Unsupervised Learning and Data Fusion. In Structural Health Monitoring Based on Data Science Techniques; Springer International Publishing: Berlin/Heidelberg, Germany, 2022; pp. 247–268. [Google Scholar] [CrossRef]
Ostachowicz, W.; Soman, R.; Malinowski, P. Optimization of sensor placement for structural health monitoring: A review. Struct. Health Monit. 2019, 18, 963–988. [Google Scholar] [CrossRef]
Shi, Z.Y.; Law, S.S.; Zhang, L.M. Optimum Sensor Placement for StructuralDamage Detection. J. Eng. Mech. 2000, 126, 1173–1179. [Google Scholar] [CrossRef]
Penny, J.E.T.; Friswell, M.I.; Garvey, S.D. Automatic choice of measurement locations for dynamic testing. AIAA J. 1994, 32, 407–414. [Google Scholar] [CrossRef]
Capellari, G.; Chatzi, E.; Mariani, S. Structural Health Monitoring Sensor Network Optimization through Bayesian Experimental Design. ASCE-ASME J. Risk Uncertain. Eng. Syst. 2018, 4, 04018016. [Google Scholar] [CrossRef]
Capellari, G.; Chatzi, E.; Mariani, S. Cost-benefit optimization of structural health monitoring sensor networks. Sensors 2018, 18, 2174. [Google Scholar] [CrossRef] [PubMed]
Malings, C.; Pozzi, M. Value-of-information in spatio-temporal systems: Sensor placement and scheduling. Reliab. Eng. Syst. 2018, 172, 45–57. [Google Scholar] [CrossRef]
Kamariotis, A.; Chatzi, E.; Straub, D. Value of information from vibration-based structural health monitoring extracted via Bayesian model updating. Mech. Syst. Signal Process. 2022, 166, 108465. [Google Scholar] [CrossRef]
Bahdanau, D.; Kyung Hyun, C.; Bengio, Y. Neural machine translation by jointly learning to align and translate. In Proceedings of theInternational Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015; Volume 3. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is All you Need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Volume 30. [Google Scholar]
Taddei, T.; Penn, J.D.; Yano, M.; Patera, A.T. Simulation-based classification; a model-order-reduction approach for structural health monitoring. Arch. Comput. Methods Eng. 2018, 25, 23–45. [Google Scholar] [CrossRef] [Green Version]
Torzoni, M.; Rosafalco, L.; Manzoni, A.; Mariani, S.; Corigliano, A. SHM under varying environmental conditions: An approach based on model order reduction and deep learning. Comput. Struct. 2022, 266, 106790. [Google Scholar] [CrossRef]
Lei, X.; Xia, Y.; Wang, A.; Jian, X.; Zhong, H.; Sun, L. Mutual information based anomaly detection of monitoring data with attention mechanism and residual learning. Mech. Syst. Signal Process. 2023, 182, 109607. [Google Scholar] [CrossRef]
Li, G.; Ma, B.; He, S.; Ren, X.; Liu, Q. Automatic Tunnel Crack Detection Based on U-Net and a Convolutional Neural Network with Alternately Updated Clique. Sensors 2020, 20, 717. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Pan, Y.; Ventura, C.E.; Li, T. Sensor placement and seismic response reconstruction for structural health monitoring using a deep neural network. Bull. Earthq. Eng. 2022, 20, 4513–4532. [Google Scholar] [CrossRef]
Rosafalco, L.; Manzoni, A.; Mariani, S.; Corigliano, A. Fully convolutional networks for structural health monitoring through multivariate time series classification. Adv. Model. Simul. Eng. Sci. 2020, 7, 38. [Google Scholar] [CrossRef]

Figure 1. Scheme of the DNN architecture adopted to address the OSP problem.

Figure 2. Eight-story shear building case study: (a) physics-based numerical model; (b) confusion matrix relevant to the classifier testing; (c) average attention scores for each monitored channel.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kim, J.-W.; Torzoni, M.; Corigliano, A.; Mariani, S. Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring. Eng. Proc. 2022, 27, 43. https://doi.org/10.3390/ecsa-9-13354

AMA Style

Kim J-W, Torzoni M, Corigliano A, Mariani S. Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring. Engineering Proceedings. 2022; 27(1):43. https://doi.org/10.3390/ecsa-9-13354

Chicago/Turabian Style

Kim, Joo-Wang, Matteo Torzoni, Alberto Corigliano, and Stefano Mariani. 2022. "Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring" Engineering Proceedings 27, no. 1: 43. https://doi.org/10.3390/ecsa-9-13354

APA Style

Kim, J.-W., Torzoni, M., Corigliano, A., & Mariani, S. (2022). Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring. Engineering Proceedings, 27(1), 43. https://doi.org/10.3390/ecsa-9-13354

Article Menu

Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring^†

Abstract

1. Introduction

2. SHM Methodology

2.1. Datasets Definition

2.2. Attention Mechanism for Data Analytics in SHM

2.3. Attention-Mechanism-Driven Sensor Placement

3. Results: Eight-Story Shear Building

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring †

Abstract

1. Introduction

2. SHM Methodology

2.1. Datasets Definition

2.2. Attention Mechanism for Data Analytics in SHM

2.3. Attention-Mechanism-Driven Sensor Placement

3. Results: Eight-Story Shear Building

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Attention Mechanism-Driven Sensor Placement Strategy for Structural Health Monitoring^†