A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings

Rosafalco, Luca; Manzoni, Andrea; Corigliano, Alberto; Mariani, Stefano

doi:10.3390/ecsa-7-08255

Open AccessProceeding Paper

A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings^†

¹

Dipartimento di Ingegneria Civile ed Ambientale, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy

²

MOX, Dipartimento di Matematica, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy

^*

Author to whom correspondence should be addressed.

^†

Presented at the 7th International Electronic Conference on Sensors and Applications, 15–30 November 2020; Available online: https://ecsa-7.sciforum.net/.

Eng. Proc. 2020, 2(1), 34; https://doi.org/10.3390/ecsa-7-08255

Published: 2 December 2020

(This article belongs to the Proceedings of 7th International Electronic Conference on Sensors and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Current progress in sensor technology is setting the ground to push toward satisfactory solutions to challenging engineering problems, like e.g., system identification and Structural Health Monitoring (SHM). In civil engineering, SHM is often based on the analysis of vibrational recordings, represented by time histories of displacements and/or accelerations, collected through pervasive sensor networks and shaped as Multivariate Time Series (MTS). Despite the great advances in soft computing techniques such as neural networks, inverse problems featuring regression tasks on raw vibrational measurements are still challenging. Developing dimensionality reduction tools, able to infer complex correlations within and across the recorded time series, is then of paramount importance. In this work, we designed an AutoEncoder (AE) capable of condensing MTS-shaped data in a reduced format featuring a few latent variables only. The obtained reduced data representation enhances the solution of inverse problems, like e.g., the identification of the parameters governing the dynamic load applied to a structural system. Numerical examples, aimed at the identification of the loading conditions on a shear-type building, are reported to assess the effectiveness of the proposed procedure.

Keywords:

load identification; Time Series Analysis; autoencoders; deep learning

1. Introduction

Data collected by pervasive sensor networks have to be processed, since they are usually unmanageable in their raw forms. Their dimension is the principal obstacle making their use extremely difficult, while the information content is typically highly redundant. Synthetic features like spectral peak frequencies, usually exploited when the acquired data are shaped as Time Series (TS), are extracted to solve engineering tasks, like load identification and Structural Health Monitoring (SHM) [1,2]. Deep Learning (DL) allows extracting features from the data according to the required task, avoiding any preliminar feature design [3,4,5,6]. Among DL techniques, AutoEncoders (AEs) are special type of Neural Networks (NN) able to obtain a reduced data representation [7], also called latent representation, without specifying the task the reduced data representation must be used for.

The NN architecture employed by an AE is usually deep or, in other words, involves the use of multiple sequential transformations. The advantages of employing AEs are manifold: (i) no feature engineering is necessary; (ii) the obtained reduced data representation can be used for different tasks; (iii) they provide the most informative data representation by setting the number of latent variables or, at least, the one that allows to reconstruct data at best. Thanks to their reduced number, latent variables are often interpretable, but only at the price of knowing something about what stays behind the variability of the collected data [8].

In the following, a novel TS AE is proposed for the dimensionality reduction of the pseudo-experimental Multivariate Time Series (MTSs) recordings related to the displacement response of a two-storey shear building. The effectiveness of the dimensionality reduction is judged by the AE ability of reconstructing the input signals from their latent representation. Despite the lack of any a priori performed task-oriented feature engineering, the obtained reduced data representation allows the identification of the load conditions applied to the building.

2. Methodology: A Deep Autoencoder for Load Identification

A Neural Network (NN) is a collection of units, called neurons. Each neuron performs, in its basic form, a linear combination of its input

V \in R^{L}

(which reads

v_{b}

for the AE input channels, see below) via a weight vector

ω

, and applies a nonlinear activation function

ζ

. If a set of L neurons, called layer, is applied to

V

, the output becomes a vector

U (V, Ω) \in R^{L}

, where

Ω = [ω_{1}, \dots, ω_{L}]

. Many layers can be stacked one after another, making the NN architecture deep.

A special type of NN layer is the convolutional one, which allows to infer correlations within and across the inputs, whenever the inputs are shaped as a collection of one-dimensional arrays. In this work, the inputs are a set of MTSs

v = [v_{1}, \dots, v_{N}] \in R^{L \times N}

acquired by a sensor system employing N sensors, and sampling L displacement recordings within a time interval

(0, T)

. The output

U (V, Ω) = [u_{1}, \dots, u_{N^{o u t}}]

of a one-dimensional convolutional layer then reads

u_{n} (V, Ω_{n}) = \sum_{b = 1}^{N} ω_{n}^{b} * v_{b}, n = 1, \dots, N^{o u t},

(1)

where:

* : (R^{H_{o u t}} \times R^{L}) \to R^{L}

is the discrete convolution operator [9];

Ω_{n} = [ω_{n}^{1}, \dots, ω_{n}^{N}] \in R^{H^{o u t} \times N}

are the weights applied to

v_{b}

(with

b = 1, \dots, N

);

Ω = [Ω_{1}, \dots, Ω_{N^{o u t}}] \in R^{H^{o u t} \times N \times N^{o u t}}

collects all the layer weights;

H^{o u t}

is the kernel dimension; N also represents the number of channels of the input layer;

N^{o u t}

is the number of channels of the output layer.

One-dimensional convolutional layers are the building blocks of the proposed AE. This latter is composed by an encoder

e n c

and by a decoder

d e c

. The encoder maps the input

V

into a latent representation

z = z (V) \in R^{P}

, with

P ≪ (L \times N)

, while the decoder maps

z

into a two dimensional array

U = U (z) \in R^{L \times N}

. Being

U

shaped as

V

, we can enforce the AE to reconstruct

V

from

z

by defining

c (V, U) = \sum_{l = 1}^{L} \sum_{n = 1}^{N} {(v_{l n} - u_{l n})}^{2},

(2)

as loss function to be minimised by the NN during the training, which consists in tuning the weights

Ω

ruling the layer operations.

The latent representation

z

can be used to solve a regression problem, involving the identification of the parameter vector

η \in R^{Q}

e.g., governing the loadings applied to the structure. If the decoder can (almost perfectly) reconstruct

V

starting from

z

, it means that

z

condenses all the relevant informations of

V

. As shown in Figure 1, a NN-based regression model r is employed to retrieve

η

starting from

z

, accomplishing this way the load identification task. To train r, a loss function

c_{r} (η, u_{r})

is defined as done in Equation (2), where

u_{r} \in R^{Q}

is the prediction of r. The training of the AE and of r takes place sequentially, first minimising

c (V, U)

, and then minimising

c_{r} (η, u_{r})

. A popular first-order stochastic gradient descend algorithm, called Adam [10], has been employed for these procedure tasks.

3. Results and Discussion

The lateral displacements of a two-storey building, shown in Figure 2, are monitored by a sensor system employing two sensors (one per floor), recording L samples within the time window

(0, T)

. Then, the output of the monitoring system is an MTS

V \in R^{L \times N}

, with

L = 250

and

N = 2

. The dynamic response of the structure is simulated by means of a two-dimensional shear building model wherein, due to the mass distribution and load bearing elements, torsional effects have been disregarded. Damping has not been modelled, having a negligible effect on the identification of continuously excited structures [11,12]. We assumed that the applied lateral loads consist of forces enforced at the floor levels, featuring a sinusoidal time dependence, ruled by the parameter

ϕ

, and a linearly increasing amplitude along the building height, governed by the parameter

α

, i.e.

A_{c} = 0.5 c α \sin (2 π ϕ t)

with

c = 1, 2

. Therefore, the parameter vector

η = {α, ϕ}

looks sufficient to fully describe the loading conditions. A uniform probability density function was associated with each parameter:

U_{α} (α) = \frac{1}{(6.25 - 0.625) 10^{3} N}

for

α

, and

U_{ϕ} (ϕ) = \frac{1}{(15 - 1) Hz}

for

ϕ

. Regarding the structural properties of the building, the same values of mass

m = 625 ton

and interstory stiffness

k = 10^{6}

\frac{kN}{m}

have been assumed for the two floors. Consequently, the resonance frequencies of the building are

f^{s t r} = [3.93, 10.3] Hz

, while the structural periods are

T^{s t r} = [0.255, 97.1 \cdot 10^{- 3}] s

.

A dataset, collecting

12, 000

MTSs, has been assembled to train the AE and r; 4000 additional MTSs, forming the validation set, have been then employed to avoid overfitting. The training dataset is processed several times, or epochs. If the loss function computed with the validation set has not reduced for 50 epochs in a row, the training has been early stopped. A test set, gathering 512 MTSs, has been then employed to verify the reconstruction capacity of the AE, and the performance of the proposed load identification procedure. The reconstruction capacity has been evaluated through two error measures, employing either a standardised

L^{2}

norm or a standardised

L^{\infty}

norm. The error measures have been computed for each reconstructed signal, and standardisation has been done by dividing the reconstruction error (either the

L^{2}

or

L^{\infty}

norm) by the standard deviation of the original signal. Without standardisation, small inaccuracies in reconstructing large displacements would have counted more than large inaccuracies at smaller scales.

A thorough investigation has been carried out to study how the number P of latent variables and the parameter

ϕ

ruling the time dependence of loading, affect the reconstruction capacity of the AE; the other way around, no correlation between the reconstruction error and

α

has been found in our experiments. Indeed, the mean value and the spread of the reconstruction error can not be modelled as a function of

α

, but rather as a function of

ϕ

. Figure 3 and Figure 4 depict the reconstruction error measured, respectively, by the standardised

L^{2}

and

L^{\infty}

norms, when the input signals have been taken from the test set. The graphs for

P = 5

(not reported for brevity) are analogous to those obtained for

P = 6

, even if showing slightly higher values of the reconstruction error. An increasing value of P does not lead to a monotonic enhancement of the AE reconstruction capacity, despite the intuition that a larger latent space should make reconstruction easier. Indeed, even if increasing the value of P has not led to retain more information on the system, we do expect that a more redundant representation should not be detrimental.

A clear relation between the error and

ϕ

can be underlined. Looking at the standardised

L^{2}

norm, the reconstruction capacity of the AE seems worse when

ϕ \approx f_{1}^{s t r}

and

ϕ \approx f_{2}^{s t r}

. This result is not surprising: the beats produced in the displacement recordings, when

ϕ

is close to the structural frequencies of the building, are additional signal characteristics that the AE must struggle to account for. Focusing on the standardised

L^{\infty}

norm, the reconstruction error is still large for

ϕ \approx f_{2}^{s t r}

, while it gets smaller for

ϕ \approx f_{1}^{s t r}

.

In Figure 5, a qualitative assessment of the reconstruction capacity of the AE is reported, to better highlight the meaning of the two error norms: the good signal reconstruction obtained for

ϕ ⪅ f_{1}^{s t r}

points toward the

L^{\infty}

norm as a more appropriate error measure. On the other hand, we are convinced that both these error measures give meaningful information, because the standardised

L^{2}

norm addresses inaccuracies in reproducing the frequency content of the input signal, while the standardised

L^{\infty}

norm highlights the inability of catching its peaks. Still referring to Figure 5, we observe that the amplitude of the signal in Figure 5a is an order of magnitude greater than the one in Figure 5b, despite

α = 702 N

in the first case, and

α = 4341 N

in the second case. The reason is that we are exciting an undamped dynamic system with

ϕ

closer to

f_{1}^{s t r}

in Figure 5a than to

f_{2}^{s t r}

in Figure 5b.

On the basis of the obtained latent representation

z

, we performed the regression of the parameters

η

governing the loading conditions. As shown in Figure 6b, the regression of the load frequency

ϕ

has been rather successfully accomplished: the graph has been obtained with the latent space dimension featuring the highest reconstruction capacity, linked to

P = 4

. An analogous result has been obtained for the regression of the load amplitude

α

, shown in Figure 6b, confirming that the proposed strategy, involving dimensionality reduction of the input and the use of a regression model, allows a correct load identification for the case at hand. It is also worth mentioning that the largest errors in the

ϕ

prediction have been obtained for the frequency range featuring the highest reconstruction error in the

L^{\infty}

norm.

4. Conclusions

The use of a time series autoencoder was proposed for the dimensionality reduction of sensor recordings, typically acquired for the SHM of civil structures. Thanks to the obtained latent representation, the regression of the parameters governing the loading conditions can be successfully carried out. Two error norms have been used to quantitatively assess the signal reconstruction capacity of the autoencoder, evaluated for different dimensions of the latent space. The capability of the autoencoder to reconstruct the input signals has been assessed also qualitatively, through comparison of the input and reconstructed signals in the less accurate cases.

In future works, we aim to understand the role of the latent space dimension on the autoencoder reconstruction capacity, and to investigate how to set it automatically and optimally.

Acknowledgments

The authors warmly thank Matteo Torzoni for sharing valuable discussions about several aspects of this research.

References

Farrar, C.R.; Worden, K. An introduction to structural health monitoring. Philos. Trans. R. Soc. Math. Phys. Eng. Sci. 2007, 365, 303–315. [Google Scholar] [CrossRef] [PubMed]
Capellari, G.; Chatzi, E.; Mariani, S. Structural Health Monitoring Sensor Network Optimization through Bayesian Experimental Design. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civ. Eng. 2018, 4, 04018016. [Google Scholar] [CrossRef]
Entezami, A.; Shariatmadar, H.; Mariani, S. Fast unsupervised learning methods for structural health monitoring with large vibration data from dense sensor networks. Struct. Health Monit. 2020, 19, 1685–1710. [Google Scholar] [CrossRef]
Entezami, A.; Shariatmadar, H.; Mariani, S. Early damage assessment in large-scale structures by innovative statistical pattern recognition methods based on time series modeling and novelty detection. Adv. Eng. Softw. 2020, 150, 102923. [Google Scholar] [CrossRef]
Entezami, A.; Sarmadi, H.; Behkamal, B.; Mariani, S. Big Data Analytics and Structural Health Monitoring: A Statistical Pattern Recognition-Based Approach. Sensors 2020, 20, 2328. [Google Scholar] [CrossRef] [PubMed]
Rosafalco, L.; Manzoni, A.; Mariani, S.; Corigliano, A. Fully convolutional networks for structural health monitoring through multivariate time series classification. Adv. Model. Simul. Eng. Sci. 2020, 7, 38. [Google Scholar] [CrossRef]
Bengio, Y.; Lamblin, P.; Popovici, D.; Larochelle, H. Greedy Layer-Wise Training of Deep Networks. In Proceedings of the 19th International Conference on Neural Information Processing Systems (NIPS’06), Vancouver, Canada, 4–9 December 2006; MIT Press: Cambridge, MA, USA, 2006; pp. 153–160. [Google Scholar]
Locatello, F.; Tschannen, M.; Bauer, S.; Rätsch, G.; Schölkopf, B.; Bachem, O. Disentangling Factors of Variations Using Few Labels. In Proceedings of the 8th International Conference on Learning Representations (ICLR), Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
Hu, J.; Shen, L.; Sun, G. Squeeze-and-Excitation Networks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar] [CrossRef]
Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015; pp. 1–13. [Google Scholar]
Corigliano, A.; Mariani, S. Parameter identification in explicit structural dynamics: Performance of the extended Kalman filter. Comput. Methods Appl. Mech. Eng. 2004, 193, 3807–3835. [Google Scholar] [CrossRef]
Eftekhar Azam, S.; Mariani, S.; Attari, N. Online damage detection via a synergy of proper orthogonal decomposition and recursive Bayesian filters. Nonlinear Dyn. 2017, 89, 1489–1511. [Google Scholar] [CrossRef]

Figure 1. Proposed procedure for the regression of

η

on

z

. First (black part), the AE is trained by minimising

c (V, U)

; next (orange part), r is trained by miminising

c_{r} (η, u_{r})

. TS AE stays for Time Series AutoEncoder.

Figure 1. Proposed procedure for the regression of

η

on

z

. First (black part), the AE is trained by minimising

c (V, U)

; next (orange part), r is trained by miminising

c_{r} (η, u_{r})

. TS AE stays for Time Series AutoEncoder.

Figure 2. Two-storey building: applied load and monitored displacements.

Figure 3. Effects of

ϕ

and P on the signal reconstruction error, which is measured via the standardised

L^{2}

norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 3a–d have been obtained for

P = 2, 3, 4, 6

respectively.

Figure 3. Effects of

ϕ

and P on the signal reconstruction error, which is measured via the standardised

L^{2}

norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 3a–d have been obtained for

P = 2, 3, 4, 6

respectively.

Figure 4. Effects of

ϕ

and P on the signal reconstruction error, which is measured via the standardised

L^{\infty}

norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 4a–d have been obtained for

P = 2, 3, 4, 6

respectively.

Figure 4. Effects of

ϕ

and P on the signal reconstruction error, which is measured via the standardised

L^{\infty}

norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 4a–d have been obtained for

P = 2, 3, 4, 6

respectively.

Figure 5. Reconstructed signal (orange lines) using

P = 4

and input signal (grey lines) belonging to the test set, for (a)

ϕ ⪅ f_{1}^{s t r}

and (b)

ϕ ⪅ f_{2}^{s t r}

.

Figure 5. Reconstructed signal (orange lines) using

P = 4

and input signal (grey lines) belonging to the test set, for (a)

ϕ ⪅ f_{1}^{s t r}

and (b)

ϕ ⪅ f_{2}^{s t r}

.

Figure 6. Outcome of the regression of

α

, Figure 6a, and of

ϕ

, Figure 6b, for the test set: the ground-truth values of

α

and

ϕ

are reported along the horizontal axis, while the predicted values are along the vertical axis.

Figure 6. Outcome of the regression of

α

, Figure 6a, and of

ϕ

, Figure 6b, for the test set: the ground-truth values of

α

and

ϕ

are reported along the horizontal axis, while the predicted values are along the vertical axis.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rosafalco, L.; Manzoni, A.; Corigliano, A.; Mariani, S. A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings. Eng. Proc. 2020, 2, 34. https://doi.org/10.3390/ecsa-7-08255

AMA Style

Rosafalco L, Manzoni A, Corigliano A, Mariani S. A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings. Engineering Proceedings. 2020; 2(1):34. https://doi.org/10.3390/ecsa-7-08255

Chicago/Turabian Style

Rosafalco, Luca, Andrea Manzoni, Alberto Corigliano, and Stefano Mariani. 2020. "A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings" Engineering Proceedings 2, no. 1: 34. https://doi.org/10.3390/ecsa-7-08255

APA Style

Rosafalco, L., Manzoni, A., Corigliano, A., & Mariani, S. (2020). A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings. Engineering Proceedings, 2(1), 34. https://doi.org/10.3390/ecsa-7-08255

Article Menu

A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings^†

Abstract

1. Introduction

2. Methodology: A Deep Autoencoder for Load Identification

3. Results and Discussion

4. Conclusions

Acknowledgments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings †

Abstract

1. Introduction

2. Methodology: A Deep Autoencoder for Load Identification

3. Results and Discussion

4. Conclusions

Acknowledgments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings^†