Next Article in Journal
Modelling the Influence of the 2.4 GHz Electromagnetic Field on the User of a Wearable Internet of Things (IoT) Device for Monitoring Hazards in the Work Environment
Previous Article in Journal
Reducing Food Waste with a Tiny CMOS-MEMS Gas Sensor, Dubbed GMOS
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings †

1
Dipartimento di Ingegneria Civile ed Ambientale, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy
2
MOX, Dipartimento di Matematica, Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy
*
Author to whom correspondence should be addressed.
Presented at the 7th International Electronic Conference on Sensors and Applications, 15–30 November 2020; Available online: https://ecsa-7.sciforum.net/.
Eng. Proc. 2020, 2(1), 34; https://doi.org/10.3390/ecsa-7-08255
Published: 2 December 2020
(This article belongs to the Proceedings of 7th International Electronic Conference on Sensors and Applications)

Abstract

:
Current progress in sensor technology is setting the ground to push toward satisfactory solutions to challenging engineering problems, like e.g., system identification and Structural Health Monitoring (SHM). In civil engineering, SHM is often based on the analysis of vibrational recordings, represented by time histories of displacements and/or accelerations, collected through pervasive sensor networks and shaped as Multivariate Time Series (MTS). Despite the great advances in soft computing techniques such as neural networks, inverse problems featuring regression tasks on raw vibrational measurements are still challenging. Developing dimensionality reduction tools, able to infer complex correlations within and across the recorded time series, is then of paramount importance. In this work, we designed an AutoEncoder (AE) capable of condensing MTS-shaped data in a reduced format featuring a few latent variables only. The obtained reduced data representation enhances the solution of inverse problems, like e.g., the identification of the parameters governing the dynamic load applied to a structural system. Numerical examples, aimed at the identification of the loading conditions on a shear-type building, are reported to assess the effectiveness of the proposed procedure.

1. Introduction

Data collected by pervasive sensor networks have to be processed, since they are usually unmanageable in their raw forms. Their dimension is the principal obstacle making their use extremely difficult, while the information content is typically highly redundant. Synthetic features like spectral peak frequencies, usually exploited when the acquired data are shaped as Time Series (TS), are extracted to solve engineering tasks, like load identification and Structural Health Monitoring (SHM) [1,2]. Deep Learning (DL) allows extracting features from the data according to the required task, avoiding any preliminar feature design [3,4,5,6]. Among DL techniques, AutoEncoders (AEs) are special type of Neural Networks (NN) able to obtain a reduced data representation [7], also called latent representation, without specifying the task the reduced data representation must be used for.
The NN architecture employed by an AE is usually deep or, in other words, involves the use of multiple sequential transformations. The advantages of employing AEs are manifold: (i) no feature engineering is necessary; (ii) the obtained reduced data representation can be used for different tasks; (iii) they provide the most informative data representation by setting the number of latent variables or, at least, the one that allows to reconstruct data at best. Thanks to their reduced number, latent variables are often interpretable, but only at the price of knowing something about what stays behind the variability of the collected data [8].
In the following, a novel TS AE is proposed for the dimensionality reduction of the pseudo-experimental Multivariate Time Series (MTSs) recordings related to the displacement response of a two-storey shear building. The effectiveness of the dimensionality reduction is judged by the AE ability of reconstructing the input signals from their latent representation. Despite the lack of any a priori performed task-oriented feature engineering, the obtained reduced data representation allows the identification of the load conditions applied to the building.

2. Methodology: A Deep Autoencoder for Load Identification

A Neural Network (NN) is a collection of units, called neurons. Each neuron performs, in its basic form, a linear combination of its input V R L (which reads v b for the AE input channels, see below) via a weight vector ω , and applies a nonlinear activation function ζ . If a set of L neurons, called layer, is applied to V , the output becomes a vector U V , Ω R L , where Ω = ω 1 , , ω L . Many layers can be stacked one after another, making the NN architecture deep.
A special type of NN layer is the convolutional one, which allows to infer correlations within and across the inputs, whenever the inputs are shaped as a collection of one-dimensional arrays. In this work, the inputs are a set of MTSs v = v 1 , , v N R L × N acquired by a sensor system employing N sensors, and sampling L displacement recordings within a time interval 0 , T . The output U V , Ω = u 1 , , u N o u t of a one-dimensional convolutional layer then reads
u n V , Ω n = b = 1 N ω n b v b , n = 1 , , N o u t ,
where: : R H o u t × R L R L is the discrete convolution operator [9]; Ω n = ω n 1 , , ω n N R H o u t × N are the weights applied to v b (with b = 1 , , N ); Ω = Ω 1 , , Ω N o u t R H o u t × N × N o u t collects all the layer weights; H o u t is the kernel dimension; N also represents the number of channels of the input layer; N o u t is the number of channels of the output layer.
One-dimensional convolutional layers are the building blocks of the proposed AE. This latter is composed by an encoder e n c and by a decoder d e c . The encoder maps the input V into a latent representation z = z V R P , with P L × N , while the decoder maps z into a two dimensional array U = U z R L × N . Being U shaped as V , we can enforce the AE to reconstruct V from z by defining
c V , U = l = 1 L n = 1 N v l n u l n 2 ,
as loss function to be minimised by the NN during the training, which consists in tuning the weights Ω ruling the layer operations.
The latent representation z can be used to solve a regression problem, involving the identification of the parameter vector η R Q e.g., governing the loadings applied to the structure. If the decoder can (almost perfectly) reconstruct V starting from z , it means that z condenses all the relevant informations of V . As shown in Figure 1, a NN-based regression model r is employed to retrieve η starting from z , accomplishing this way the load identification task. To train r, a loss function c r η , u r is defined as done in Equation (2), where u r R Q is the prediction of r. The training of the AE and of r takes place sequentially, first minimising c V , U , and then minimising c r η , u r . A popular first-order stochastic gradient descend algorithm, called Adam [10], has been employed for these procedure tasks.

3. Results and Discussion

The lateral displacements of a two-storey building, shown in Figure 2, are monitored by a sensor system employing two sensors (one per floor), recording L samples within the time window 0 , T . Then, the output of the monitoring system is an MTS V R L × N , with L = 250 and N = 2 . The dynamic response of the structure is simulated by means of a two-dimensional shear building model wherein, due to the mass distribution and load bearing elements, torsional effects have been disregarded. Damping has not been modelled, having a negligible effect on the identification of continuously excited structures [11,12]. We assumed that the applied lateral loads consist of forces enforced at the floor levels, featuring a sinusoidal time dependence, ruled by the parameter ϕ , and a linearly increasing amplitude along the building height, governed by the parameter α , i.e. A c = 0.5c α sin 2 π ϕ t with c = 1 , 2 . Therefore, the parameter vector η = { α , ϕ } looks sufficient to fully describe the loading conditions. A uniform probability density function was associated with each parameter: U α α = 1 6.25 0.625 10 3 N for α , and U ϕ ϕ = 1 15 1 Hz for ϕ . Regarding the structural properties of the building, the same values of mass m = 625 ton and interstory stiffness k = 10 6 kN m have been assumed for the two floors. Consequently, the resonance frequencies of the building are f s t r = 3.93, 10.3 Hz , while the structural periods are T s t r = 0.255, 97.1· 10 3 s .
A dataset, collecting 12 , 000 MTSs, has been assembled to train the AE and r; 4000 additional MTSs, forming the validation set, have been then employed to avoid overfitting. The training dataset is processed several times, or epochs. If the loss function computed with the validation set has not reduced for 50 epochs in a row, the training has been early stopped. A test set, gathering 512 MTSs, has been then employed to verify the reconstruction capacity of the AE, and the performance of the proposed load identification procedure. The reconstruction capacity has been evaluated through two error measures, employing either a standardised L 2 norm or a standardised L norm. The error measures have been computed for each reconstructed signal, and standardisation has been done by dividing the reconstruction error (either the L 2 or L norm) by the standard deviation of the original signal. Without standardisation, small inaccuracies in reconstructing large displacements would have counted more than large inaccuracies at smaller scales.
A thorough investigation has been carried out to study how the number P of latent variables and the parameter ϕ ruling the time dependence of loading, affect the reconstruction capacity of the AE; the other way around, no correlation between the reconstruction error and α has been found in our experiments. Indeed, the mean value and the spread of the reconstruction error can not be modelled as a function of α , but rather as a function of ϕ . Figure 3 and Figure 4 depict the reconstruction error measured, respectively, by the standardised L 2 and L norms, when the input signals have been taken from the test set. The graphs for P = 5 (not reported for brevity) are analogous to those obtained for P = 6 , even if showing slightly higher values of the reconstruction error. An increasing value of P does not lead to a monotonic enhancement of the AE reconstruction capacity, despite the intuition that a larger latent space should make reconstruction easier. Indeed, even if increasing the value of P has not led to retain more information on the system, we do expect that a more redundant representation should not be detrimental.
A clear relation between the error and ϕ can be underlined. Looking at the standardised L 2 norm, the reconstruction capacity of the AE seems worse when ϕ f 1 s t r and ϕ f 2 s t r . This result is not surprising: the beats produced in the displacement recordings, when ϕ is close to the structural frequencies of the building, are additional signal characteristics that the AE must struggle to account for. Focusing on the standardised L norm, the reconstruction error is still large for ϕ f 2 s t r , while it gets smaller for ϕ f 1 s t r .
In Figure 5, a qualitative assessment of the reconstruction capacity of the AE is reported, to better highlight the meaning of the two error norms: the good signal reconstruction obtained for ϕ f 1 s t r points toward the L norm as a more appropriate error measure. On the other hand, we are convinced that both these error measures give meaningful information, because the standardised L 2 norm addresses inaccuracies in reproducing the frequency content of the input signal, while the standardised L norm highlights the inability of catching its peaks. Still referring to Figure 5, we observe that the amplitude of the signal in Figure 5a is an order of magnitude greater than the one in Figure 5b, despite α = 702 N in the first case, and α = 4341 N in the second case. The reason is that we are exciting an undamped dynamic system with ϕ closer to f 1 s t r in Figure 5a than to f 2 s t r in Figure 5b.
On the basis of the obtained latent representation z , we performed the regression of the parameters η governing the loading conditions. As shown in Figure 6b, the regression of the load frequency ϕ has been rather successfully accomplished: the graph has been obtained with the latent space dimension featuring the highest reconstruction capacity, linked to P = 4 . An analogous result has been obtained for the regression of the load amplitude α , shown in Figure 6b, confirming that the proposed strategy, involving dimensionality reduction of the input and the use of a regression model, allows a correct load identification for the case at hand. It is also worth mentioning that the largest errors in the ϕ prediction have been obtained for the frequency range featuring the highest reconstruction error in the L norm.

4. Conclusions

The use of a time series autoencoder was proposed for the dimensionality reduction of sensor recordings, typically acquired for the SHM of civil structures. Thanks to the obtained latent representation, the regression of the parameters governing the loading conditions can be successfully carried out. Two error norms have been used to quantitatively assess the signal reconstruction capacity of the autoencoder, evaluated for different dimensions of the latent space. The capability of the autoencoder to reconstruct the input signals has been assessed also qualitatively, through comparison of the input and reconstructed signals in the less accurate cases.
In future works, we aim to understand the role of the latent space dimension on the autoencoder reconstruction capacity, and to investigate how to set it automatically and optimally.

Acknowledgments

The authors warmly thank Matteo Torzoni for sharing valuable discussions about several aspects of this research.

References

  1. Farrar, C.R.; Worden, K. An introduction to structural health monitoring. Philos. Trans. R. Soc. Math. Phys. Eng. Sci. 2007, 365, 303–315. [Google Scholar] [CrossRef] [PubMed]
  2. Capellari, G.; Chatzi, E.; Mariani, S. Structural Health Monitoring Sensor Network Optimization through Bayesian Experimental Design. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civ. Eng. 2018, 4, 04018016. [Google Scholar] [CrossRef]
  3. Entezami, A.; Shariatmadar, H.; Mariani, S. Fast unsupervised learning methods for structural health monitoring with large vibration data from dense sensor networks. Struct. Health Monit. 2020, 19, 1685–1710. [Google Scholar] [CrossRef]
  4. Entezami, A.; Shariatmadar, H.; Mariani, S. Early damage assessment in large-scale structures by innovative statistical pattern recognition methods based on time series modeling and novelty detection. Adv. Eng. Softw. 2020, 150, 102923. [Google Scholar] [CrossRef]
  5. Entezami, A.; Sarmadi, H.; Behkamal, B.; Mariani, S. Big Data Analytics and Structural Health Monitoring: A Statistical Pattern Recognition-Based Approach. Sensors 2020, 20, 2328. [Google Scholar] [CrossRef] [PubMed]
  6. Rosafalco, L.; Manzoni, A.; Mariani, S.; Corigliano, A. Fully convolutional networks for structural health monitoring through multivariate time series classification. Adv. Model. Simul. Eng. Sci. 2020, 7, 38. [Google Scholar] [CrossRef]
  7. Bengio, Y.; Lamblin, P.; Popovici, D.; Larochelle, H. Greedy Layer-Wise Training of Deep Networks. In Proceedings of the 19th International Conference on Neural Information Processing Systems (NIPS’06), Vancouver, Canada, 4–9 December 2006; MIT Press: Cambridge, MA, USA, 2006; pp. 153–160. [Google Scholar]
  8. Locatello, F.; Tschannen, M.; Bauer, S.; Rätsch, G.; Schölkopf, B.; Bachem, O. Disentangling Factors of Variations Using Few Labels. In Proceedings of the 8th International Conference on Learning Representations (ICLR), Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
  9. Hu, J.; Shen, L.; Sun, G. Squeeze-and-Excitation Networks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar] [CrossRef]
  10. Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015; pp. 1–13. [Google Scholar]
  11. Corigliano, A.; Mariani, S. Parameter identification in explicit structural dynamics: Performance of the extended Kalman filter. Comput. Methods Appl. Mech. Eng. 2004, 193, 3807–3835. [Google Scholar] [CrossRef]
  12. Eftekhar Azam, S.; Mariani, S.; Attari, N. Online damage detection via a synergy of proper orthogonal decomposition and recursive Bayesian filters. Nonlinear Dyn. 2017, 89, 1489–1511. [Google Scholar] [CrossRef]
Figure 1. Proposed procedure for the regression of η on z . First (black part), the AE is trained by minimising c V , U ; next (orange part), r is trained by miminising c r η , u r . TS AE stays for Time Series AutoEncoder.
Figure 1. Proposed procedure for the regression of η on z . First (black part), the AE is trained by minimising c V , U ; next (orange part), r is trained by miminising c r η , u r . TS AE stays for Time Series AutoEncoder.
Engproc 02 00034 g001
Figure 2. Two-storey building: applied load and monitored displacements.
Figure 2. Two-storey building: applied load and monitored displacements.
Engproc 02 00034 g002
Figure 3. Effects of ϕ and P on the signal reconstruction error, which is measured via the standardised L 2 norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 3a–d have been obtained for P = 2 , 3 , 4 , 6 respectively.
Figure 3. Effects of ϕ and P on the signal reconstruction error, which is measured via the standardised L 2 norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 3a–d have been obtained for P = 2 , 3 , 4 , 6 respectively.
Engproc 02 00034 g003
Figure 4. Effects of ϕ and P on the signal reconstruction error, which is measured via the standardised L norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 4a–d have been obtained for P = 2 , 3 , 4 , 6 respectively.
Figure 4. Effects of ϕ and P on the signal reconstruction error, which is measured via the standardised L norm and is computed with the second floor displacements belonging to the test set. The plots in Figure 4a–d have been obtained for P = 2 , 3 , 4 , 6 respectively.
Engproc 02 00034 g004
Figure 5. Reconstructed signal (orange lines) using P = 4 and input signal (grey lines) belonging to the test set, for (a) ϕ f 1 s t r and (b) ϕ f 2 s t r .
Figure 5. Reconstructed signal (orange lines) using P = 4 and input signal (grey lines) belonging to the test set, for (a) ϕ f 1 s t r and (b) ϕ f 2 s t r .
Engproc 02 00034 g005
Figure 6. Outcome of the regression of α , Figure 6a, and of ϕ , Figure 6b, for the test set: the ground-truth values of α and ϕ are reported along the horizontal axis, while the predicted values are along the vertical axis.
Figure 6. Outcome of the regression of α , Figure 6a, and of ϕ , Figure 6b, for the test set: the ground-truth values of α and ϕ are reported along the horizontal axis, while the predicted values are along the vertical axis.
Engproc 02 00034 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Rosafalco, L.; Manzoni, A.; Corigliano, A.; Mariani, S. A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings. Eng. Proc. 2020, 2, 34. https://doi.org/10.3390/ecsa-7-08255

AMA Style

Rosafalco L, Manzoni A, Corigliano A, Mariani S. A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings. Engineering Proceedings. 2020; 2(1):34. https://doi.org/10.3390/ecsa-7-08255

Chicago/Turabian Style

Rosafalco, Luca, Andrea Manzoni, Alberto Corigliano, and Stefano Mariani. 2020. "A Time Series Autoencoder for Load Identification via Dimensionality Reduction of Sensor Recordings" Engineering Proceedings 2, no. 1: 34. https://doi.org/10.3390/ecsa-7-08255

Article Metrics

Back to TopTop