1. Introduction
With the continuous increase of traffic load, the safety of bridges as important and most complex traffic components has received more attention. The safety of beam end components (including dampers, expansion joints, bearings, etc.) is one of them. Under the influence of long-term external environment (mainly ambient temperature, followed by wind and traffic loads, etc.), the beam end of the bridge will undergo frequent reciprocating longitudinal displacements, which will lead to damage to the beam end components [
1,
2,
3]. Therefore, health monitoring of beam end longitudinal displacement (LDBD) is particularly important for bridge safety assessment. However, when sensor failure, storage failure, transmission failure, or equipment inspection and replacement occur, data loss will occur [
4,
5]. It can be said that data loss is an inevitable problem of bridge health monitoring systems [
6,
7]; especially for health monitoring systems using wireless sensors, data loss is more frequent [
8,
9,
10]. When the data loss rate is too high, the health assessment of the structure cannot be carried out normally, which may lead to serious consequences [
11]. Therefore, this paper reconstructs the missing data of LDBD.
This paper selects suspension bridges as the research object because suspension bridges are large-span bridges with greater flexibility, and their LDBD changes more dramatically. Guo et al. [
12] compared the beam end displacements of a suspension bridge (Jiangyin Yangtze River Bridge, main span 1385 m) and a cable-stayed bridge (Sutong Bridge, main span 1088 m) in my country and found that the fluctuation amplitude and daily accumulation of the former were one order of magnitude larger than those of the latter. Okuda et al. [
13] reported that fatigue cracks appeared in the expansion joint connecting pins of the Akashi Kaikyo Bridge (suspension bridge, main span 1991 m) in Japan after 3 years of use. The expansion joints of the Jiangyin Bridge and Runyang Bridge in my country also showed damage after 3–4 years of use. It can be seen that the damage caused by LDBD in the suspension bridge is greater.
At present, many scholars have reconstructed the missing data of the structural health monitoring system. Niu Yanwei et al. [
14] established a finite element model based on bridge health monitoring data and analyzed the mechanism of induced bridge deformation under different temperature fields. Li et al. [
15] proposed two multi-scale finite element models to reconstruct structural responses. Zhong Guoqiang et al. [
16] established the completed measurement temperature field of the entire bridge based on the finite element model and finite measured temperature data. Wang Xu, Xie Guilin, and others [
17] predicted the long-term vertical displacement of concrete bridges based on meteorological data and an optimized GRU model. However, this method of repairing data by relying on finite element models has extremely high requirements on the accuracy of the model. When finite element modeling is performed on complex building structures, it is difficult to accurately simulate the various uncertainties encountered in actual buildings, such as changes in material properties, structural defects, and changes in boundary constraints during long-term use of the building [
18]. Therefore, the reconstruction of data by finite element models can only be applied to some simple model structures, and is not applicable to actual large-scale building structures. By using intelligent algorithms such as neural networks, connections between different sensors can be directly established based on historical monitoring data, eliminating the complex modeling process. Although the RGU model performs well in terms of computing efficiency and implementation simplification, it is not as good as the LSTM model in terms of long-range dependency capture capabilities, flexibility, empirical support, and adaptability. LSTM solves the problem of gradient vanishing or gradient exploding in RNN neural networks by adding a forgetting mechanism. It has long-term memory and is convenient for time series data analysis [
19,
20]. Compared with traditional time series processing methods (such as ARIMA [
21]), LSTM has good stability and is more effective in dealing with nonlinear problems [
22].
In summary, this paper takes the suspension bridge as the research object. First, the monitoring data related to LDBD, such as temperature and beam end rotation angle data, (RDBD) are analyzed and screened. Then, the temperature data at different positions of the bridge are used as the input of the LSTM model to compare and analyze the prediction effect of LDBD. Next, RDBD is used as the input of the LSTM model to observe the prediction effect of LDBD; then, temperature and RDBD are used as the input of the LSTM model to observe whether the prediction effect of the LSTM model is improved; finally, the prediction method of LDBD is summarized.
3. Case Study
3.1. Analysis of Influencing Factors of Beam End Longitudinal Displacement (LDBD)
First, the end displacement of a suspension bridge is very sensitive to environmental conditions, especially changes in ambient temperature. Studies have shown that changes in the structural static response (TSR) caused by temperature changes sometimes exceed those caused by traffic or other loads [
32]. The effect of temperature on the end displacement of a long-span bridge cannot be ignored [
33]. Consequently, it is possible to use temperature sensors in the bridge health monitoring system to predict LDBD.
Secondly, LDBD may be related to beam end rotation angle data (RDBD). The main beam of a suspension bridge can be regarded as a flexible cube, as shown in
Figure 2. When the rotation angles at both ends change, the longitudinal displacement in the length direction will also change. Therefore, it can be thought that RDBD can be used to predict LDBD.
3.2. Source of Data
The data of this study uses the data of the health monitoring system of Chongqing Wanzhou Fuma Bridge. Chongqing Wanzhou Fuma Bridge is located in the main urban area of Wanzhou. The main span is a single-span 1050 m supported steel box girder suspension bridge. The total length of the bridge is 2030 m. The standard width of the bridge is 26.5 m, and the width of the main bridge plus the wind nozzle is 32 m. Chongqing belongs to the tropical monsoon humid climate, with an average annual temperature of 16~18 °C. Its climate is characterized by hot summers and cool autumns, warm winters and early springs, and four distinct seasons. The highest temperature in a year can exceed 40 °C, and the lowest temperature can drop to below zero. Due to the large temperature difference, the box girder will produce a large deformation, resulting in a large change in the LDBD of the bridge. This requires close monitoring of the beam end displacement to see whether it is within a reasonable range. Therefore, the stability and accuracy of the beam end displacement data monitoring are essential.
Table 1 shows the types, quantities, and corresponding representative symbols of the sensors selected in the structural health monitoring system.
Figure 3 shows the arrangement position of using
Table 1 symbols instead of sensors.
3.3. Initial Data
Due to the symmetry of the bridge, LDBD at one position is selected for research to reduce the workload. This study determines LDBD in the upstream south bank cable tower. Similarly, the temperature sensors at different monitoring locations are chosen from the one closest to the LDBD sensor in the upstream south bank cable tower as a representative for research. Since the anchor room temperature is too poor, it has not been studied. The selected temperature sensor numbers and positions are shown in
Table 2. The RDBD sensor used is the RDBD sensor that is closest to the selected LDBD sensor. The raw data collected by each sensor is shown in
Figure 4. It can be seen that there are a series of problems with the raw data.
(1) Missing values: Since the sensors in the bridge health system work externally for a long time, they are very susceptible to equipment failures caused by external or internal factors, resulting in interruptions in the collected data. As shown in
Figure 4a,e,g, a flat line segment suddenly appears in the curve in the Figure, and the data in this period is missing;
(2) Abnormal values: Since the monitoring of the bridge health monitoring system is long-term, the monitoring data has a certain trend of change, but some data are obviously out of the overall change trend. These data need to be eliminated. As shown in
Figure 4a,e,g,i, a few data points in the curve suddenly drop, which deviates from the overall trend of change.
3.4. Sample Data Preprocessing
(1) Data missing processing
For the part with continuous missing data, it is unrealistic to fill all of them, and the accuracy of such filling is not high, so this part is directly discarded; for the data with a small number of missing data, the moving average method with a window length of 30 is used to fill.
(2) Outlier processing
The 3σ criterion applies to large sample data and provides a clear criterion for judging outliers. When the data error exceeds the range of plus or minus 3 times the standard deviation, these data are considered outliers. Therefore, this study adopts this criterion to identify outliers. After deleting the outliers, the deleted positions are filled with the moving average method with a window length of 30.
(3) Data Frequency Unification
Since different types of sensors collect data at various frequencies, and the input and output of LSTM should correspond one to one, it is necessary to unify the frequency of data collection by different sensors to ensure that the time when the data is generated can correspond.
First, establish a timetable with a time interval of 10 min, and then interpolate the data collected by other sensors based on the time corresponding to this timetable. This study adopts the spline interpolation method, which can produce a smooth interpolation curve and better retain the overall characteristics of the data.
Figure 4b,d,f,h,j,l show the data obtained after preprocessing each sensor data. It can be seen that the outliers and missing values significantly improve after the data is processed.
3.5. Sample Data Denoising
3.5.1. Correlation Analysis and Evaluation Principle
Although the deep learning model is powerful, it cannot find the pattern between two unrelated data. Therefore, before putting the data into the deep learning model training, it is necessary to analyze the correlation coefficient between the beam end displacement and other sensor data to determine whether there is any correlation between the data.
The correlation evaluation model analyzes the correlation between beam end displacement and temperature by linear regression analysis. The fitting equation of linear regression analysis is as follows:
p1 and
p2 are the regression coefficients of the fitting equation,
x is the independent variable, and
y is the dependent variable. The regression coefficients are obtained using the least squares method:
In the formula, xi and yi are the actual values of the independent variable x, and the dependent variable y; and are the average values of the samples of these variables.
The correlation coefficient
R is used to verify the rationality of the calculated linear relationship between the independent variable
x and the dependent variable
y. The value of
R can reflect the linear correlation between the independent and dependent variables, as shown in
Table 3. It can be seen that when the absolute value of
R is closer to 1, the correlation between the independent variable and the dependent variable is stronger. When the absolute value of
R is closer to 0, the correlation between the independent and dependent variables is weaker.
3.5.2. Correlation When the Signal Is Not Denoised
The correlation between temperature at different positions and beam end displacement is evaluated using the above evaluation principle, as shown in
Figure 5. It can be seen from the Figure that the temperature of A-T and B-T have the best correlation with the data of L-D, and the correlation coefficient is above 0.960. Secondly, the temperature of C-T and D-T correlates well with the data of L-D, and the correlation coefficient is about 0.820. Finally, the data of R-D correlate well with L-D. It can be seen from the correlation that L-D decreases as the absolute value of R-D increases, which also confirms that the beam end angle in
Section 3.1 can predict the beam end displacement.
3.5.3. Correlation When the Signal Is Denoised
The health monitoring system is affected by the uncertainty of the monitoring environment and is inevitably affected by various factors, which reduces the monitoring accuracy of the sensor [
11]. As shown in
Figure 6, although the LDBD data is relatively good overall, after magnification, it can be found that there are a lot of “burrs” in the data curve. These “burrs” are the noise in the sensor data collection process. This study uses wavelet denoising [
34] to reduce the noise of the collected signal. Wavelet denoising is an upgrade of the traditional Fourier transform. Wavelet denoising has high efficiency, adaptability, robustness, sparsity, and multi-resolution analysis capabilities and can better identify signal characteristics of different scales and time intervals [
35]. This study selects the wavelet of the Symlets series. The performance of the SymN wavelet is relatively comprehensive, with good orthogonality, biorthogonality, compact support, symmetry, etc. N represents the vanishing moment, and N is taken as 6 in this study. As shown in
Figure 6, the displacement curve of the beam end after denoising by wavelet transform retains the change trend of the original data and becomes smooth.
As shown in
Figure 7, the correlation between the temperature in each area has been improved after noise reduction. The distribution of the displacement of the beam end and other data after noise reduction is more regular. The points after noise reduction are located on both sides of the linear fitting line and overlap to form a spiral curve.
4. Using Temperature as Input to the LSTM Model
4.1. Model Parameter Analysis
The previous article analyzed the correlation between different regions’ temperatures and the beam end’s displacement. The following uses the temperature of the above four areas to predict the displacement of the beam end. To make the model convincing, this study divides the data of the temperature of different regions and the beam end into three parts, training set, validation set, and test set, according to the ratio of 7:1:2. The training set trains the LSTM model, optimizes the parameters of the model according to the prediction of the validation set, and finally uses the test set to predict and test the model after parameter optimization.
The structure of the LSTM neural network, the model training optimization method, and the model parameter optimization method described in
Section 1 are adopted. The specific LSTM parameters and operating environment settings are shown in
Table 4.
Other parameters, such as the number of hidden layers, the number of neurons in the hidden layers, the batch size, the learning rate, and the threshold size of the drop layer, are optimized using Bayesian optimization. The specific implementation steps are as follows:
(1) Determine the parameters to be optimized and set each parameter’s range so that forms the parameter space. Bayesian optimization will randomly generate a set of parameters in this search space to train the LSTM model. After the training is completed, the error between the predicted value and the true value of the model can be obtained by inputting the validation set into the model (this error is used as the objective function of Bayesian optimization). The error is returned to the Gaussian model, and the Gaussian model is corrected;
(2) According to the corrected Gaussian model, the acquisition function will obtain the next most likely parameter combination, and the LSTM model will be trained with this parameter combination to obtain a new error. The latest mistake will update the Gaussian model and the parameter combination;
(3) Repeat the second step. After the set number of optimizations is completed, the parameter combination corresponding to the minimum error of the validation set will be obtained. This combination is the better parameter combination. Finally, the LSTM model is trained, verified, and predicted according to the optimized parameter combination.
According to the model input dimension and output dimension, the search space of the approximate range of the optimal parameters is set, as shown in
Table 5.
4.2. Prediction Effect
4.2.1. Parameter Regulation
The model is established according to the above content. The temperature data of the four regions of the suspension bridge after noise reduction in
Section 3 and the LDBD of the upstream south bank beam end are used as the input and output of the model for training. The model iteration time of each round of Bayesian optimization is 50 times, and 200 rounds of Bayesian optimization are carried out to obtain the optimal parameter combination of each region’s prediction, as shown in
Table 6.
4.2.2. Model Training
The optimized parameters were used to set and train the model. After 100 iterations of training, the training error curve was obtained, as shown in
Figure 8.
In
Figure 8, both the error curve and the verification error curve slowly decline and eventually level off, indicating that the training of each model has converged. However, the bottom spacing of the training and verification curves of
Figure 8c,d is significantly larger than that of
Figure 8a,b, indicating that the regularity between the corresponding temperature and displacement of the latter two is worse than that of the former two. The lowest verification errors are (b), (a), (d), and (c) in
Figure 8, in order from small to large. It can be seen that
Figure 8b has the best training effect.
4.2.3. Predicted Results
After training, the above test set of temperature in different regions was used to predict LDBD, and the predicted results were obtained, as shown in
Figure 9. As can be seen from the Figure, LSTM can use the temperature of each part of the bridge to predict LDBD. However, different parts of the bridge temperature prediction effect are different. To facilitate the comparison of the impact of varying prediction results, statistical analysis was carried out on the evaluation indicators of different prediction results, as shown in
Table 7.
Combining the meaning of the evaluation indicators mentioned in
Section 2 and
Table 7 shows that the best prediction effect is achieved by using the data of B-T to predict LDBD. The values of RMSE and MAE are as low as 0.033 and 0.024, respectively, which are closest to zero; the values of SCC and R2 are 0.963 and 0.958, respectively, which are very close to 1. It can be seen that the prediction effect of using the data of B-T to predict LDBD is very good; secondly, the prediction effect is good when the data of A-T is used to predict LDBD; finally, the prediction effect is relatively poor when the data of C-T and D-T are used to predict LDBD.
5. Using Beam End Rotation Data as Input to the LSTM Model
As shown in
Section 3.1 of this article, if the main beam of a suspension bridge is regarded as a long and flexible cube, the longitudinal displacement in the length direction will also change when the rotation angles at both ends change. However, the rotation angle at the end of the main beam is not limited to one direction. The bridge health monitoring system monitors the beam end’s horizontal and vertical rotation angles. When the beam end rotates in two different directions, as shown in
Figure 10, the longitudinal length of the beam is affected by the superposition of the rotation angles in the two directions.
Therefore, this chapter predicts LDBD through the displacement in these two directions. The prediction model was established using the method in
Section 3, and the hyperparameters were adjusted using the same method, as shown in
Table 8.
The training loss curves and prediction results are obtained after 100 iterations based on the parameter adjustment results in
Table 8, as shown in
Figure 11. The evaluation indicators of this forecast are shown in
Table 9. It can be seen that the prediction results of this time are more excellent than the previous prediction results using temperature as the model input. Although the correlation between two RDBDs and LDBD was only −0.772 and 0.465, which was lower than the correlation coefficient between the temperature and LDBD, the R
2 and SCC of the predicted results were as high as 0.970 and 0.973, respectively. This shows that the two RDBDs as the model’s input can better predict LDBD.
6. Temperature and RDBD as Model Input
It can be seen from the above that both temperature and RDBD can predict LDBD. Temperature causes the expansion and contraction of the main beam, causing its length to change, thus changing LDBD. When the size of the main beam is unchanged, the rotation angle of the beam end can change the longitudinal length of the beam end. It can be seen that the influence of these two effects on LDBD may be superimposed, and the temperature and the rotation angle of the beam end can change LDBD by changing the length and shape of the main beam, respectively. Therefore, in this study, the temperature of the box girder in the span of the main beam and RDBD are used as the model inputs to predict LDBD.
The prediction model was established using the method in
Section 4, and the hyperparameters were adjusted using the same method, as shown in
Table 10:
The model is set using the above hyperparameters. After 100 training iterations, the training loss curve, prediction graph, and prediction result correlation graph are obtained, as shown in
Figure 12. The model’s evaluation indicators are shown in
Table 11.
As can be seen from
Figure 12, the correlation coefficient of the prediction results reaches 0.996, which is very close to 1, indicating that the prediction effect is very good. As shown in
Table 12, the temperature input, beam end angle input, and the model evaluation indexes obtained by both of them as inputs were summarized and compared. It was found that the prediction effect of the model was increased by 3.210% and 1.923%, respectively, when temperature and beam end angle were used as the input of the model, compared with them as separate inputs to the model. This shows that compared with using the temperature or angle of the beam end separately to predict LDBD, combining the two as the input of the model can improve the prediction accuracy of LDBD.
7. Comparison Validation
In order to verify the correctness of the above-mentioned LDBD prediction method, the temperature and RDBD data of the middle span box girder in the health monitoring data of the Xintian Yangtze River Bridge, which is also a suspension bridge, were selected. Their correlation with LDBD data is first analyzed, as shown in
Figure 13. From the figure, we can see that the temperature inside the span box girder and the correlation characteristics of RDBD and LDBD are more consistent with the correlation between the front bridge.
Use the same method to predict LDBD. Obtain the evaluation indexes corresponding to the prediction results of different input conditions. Their summary is shown in
Table 13. From the table, we can see that using B-T and RDBD in the bridge health monitoring system can also make good predictions of LDBD. When combining temperature with RDBD as input to the LSTM model, the prediction accuracy of LDBD can be improved. This verifies the reliability of the aforementioned LDBD prediction method.
Through the cases of the two suspension bridges, we can see that the prediction error of LDBD in this paper is within 2% when combined with RDBD as the prediction method of the LSTM model. Compared with Hui Wang et al. [
36] the prediction error of LDBD was 8.1% when using the full-size test method; the error of the method in this paper is reduced by nearly 75%.
8. Conclusions
By analyzing the influencing factors of LDBD, temperature data from different locations in RDBD and health monitoring systems were selected. The original data was processed to improve data quality. Combined with the LSTM model, different input values were used to predict LDBD, and a comparative analysis was performed. The optimal input for LDBD prediction was analyzed. The main conclusions are as follows:
(1) Through data denoising, the correlation between temperature, RDBD, and LDBD can be improved, which is convenient for model learning and training.
(2) In the prediction of LDBD by temperature at different locations, the prediction effect is from best to worst: the temperature in the mid-span box girder at the beam end (B-T), the temperature at the mid-span guardrail (A-T), the saddle cover at the top of the upstream south bank tower (C-T), and the cable tower at the upstream south bank (D-T).
(3) Although the correlation between the RDBD in different directions and LDBD is much worse than the correlation between temperature and LDBD, using the RDBD in two directions as the model input can better predict LDBD than using temperature as the model input. This shows that using the RDBD data in two different directions has a good superposition effect on the prediction of LDBD.
(4) The prediction effect of using temperature and RDBD as the model input is 3.210% and 1.923% higher than that of using them separately as the model input.