1. Introduction
Earthquakes are a major natural hazard that impact urban sustainable development and infrastructure safety [
1]. Bridges are the most vulnerable elements in the urban transportation system. The damage state evaluation of regional bridges informs bridge managers about the possible damages and risks in a seismic event [
2,
3,
4]. Due to traffic loads and environmental effects, seismic demand and the capacity of in-service bridges are different from the original conditions [
5,
6]. To make an informed decision regarding pre-earthquake maintenance and post-earthquake recovery, it is critical to evaluate the seismic damage states of bridges based on their real-time conditions.
With the development of structural health monitoring, quite a few structural health monitoring systems (SHMS) are installed on infrastructures to record the long-term behaviors [
7,
8]. Information obtained from SHMS is mostly used to assess the long-term deterioration process due to physical aging and traffic loads, while there are still some limitations in identifying the location and degree of structural damages under the earthquake: (1) nonlinear system identification: ground motions lead to structural nonlinear failure, and existing techniques are not easily able to identify strong nonlinear behaviors; (2) distributed damage: multiple local damages appear in a large number of components after the earthquake; (3) signal interruption: signal cables and electric cables might be cut off in the earthquake, which obstructs engineers from obtaining the in-earthquake and post-earthquake data. In view of the limitations of seismic monitoring, it would be appropriate to update the structural conditions with SHMS before the earthquake.
Data-driven methods are approaches that apply machine learning algorithms to directly use the measured data as input to evaluate the structural conditions. Sajedi and Liang [
9] developed a method to diagnose structural real-time damages using the support vector machine (SVM). It precisely predicted the existence, location and severity of damages. Clustering methods [
10] and neural networks [
11] were used for condition assessments and damage detections. Jang and Smyth [
12] explored the three most popular machine learning models to describe the relationship between environmental effects and modal properties, where random forests (RF) obtained higher efficiency with the same prediction accuracy. Since data-driven models directly identify the data features of structural condition changes, they do not need any prior knowledge.
Vibration-based structural condition assessment is a conventional and effective approach to detecting structural damages only using structural mechanism characteristics [
13,
14,
15], such as natural frequencies, modal damping, curvature mode shapes, etc. Based on the first three natural frequencies, Morassi and Rollo [
16] developed a method for assessing the condition of a simply supported beam under flexural vibrations. Lin and Cheng [
17] presented a technique with curvature mode shapes to detect the damages of a free–free beam. Kbiem and Lien [
18] derived frequency change indicators and used them to determine the structural condition and detect the locations of cracks. The availability of vibration-based structural condition assessments provides a unique opportunity to evaluate seismic damage states based on updated structural conditions.
Previous studies used comprehensive geometrical and material parameters to reflect the real performance of each component [
19,
20]. Material stiffnesses and boundary conditions affect the structural seismic performance, but their correct determination is labor-consuming. Abutments provide a great source of energy dissipation and decrease the probability of the unseating of the bridge beams [
21]. Shear keys assist in constraining the relative lateral displacement between the beam and the abutments, and two adjacent beam segments [
22]. Bearings transfer the horizontal friction forces between superstructures and substructures. Bridge foundations transfer structural loads to the underlying soil. The bridge column is one of the main components in resisting the earthquake, and its seismic performance depends on the area of reinforcement, the strength of core concrete, and built-in reinforcement. Although these detailed material stiffness and boundary condition parameters make a great contribution to enhancing the model’s accuracy, it is labor-consuming to get these real values. Since the structural dynamic characteristics can reflect the material and boundary conditions, it is convenient for the monitored bridges to use the measured structural dynamic parameters to represent their real-time seismic performances.
Earthquake excitations of each bridge are diversified across regional scales [
23,
24]. Sensors installed on the bridges and the nearby seismographs record the time-history of the excreted ground motion, and detailed characteristics of ground motions could be easily extracted to reflect their intensities. In previous studies, most of the research selects only one intensity measure (IM) to represent the seismic hazard; peak ground acceleration (PGA) is the most commonly used. Padgett [
25] suggested that the PGA is a suitable selection to represent the severity of the earthquake ground motion, due to its efficiency and ease-of-implementation. The relative efficiency of PGA with some other IMs [peak ground velocity (PGV), peak ground displacement (PGD) and spectral acceleration at 1 s] has been carried out. However, since a large PGA value does not always induce severe structural damage, other IM, such as PGV, PGD, the time duration of strong motion and spectrum intensity, should also be considered.
Seismic damage state evaluation is a kind of classification problem [
26]. RF is an ensemble learning method, and it consists of bagging and random feature selection techniques to generate a large quantity of independent decision trees [
27,
28]. The trained model is more robust than a single decision tree, and less likely to overfit. Jia [
29] used RF to train the model with the data of the Wenchuan earthquake and the Tangshan earthquake. The results showed a good performance for assessing the damage states of two bridges. Kiani [
30] compared multiple classification algorithms for assessing the damage states of buildings. RF has the highest efficiency in predicting the structural seismic damages compared with other methods. RF can robustly handle high dimensional, large datasets with outliers and non-linear data. Its parallel computing can split the process into multiple machines to save computation time. Each decision tree has a high variance, but low bias. Since RF averages the variances of all the trees, we could get a low bias and moderate variance model, even for an unbalanced dataset [
31].
This paper proposes a vibration-based damage state evaluation method for concrete beam bridges using the RF method. The models contain bridge design parameters, structural dynamic characteristics and ground motion parameters. Design parameters represent the bridge configuration. Instead of traditional structural material strength and boundary stiffness parameters, dynamic characteristics are included in the models to represent bridge real-time conditions. These structurally related parameters are easy to be obtained for a bridge installed with SHMS. As for ground motions, several parameters, related to the peak effect, spectral characteristics and time duration of strong motions, are included in the models. RF classification methods are used to predict the damage state. To verify the effectiveness of the proposed method, the prediction accuracy of the proposed models and traditional models are compared. Besides, the confusion matrix and receiver operating characteristic curves of the proposed models are illustrated to manifest their high efficiency. For each bridge component, the relative significant parameters are also identified.
2. Numerical Modeling Techniques
This study selects the short- and medium-span beam bridges to establish regional damage state evaluation models. The three-dimensional simplified numerical models are developed by the Open System for Earthquake Engineering Simulation Platform (OpenSEES), incorporating the nonlinear material and geometrical behaviors. A typical layout of the selected short- and medium-span beam bridge model employed in this study is illustrated in
Figure 1.
In general, there are four common types of the superstructure for short- and medium-span beam bridges, that is, RC simply supported slab bridge, prestressed concrete simply supported hollow slab bridge, prestressed T-shaped concrete beam bridge, and prestressed box-shaped concrete beam bridge. The design references of short- and medium-span beam bridges in each country are relatively similar. Taking the design references in China as an example, the Ministry of Transport issued the standard drawings to guide the design of each type of bridge. The section dimensions, reinforcement layouts, reaction of superstructures, and other details for standard span length beam are given. This information can provide an efficient basis for the establishment of numerous standardized short- and medium-span beam bridges.
2.1. Model Establishment
The beam is modeled with the elastic beam-column element, since this would remain elastic during the earthquake. The mass of the superstructures acts as the inertial force for the whole bridge, so it is precisely calculated by the reaction of the superstructures. Due to the proposed model containing the dynamic characteristics, the stiffness of superstructures is also calculated with the given section’s dimensions. The transverse beam is modeled with the massless rigid link element to illustrate the torsion of the beam. Most of the short- and medium-span bridges are designed with laminated rubber bearings, and their stiffness impacts the bridge’s nonlinear response and dynamic characteristics. Bearing is assumed to be a perfectly elastic model [
32], where its stiffness is determined by the bridge’s seismic design code [
33]. Beam caps remain elastic during the earthquake, so its mass is modeled on the top of the columns. Columns are modeled with the fiber-based displacement beam-column element. In the fiber sections, the Steel02 material model with a hardening factor of 0.01 is used to simulate the reinforcement behavior. The Concrete01 and Concrete02 material models are used to account for the cover and core concrete behavior, respectively. Each bridge foundation is modeled with three translational linear springs and three rotational linear springs.
The influence of the abutment on the seismic damage of the bridge is significant; quite a few studies [
21,
34,
35] have presented the intrinsic issues in abutment modeling. In this study, a complex series and parallel spring system account for the abutment’s dynamic behaviors, and these are all modeled with zero-length elements. In the longitudinal direction, the effects of elastomeric bearing, gap, abutment piles (active soil) and soil backfill material (passive soil) are considered. The gap is modeled with the pounding springs [
36], which contain a gap and bilinear high stiffness. Before the gap closure, the force transmits from the beam to the bearings and gaps, and then to the abutment piles and backfill soils [
37]. After the gap closure, the beam, along with the bearing systems, collide directly with the abutment, which initiates the full passive earth pressure. In the transverse direction, the effects of elastomeric bearing, concrete shear keys and abutment piles (active soil) are considered. The elastomeric bearing and shear keys act in parallel. This combined parallel system is in series with the abutment piles (active soil). According to Caltrans [
38], the ultimate strength of the shear key is determined to be 30% of the superstructure dead load. A tri-linear hysteric backbone curve is defined for shear keys.
2.2. Model Verification
Accurate bridge dynamic models are an effective basis for the following seismic damage analysis. There are quite a few short- and medium-span bridges installed with SHMS [
39]. Since the occurrence of earthquakes is extremely rare, it is quite hard to measure the seismic response. This paper compares the bridge dynamic characteristics between measured real bridges and the corresponding numerical models, to ensure the reliability of the proposed modeling techniques.
Nanli river bridge is located on the Xinglin Highway in Hebei province, China. It is a typical prestressed box-girder skew bridge with a span of 30 m. The bridge cross-section consists of four small box longitudinal beams, and there are three transversal beams at the quarter and middle of the span to link all these longitudinal beams. The layout of the monitored span and installed sensors is shown in
Figure 2. Five sections are installed with different kinds of sensors, and static levels are used to measure the displacement of the bridge; thermometers can compensate the temperature error of the measured data; and dynamic characteristics can be identified from strain gauges and accelerometers.
Using the Stochastic Subspace Identification (SSI) method to analyze the response data of the accelerometers, the bridge frequencies can be accurately measured. Using the proposed numerical modeling techniques combined with real bridge parameters, the frequency of the numerical model can be extracted. Besides, the accuracies of mode shapes are calculated with the modal assurance criterion (MAC). The first three-order frequencies and MACs are compared in
Table 1. It can be seen that the difference between these two sources is very small, indicating that the proposed numerical modeling techniques are reliable.
3. Uncertain Parameters
This paper proposes a seismic damage evaluation method for regional monitored concrete beam bridges. Unlike the traditional seismic damage evaluation method, the selected parameters in the proposed method are conveniently obtained for a monitored bridge. To ensure the high accuracy of the damage prediction, the traditional method employs the material stiffness and boundary condition parameters to simulate the real condition of bridges, however, the proposed method uses bridge real-time dynamic characteristics.
3.1. Material and Boundary Parameters for Traditional Unmonitored Bridges
In the traditional method, many bridge geometrical and material parameters are included, as listed in
Table 2. Geometrical parameters selected from the main design parameters illustrate the bridge configuration. Span (
) and Number of Spans (
) depict the longitudinal layout of bridges. Number of Beams (
) depicts the transversal layout of bridges. Since the proposed modeling technique is based on the aforementioned standard drawings, the mass and stiffness of the superstructures are easy to determine with these three parameters. Column height (
), Diameter of Column (
) and Number of Columns (
) are used to determine the layout of substructures. Bearing stiffness is determined with the mass of superstructures and seismic design codes. Skew angle (
) makes it possible to simulate the skew bridges.
Material parameters and boundary stiffness parameters are included to modify the bridge response. Since the column plays an important role in the earthquake event, longitudinal reinforcement ratio (), concrete compressive strength () and reinforcement yield strength () are used to calibrate the real performance of the column. As for the foundation, rotational stiffness () and translational stiffness () are the key stiffness. Abutment pile stiffness and backfill stiffness are determined with the soil condition. These parameters () are determined with the original design values, so they can not reflect the real-time structural conditions.
This paper establishes 672,000 numerical models with these modeling parameters, to comprehensively consider all kinds of regional beam bridges. The Latin Hypercube Sampling (LHS) method [
40] is a stratified sampling method for generating a near-random sample of parameter values from a multidimensional distribution. To perform the stratified sampling, the cumulative probability is divided into segments. It randomly selects samples from each segment using a uniform distribution, and then maps to the correct representative value the variable’s actual distribution. Once each variable has been sampled using this method, a random grouping of variables is selected with the independent uniform selection. LHS aims to spread the sample points more evenly across all possible values. In this paper, the samples of the numerical three-dimensional bridge model are generated by sampling across a certain range of uncertain parameters using the LHS method.
3.2. Dynamic Characteristic Parameters for Proposed Monitored Bridges
In the proposed method, the bridge dynamic characteristics parameters are included to replace some material and stiffness parameters. Since the bridge geometrical parameters (
~
) are easy to obtain from the design documents or technical reports, they are preserved in the model to illustrate the bridge configuration. Bridge dynamic characteristics are important indicators of the bridge condition, as their fluctuations could reflect the deterioration of the material and stiffness performances. According to the seismic design code, the maximum acceleration response spectrum appears between 0.1 s and the characteristic period (
). This interval generally contains the first five orders of bridge frequencies (
~
), as illustrated in
Figure 3. With the help of modern signal processing techniques, frequencies (
~
) of the monitored bridge can be identified. These parameters mainly aim to represent the real-time condition of the measured bridges. Although the dynamic characteristic parameters might be influenced by the bridge design parameters, these parameters can be used together to effectively determine the configurations and real-time conditions of the measured bridges. The proposed method uses theses 11 parameters (
Table 2) to predict seismic damage states for regional monitored beam bridges.
Generally, the proposed dynamic characteristic parameters can be measured either before or after the earthquake. If they are measured before the earthquake, this could have implications on the post-earthquake emergency traffic and recovery operations; If they are measured after the earthquake, the post-earthquake damage states could be identified without much delay.
3.3. Intensity Measures
From the regional view, each bridge undertakes different ground motions in the same earthquake scene. Analysis of recorded signals from installed sensors or nearby seismographs could provide the time-history and detailed characteristics of the ground motions. In addition to considering the structural and material uncertainties, the ground motion uncertainties should also be fully considered. Earthquake ground motion is typically characterized by three main aspects: peak effect, response spectrum and acceleration duration [
30]. This study selects eight IMs (
) to represent these three aspects in the seismic damage state evaluations. PGA and cumulative absolute energy (CAV) are selected to represent the peak effect of ground motion. Acceleration spectrum intensity and spectral acceleration, at the periods of 0.5, 1, and 3 s, are used to show the spectrum intensity characteristics. 5–75% and 5–95% significant durations are chosen as the representatives of time history characteristics. The definitions of these eight selected IMs are listed in
Table 3.
The ground motion suite used in this study should be informative considering the established 672,000 numerical models. It contains 1000 ground motions that are developed from the Pacific Earthquake Engineering Research (PEER) ground motion database. The PGAs of these 1000 ground motions are evenly distributed between 0 g and 1.0 g, and there are 100 ground motions contained in every 0.1 g interval. Among them, ground motions with PGA lower than 0.4 g are natural ground motions, while the other 600 ground motions are scaled ground motions. Since the ground motions with high PGA are rare in the PEER database, they are scaled from the other natural ground motions to populate sufficient response data for the strong earthquake scene. The IMs of selected ground motions with PGA between 0.5 g to 0.6 g is illustrated in
Figure 4. This study randomly pairs 672,000 numerical models with these 1000 ground motions, in both longitudinal and transverse excitations.
4. Vibration-Based Seismic Damage State Evaluation Methodology
There are two main stages in the proposed evaluation frameworks, including the regional bridge seismic simulation stage and the seismic damage state evaluation stage. The framework for vibration-based seismic damage state evaluations is represented in
Figure 5. The seismic simulation stage aims to label the structural seismic damage state for a certain bridge–earthquake pair, and it provides the labeled dataset for the following supervised model training. In the evaluation stage, the labeled 672,000 bridge–earthquake pairs are split randomly in this study into a training set (70%) and a test set (30%). The RF is trained with the training set to minimize the prediction error. The evaluation of the model using the test set informs the prediction performance and prevents overfitting. This study uses the cross-validation strategy to evaluate predictive models. In practice, the training set is randomly partitioned into
n equal-sized subsamples. Of the
n subsamples, a single subsample is retained as the validation data for testing the model, and the remaining
n–1 subsamples are used as training data. The cross-validation process is then repeated
n times, with each of the
n subsamples used exactly once as the validation data. The
n results from the folds can then be averaged to produce a single estimation. The advantage of this method is that all observations are used for both training and validation, and each observation is used for validation exactly once. The ideal trained models are expected to imply the general law of structural seismic damage, and are effectively generalized to other regional bridges.
4.1. Random Forest
RF is an ensemble machine learning method operated by constructing a large number of decision trees (DT). Unlike DT, which uses all features to generate a tree-like graph for classification, RF uses an effective “feature bagging” learning algorithm, which combines the random feature selection and bagging techniques. If one or a few features are very strong predictors for the target output, this subset of features will be selected to construct a tree-like classification graph sample. This kind of sample is known as the bootstrap sample. Using bagging techniques, these models are fitted with the above bootstrap samples and combined by voting. RF improves stability and accuracy, reduces variance, and helps to avoid overfitting.
With the size and nature of the training set, an optimal number of trees are determined by bootstrap aggregating or bagging. By averaging the predictions from the individual regression trees, the RF prediction can be expressed as:
where
denotes the RF prediction from the total of
trees, and
denotes the prediction of each individual tree with the input
. Additionally, an estimate of the uncertainty of the prediction can be made as the standard deviation of the predictions from all the individual trees, and can be expressed as:
4.2. Component Demands and Damage State Classification Labeling
RF is a supervised learning algorithm, which infers the function between input features and output results based on the labeled training data. Following the material performance and bridge design codes, this study suggests three damage states for each bridge component and the whole bridge system, illustrated in
Figure 6. The component demand obtained from the nonlinear seismic analysis of the bridges is compared with their capacities, and three damage states are defined as listed in
Table 4, where
denotes the structural seismic response,
denotes the yield strain, and
denotes the ultimate strain. Since the seismic demand value of each bridge component in the regional scale is different, this paper only gives the principle of classifying the seismic damage states.
The defined damage states are associated with material performance, structural damage and regional traffic influence. This paper selects five bridge components to make the seismic damage state evaluation. For abutments, the stiffness of longitudinal direction (AbutX) and transversal direction (AbutY) are different, and the yield points and ultimate points are determined via the soil condition and previous research. For bridge bearing (Bear), since it is assumed to be a perfectly elastic model, displacement controls the damage states. Bridge columns (Column) play an important role in seismic analysis. Typical sectional moment-curvature analyses for 672,000 samples are carried out; at the yield point, the built-in rebars start to yield, and cover concrete begins spalling; at the ultimate point, core concrete crashes. The damage state of beam unseating (Beam) is defined via the bridge transverse structure configuration. The whole bridge system is considered as a series system of these five components, and the damage state of the system is determined via the most damaged component.
4.3. Predict Performance Indicators
To establish a predictive model for classifying the seismic damage state of components and bridge systems, the RF algorithm is carried out as mentioned in
Section 4.1. The confusion matrix is a table for the visualization of the predicted performance, in which each row of the matrix depicts the cases in an actual class, while each column depicts the cases in a predicted class. It is usually constructed with
rows and
columns, where
equals the number of classes. There are four kinds of classification of the analytic results, including true positives (
TP), true negatives (
TN), false positives (
FP) and false negatives (
FN), shown in
Figure 7.
TP and
TN are the outcomes where the model correctly predicts the positive and negative class, respectively.
FP and
FN are two kinds of prediction error, indicating that the model incorrectly predicts the positive and negative class, respectively.
The efficiency of the predictive model is evaluated using the
Accuracy,
Precision,
Recall and
F1-score (
) of the test data set.
Accuracy is the most intuitive performance indicator; it defines a ratio of correctly predicted conditions to the total conditions.
Accuracy is a good performance indicator only when FP and FN have a similar cost.
Precision defines the ratio of correctly predicted positive conditions to the total predicted positive conditions. In other words,
Accuracy denotes the closeness of the predictions to the target value, while
Precision denotes the closeness of the predictions to each other.
Recall defines the ratio of correctly predicted positive conditions to all conditions in a true condition.
denotes the weighted average of
Precision and
Recall. It is usually more useful in an uneven class distribution [
30]. Since the damage state distributions of each component are extremely uneven,
is an important indicator in this study. The equations of the above indicators are listed below:
The receiver operating characteristic curve (ROC Curve) and area under the curve (
AUC) are performance indicators that depict the diagnostic ability of the established classifier. ROC Curve defines a probability curve plotting the true positive rate (
TPR) against the false positive rate (
FPR), at various setting values. An excellent prediction method would approach the point at the upper left corner of the ROC space, representing no FN and FP. Under this circumstance, the associated
AUC is near to 1, as shown in
Figure 8. The equations of
TPR and
FPR are listed below:
6. Conclusions
Regional transportation networks play an important role in urban areas. The efficient seismic evaluation of regional bridges can identify the potential damage of components before the earthquake, and guide the recovery operation after an earthquake. With the aid of a structural health monitoring system (SHMS), the bridge’s real-time condition can be accurately identified. SHMS could also identify ground motions that are exerted on the monitored bridges during the earthquake. Based on the measured information and some bridge design parameters, this paper proposes an effective seismic damage evaluation method for regional monitored concrete beam bridges with machine learning techniques.
The proposed seismic damage evaluation method is demonstrated for short- and medium-span beam bridges, which are the dominant bridge classes on a regional scale. 672,000 bridge numerical models are probabilistically generated, representing the structural and material diversity of regional beam bridges, and 1000 ground motions are developed to representing the regional ground motion diversity. The non-linear time history analysis of the bridges is carried out to estimate the seismic damage state of selected bridge components and systems. The seismic damage states are labeled with three tags: Open (structural safe), Restrict (open for emergencies) and Close (collapse or the potential to collapse). Using the selected bridge design parameters, bridge dynamic parameters, intensity measures and associated labeled damage states in the dataset, the models are trained with RF. The performance of the proposed machine learning models is explored in this study. RF can predict the seismic damage states with an accuracy ranging from 89% to 97%, depending on the bridge components and system. The precision, recall and F1-score of most bridge components and systems account for at least 90%. The area under the receiver operating characteristic curve for bridge components and systems yields over 0.99. It is noted from these performance indicators for bridge components and systems that RF has great performance potential in evaluating the seismic damage states of regional beam bridges.
RF is also applied to identify the significant parameters of each bridge component and system in the seismic damage state evaluation. Since the measured structural dynamic characteristics could reflect the structural configuration and real-time conditions, most of them are proved to have a significant influence on the seismic damage state, while some bridge design parameters are neglected. Along with these bridge dynamic parameters, this study also indicates that the skew angle and some ground motion intensities have a major impact on seismic damage states. It is of great importance to have the multi-parameter seismic fragility models available to assess the damage risk and loss of regional bridges. The proposed RF-based damage evaluation method could rapidly and precisely evaluate the seismic damage states. Since the numerical models established in this study are based on the Chinese bridge design code and Chinese official recommended standard drawings, the findings in this study should be carefully applied in other areas. In addition, further studies will apply the proposed method to other area bridges, and compare the seismic characteristics of bridges in different regions.