1. Introduction
An earthquake is known as a devastating natural disaster leading to huge damage to buildings and infrastructures. Therefore, engineers must design buildings to protect them from seismic vibrations. Various technologies are available in the modern construction industry to reduce the financial and human losses in buildings due to ground shaking. Among these technologies, the installation and use of energy-dissipating devices is widely recognized as one of the most effective ways to lower the damage risk of the buildings exposed to an earthquake [
1,
2,
3,
4,
5,
6,
7,
8,
9]. Using energy-dissipating devices can offer many advantages over the standard techniques for seismic upgrading. For instance, these modern devices can reduce both the total acceleration and inter-story drift in the buildings experiencing an earthquake, leading to the effective protection of both the non-structural and structural components of buildings [
10].
Base isolation (BI) is a well-known energy-dissipating tool installed in many buildings. Buildings equipped with BI have shown an acceptable performance during some recent, strong, seismic events. For instance, in the 2023 earthquake event in Turkey, there was good performance recorded for hospital buildings with BI systems, which verified the seismic protection capabilities of the BI devices [
11]. However, due to the complicated nature of seismic loads, using BI systems can sometimes have some challenges. These challenges can be observed both in far-field and near-field seismic events [
12,
13]. Consequently, base isolation has been widely studied in recent years given its associated advantages and challenges. Perez-Rocha et al. conducted research for studying the performance of BI for mid-rise buildings considering soil–structure interaction effects [
14]. Li et al. evaluated the performance of a nonlinear hybrid base isolation system under the ground motions [
15]. Wang et al. studied methods for enhancing the performance of base-isolated structures by employing an optimized, configurable friction isolator-tuned inerter damper system [
16]. Liu et al. examined the seismic response of three-dimensional base-isolated structures, while considering the influence of rocking and P-D effects [
17].
Confirming the effectiveness of the base isolation system requires a thorough evaluation of the structural response. Such evaluations may be conducted by means of experimental tests, numerical simulations, and modeling, or advanced artificial intelligence (AI) methods. Among the methods, experimental testing is typically the most expensive, whereas numerical modeling and AI-based approaches are usually favored, except in cases where the structural system demonstrates highly complex or unpredictable behavior. Nonlinear time-history analysis is known as one of the best techniques for evaluating the behavior of structures when an earthquake occurs. However, it can be a time-consuming process with huge numbers of computations, which requires creating a finite element model of structure [
18]. The advent of complicated AI and soft computing techniques such as machine learning (ML) has made it possible to predict the response of structures under seismic events. Therefore, during recent years, many scholars have focused on evaluating different structural systems using AI tools [
19,
20,
21,
22,
23]. Nguyen employed ML for predicting the seismic drift responses of planar steel moment frames [
24]. Mangalathu et al. used ML interpretability techniques for seismic performance assessment of infrastructure systems [
25]. Hwang et al. estimated the economic seismic loss of steel moment-frame buildings using an ML algorithm [
26]. Zhang et al. presented data-driven ML methods for seismic response prediction of a damped structure [
27]. Yu et al. employed a hybrid approach based on AI to evaluate the condition of bridge decks [
28]. Yu et al. also used an EBA-optimized, chemistry-informed, interpretable deep learning model to predict the compressive strength of fly ash/slag-based geopolymer concrete [
29].
Numerous studies have investigated the assessment of building damage under seismic events [
30,
31,
32,
33,
34]. However, despite the extensive research on predicting building damage caused by earthquakes, limited attention has been given to buildings equipped with BI systems. Moreover, among the many studies employing ML techniques to predict the seismic performance of buildings, very few have focused on evaluating the response of retrofitted structures, particularly those retrofitted with BI [
19,
20,
21,
22,
23]. Therefore, new studies seem essential to explore the capability of ML models in assessing the seismic response of BI-retrofitted buildings. Accordingly, the present study focuses on applying ML methods to predict the seismic damage of BI-equipped buildings. It is evident that accurate prediction of damage in BI-retrofitted reinforced concrete buildings can support engineers in rapid post-earthquake assessment, optimization of isolation design, and cost-effective retrofit decision-making, thereby enhancing the resilience of the built environment [
24,
25,
26,
27].
2. Methodology
In the current study, a six-step systematic procedure was employed. This method has been used in several previously published papers. The steps were performed as follows:
Design the RC buildings;
Design base isolation for seismic retrofitting;
Conduct nonlinear time-history analysis to gain datasets;
Classify the damage using proposed machine learning models;
Check the accuracy of learning models for damage prediction;
Train machine learning models named RF, ANN, DT, SVM, and XGBoost 3.1.1.
An overview of the research workflow is as follows: First, various reinforced concrete (RC) buildings, originally designed according to older versions of commonly used seismic codes in the United States, were considered. The proposed code references were AISC 7 and ACI 318, both published in 2002. Subsequently, BI systems were designed following the provisions of widely recognized standards, such as ASCE 7-22, and installed beneath the bottom columns of these buildings. Various seismic ground motions with differing intensities were then applied to the BI-retrofitted buildings. A portion of the resulting data, representing damage levels, was used to train ML models, while the remaining data was reserved for validating the accuracy of the ML algorithms. Finally, the damage levels were determined, and the results of the ML models were compared with those obtained from time-history analysis. Further details on each step are provided in the following sections.
The dataset produced for the machine learning models covers a wide range of configurations, including the number of stories, type of moment-resisting frame, span length of the RC buildings, and various ground motions with different intensities, distance from the fault, and frequency characteristics.
The machine learning models used in this study—DT, RF, XGBoost, SVM, and an ANN—were selected because they can effectively capture nonlinear and interaction effects among input parameters which are characteristic of the seismic response of base-isolated buildings [
35,
36,
37].
The features used in the machine learning algorithms were classified into three main groups: building parameters (e.g., natural period), seismic ground motion parameters (e.g., peak ground acceleration), and soil property parameters (e.g., soil type). The feature selection process was guided by previous studies, correlation analysis, and feature importance ranking to remove redundant variables and retain only the most influential ones. This approach enhances both model accuracy and interpretability [
22,
23,
24,
25]. The dataset was divided into training (87%) and testing (13%) subsets using a fixed random seed to ensure reproducibility. Five-fold cross-validation was applied on the training data to achieve stable and unbiased model performance.
Evaluation metrics, including Precision, Recall, and F1-score, were used to assess the performance of the classification models. Precision measures the proportion of correctly predicted positive cases, Recall quantifies the model’s ability to detect all actual positive cases, and the F1-score represents their harmonic mean. These metrics together provide a comprehensive and balanced evaluation of classification accuracy and reliability, especially for datasets with imbalanced class distributions.
The study is organized as follows:
Section 2 provides a brief overview of the ML algorithms employed.
Section 3 describes the modeling of the buildings and the time-history analysis conducted using finite element (FE) software.
Section 4 discusses the properties of the seismic events, while
Section 5 presents the generation and training of the datasets.
Section 6 evaluates the accuracy of the ML algorithms, and
Section 7 concludes the study.
3. Overview of Machine Learning Methods for Damage Prediction
In this section, a brief description is presented about the ML techniques employed in this study. Techniques used include an Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Extreme Gradient Boosting (XGBoost). Presenting a detailed discussion about these methods is beyond the scope of this study, and more information can be found in the cited articles [
38]. These methods were selected because of their popularity and widespread application in the articles published about using ML for predicting the seismic behavior of civil engineering structures.
The working mechanism of ANNs was inspired by the structure of animal brains. The core structure of an ANN consists of interconnected nodes (or neurons), where each connection is assigned a specific weight [
38]. During training, ANNs adjust the weights of connections between nodes to minimize the prediction error. Once trained, the network uses these learned weights to predict outputs based on input data. ANNs work by employing a feed-forward process to compute outputs and are typically trained using a backpropagation algorithm that updates the weights based on error minimization [
19]. Three types of layers known as input, output, and hidden layer(s) exist in an ANN. Information is fed into the ANN through the input layer, while the output layer is responsible for presenting the results. The hidden layers, located between the output and input layers, learn internal representations that enable the ANN to perform complex mappings from inputs to outputs [
38].
SVMs are supervised learning algorithms commonly used for data classification due to their high accuracy and low computational cost [
39,
40]. SVMs work by identifying the optimal hyperplane that maximizes the margin between data points of different classes. The data points that lie closest to this hyperplane are known as support vectors, and they play a key role in defining the decision boundary. SVMs can be classified into linear and nonlinear models, depending on the complexity of the data distribution [
38].
A DT is a powerful and simple tool used for data classification. It classifies the data by creating a tree-like model. The main elements of this method are leaves, nodes, and branches. The outcomes of the tests conducted by DT are presented by each branch, while internal nodes are responsible for testing the features. Using the DT can sometimes lead to overfitting issues [
38].
RF is an ensemble learning method that produces several decision trees in the training process and combines their outputs to improve accuracy and mitigate overfitting. Each tree is trained based on a random subset of the data with randomly selected features. The final classification of the data is determined through majority voting across all trees [
19].
XGBoost is known as one of the developed gradient boosting algorithms designed for improving efficiency and performance. In this method, a group of decision trees are produced, and each decision tree proposes to correct the errors made by its predecessors. Similarly to RF, XGBoost can be a good choice for reducing the overfitting issues of the DT [
19].
The selected ML algorithms were specifically chosen to address the nonlinear and multi-parameter nature of the seismic response in base-isolated reinforced concrete buildings. The behavior of these systems depends on a combination of the base-isolated building’s behavior and ground motion characteristics, resulting in complex nonlinear relationships that are difficult to represent analytically. Tree-based ensemble models such as RF and XGBoost were adopted for their robustness, ability to capture nonlinear feature interactions, and capability to rank variable importance—allowing identification of the most influential parameters governing structural response [
41,
42,
43]. An ANN was included because it can approximate highly nonlinear mappings and learn hidden relationships between seismic input parameters and response outputs [
44,
45]. An SVM was used to benchmark performance on relatively small but high-quality datasets due to its strong generalization capability [
46,
47], while DT provides interpretability and transparency in identifying key decision boundaries [
48].
4. Modeling and Design Details
The focus of this study is to evaluate the performance of retrofitted RC moment-resisting frame buildings using BI. The BI employed in this study is the lead rubber bearing (LRB). The selected buildings for retrofitting range from 5 to 8 floors with three-, four-, and five-bay configurations. In each building, columns are evenly spaced, with span lengths of 5 m, 6.25 m, and 8.33 m for buildings with five, four, and three bays, respectively. It is further assumed that the bay spacing is uniform in both the X and Y directions, and all modeled structures have a symmetrical rectangular plan.
For the purposes of this study, the RC buildings were originally designed without BI. It is assumed that the seismic load is calculated based on an older version of the ASCE 7 code, while the retrofitting was performed according to the latest version. Specifically, the older version refers to the ASCE 7 code published in 2002, while the new version refers to the ASCE 7 released in 2022. Although the general methodology of the two versions is similar, certain parameters, such as the base shear coefficient, have increased in some regions of the United States. For example, in Salt Lake City and Seattle, the base shear coefficient for site class C has increased by more than 25% between the older and newer versions of the ASCE 7 [
49,
50].
In the design of the RC buildings, the dead loads and live loads are as follows: The live load is assumed to be 200 kg per square meter, while the dead load is assumed to be 500 kg per square meter, excluding the RC structural members. The weight of the RC structural members, including beams and columns, is accounted for separately. Furthermore, the seismic mass of the building is considered as the sum of the dead loads and 20% of the live loads. According to ASCE 7-02, the base shear coefficient can then be calculated as follows [
49]:
where
I represents the importance factor, which is selected to be 1.0 in the designed buildings,
R denotes the
R factor, and
SDS is the design spectral acceleration. The value of
CS should not exceed
, where
T denotes the natural period of the building and
SD1 represents the design spectral at the period of 1.0 s.
The RC buildings modeled in this study are located on sites B and C according to the ASCE 7 classification. The buildings have R-factors of 6 and 8, corresponding to intermediate and special RC moment-resisting frames, respectively. Note that the R-factors of 6 and 8 correspond to RC structures with medium and high ductility, respectively, and in the design process, the detailing requirements associated with both ductility levels are considered. In summary, the buildings differ in the number of stories, bay lengths, site location, and R-factor, resulting in a total of 48 RC buildings modeled and analyzed.
The nonlinear time-history analysis in this study was conducted using the OpenSees software platform. Both OpenSees and OpenSeesPy provide frameworks for simulating the seismic response of structural systems. OpenSeesPy was selected because it operates within the Python 3.12 programming environment, which offers convenient libraries for ML applications. A brief overview of the modeling details in OpenSeesPy is provided below.
The structural models used for the time-history analysis were two-dimensional. Beams and columns were modeled using nonlinearBeamColumn elements. The materials selected for modeling the structural concrete and steel rebar were Concrete02 and Steel02, respectively, which allow consideration of the cyclic degradation in RC sections. The properties of the proposed steel are listed in
Table 1. In
Table 1,
Fy and
E denote the yield stress and Young modulus of the steel rebar, whereas the parameters
R0,
R1, and
R2 control the transition from elastic to plastic branches [
51]. Moreover, the properties of the concrete materials used in the numerical modeling are listed in
Table 2. Note that the parameters
f’c and
denote the compressive strength and corresponding compressive strain, while
and
represent the ultimate compressive strength and ultimate strain of the concrete material. Moreover,
is defined as the ratio between the unloading slope at
and initial slope [
51]. The mechanical parameters of the concrete are defined for both core and cover concrete, and, as seen in
Table 2, the core concrete has a higher compressive strength and ultimate strain. Additionally, the stiffness and strength of the concrete under tension are ignored in modeling [
52].
It is noted that the type of BI selected for this study is the lead rubber bearing (LRB). According to the provisions of ASCE 7-22, several factors must be considered in the design of the LRBs. The design of LRBs as seismic isolation devices under ASCE 7-22 focuses on ensuring adequate energy dissipation, sufficient vertical and lateral load-carrying capacity, and stability under both seismic and gravity loads. Another critical aspect of LRB design under ASCE 7-22 is the evaluation of displacement demand under the Design Earthquake (DE) and Maximum Considered Earthquake (MCE) levels. The isolation system must accommodate the maximum expected displacement without instability or excessive strain [
50].
The nonlinear hysteretic behavior of the BI system is modeled as a bilinear hysteretic material as illustrated in
Figure 1. The LRB material is characterized by a post-yielding shear stiffness of 60 psi (0.4136 MPa) [
53].
In this study, the output of the ML models is the classification of damage levels following seismic excitations. Damage levels are defined based on maximum inter-story drift ratios (MISDRs), using a five-grade classification: null to slight, moderate, heavy, very heavy, and collapse [
54]. The MISDR range corresponding to each damage level is shown in
Table 3.
The RC buildings modeled in the current study represent typical mid-rise structures retrofitted by installing BIs. The dataset covers a wide range of building heights, fundamental periods, span lengths, site classes, and ductility levels to ensure representativeness of realistic design variations. The isolation system parameters, including yield strength, post-yield stiffness ratio, and effective isolation period, were selected within ranges reported in design guidelines and validated experimental studies, ensuring that the models reflect realistic isolation behavior observed in practice [
50,
54,
55].
During the design stage, the variability of the BI system and the superstructure properties was explicitly incorporated to represent the expected range of physical and construction uncertainties. The mechanical properties of the isolators—such as effective stiffness, post-yield stiffness ratio, yield strength, and damping—were varied within the upper and lower bounds recommended by ASCE 7-22 to reflect manufacturing tolerances, aging effects, and testing uncertainties inherent in practical isolation systems. Similarly, a ±10% variation in the first fundamental period of the base-isolated building was introduced to account for plausible deviations in material stiffness, concrete strength, and reinforcement detailing that typically influence global dynamic characteristics. These variations ensure that the modeled structures represent realistic design scenarios while capturing the influence of parameter uncertainty on the predicted seismic performance of base-isolated reinforced concrete buildings.
The nonlinear analyses in this study were conducted using two-dimensional (planar) models of the base-isolated structure. While three-dimensional (3D) effects such as torsion and out-of-plane coupling can influence seismic response, the use of planar (2D) modeling remains widely accepted in design codes and research for plan-regular buildings with isolation systems. For example, recent reviews of base isolation systems emphasize that much of the parametric- and performance-based research continues to employ 2D idealizations to balance computational cost and input-motion sampling [
56,
57]. By adopting this simplified modeling framework, we were able to conduct an extensive parametric and machine learning investigation.
In this study, the soil–structure interaction (SSI) effects were neglected to focus on developing and validating the ML framework for predicting the seismic response of base-isolated buildings. The primary objective was to evaluate the capability of the proposed ML model to capture the nonlinear behavior of isolated superstructures under various ground motions, assuming a fixed-base condition. It is noted that there are many studies in the field of using machine learning to predict the behavior of base-isolated buildings in which the effects of SSI are ignored [
58].
It should be noted that during the application of ML, the mechanical properties of the base isolation system and the structural parameters were treated as deterministic values. This assumption is consistent with the purpose of the study, which focuses on developing and validating an ML framework for predicting the seismic response of base-isolated buildings rather than performing a full probabilistic risk analysis. The variability of isolator and structural properties had already been represented during the design stage through code-based parameter bounds and the ±10% variation in the fundamental period, ensuring that the dataset used for training adequately spans the realistic design domain. Using deterministic inputs allows the ML models to learn clear cause–effect relationships between seismic demand parameters and structural responses without introducing additional stochastic noise [
59,
60,
61].
Table 3.
The range of maximum inter-story drift ratios for each class of damage in the retrofitted RC buildings [
62].
Table 3.
The range of maximum inter-story drift ratios for each class of damage in the retrofitted RC buildings [
62].
| Damage Level | Null to Slight (Level 1) | Moderate (Level 2) | Heavy (Level 3) | Very Heavy (Level 4) | Collapse (Level 5) |
|---|
| Range of MISDR | 0 to 0.25% | 0.25% to 0.5% | 0.5% to 1% | 1% to 2% | >2.0% |
5. Seismic Excitation Properties
Eleven seismic events were selected for the time-history analysis. The acceleration history of each seismic event is presented in
Appendix A, with all seismic events scaled to a maximum peak ground acceleration (PGA) of 1.0 g. For the Kobe (1995) and Tabas (1978) earthquakes, two ground motions from two different stations are considered. The selected seismic events comprise both near-field and far-field earthquakes, which can result in differing performance of base isolation systems. Other properties of the ground motions are listed in
Table 4.
To enable a more detailed comparison, the displacement, acceleration, and velocity response spectra of the seismic events were analyzed for a structure with 5% of critical damping. In the derivation process of each spectrum, the PGA was set to be 1.0 g. The resulting spectral displacement, velocity, and acceleration are shown in
Figure 2. As observed, near-field events such as Loma Prieta (1989), Kobe I (1995), and Chi-Chi (1999) exhibit higher spectral velocity peaks.
As seen in
Figure 2,
Table 4, and
Appendix A, the selected ground motions represent a wide variety, which allows the acquisition of a suitable insight about the buildings’ performance under various seismic vibrations with different properties. The unscaled magnitude of the ground motions varies from 6.5 to 7.62, while the PGA of the original events covers a band from 0.09 g to 1.23 g. Furthermore, the selected ground motions include both far-fault and near-fault seismic events, where the events El Centro, Northridge, Tabas I, Kobe II, Tabas II, San Fernando, and Friuli are far-field ground motions, while Kobe I, Chi-Chi, Duzchi, and Loma Prieta are near-field ground motions. Moreover, according to
Figure 2c the main peak of the acceleration spectrum covers a range from the period (
T) of near 0.1 s to 0.85 s, showing the wide range of the dominant period and frequency contents in the proposed seismic ground motions.
For dataset generation, each seismic event was scaled across a range of PGAs from 0.05 g to 0.70 g. Initially, each proposed seismic event was considered at a maximum PGA of 0.05 g, and the corresponding damage level was recorded. Incremental dynamic analysis (IDA) was then conducted, increasing the PGA in 0.05 g increments until structural collapse occurred. This procedure resulted in the generation of more than 7000 datasets.
6. Dataset Generation for Model Training
This study considers both the structural system and the seismic excitation properties used to train the ML models, enabling the prediction of damage levels in retrofitted RC buildings.
As seen in
Table 5, 28 parameters were considered to train the datasets in the ML algorithms. Among these 28 variables, 15 variables referred to ground motions, while 12 referred to the properties of the retrofitted buildings, such as height and vibration period in the first mode, and 1 of them represented the site type of the buildings.
It is noted that the input parameters shown in
Table 5 were not explicitly normalized or standardized prior to model training. As the majority of the employed algorithms (DT, RF, and XGBoost) are tree-based and inherently insensitive to feature scaling, normalization was not required for those models. However, the SVM and ANN can be sensitive to input magnitudes. In this study, since all input features were within a comparable numerical range (after checking for outliers and unit consistency), the models achieved stable convergence without additional normalization [
63,
64,
65,
66,
67].
Prior to training the ML models, the datasets were classified into two groups: one group for training the models and the other for evaluating their performance. It is a well- established fact that the accuracy of ML models is highly dependent on the quality and composition of the training data. In this study, approximately 83% of the dataset was used for training the ML models, while the remaining 17% was reserved for testing and validating model accuracy.
To ensure robust performance, the training data were carefully selected to represent the full range of structural and seismic conditions. Subsequently, the performance of each ML algorithm was evaluated using the selected random data. As previously described, the modeled buildings vary in the number of stories, number of bays, and type of RC moment-resisting frame. For each frame with a specified number of stories, the data corresponding to one bay configuration and one type of moment-resisting frame were randomly excluded from the training dataset. The bay configurations included three, four, and five bays, while the moment-resisting frame types were designated as Type 1 and Type 2, corresponding to special and intermediate frames, respectively.
The damage data derived from the time-history analyses were subsequently examined.
Figure 3 presents the percentage distribution of each damage level across the entire dataset. As observed, damage levels 3 and 4 are the most prevalent. It is important to note, based on
Table 3, that the MISDR range corresponding to these damage levels is wider than that of other levels, which explains their higher frequency in the dataset. The percentage of datasets corresponding to damage level 5 was about 13%, and it can be increased if the number of datasets produced with higher PGAs is also increased. However, earthquakes with PGAs higher than 0.70 g are very rare, and in the time-history analysis conducted in this study, the PGAs value do not exceed 0.70 g.
As shown in
Figure 3, levels 3–4 have higher frequencies. In this study, no oversampling or under-sampling technique was applied, as the models were trained on the original dataset to preserve its natural distribution.
To mitigate potential bias, a stratified train–test split was adopted, ensuring that each damage level was proportionally represented in both subsets. Furthermore, class-wise performance metrics (Precision, Recall, and F1-score) were examined, which are presented in the next sections of this paper.
Although this approach maintains the physical realism of the data, it may introduce a minor bias toward the more frequent classes (levels 3–4). This reflects the fact that heavy to very heavy damage states are more prevalent in the simulated scenarios and thus represent realistic but slightly imbalanced conditions. The influence of this imbalance was carefully evaluated through class-wise metrics and confusion matrices to confirm that model performance remained robust across all damage levels.
It is noted that the dataset was generated using numerical simulations based on well-established material and isolation models. To evaluate the sensitivity of the results to modeling assumptions, a range of structural and isolation parameters, such as stiffness, natural period, and height, were used within realistic bounds recommended by design codes. This parametric variability allowed the dataset to capture diverse structural behaviors and reduced dependence on specific modeling choices. Nevertheless, some simplifications were adopted to maintain computational efficiency, such as two-dimensional representation of the buildings and deterministic material properties. These assumptions may influence absolute response values but are not expected to affect the overall trends learned by the machine learning models [
68].
Figure 4 shows the histogram of the dataset produced after conducting the time-history analysis. The data are approximately uniformly distributed for MISDR values between 0.1% and 1.5%. Beyond MISDR = 1.5%, the density of data decreases, reflecting the lower probability of very strong seismic events. Consequently, the number of cases exhibiting severe damage declines in the dataset.
7. Results of Machine Learning Classification
To achieve high accuracy in the ML algorithms, the hyperparameters of each model need to be selected within an appropriate range. In this study, the hyperparameters selected for each ML algorithm are listed in
Table 6. The hyperparameters listed in
Table 6 were determined through a systematic random search process aimed at minimizing the prediction error on the validation dataset. Several combinations of parameters were tested, and the configuration yielding the best performance metrics was selected as optimal for model training. It is noted that the depths of the DT and RF models were varied between 10 and 25, and the maximum reduction in their accuracy was less than 1%, indicating that both models were relatively insensitive to overfitting. Furthermore, the lower accuracy of the DT model demonstrates its limited ability to capture the complex nonlinear behavior of structures under earthquake excitation.
The performance of ML algorithms for the prediction of damage levels in the retrofitted buildings using BI must be evaluated using appropriate criteria. Based on published papers in the field of ML application in the prediction of damage level of buildings exposed to earthquakes, three criteria known as Precision, F1-score, and Recall were selected to assess and compare the accuracy of each model. The mentioned criteria are defined in
Table 7. Note that in
Table 7, True Positives (TPs) refer to instances where the model correctly predicts a given class. False Positives (FPs) and False Negatives (FNs) represent cases where the model incorrectly assigns a class or fails to identify it, respectively.
Table 8 presents the F1-score of the employed ML algorithms for predicting different damage levels (DLs) in the retrofitted buildings. Note that the averages shown in the table denote the weighted averages. Overall, the average accuracy for all algorithms exceeds 84%. The maximum accuracy is achieved by the XGBoost method, while the minimum accuracy is observed in the performance of DT with a difference of 5% between the two algorithms. For most ML methods, the maximum accuracy is observed in DL1, while the minimum accuracy is seen for the results corresponding to DL5. It is noteworthy that approximately 13% of the dataset represents DL5, which may contribute to the reduced accuracy for this class. Across all ML models, the F1-scores exhibit a non-monotonic trend: accuracy decreases from DL1 to DL2, increases from DL2 to DL4, and declines again from DL4 to DL5.
Figure 5 shows a bar plot of the accuracy metrics for the ML methods, and similar trends are observed for Precision and Recall.
In
Table 8, although the imbalanced data distribution affects predictive accuracy for extreme cases, the models still achieve acceptable performance: the difference between the overall average F1-score and that of damage level 5 was less than 10%. This suggests that the minority class was reasonably well captured despite its smaller representation.
To quantitatively confirm model differences, paired
t-tests were conducted on the 5-fold macro-F1-scores of all models. The results indicated that XGBoost achieved statistically higher performance than the Decision Tree, Random Forest, SVM, and ANN (
p < 0.01 for all comparisons). These results confirm that the superiority of XGBoost, although visually subtle in
Figure 5, is statistically significant. The corresponding
t-values and
p-values are reported in
Table 9.
Table 9 presents the
t-statistics and corresponding
p-values obtained from paired
t-tests on the 5-fold cross-validation macro-F1-scores. The
t-statistic indicates the magnitude of difference between models, while the
p-value denotes the probability that this difference occurred by chance. Values of
p < 0.05 were considered statistically significant.
The prediction accuracy of the ML techniques was further evaluated using confusion matrices as depicted in
Figure 6. The confusion matrices are presented for five algorithms: DT, RF, SVM, XGBoost, and an ANN. For instance, the algorithm XGBoost, which achieved the best performance among the ML techniques used in this paper, correctly labeled and predicted 103 out of 109 samples for DL1. In contrast, for the highest damage level (DL5), the number of correct predictions was lower with 91 correct predictions out of 117 samples. Again, the worst performance can be seen for the highest DL.
The superior performance of XGBoost and the ANN compared to DT and RF can be attributed to their ability to capture the strongly nonlinear relationships governing the seismic response of base-isolated RC structures. The behavior of such systems is highly dependent on the interaction between isolator stiffness, ground motion properties, and superstructure flexibility, which leads to complex patterns in drift and damage responses. XGBoost, through gradient boosting and adaptive weighting of misclassified samples, effectively learns these intricate dependencies, while the ANN benefits from its multi-layer architecture to approximate nonlinear mappings between input features and damage levels. In contrast, DT and RF, although interpretable, rely on axis-aligned splits that limit their ability to fully represent the coupled influence of multiple isolation parameters, resulting in comparatively lower prediction accuracy [
69].
Examination of the confusion matrices showed that most misclassifications occurred between adjacent damage levels, particularly between DL3 and DL4. This pattern is expected because structural damage evolves continuously rather than in discrete steps, and the boundary between moderate and severe damage is often ambiguous. Misclassification in the collapse category (DL5) stems mainly from the limited number of high-intensity samples and the inherent uncertainty of extreme responses in nonlinear dynamic analyses [
70]. Despite these challenges, the models—especially XGBoost and the ANN—maintained consistent predictive accuracy across the range of seismic intensities represented in the dataset. This robustness suggests that the learned relationships between isolation properties, seismic demand, and damage level are stable and generalizable under varying loading conditions.
Finally, the most effective parameters in damage level identification of RC retrofitted buildings by using a BI system are shown in
Figure 7. The ML models were trained considering 28 factors, and
Figure 7 shows the most important ones. The top three most effective factors are spectral velocity corresponding to the period of the first mode SV(T
1), peak ground velocity (PGV), and Housner intensity (HI). These three parameters are highly dependent on the velocity history of the seismic event and the natural period of the retrofitted buildings. Based on the ranking, the value of the acceleration spectrum in the period of T1 is much less important than the corresponding value of the velocity spectrum. The results shown in
Figure 7 support the fact that the seismic performance of base-isolated (BI) buildings is highly influenced by the distance of the seismic event from the fault. It is also important to note that near-field ground motions typically exhibit high maximum spectral velocity (MSV) due to the pulse-like characteristics present in their time-histories.
As seen in
Table 7, most of the ML algorithms exhibit their highest accuracy for the lowest damage level (DL1), followed by a noticeable drop from DL1 to DL2. This trend indicates that predicting the onset of nonlinear response in base-isolated buildings is more challenging than identifying the undamaged state. As shown in
Figure 7, the spectral velocity at the fundamental period, S
V(T
1), is the most influential parameter in determining the damage level. The observed decrease in accuracy can therefore be attributed to the shift in the fundamental period of the structure as the isolation system enters higher nonlinearity with increasing seismic demand. This period elongation alters S
V(T
1) and reduces the ability of models trained under lower-intensity conditions to accurately predict transitions from elastic to inelastic behavior.
8. Conclusions
This study focused on using ML models to predict the damage levels of retrofitted RC buildings. Initially, 48 RC buildings were designed based on previous versions of the ASCE. The buildings differed in the number of stories (5–8), number of bays (3–5), site classification (types B and C), and type of moment-resisting frame (special and intermediate).
After designing the buildings based on older versions of ASCE and ACI codes, the BI system was designed for upgrading the buildings. The selected type for the BI system was lead rubber bearing, and it was designed based on the provisions of ASCE-22. Afterward, a finite element model was produced for each building using the OpenSees software package. Seismic excitations with variable intensities were imposed on the modeled buildings, and nonlinear time-history and incremental dynamic analyses were conducted. The output of the time-history analysis was the damage level in each building, which was specified based on maximum inter-story drift. Five damage levels were defined based on the maximum inter-story drift.
Afterward, five ML models named DT, RF, SVM, XGBoost, and ANN were employed to predict the damage level in the buildings. The performance of ML models is dependent on the data used for training them. Therefore, a random part of data was used to train the ML models, and the minimum accuracy gained for each method was finally recorded. This process led to removing the effects of data used for training on the performance of ML models.
The performance of ML algorithms was compared by defining appropriate parameters named F1-score, Precision, and Recall. Furthermore, confusion matrices were assembled to present a better comparison of the ML methods in each damage level. Based on the results, the ML algorithms were able to predict the damage level with an accuracy between 84% and 89%. The minimum accuracy was derived for DT, while the maximum accuracy was derived for XGBoost. For each ML algorithm, the best performance was observed for damage level 1, while the worst performance was observed for damage level 5. Note that damage level 1 corresponds to minimum inter-story drift and damage, while damage level 5 corresponds to the maximum damage level.
The most influential parameters affecting the damage levels of the RC buildings retrofitted with BI systems were identified. Based on the results, the velocity time-history of the ground motion emerged as the dominant factor in the damage level of the retrofitted RC buildings using BI. The top three parameters affecting the damage level were the spectral velocity corresponding to the period of the first mode of the retrofitted building, peak ground velocity (PGV), and Housner intensity. These parameters are strongly influenced by the distance of the seismic events from the faults, distinguishing between far-field and near-field events, which is also a very important parameter in the design process of the BI systems for retrofitting the RC buildings. Furthermore, the velocity time-history was found to have a greater impact on damage prediction than the acceleration time-history.
In this study, it was demonstrated that the proposed ML models can predict the damage levels of RC buildings retrofitted with base isolation with suitable accuracy. Such predictive capability provides engineers with a practical decision-support tool when evaluating retrofit options for existing RC buildings, particularly those designed according to older seismic codes with lower base-shear capacity. By using the ML-based predictions, engineers can better assess whether base isolation will achieve the desired performance objectives, estimate the potential reduction in damage under different seismic intensities, and compare the cost-effectiveness of BI retrofitting against alternative strengthening methods. This highlights the practical relevance of the proposed approach for real engineering decision-making in seismic retrofit projects.
Finally, despite the large number of time-history analyses conducted in this study, several limitations remain and are recommended for future investigation. First, this research employed two-dimensional models of retrofitted RC buildings. Although 2D modeling is widely accepted in many well-known seismic design codes, extending the analyses to fully three-dimensional models would provide a more comprehensive understanding of the structural behavior, especially for seismically isolated systems. In addition, the effects of soil–structure interaction were not considered in the present work and should be examined in future studies to better capture the realistic structural response. Moreover, both the properties of the retrofitted buildings and the mechanical characteristics of the base isolation systems can exhibit significant variability. Incorporating sensitivity analyses and explicitly accounting for these uncertainties within the machine learning framework would enhance the robustness and generalizability of the predictive models.