Next Article in Journal
Development, Implementation, and Experimental Validation of a Novel Thermal–Optical–Electrical Model for Photovoltaic Glazing
Previous Article in Journal
The Indoor Environment During Swimming Competitions and Its Impact on Construction Materials: Airborne Trichloramine as a Degradation Factor
Previous Article in Special Issue
Performance-Based Maintenance and Operation of Multi-Campus Critical Infrastructure Facilities Using Supply Chain Multi-Choice Goal Programming
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrating Environmental Conditions into Machine Learning Models for Predicting Bridge Deterioration

by
Papa Ansah Okohene
and
Mehmet E. Ozbek
*
Department of Construction Management, Colorado State University, Fort Collins, CO 80523, USA
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(22), 12042; https://doi.org/10.3390/app152212042
Submission received: 18 September 2025 / Revised: 7 November 2025 / Accepted: 10 November 2025 / Published: 12 November 2025
(This article belongs to the Special Issue Infrastructure Resilience Analysis)

Abstract

Accurate prediction of bridge deterioration is essential for optimizing inspection schedules, prioritizing maintenance activities, and ensuring infrastructure safety. This study developed machine learning-based predictive models to estimate the deterioration states of bridge decks, superstructures, and substructures using a comprehensive dataset from the Colorado National Highway System spanning 2014 to 2024. Structural, operational, and environmental parameters including freeze–thaw cycles, precipitation, condensation risk, and extreme temperatures were incorporated to capture both design-driven and climate-driven deterioration mechanisms. Decision Tree, Random Forest, and Gradient Boosting classifiers were trained and evaluated using Balanced Accuracy, Matthews Correlation Coefficient, Cohen’s Kappa, and macro-averaged F1-scores, with class imbalance addressed via SMOTETomek resampling. The Gradient Boosting classifier achieved the highest predictive performance, with balanced accuracy exceeding 97% across all components. Feature importance analysis revealed that sufficiency rating, year of construction, and environmental stressors were among the most influential predictors. Incorporating environmental variables improved predictive accuracy by up to 4.5 percentage points, underscoring their critical role in deterioration modeling. These findings demonstrate that integrating environmental factors into machine learning frameworks enhances the reliability of deterioration forecasts and supports the development of climate-aware asset management strategies, enabling transportation agencies to proactively plan maintenance interventions and improve infrastructure resilience.

1. Introduction and Purpose

Bridge infrastructure is a critical component of transportation networks that support the transit of people, goods and services. Bridges also play an important role in reducing the cost of transport, improving accessibility, and enabling trade and commerce between regions [1]. Bridges deteriorate as a result of both loads and time, with the former including those due to various types of vehicles; the latter includes but is not limited to the effects of material aging and environmental conditions such as variations in weather [2]. Heavy traffic, especially freight traffic, places constant loading on structural elements resulting in capacity reduction due to fatigue and damage accumulation [3]. Environmental conditions such as temperature fluctuations, moisture ingress and chemical agents (e.g., the use of deicing salts), all cause deterioration of bridge materials in particular concrete and steel [4]. For example, moisture containing chloride ions from deicing salts accelerates steel reinforcing corrosion, while freeze–thaw cycles are responsible for surface cracking and spalling of concrete [4]. In this manner, it impacts the structural integrity of bridges and underlines the necessity for routine maintenance of safety and prevention of catastrophic failures.
Traditionally, bridge maintenance planning has been based on a mix of periodic inspections and expert opinion as well as schedules defined by bridge age or amount of traffic load. Due to the limitations, there is a growing trend in bridge maintenance towards data-driven approaches, especially machine learning (ML) techniques for predictive modeling of bridge deterioration. The ability of ML to analyze large amounts of historical data (e.g., inspection records, structural parameters and traffic loads) allows to find patterns and trends that are not easily observable in a manual inspection [5]. The growing availability of data from smart sensors, remote sensing technologies and infrastructure monitoring systems further promoted the application of ML in bridge management [6]. Despite substantial progress in developing ML models to predict bridge deterioration, a lack of understanding regarding the effect of environmental conditions on the rate of deterioration means that there is still an important gap remaining.
Most ML models today in bridge maintenance work on structural parameters and inspection data while treating environmental factors either as a secondary consideration or they are indeed left out of the model completely [2,5,7]. This method’s lack of generalizability limits its application to structures like bridges in regions with intense or diverse climates. Moreover, adverse effects of climate change will most likely increase the environmental loads on bridges; hence, such conditions should be factored into predictive models [8]. Due to the increasing focus on data-driven decision-making in infrastructure management, it is crucial to improve ML models so they can learn from environmental data more effectively. This will give a comprehensive view of the factors leading to the deterioration of bridges resulting into an integrated prediction and reliable maintenance stratagem. This integration will not only enhance the reliability of predictions over diverse climatic regions but also allow one to develop region-specific maintenance protocols addressing both structural and environmental vulnerabilities.
This study addresses these gaps by integrating environmental variables with structural and inspection data into ML-based predictive models for bridge deterioration. Unlike prior studies that primarily relied on structural and inspection data, the innovation of this study lies in the explicit integration of environmental stressors such as freeze–thaw cycles, precipitation, humidity, condensation risk, and temperature extremes into machine learning models of bridge deterioration. This integration, coupled with the use of advanced resampling techniques to address severe class imbalance and a multi-metric evaluation framework, allows for the development of climate-aware predictive models that are more generalizable across regions and resilient to changing environmental conditions. The research focuses on the National Highway System bridges in Colorado, leveraging 11 years of historical data from the National Bridge Inventory (NBI) and climate records from the PRISM dataset.

2. Literature Review

2.1. Environmental Conditions Affecting Bridge Deterioration

Environmental conditions contribute to a major part of bridge structure deterioration, which greatly affects their lifespan and structural integrity. Interactions between the environmental and the inherent stresses in bridges can result in accelerated material degradation, thus leading to great challenges in maintenance and management practices. One major environmental factor in bridge deterioration is temperature changes. Severe temperature changes may cause thermal expansion and contraction in bridge materials that can lead to cracking and spalling, mainly in regions subjected to big seasonal variations.
Studies show that it is known that freeze–thaw cycles, which occur much more in colder climates, enhance these impacts by making materials degrade more with the cyclic stresses acted upon them [9,10]. Materials in different temperatures show expansion and contraction, creating stress concentrations that weaken structural components over time and finally compromise the integrity of the bridge [11,12]. Along with temperature, humidity also plays a role in the deterioration process. Higher the humidity, greater the exposure to moisture, and thus corrosion in steel members and weakening of concrete advance at a much faster rate. Research has indicated that moisture not only accelerates the electrochemical reactions that cause corrosion but also affects the rusting of steel reinforcement in reinforced concrete [13]. The interaction between humidity and temperature, therefore, has produced a corrosive ambient environment that further complicates the maintenance management of bridge infrastructures [14,15].
Water ingress and precipitation are also key contributors to bridge deterioration. Excessive rain contributes to foundation scouring and destabilizing structures, especially in flood regions. Water infiltration in bridge concrete and structural steel can accelerate the corrosion process, and the deterioration of the materials [16]. In addition, the accumulation of water can promote biological growth that cause loss of structural integrity of bridges [1,17]. Another important factor is exposure to corrosive agents, which includes chlorides and de-icing salts. In Colorado, where de-icing salts are normally used throughout winter months, the likelihood of corrosion increases significantly. Chlorides may seep through concrete and initiate the onset of corrosion of steel reinforcements, leading to spalling and cracking of the concrete cover [18]. This can especially be true in the presence of chlorides at high humidities and temperatures, forming a very corrosive environment that causes further deterioration of the components [19]. This could be more dangerous for bridges located in coastal areas where the salt spray accelerates the processes related to corrosion [10,16].
Long-term exposure to adverse environmental conditions can cause cumulative effects that result in the progressive weakening of the structural components. Over time, such a combined effect of temperature changes, humidity, precipitation, and corroding agents can significantly shorten the service life of bridges. Research indicates that these deteriorating effects are mostly nonlinear; hence, the damage produced by an individual environmental exposure is cumulative [11,20]. This underlines the importance of continuous monitoring and assessment of environmental conditions in informing maintenance strategies to extend the life span of bridge infrastructures [17].
The above-discussed environmental factors, i.e., temperature variations, freeze–thaw cycles, humidity, precipitation, and chlorides exposure due to de-icing, are some of the most frequent causes of accelerated bridge deterioration. The stresses have varying effects on bridges depending upon the varying climatic and structural conditions, and their overall effect can severely degrade materials over the years. Several studies have demonstrated that these environmental parameters are one of the significant sources of damage, and it is thus necessary to incorporate them into predictive models in order to achieve improved maintenance planning [4,21]. Although these conditions are some of the most prevalent environmental stressors, they are not the sole conditions influencing bridge deterioration. Some other conditions like ultraviolet radiation, wind-borne debris, air pollution, soil and groundwater chemistry, hydraulic erosion, vegetation growth, ice formation, and excessive heat from wildfires have the potential to influence bridge degradation. These conditions, either singly or in combination with other conditions, have the potential to exacerbate the degradation of bridge materials [22].

2.2. Traditional Approaches in Bridge Deterioration Prediction

Traditional approaches to modeling and forecasting of the deterioration of bridges have based infrastructure management on empirical models, statistical analysis techniques, and heuristic methods. Most of those are based on the adoption of predefined deterioration curves, material degradation rates, and some general hypotheses with respect to structural performance when assessing the bridge condition at any instant in time. For instance, Markov chain models have been widely used in predicting future states of bridge components based on past records, hence providing a probabilistic framework that captures the stochastic nature of deterioration processes [23,24,25].
The most common frameworks of traditional approaches include regression models, condition rating systems, and Markov chains. Regression models are mostly based on historical inspection data in determining relationships among the factors of deterioration, like age, traffic loads, and environmental conditions. In contrast, condition rating systems allow for the standardized assessment of states of bridge components and, therefore, the classification of conditions in discrete states [26,27]. Markov chains, particularly, have been instrumental in developing bridge management systems (BMS) by modeling the transition probabilities between different condition states, thus enabling the prediction of future deterioration based on current assessments [24,25,28,29]. Nevertheless, besides their utility, traditional approaches have some serious drawbacks when dealing with the complex and multifactorial features of deterioration in bridges. These models often oversimplify the interactions between environmental conditions, traffic loads, and structural aging, treating these factors as static or isolated variables. For example, the Markov-based model assumes a time-invariant transition probability. Such an assumption cannot model the dynamic nature of environmental influences, such as temperature fluctuations, freeze–thaw cycles, and exposure to corrosive agents, in an effective way [30]. This static treatment generally leads to inaccurate predictions and suboptimal maintenance strategies in practice because the models do not adequately represent variability and uncertainty under real-world conditions [27].
Moreover, traditional approaches relied on periodic inspections and condition assessment, mostly leading to reactive strategies in maintenance. In many cases, these strategies do not address deterioration until deterioration has reached critical levels, which may imply increased costs for repair and possible safety risks [31,32]. Another major problem is the reliance on historical data in predicting future conditions, which may not show current or future environmental conditions that are substantially influencing the deterioration patterns [33]. For instance, traditional models overlook climate change impacts leading to an underestimation of deterioration rates in some contexts [31,33].

2.3. Machine Learning Approaches in Bridge Deterioration

ML has emerged as the revolutionizing subset of artificial intelligence that enables systems to learn from data, recognize patterns, and make forecasts or decisions without explicit programming. ML deals with handling huge datasets and complicated nonlinear relationships of features that are usually hidden in traditional statistical methods. Such capability is particularly important for infrastructure management, in which the integration of diverse data sources may enhance modeling and decision-making. ML uses a variety of algorithms, such as supervised, unsupervised, and reinforcement learning approaches to address different types of data and raise predictive accuracy in a high number of applications, such as bridge maintenance and deterioration prediction [34,35].
Application of ML to bridge deterioration prediction presents a revolutionary change for maintenance planning. Traditional methods are mostly based on deterministic models that do not account for the various factors affecting bridge health. These methods generally result in oversimplification of the analysis, which cannot catch the dynamics involved in the deterioration processes of bridges. On the contrary, ML offers a data-driven alternative that can take advantage of vast amounts of both historical and real-time data by recognizing complex patterns and more precisely predicting the trend of deterioration with more reliability [36,37].
ML techniques in bridge deterioration modeling include a large number of methodologies that can be mainly grouped into supervised and unsupervised learning. The former has shown improvement in predictive accuracy by using techniques such as regression models, support vector machines (SVM), and ensemble techniques, including random forests (RF) and gradient boosting (GB). These models are good at using historical inspection and environmental data to identify patterns that can make predictions on future deterioration states. For example, some studies have shown that ensemble methods can be an effective approach in integration for improving overall accuracy of the predictions from multiple predictive models for more reliable bridge condition forecasts [38,39]. Furthermore, unsupervised learning techniques like clustering and anomaly detection have become relevant for the identification of deterioration patterns when labeled datasets are poor, enabling the finding of hidden insights in data [40,41].
One of the benefits associated with ML approaches involves the potential to integrate dynamic environmental conditions within predictive models. Whereas in traditional techniques, the consideration of environmental factors is generally secondary or static, in ML models, there is a potential for synthesizing real-time data that will accurately represent temperature shifts, humidity, freeze–thaw cycles, and exposure to corrosive agents. This would enable more holistic and site-specific assessments of bridge deterioration. For example, research have proven that the inclusion of environmental variables in predictive models increases the accuracy of deterioration forecasts made by maintenance teams, which can then prioritize interventions using real-time risk assessment [42].
Recent studies also indicate a growing trend toward leveraging societal trends such as smart cities and the Internet of Things (IoT) for dynamic bridge monitoring. These interconnected systems facilitate the continuous flow of data, enabling real-time analysis and management of structural conditions via machine learning [43]. Continuous and proactive monitoring can significantly reduce the risks of catastrophic failures associated with aging infrastructure. The evolution of artificial intelligence in civil engineering further supports long-term strategies for infrastructure maintenance and management. These technologies promise to optimize processes, reduce operational costs, and extend the functional lifecycle of critical bridge infrastructure [44]. As more practitioners recognize the significant impacts of AI-enhanced methodologies, the transfer of knowledge into practice could redefine the standards for structural health monitoring and predictive maintenance.
Recent advances in deep learning have also shown potential for bridge deterioration assessment. Frameworks such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and graph neural networks (GNNs) are increasingly used for tasks like crack detection, spatiotemporal prediction, and integration of image and sensor data. While these approaches are better suited for unstructured data, they complement the tabular data–oriented machine learning models used in this study and highlight opportunities for future research.
Notwithstanding all the encouraging developments relating to ML for bridge deterioration modeling, a number of challenges persist. Among them is that there will be high demand for quality and diversity in data. The performance of ML models strongly depends on the quality and quantity of the data in training. Some complicated models require heavy computation resources to train the model, which may act as a barrier for a wide adaption of the practice in reality [45,46,47]. However, the opportunities presented by ML techniques in learning from enormous volumes of data, adapting to changes in conditions, and providing insights that can lead to practical actions underpinning their potential to revolutionize bridge maintenance.

3. Materials and Methods

Data for this research was collected from two main sources. The bridge data used for analysis were downloaded from the NBI databases. The NBI provides data on condition ratings of bridge components. It categorizes them into a scale of 0–9, where 9 means “excellent” and 0 means “failed”. It includes data on design characteristics of the bridge and other operational characteristics, such as ADT. In addition, the database provides data on the states (good, fair and poor) of the bridge elements. Environmental data were collected from the PRISM dataset in respect to the state of Colorado. It includes temperature variables of maximum, minimum, and mean, precipitation and relative humidity. Because both the NBI and climate datasets follow standardized formats, the framework can be readily extended to other regions by linking local bridge inspection records with corresponding environmental data.
The data preprocessing stage consists of two main phases: (1) data cleaning and preparation, and (2) data standardization and transformation. In order to merge datasets, identifiers such as bridge id, and geographic coordinates (latitude and longitude) were used to align datasets. Due to varying changes in environmental data for a particular location, geographic proximity was important to ensure that climate data corresponded to the exact or nearest location of bridges considered. Also, environmental data such as temperature and precipitation are often recorded at daily or hourly intervals, therefore these data needed to be synchronized with the timeline of our structural data (such as bridge age). All preprocessing steps were validated by examining distributions of key variables (e.g., sufficiency rating, traffic load) before and after imputation, outlier handling, and standardization. These checks confirmed that preprocessing improved data quality while preserving the underlying distributional characteristics across all 97 features. A clean dataset was subsequently created containing information about bridges and environmental condition data from the NBI as well as the PRISM datasets, respectively. During the cleaning process, missing data and inconsistencies such as missing data related to condition ratings were identified. These inconsistencies affect the analysis of the data and hence are addressed by handling missing data, removing duplicate records, detecting outliers that can skew the model’s understanding of the data, and treating outliers. The integration of bridge structural data with environmental conditions required a systematic approach to data standardization and transformation. The NBI condition ratings, ranging from 0 to 9, were standardized alongside diverse environmental measurements to ensure computational compatibility and meaningful analysis. Class imbalance was addressed using the SMOTETomek algorithm, which combines oversampling of minority classes with undersampling of borderline majority samples. The SMOTETomek resampling method has been widely applied in civil infrastructure datasets to handle class imbalance while preserving neighborhood structure in high-dimensional feature spaces [37]. For SMOTE, we set k = 5 nearest neighbors, ensuring that synthetic samples were generated within locally coherent neighborhoods in multi-dimensional feature space. Tomek Links were then applied to remove majority-class instances forming borderline pairs with minority samples, thereby improving class separability. Numerical features underwent standard scaling transformation, expressed as:
Z = x μ σ
where Z represents the standardized value, x is the original value, μ is the feature mean, and σ is the standard deviation [37]. This standardization was crucial for features such as bridge age, structure length, and traffic loads. To capture deterioration patterns effectively, a deterioration rate (DR) was calculated for each bridge component:
D R = 9 C R B A  
where CR represents the condition rating and BA denotes bridge age adapted from. Environmental measurements required specific transformations, including temperature standardization to Celsius and precipitation to millimeters. The categorical variables were one-hot encoded, ordinal variables maintained their hierarchical relationships through appropriate encoding schemes, and missing values were handled through domain-specific methods. Environmental data gaps were filled using spatial interpolation from nearby stations, and structured measurements were replaced on a similar bridge attribute basis.
Feature selection was carried out using machine learning algorithms that are well suited for assessing variable importance. Decision Tree (DT), RF, and GB were employed to compute importance scores for each feature. A composite importance index was then derived by aggregating the results from all three models. Highly correlated features, defined as those with a correlation coefficient above 0.9, were removed to reduce redundancy. Predictors that consistently ranked highly across the models were retained as the most significant variables for deterioration prediction.
The predictive modeling framework was built on three machine learning algorithms: DT, RF, and GB (implemented via XGBoost). DT provided a baseline due to their interpretability, while Random Forests offered improved performance through ensemble learning and feature interaction handling [48]. GB was selected for its strong predictive accuracy and robustness in handling class imbalance [49]. SVMs were initially considered but excluded due to computational challenges with the large dataset, which contained over 75,000 samples and 97 features, further expanded through synthetic oversampling. Hyperparameter optimization for each algorithm was performed using GridSearchCV with stratified three-fold cross-validation. To address class imbalance, the SMOTETomek technique was applied, and model calibration was conducted using Platt scaling to produce reliable probability estimates:
S   =   S M O T E X ,   N ,   k   T o m e k L i n k s X  
where S represents the balanced dataset, X is the original feature set, N is the desired sampling ratio, and k is the number of nearest neighbors used in the SMOTE algorithm [50].
The dataset was partitioned into training, validation, and test sets in a 64-16-20 ratio, stratified to preserve class distributions. Model-specific tuning included adjusting maximum depth, minimum samples per split, and Gini criterion for DT; selecting 100 estimators with bootstrap sampling for RF; and optimizing GB with a learning rate of 0.1, 100 estimators, and an early stopping criterion to prevent overfitting. To calibrate predicted probabilities, Platt scaling was applied by fitting a sigmoid function to decision scores on the validation set, which corrected for probability miscalibration commonly observed in imbalanced classification.
Model performance was evaluated using multiple metrics that are appropriate for class-imbalanced data. Balanced Accuracy, Cohen’s Kappa, MCC, and the Geometric Mean Score were employed alongside macro-averaged precision, recall, and F1-scores. Cohen’s Kappa coefficient metric is expressed as:
κ   = p o     p e 1     p e  
where po represents observed agreement and pe represents expected agreement by chance, providing a more robust measure of classification performance by accounting for random chance agreement [37]. To specifically address class imbalance challenges, MCC metrics was implemented:
M C C   = T P   ×   T N     F P   ×   F N T P   +   F P T P   +   F N T N   +   F P T N   +   F N
where FP represents false positives and FN represents false negatives [51]. To ensure robustness, stratified k-fold cross-validation (k = 3) was used, and performance estimates were bootstrapped 1000 times to provide confidence intervals. These metrics offered a comprehensive evaluation of the models’ ability to predict bridge deterioration under realistic conditions. The same evaluation framework was applied to all three algorithms (DT, RF, and GB) to allow direct performance comparisons while mitigating issues related to the naturally imbalanced characteristics of infrastructure maintenance data. From a methodological standpoint, this study also made several design choices to ensure robustness and scalability. A correlation threshold of 0.9 was applied to reduce redundancy among predictors while retaining sufficient explanatory features, a level consistent with prior infrastructure deterioration studies [39,52]. In addition, Support Vector Machines were excluded after preliminary testing showed minimal performance gains (<0.5% difference in balanced accuracy) compared to tree-based models, while requiring substantially greater computation time on the full dataset. These choices ensured that the models remained both transparent and computationally feasible while maintaining high predictive performance.

4. Results

4.1. Dataset Overview

The dataset used in this study combined structural, operational, and environmental parameters for bridges located on the Colorado National Highway System between 2014 and 2024. After preprocessing, the merged dataset comprised 75,063 samples and 97 unique features, incorporating condition ratings for decks, superstructures, and substructures, along with traffic data, structural attributes, and climatic variables such as mean temperature, precipitation, relative humidity, freeze–thaw frequency, and condensation risk.
Figure 1 illustrates the distribution of bridges across maintenance condition classes, showing that the majority of structures are in either “Good” (53.21%) or “Fair” (43.05%) condition, while only a small proportion fall into the “Poor” category (3.74%). This pronounced class imbalance necessitated the use of synthetic oversampling and undersampling techniques during model training, specifically SMOTETomek, to ensure that minority classes were adequately represented.

4.2. Feature Importance Analysis

Feature selection then cleaned the dataset by identifying useful predictors of bridge deterioration. Filtering at a high level of inter-feature correlation (threshold = 0.9) eliminated ten features, including ROADWAY_WIDTH_MT_051, BRIDGE_AGE, among others. Using DT, RF and GB classifiers to identify the most significant predictors of bridge deterioration feature selection was performed. Across all three models, the SUFFICIENCY_RATING emerged as the single most influential feature, with an average importance score of 0.6918, indicating its comprehensive representation of a bridge’s overall structural adequacy. Other prominent predictors included the year of construction (YEAR_BUILT_027), years since reconstruction, geographic coordinates (LAT_016, LONG_017), and approach roadway width (APPR_WIDTH_MT_051). Environmental factors, particularly freeze–thaw frequency, precipitation, and condensation risk, consistently ranked among the top contributors, highlighting the strong influence of climatic stressors on deterioration mechanisms [53]. The top ten predictive features and their average importance scores are presented in Table 1. The inclusion of geographic coordinates among leading predictors aligns with previous findings demonstrating the spatial variability of deterioration mechanisms driven by localized climatic and environmental conditions [37].
Among the environmental variables included, freeze–thaw cycles and extreme temperature events consistently ranked among the most influential predictors across models. These stressors are well-documented drivers of cracking, delamination, and accelerated fatigue, particularly in deck elements. Precipitation and humidity also contributed measurably, with higher importance in substructure condition predictions, reflecting moisture-driven mechanisms of scour and material degradation. While chloride exposure was not directly represented in this dataset due to data limitations, related variables (e.g., freeze–thaw and precipitation) indirectly captured correlated effects. These results support the practical value of including climatic stressors in deterioration models, as they mirror known physical mechanisms of degradation.
Figure 2 presents the DT-based modeling workflow applied for predicting deterioration states of bridge components. The model was developed to predict bridge maintenance needs, with key decision splits based on features such as bridge age, traffic load and freeze–thaw cycles. After preprocessing the dataset, feature selection was performed and the most significant predictors including bridge age, traffic load, freeze–thaw cycles, and precipitation were used to train component-specific DT for decks, superstructures, and substructures.
For Random Forest (RF), an ensemble of 100 decision trees was trained on bootstrap samples with replacement. At each split, a random subset of features (√p) was considered, which reduces correlation among trees and improves generalization. This configuration balances predictive stability with computational efficiency. The final prediction was determined by majority voting across all trees. For Gradient Boosting (GB), decision trees of maximum depth 3 were trained sequentially, where each tree was fit to the negative gradient of the loss function relative to current predictions. This process updates the ensemble in the steepest descent direction, incrementally improving predictive performance. A learning rate of 0.1 was used, and early stopping was applied (halting training after 50 rounds without improvement in validation loss) to ensure convergence and prevent overfitting.
Recent studies have reported comparable computational behavior for similar predictive tasks in bridge deterioration modeling. For instance, refs. [39,52] demonstrated that decision tree–based and ensemble methods achieve efficient convergence on multi-feature datasets with runtimes ranging from a few seconds to several minutes, depending on feature dimensionality and data volume. Ref. [37]. achieved comparable processing times when optimizing gradient boosting for statewide bridge datasets, highlighting its suitability for large-scale monitoring. Likewise, refs. [54,55] noted that moderate increases in computation time are justified by tangible gains in predictive accuracy and decision support value for maintenance planning. These comparisons confirm that the models developed in this study fall well within the practical limits of computational efficiency reported in the recent literature while maintaining engineering applicability and scalability.

4.3. Model Performance Evaluation

The predictive models were evaluated using balanced accuracy, Cohen’s Kappa, MCC, geometric mean, and macro-averaged precision, recall, and F1-scores. The results for DT, RF and RF classifiers are summarized in Table 2.
Among the three algorithms, the GB classifier consistently achieved the best performance across all bridge components, with balanced accuracy scores ranging between 0.972 and 0.978 and MCC values near 0.97. DT, while slightly less robust overall, achieved the highest single metric for deck components, attaining a balanced accuracy of 0.9875.
Random Forests demonstrated stable and competitive results, though they generally trailed GB slightly in predictive performance. In addition to the numerical results presented in Table 2, Figure 3 provides a visual comparison of DT, RF, and GB across key metrics and components, highlighting relative strengths and performance stability of the models. These findings are consistent with prior studies [38,39] demonstrating the effectiveness of ensemble-based approaches such as GB and RF in modeling deterioration patterns of complex infrastructure systems.
Across components, the GB classifier consistently achieved the highest or near-highest scores across all evaluation metrics. For deck prediction, GB achieved a Balanced Accuracy of 0.978, compared to 0.988 for DT and 0.938 for RF. Although DT marginally exceeded GB in this single case, GB outperformed both DT and RF in superstructure prediction (0.972 vs. 0.953 for DT and 0.929 for RF) and substructure prediction (0.974 vs. 0.975 for DT and 0.935 for RF). On average across all components, GB exceeded RF by approximately 3.8 percentage points in Balanced Accuracy and demonstrated greater stability in MCC and Macro F1-scores. These results confirm GB as the most reliable overall performer among the three models.

4.4. Effect of Integrating Environmental Variables

The excellent performance of the DT across all bridge components highlights the critical role of integrating environmental conditions with traditional structural metrics in predicting bridge deterioration (as shown in Table 3). In this study, a feature selection and importance analysis identified several key predictors: environmental variables driving deterioration trends, while bridge age (time in service since construction or last major rehabilitation) stood out as the most critical structural attribute, with traffic load measures (e.g., Average Daily Truck Traffic) and certain design attributes (material, span length, etc.) also contributing significantly. This finding is consistent with recent research. For example, ref. [54] reported that age is the single most important factor in bridge deck degradation, but closely followed by environmental stressors like freeze–thaw cycle counts and rainfall, as well as heavy truck traffic volumes. The study results align with that pattern: age and traffic set the baseline rate of wear, and environmental factors modulate that rate, often accelerating deterioration in harsh climates. The inclusion of climate variables in the DT model led to tangible improvements in predictive performance, as evidenced by the near-perfect metrics obtained, a clear contrast to models trained without those inputs. In practical terms, this means the integrated model was much better at catching those bridges that, due to environmental exposure, deteriorate faster than what one would expect from age or traffic alone. These improvements demonstrate that while the algorithms employed are established, the novelty of this work arises from enhancing them with climate-driven variables and advanced evaluation procedures, which significantly extend their utility for infrastructure asset management beyond prior approaches.
The RF results across all three bridge components underscore the critical importance of integrating environmental conditions with traditional structural metrics. Overall, the RF model attained very high balanced accuracies (≈0.93) and macro F1-scores (≈0.96) for deck, superstructure, and substructure alike, a performance level that was unattainable by the baseline model lacking environmental inputs (see Table 3). This performance validates the hypothesis that considering both climate exposure and structural characteristics yields more predictive power [54].
Table 3 presents the comparative results, showing that integrating environmental conditions improved balanced accuracy and macro F1-scores by 3.0 to 4.5 percentage points, corresponding to a 6–15% reduction in relative error rates for deterioration forecasting.
Traditional factors such as age and heavy traffic (ADT/ADTT) remain fundamental drivers of deterioration, as expected, but environmental stressors significantly influence the rate and severity of damage [54]. The synergy between these types of features is evident in the RF model’s performance, and each component’s condition is best predicted by a combination of intrinsic properties and external exposures [37]. The results here reinforce the consensus that comprehensive models provide superior predictions for infrastructure health. In practical terms, this means that bridge management agencies can achieve more reliable deterioration forecasts by incorporating climate data, which supports a shift toward proactive maintenance strategies. Studies have noted that predictive models leveraging environmental inputs enable earlier detection of critical deterioration, leading to optimized maintenance planning and improved lifecycle outcomes. The RF model’s success thus highlights the value of an integrated approach: environmental and structural metrics together produced a robust predictor of bridge deterioration, ultimately contributing to more resilient and cost-effective bridge maintenance programs.
Figure 4 presents the measurable performance gains achieved when environmental variables are included in the models. Results are shown separately for Decision Tree, Random Forest, and Gradient Boosting classifiers, across all bridge components (deck, superstructure, substructure) and evaluation metrics (Balanced Accuracy, MCC, and Macro F1). Improvement values are calculated as the difference between performance with environmental variables and performance without them. These results demonstrate that environmental stressors, particularly freeze–thaw cycles, annual rainfall, and extreme temperatures, play a critical role in accelerating bridge deterioration and should be explicitly accounted for in predictive frameworks [4,21]. The improved predictive capability offered by the integration of environmental variables supports the adoption of risk-informed maintenance strategies tailored to regional climatic conditions. Such approaches enable transportation agencies to better allocate resources and proactively schedule inspections for bridges most vulnerable to environmental degradation.

5. Discussion

This study developed predictive models to estimate the deterioration of bridge components using structural, operational, and environmental data from the Colorado National Highway System. The findings demonstrate that integrating environmental variables significantly improves predictive accuracy, underscoring the important role that climatic conditions play in the degradation of bridge infrastructure.
The feature importance analysis revealed that sufficiency rating is the most influential predictor across all models, corroborating findings from earlier studies where overall structural adequacy indices were shown to capture composite deterioration effects effectively [37,53]. Variables such as year of construction, geographical coordinates, and approach roadway width also ranked highly, consistent with prior evidence linking bridge age and design parameters to deterioration rates [38,39]. The consistent prominence of geographic coordinates reflects the spatial heterogeneity of deterioration mechanisms, reinforcing previous work that highlights regional differences in structural performance due to localized climate and environmental stressors [4]. The model outcomes are influenced to some extent by the selected correlation threshold and resampling parameters. However, these choices are consistent with prior studies in bridge deterioration prediction, and sensitivity checks conducted during development indicated that moderate variation in these parameters did not materially alter the ranking of predictor importance or relative model performance [37,39,52].
A notable contribution of this study is the explicit integration of environmental factors into the predictive framework. Climatic variables such as freeze–thaw cycles, precipitation, condensation risk, and extreme temperatures emerged as key predictors influencing deterioration patterns, which aligns with previous research emphasizing the vulnerability of transportation infrastructure to environmental conditions [21]. Our results demonstrate that including these variables improves balanced accuracy and F1-scores by up to 4.5 percentage points, confirming the hypothesis that environmental stressors accelerate deterioration processes in cold and variable climates. This finding reinforces the conclusions of [4], who demonstrated that freeze–thaw activity substantially contributes to deck cracking and delamination in regions with fluctuating winter temperatures.
Comparative model performance shows that the GB classifier outperformed DTs and RFs across most evaluation metrics. This supports earlier findings that ensemble-based learning techniques are highly effective for complex predictive tasks involving heterogeneous datasets [38,39]. While DTs offered interpretability and computational efficiency, their relatively lower predictive performance underscores the importance of leveraging advanced ensemble methods when dealing with highly imbalanced datasets such as this one. Furthermore, the successful application of SMOTETomek resampling to address class imbalance highlights the necessity of adopting robust data balancing strategies when predicting minority deterioration classes, which is a challenge often reported in bridge condition modeling literature [37].
From a methodological standpoint, this study also made several design choices to ensure robustness and scalability. A correlation threshold of 0.9 was applied to reduce redundancy among predictors while retaining sufficient explanatory features, a level consistent with prior infrastructure deterioration studies [39,52]. In addition, Support Vector Machines were excluded after preliminary testing showed minimal performance gains (<0.5% difference in balanced accuracy) compared to tree-based models, while requiring substantially greater computation time on the full dataset. These choices ensured that the models remained both transparent and computationally feasible while maintaining high predictive performance.
Beyond the numerical gains, the variations observed in predictor importance reflect meaningful physical and operational processes. For instance, fluctuations in the relative influence of environmental indicators such as freeze–thaw frequency and precipitation intensity are closely tied to the geographic and material heterogeneity of Colorado’s bridge inventory. Older concrete structures with lower sufficiency ratings exhibited stronger sensitivity to temperature extremes and moisture cycles, suggesting interaction effects between structural age, material composition, and local climate. Similar dependencies were reported by [37,54], who found that climatic exposure amplified deterioration rates in aging decks even under moderate traffic loads. In contrast, the relatively stable ranking of geometric variables such as approach width and span length indicates that design parameters exert consistent, long-term influence irrespective of environmental variation. These patterns demonstrate that deterioration is governed by both intrinsic structural factors and extrinsic environmental stressors, reinforcing the importance of multivariate modeling frameworks that capture such cross-dependencies.
From an engineering standpoint, the observed 3–4.5% improvement in accuracy yields tangible value when scaled to network-level decision-making. Even marginal gains can shift the prioritization of dozens of bridges toward timely maintenance, preventing cost escalation associated with deferred rehabilitation. This aligns with recent findings by [39,52], who emphasized that interpretability and deployment feasibility often outweigh raw algorithmic novelty in transportation asset management. Thus, the proposed climate-informed modeling approach balances predictive performance, transparency, and computational efficiency, offering a practical pathway for integrating environmental intelligence into existing maintenance workflows.
The implications of these findings are significant for infrastructure management and maintenance planning. Transportation agencies can utilize such predictive models to prioritize inspection schedules and optimize resource allocation, particularly for bridges most vulnerable to environmental stressors. In addition, identifying the sufficiency rating as the dominant predictor suggests that existing composite indices remain useful for deterioration assessment but should be supplemented with region-specific environmental metrics to improve forecasting reliability. The findings confirm that the methodological contribution of this study does not lie in proposing entirely new algorithms but in demonstrating how established models can be systematically enhanced through integration of environmental stressors, advanced imbalance handling, and rigorous evaluation. This innovation provides transportation agencies with climate-informed tools that improve the reliability and accuracy of deterioration forecasting.

6. Conclusions and Future Research

This study developed and evaluated machine learning models to predict the deterioration of bridge components using a combination of structural, operational, and environmental data for bridges on the Colorado National Highway System. Among the algorithms tested, the GB classifier consistently outperformed DTs and RFs across multiple evaluation metrics, achieving balanced accuracy values between 97.2% and 97.8% and Matthews Correlation Coefficients near 0.97 across bridge decks, superstructures, and substructures. In contrast, RF and DT models achieved balanced accuracy—between 93 and 95% and 95–98%, respectively—demonstrating the superior predictive stability of GB. Feature importance analysis revealed that the sufficiency rating remains the most influential predictor of deterioration, but environmental variables ranked highly across all models. Integrating these variables improved predictive accuracy by 3.0–4.5 percentage points compared to models without climate data, underscoring the importance of incorporating regional climatic stressors into deterioration modeling. The results support the adoption of climate-aware predictive frameworks capable of adapting maintenance strategies to evolving environmental conditions.
Despite the robustness of the results, some limitations warrant attention. First, while the models incorporated a rich set of environmental variables, other relevant stressors, such as chloride concentrations from de-icing salts, wind loading, and seismic activity, were excluded due to limited data availability. Future research should aim to integrate these factors where possible, particularly given their known impacts on corrosion and structural fatigue. Second, the models were developed using bridges within Colorado’s National Highway System, and while geographic coordinates partially capture spatial variability, generalizing findings to other regions with different climatic conditions will require external validation on broader datasets. Finally, while this study compared three prominent machine learning algorithms, exploring deep learning architectures and hybrid modeling approaches could further enhance predictive capabilities, particularly for large-scale national datasets.
Although this study focused on Colorado bridges, the methodology is broadly transferable. The standardized nature of NBI data and the availability of equivalent climate datasets in other regions make it possible to apply the same framework elsewhere. Minor adjustments for local conditions may be required, but the integration of structural and environmental variables ensures that the model design is generalizable. In future work, predictive frameworks could also be extended to incorporate real-time monitoring data from embedded sensors and Internet of Things (IoT) systems, allowing dynamic updating of deterioration forecasts. Such integration would facilitate proactive decision-making and improve the resilience of transportation infrastructure under evolving climatic conditions.

Author Contributions

Methodology, P.A.O.; Formal analysis, P.A.O.; Writing—original draft, P.A.O.; Writing—review & editing, P.A.O. and M.E.O.; Supervision, M.E.O.; Project administration, M.E.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wang, T.; Liu, Y.; Li, Q.; Du, P.; Zheng, X.; Gao, Q. State-of-the-Art Review of the Resilience of Urban Bridge Networks. Sustainability 2023, 15, 989. [Google Scholar] [CrossRef]
  2. Srikanth, I.; Arockiasamy, M. Deterioration models for prediction of remaining useful life of timber and concrete bridges: A review. J. Traffic Transp. Eng. (Engl. Ed.) 2020, 7, 152–173. [Google Scholar] [CrossRef]
  3. Dinegdae, Y.H.; Birgisson, B. Effect of Heavy Traffic Loading on Predicted Pavement Fatigue Life. In 8th RILEM International Conference on Mechanisms of Cracking and Debonding in Pavements; Chabot, A., Buttlar, W.G., Dave, E.V., Petit, C., Tebaldi, G., Eds.; Springer: Dordrecht, The Netherlands, 2016; pp. 389–395. [Google Scholar] [CrossRef]
  4. Fujiu, M.; Minami, T.; Takayama, J. Environmental Influences on Bridge Deterioration Based on Periodic Inspection Data from Ishikawa Prefecture, Japan. Infrastructures 2022, 7, 130. [Google Scholar] [CrossRef]
  5. Jaafaru, H.; Agbelie, B. Bridge maintenance planning framework using machine learning, multi-attribute utility theory and evolutionary optimization models. Autom. Constr. 2022, 141, 104460. [Google Scholar] [CrossRef]
  6. Guan, S.; Bridge, J.A.; Li, C.; DeMello, N.J. Smart Radar Sensor Network for Bridge Displacement Monitoring. J. Bridg. Eng. 2018, 24, 04018102. [Google Scholar] [CrossRef]
  7. Liu, K.; El-Gohary, N. Fusing Data Extracted from Bridge Inspection Reports for Enhanced Data-Driven Bridge Deterioration Prediction: A Hybrid Data Fusion Method. J. Comput. Civ. Eng. 2020, 34, 04020047. [Google Scholar] [CrossRef]
  8. Kumar, P.; Kota, S.R. Machine learning models in structural engineering research and a secured framework for structural health monitoring. Multimed. Tools Appl. 2024, 83, 7721–7759. [Google Scholar] [CrossRef]
  9. Fom, P.B.; Imam, B.; Chryssanthopoulos, M.K. Long-term deterioration effects on the buckling strength of metallic bridge girders. In Proceedings of the Second International Conference on Performance-based and Life-cycle Structural Engineering (PLSE 2015), Brisbane, QLD, Australia, 16–18 December 2015; pp. 1515–1525. [Google Scholar] [CrossRef]
  10. Kallias, A.N.; Imam, B.M. Effect of Climate Change on the Deterioration of Steel Bridges. In Proceedings of the IABSE Conference, Rotterdam 2013: Assessment, Upgrading and Refurbishment of Infrastructures, Rotterdam, The Netherlands, 6–8 May 2013; pp. 548–549. [Google Scholar] [CrossRef]
  11. Feng, J.; Gao, K.; Wu, G.; Xu, Y.; Jiang, H. A deep learning-based interferometric synthetic aperture radar framework for abnormal displacement deformation prediction of bridges. Adv. Struct. Eng. 2023, 26, 3005–3020. [Google Scholar] [CrossRef]
  12. Nguyen, P.T.; Bastidas-Arteaga, E.; Amiri, O.; El Soueidy, C.-P. An Efficient Chloride Ingress Model for Long-Term Lifetime Assessment of Reinforced Concrete Structures Under Realistic Climate and Exposure Conditions. Int. J. Concr. Struct. Mater. 2017, 11, 199–213. [Google Scholar] [CrossRef]
  13. Zaki, A.; Chai, H.K.; Behnia, A.; Aggelis, D.G.; Tan, J.Y.; Ibrahim, Z. Monitoring fracture of steel corroded reinforced concrete members under flexure by acoustic emission technique. Constr. Build. Mater. 2017, 136, 609–618. [Google Scholar] [CrossRef]
  14. Garg, R.; Garg, R.; Singla, S. Experimental Investigation of Electrochemical Corrosion and Chloride Penetration of Concrete Incorporating Colloidal Nanosilica and Silica fume. J. Electrochem. Sci. Technol. 2021, 12, 440–452. [Google Scholar] [CrossRef]
  15. Stewart, M.G.; Wang, X.; Nguyen, M.N. Climate change adaptation for corrosion control of concrete infrastructure. Struct. Saf. 2012, 35, 29–39. [Google Scholar] [CrossRef]
  16. Obata, M.; Guotai, L.; Watanabe, Y.; Goto, Y. Numerical simulation of adhesion of sea-salt particles on bridge girders. Struct. Infrastruct. Eng. 2014, 10, 398–408. [Google Scholar] [CrossRef]
  17. Rizzo, P.; Enshaeian, A. Challenges in Bridge Health Monitoring: A Review. Sensors 2021, 21, 4336. [Google Scholar] [CrossRef] [PubMed]
  18. Gao, Y. Influence of Chloride Ion Corrosion on the Performance of Reinforced Concrete Beam Bridge in Offshore Environment. Arch. Civ. Eng. 2020, 66, 253–265. [Google Scholar] [CrossRef]
  19. Soliman, M.; Frangopol, D.M. Life-Cycle Cost Evaluation of Conventional and Corrosion-Resistant Steel for Bridges. J. Bridg. Eng. 2015, 20, 06014005. [Google Scholar] [CrossRef]
  20. Bastidas-Arteaga, E. Reliability of Reinforced Concrete Structures Subjected to Corrosion-Fatigue and Climate Change. Int. J. Concr. Struct. Mater. 2018, 12, 10. [Google Scholar] [CrossRef]
  21. Al-Rashed, R.; Abdelfatah, A.; Yehia, S. Identifying the Factors Impacting Bridge Deterioration in the Gulf Cooperation Council. Designs 2023, 7, 126. [Google Scholar] [CrossRef]
  22. Ibrahim, A.; Abdelkhalek, S.; Zayed, T.; Qureshi, A.H.; Abdelkader, E.M. A Comprehensive Review of the Key Deterioration Factors of Concrete Bridge Decks. Buildings 2024, 14, 3425. [Google Scholar] [CrossRef]
  23. Kobayashi, K.; Kaito, K.; Lethanh, N. A statistical deterioration forecasting method using hidden Markov model for infrastructure management. Transp. Res. Part B Methodol. 2012, 46, 544–561. [Google Scholar] [CrossRef]
  24. O’cOnnor, A.J.; Sheils, E.; Breysse, D.; Schoefs, F. Markovian Bridge Maintenance Planning Incorporating Corrosion Initiation and Nonlinear Deterioration. J. Bridg. Eng. 2013, 18, 189–199. [Google Scholar] [CrossRef]
  25. Hasan, S.; Setunge, S.; Law, D.W.; Koay, Y.C. Forecasting Deterioration of Bridge Components from Visual Inspection Data. Int. J. Eng. Technol. 2015, 7, 40–44. [Google Scholar] [CrossRef]
  26. Li, L.; Sun, L.; Ning, G. Deterioration Prediction of Urban Bridges on Network Level Using Markov-Chain Model. Math. Probl. Eng. 2014, 2014, 728107. [Google Scholar] [CrossRef]
  27. Weissmann, J.; Weissmann, A.J.; Montoya, A. Deterioration Models for Texas Bridges and Culverts. Transp. Res. Rec. J. Transp. Res. Board 2023, 2677, 307–316. [Google Scholar] [CrossRef]
  28. Ahmed, M.; Moselhi, O.; Bhwomick, A. Integration of NDE Measurements and Current Practice in Bridge Deterioration Modeling. In Proceedings of the 33th International Symposium on Automation and Robotics in Construction, Auburn, AL, USA, 18–21 July 2016. [Google Scholar] [CrossRef]
  29. Fernando, D.; Walbridge, S.; Wan, B. A Markovian-based methodology for the life-cycle cost analysis of bridge maintenance interventions under changing deterioration rates. J. Civ. Eng. Inter Discip. 2020, 1, 1–12. [Google Scholar] [CrossRef]
  30. De-León-Escobedo, D.; Delgado-Hernández, D.-J.; Martinez-Martinez, L.-H.; Rangel-Ramírez, J.-G.; Arteaga-Arcos, J.-C. Corrosion initiation time updating by epistemic uncertainty as an alternative to schedule the first inspection time of pre-stressed concrete vehicular bridge beams. Struct. Infrastruct. Eng. 2014, 10, 998–1010. [Google Scholar] [CrossRef]
  31. Lin, P.; Yuan, X.; Tovilla, E. Integrative modeling of performance deterioration and maintenance effectiveness for infrastructure assets with missing condition data. Comput. Civ. Infrastruct. Eng. 2019, 34, 677–695. [Google Scholar] [CrossRef]
  32. Santos, K.; Dias, J.P.; Amado, C. A literature review of machine learning algorithms for crash injury severity prediction. J. Saf. Res. 2022, 80, 254–269. [Google Scholar] [CrossRef] [PubMed]
  33. Liu, Y.; Su, D.; Duan, Q.; Cao, Z. Recommendations for Refined Preventive Maintenance Management of Concrete Bridges in China Based on Environmental Risk Zoning. Transp. Res. Rec. J. Transp. Res. Board 2022, 2676, 460–479. [Google Scholar] [CrossRef]
  34. Kim, K.H.; Nam, M.S.; Hwang, H.H.; Ann, K.Y. Prediction of Remaining Life for Bridge Decks Considering Deterioration Factors and Propose of Prioritization Process for Bridge Deck Maintenance. Sustainability 2020, 12, 10625. [Google Scholar] [CrossRef]
  35. Miao, P.; Yokota, H.; Zhang, Y. Deterioration prediction of existing concrete bridges using a LSTM recurrent neural network. Struct. Infrastruct. Eng. 2023, 19, 475–489. [Google Scholar] [CrossRef]
  36. Jiang, L.; Tang, Q.; Jiang, Y.; Cao, H.; Xu, Z. Bridge Condition Deterioration Prediction Using the Whale Optimization Algorithm and Extreme Learning Machine. Buildings 2023, 13, 2730. [Google Scholar] [CrossRef]
  37. Nasab, A.R.; Elzarka, H. Optimizing Machine Learning Algorithms for Improving Prediction of Bridge Deck Deterioration: A Case Study of Ohio Bridges. Buildings 2023, 13, 1517. [Google Scholar] [CrossRef]
  38. Kale, A.; Kassa, Y.; Ricks, B.; Gandhi, R. A Comparative Assessment of Bridge Deck Wearing Surfaces: Performance, Deterioration, and Maintenance. Appl. Sci. 2023, 13, 10883. [Google Scholar] [CrossRef]
  39. Li, Q.; Song, Z. Ensemble-Learning-Based Prediction of Steel Bridge Deck Defect Condition. Appl. Sci. 2022, 12, 5442. [Google Scholar] [CrossRef]
  40. Fereshtehnejad, E.; Gazzola, G.; Parekh, P.; Nakrani, C.; Parvardeh, H. Detecting Anomalies in National Bridge Inventory Databases Using Machine Learning Methods. Transp. Res. Rec. J. Transp. Res. Board 2022, 2676, 453–467. [Google Scholar] [CrossRef]
  41. Mete, F.; Corr, D.J.; Wilbur, M.P.; Chen, Y. Bridge Response and Heavy Truck Classification Framework Based on a Two-Step Machine Learning Algorithm. Transp. Res. Rec. J. Transp. Res. Board 2022, 2676, 454–467. [Google Scholar] [CrossRef]
  42. Veloso, B.; Gama, J.; Ribeiro, R.P.; Pereira, P.M. A Benchmark dataset for predictive maintenance. arXiv 2022, arXiv:2207.05466. [Google Scholar] [CrossRef]
  43. Xu, G.; Guo, T. Advances in AI-powered civil engineering throughout the entire lifecycle. Adv. Struct. Eng. 2025, 28, 1515–1541. [Google Scholar] [CrossRef]
  44. Alghurair, M.S.; Fahim, A.R. The Role of Artificial Intelligence in Civil Engineering Applications and Programs. J. Eng. Sci. Inf. Technol. 2023, 7, 54–67. [Google Scholar] [CrossRef]
  45. Dhyani, B. Predicting Equipment Failure in Manufacturing Plants: An AI-driven Maintenance Strategy. Math. Stat. Eng. Appl. 2021, 70, 1326–1334. [Google Scholar] [CrossRef]
  46. Raja, H.A.; Kudelina, K.; Asad, B.; Vaimann, T.; Kallaste, A.; Rassõlkin, A.; Van Khang, H. Signal Spectrum-Based Machine Learning Approach for Fault Prediction and Maintenance of Electrical Machines. Energies 2022, 15, 9507. [Google Scholar] [CrossRef]
  47. Agrawal, A.K.; Kawaguchi, A.; Chen, Z. City University of New York. City College. Dept. of Civil Engineering. Bridge Element Deterioration Rates. C-01–51, October 2008. Available online: https://rosap.ntl.bts.gov/view/dot/16746 (accessed on 12 June 2025).
  48. Ghafoori, M.; Abdallah, M.; Ozbek, M.E. Machine Learning–Based Bridge Maintenance Optimization Model for Maximizing Performance within Available Annual Budgets. J. Bridg. Eng. 2024, 29, 04024011. [Google Scholar] [CrossRef]
  49. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016. [Google Scholar] [CrossRef]
  50. Batista, G.E.A.P.A.; Prati, R.C.; Monard, M.C. A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explor. Newsl. 2004, 6, 20–29. [Google Scholar] [CrossRef]
  51. Chicco, D.; Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom. 2020, 21, 6. [Google Scholar] [CrossRef]
  52. Mia, M.; Kameshwar, S. Machine learning approach for predicting bridge components’ condition ratings. Front. Built Environ. 2023, 9, 1254269. [Google Scholar] [CrossRef]
  53. Elleathy, Y.; Ghaith, M.; Haggag, M.; Yosri, A.; El-Dakhakhni, W. Climate-induced deterioration prediction for bridges: An evolutionary computing-based framework. Innov. Infrastruct. Solut. 2024, 9, 114. [Google Scholar] [CrossRef]
  54. Yang, C.; Wang, X.; Nassif, H. Impact of Environmental Conditions on Predicting Condition Rating of Concrete Bridge Decks. Transp. Res. Rec. J. Transp. Res. Board 2024, 2678, 999–1012. [Google Scholar] [CrossRef]
  55. Sun, Z.; Santos, J.; Caetano, E. Data-driven prediction and interpretation of fatigue damage in a road-rail suspension bridge considering multiple loads. Struct. Control. Heal. Monit. 2022, 29, e2997. [Google Scholar] [CrossRef]
Figure 1. Distribution of Maintenance Classes.
Figure 1. Distribution of Maintenance Classes.
Applsci 15 12042 g001
Figure 2. Decision Tree flow for bridge deterioration prediction.
Figure 2. Decision Tree flow for bridge deterioration prediction.
Applsci 15 12042 g002
Figure 3. Comparative performance of DT, RF, and GB across Balanced Accuracy, MCC, and Macro F1 for deck, superstructure, and substructure components.
Figure 3. Comparative performance of DT, RF, and GB across Balanced Accuracy, MCC, and Macro F1 for deck, superstructure, and substructure components.
Applsci 15 12042 g003
Figure 4. Performance gains achieved by incorporating environmental variables into Decision Tree (DT), Random Forest (RF), and Gradient Boosting (GB) models. Improvements are shown across deck, superstructure, and substructure components for three metrics (Balanced Accuracy, MCC, and Macro F1).
Figure 4. Performance gains achieved by incorporating environmental variables into Decision Tree (DT), Random Forest (RF), and Gradient Boosting (GB) models. Improvements are shown across deck, superstructure, and substructure components for three metrics (Balanced Accuracy, MCC, and Macro F1).
Applsci 15 12042 g004
Table 1. Top predictive features identified with mean importance values and standard deviations across the three models (Decision Tree, Random Forest, Gradient Boosting).
Table 1. Top predictive features identified with mean importance values and standard deviations across the three models (Decision Tree, Random Forest, Gradient Boosting).
RankFeatureImportance (Average)SD
1SUFFICIENCY_RATING0.69180.0044
2YEAR_BUILT_0270.03750.0096
3YEARS_SINCE_RECONSTRUCTION0.03070.0076
4LAT_0160.01960.0064
5LONG_0170.01930.0024
6year0.01840.0024
7APPR_WIDTH_MT_0510.01800.0015
8DECK_STRUCTURE_TYPE_107_30.01620.0088
9CHANNEL_COND_0610.01440.0064
10TRAFFIC_LOAD0.01130.0074
Table 2. DT, RF, and GB Test-Set Metrics by Bridge Component.
Table 2. DT, RF, and GB Test-Set Metrics by Bridge Component.
ModelComponentBalanced AccuracyCohen’s KappaMCCGeometric MeanMacro F1Macro RecallMacro Precision
Decision TreeDeck0.98750.98940.98940.98750.98390.98750.9804
Superstructure0.95260.97790.97790.9510.95050.95260.9485
Substructure0.97490.98770.98770.97450.98110.97490.9875
Random ForestDeck0.93720.97680.97690.93390.95990.93720.9872
Superstructure0.92940.98060.98060.92470.95760.92940.9932
Substructure0.93540.97610.97610.9320.96070.93540.9916
Gradient BoostingDeck0.97840.97460.9750.9780.97730.97650.98
Superstructure0.97180.9690.96950.97250.9710.970.9738
Substructure0.97350.97050.97080.9730.9720.97150.9745
Table 3. Comparative test-set metrics for Decision Tree, Random Forest and Gradient Boosting classifiers.
Table 3. Comparative test-set metrics for Decision Tree, Random Forest and Gradient Boosting classifiers.
ModelComponentMetricWith Env. FeaturesWithout Env. FeaturesImprovement (%)
Decision TreeDeckBalanced Accuracy0.98750.95253.5
Macro F10.98390.94394
SuperstructureBalanced Accuracy0.95260.92263
Macro F10.95050.91553.5
SubstructureBalanced Accuracy0.97490.93993.5
Macro F10.98110.94114
Random ForestDeckBalanced Accuracy0.93720.90723
Macro F10.95990.92493.5
SuperstructureBalanced Accuracy0.92940.89443.5
Macro F10.95760.91764
SubstructureBalanced Accuracy0.93540.90043.5
Macro F10.96070.92074
Gradient BoostingDeckBalanced Accuracy0.97840.94343.5
Macro F10.97730.93234.5
SuperstructureBalanced Accuracy0.97180.93683.5
Macro F10.9710.9264.5
SubstructureBalanced Accuracy0.97350.93853.5
Macro F10.9720.9274.5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Okohene, P.A.; Ozbek, M.E. Integrating Environmental Conditions into Machine Learning Models for Predicting Bridge Deterioration. Appl. Sci. 2025, 15, 12042. https://doi.org/10.3390/app152212042

AMA Style

Okohene PA, Ozbek ME. Integrating Environmental Conditions into Machine Learning Models for Predicting Bridge Deterioration. Applied Sciences. 2025; 15(22):12042. https://doi.org/10.3390/app152212042

Chicago/Turabian Style

Okohene, Papa Ansah, and Mehmet E. Ozbek. 2025. "Integrating Environmental Conditions into Machine Learning Models for Predicting Bridge Deterioration" Applied Sciences 15, no. 22: 12042. https://doi.org/10.3390/app152212042

APA Style

Okohene, P. A., & Ozbek, M. E. (2025). Integrating Environmental Conditions into Machine Learning Models for Predicting Bridge Deterioration. Applied Sciences, 15(22), 12042. https://doi.org/10.3390/app152212042

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop