Evaluating Pavement Deterioration Rates Due to Flooding Events Using Explainable AI

Lidan Peng; Lu Gao; Feng Hong; Jingran Sun

doi:10.3390/buildings15091452

,

and

¹

College of Traffic & Transportation, Chongqing Jiaotong University, Chongqing 400074, China

²

Department of Civil and Environmental Engineering, University of Houston, Houston, TX 77204, USA

³

Ingram School of Engineering, Texas State University, San Marcos, TX 78666, USA

⁴

Center for Transportation Research, The University of Texas at Austin, Austin, TX 78712, USA

Buildings2025, 15(9), 1452;https://doi.org/10.3390/buildings15091452

This article belongs to the Special Issue Advances in Road Pavements

Version Notes

Order Reprints

Abstract

Flooding can damage pavement infrastructure significantly, causing both immediate and long-term structural and functional issues. This research investigates how flooding events affect pavement deterioration, specifically focusing on measuring pavement roughness by the International Roughness Index (IRI). To quantify these effects, we utilized 20 years of pavement condition data from TxDOT’s PMIS database, which is integrated with flood event data, including duration and spatial extent. Statistical analyses were performed to compare IRI values before and after flooding and to calculate the deterioration rates influenced by flood exposure. Moreover, we applied explainable artificial intelligence (XAI) techniques, such as Shapley Additive Explanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME), to assess the impact of flooding on pavement performance. The results demonstrate that flood-affected pavements experience a more rapid increase in roughness compared to non-flooded sections. These findings emphasize the need for proactive flood mitigation strategies, including improved drainage systems, flood-resistant materials, and preventative maintenance, to enhance pavement resilience in vulnerable regions.

Keywords:

pavement deterioration; flooding; explainable AI; pavement performance; resilience

1. Introduction

Pavement deterioration due to flooding leads to several types of damage, including reduced pavement strength, increased roughness, rutting, and cracking, which can lead to a loss of pavement service life and increased rehabilitation costs [1]. The economic impact includes direct repair costs, additional user costs related to delays and accidents, and the need for more frequent reconstruction [2]. Structurally, flooding and moisture infiltration weaken pavement layers, reduce the resilient modulus of unbound materials, and cause differential movement, leading to longitudinal cracks and heaves [3]. The severity of deterioration depends on factors such as the pavement structure, drainage, traffic, flood duration, and the condition of the pavement before flooding [4].

To better understand and mitigate these impacts, researchers have employed various methods, including field observations, experimental studies, numerical modeling, and mechanistic–empirical (M-E) simulations. Some studies develop predictive models based on experimental measurements of pavement responses before and after flooding [3]. Numerical tools are also employed to simulate pavement responses under various conditions, with the goal of creating algorithms that can generate reasonable results from a limited number of input parameters for network-level applications. M-E design tools such as UPDAPS and TxME are used to analyze pavement performance under flooding by modifying layer moduli to reflect moisture-induced weakening of materials [1]. Machine learning techniques are increasingly used to predict pavement performance after flood events, using historical pavement distress and flood data [5]. Studies have also explored the economic aspects, such as increased maintenance and rehabilitation costs, and proposed frameworks for cost analysis and resource allocation [6]. Furthermore, research has been conducted to define and evaluate pavement resilience, with the aim of developing methods to minimize loss and improve design and construction practices [7].

Despite these advances, assessing flood-induced pavement damage presents several challenges, including the difficulty in obtaining pre- and post-flood data for a comprehensive analysis [8]. Much damage is not visible on the surface immediately after flooding, making it difficult to assess the extent of the problem [9]. Current mechanistic–empirical (ME)-based approaches and climate data modeling may not accurately simulate flooding and extreme precipitation events due to limitations in capturing short-term events and moisture damage [1]. In addition, it is challenging to incorporate all flood-related stressors (flood depth, duration, velocity, debris, and contaminants) into a single model [10]. Factors such as varying pavement section lengths and heterogeneous flood data sources add complexity to network-level assessments [5]. Moreover, accurately predicting long-term deterioration and costs is difficult due to the complex interaction of factors and the variability in agency policies and priorities. Given these limitations, there is a critical need for more in situ data collection and research to improve the accuracy of pavement deterioration assessments and enhance resilience planning [2]. Motivated by these challenges, this research integrates explainable artificial intelligence (XAI) techniques, specifically SHAP and LIME, with advanced machine learning models. This approach not only enhances prediction accuracy but also provides transparent insights into the contributions of key variables.

The objective of this study is to assess flood-induced pavement degradation by analyzing 20 years of TxDOT PMIS data, and measuring changes in the International Roughness Index (IRI) caused by flood events. Various regression models are developed and evaluated, with explainable AI methods, SHAP, and LIME applied to highlight the contributions of key factors such as initial pavement condition, truck traffic, and flood exposure. The study focuses on isolating the impacts of flooding by comparing flooded and non-flooded pavement sections within the studied region, aiming to provide actionable insights for data-driven decision making regarding flood mitigation and maintenance planning.

2. Literature Review

Roads play a crucial role in facilitating transportation as critical infrastructure components. However, they are vulnerable to damage caused by natural disasters, such as floods. Research consistently demonstrates that rainfall negatively impacts pavement conditions over various time periods [11]. Therefore, it is important to understand the effects of floods on roads and pavements and implement appropriate systems and designs to prevent or minimize flood-related damage.

2.1. Impact of Flooding on Pavement Condition and Performance

Flooding significantly compromises pavement structure and functionality, exhibiting various deterioration patterns [12]. The intrusion of water into pavement structures elevates the moisture content within unbound materials, consequently diminishing their modulus and load-bearing capabilities [3]. This saturation process undermines the adhesive and cohesive forces that bind aggregates and asphalt, leading to expedited material degradation, such as asphalt stripping [13]. Moreover, excess moisture in the asphalt layer can result in weakening of aggregate–bitumen bonding and can lead to a higher rate of damage accumulation in the asphalt layer [1]. Therefore, controlling moisture infiltration and ensuring proper drainage are critical in preventing moisture-induced raveling in asphalt pavements [14]. The subsequent reduction in structural support intensifies deflections under load, predisposing the pavement to premature failure mechanisms such as rutting and fatigue cracking. The degree of deterioration is contingent on the attributes of the flood event, the structural composition of the pavement, and prevailing environmental conditions [13]. Furthermore, flood-induced debris can obstruct drainage pathways, exacerbating water retention and prolonging the saturation of pavement layers [1].

The type of damage caused by flooding can be broadly categorized as either visible or hidden [15]. Visible damage includes readily apparent issues such as cave-ins or washouts of the pavement. Hidden damage, on the contrary, encompasses the weakening of the pavement’s internal strength and stiffness. While less immediately evident, hidden damage precipitates rapid deterioration and potential catastrophic failures. The specific stressors associated with flooding events include flood depth, duration, velocity, debris, and contaminants [1]. Flood depth and duration contribute to increased saturation within pavement layers, while flood velocity can inflict immediate structural compromise. Debris accumulation can impede drainage efficiency, further extending water exposure. The impact of flooding events demonstrates distinct patterns, including delayed effects, jump effects (characterized by sudden performance decline), a combination of jump and delayed effects, and direct failure scenarios [13].

The existing state of a pavement significantly influences its response to flooding [4]. Pavements initially in good or very good condition experience a sudden decline in quality following a flood. Conversely, pavements already in fair or poor condition do not exhibit a substantial immediate decline but undergo accelerated deterioration compared to non-flooded pavements. Stochastic modeling techniques, such as Monte Carlo simulations within discrete-time Markov chains, can effectively represent these deterioration dynamics. These models facilitate the assessment of pavement condition changes over time under various flooding scenarios, providing valuable insights for infrastructure management.

Effective strategies are available to reduce the adverse effects of flooding on pavement infrastructure. Incorporating efficient drainage systems is critical in mitigating water infiltration and saturation [1]. Employing durable and flood-resistant materials enhances the pavement’s capacity to withstand long periods of water exposure. Adopting robust pavement structures reinforces the pavement’s capacity to endure flood-induced stresses [12]. Regular inspection, maintenance, and repair protocols are essential for proactively identifying and addressing vulnerabilities. Implementing comprehensive flood management strategies and actively monitoring weather patterns are also crucial for preparedness and responsiveness during flood events [2]. Innovative solutions, such as integrating geotextile-wrapped sand layers, offer enhanced drainage capabilities and effectively mitigate subgrade moisture fluctuations and swelling pressures [15].

2.2. Deterioration Rate Estimation

Estimating deterioration rates resulting from flooding requires a comprehensive analysis of the impact of flood events on the road network’s condition over time. This can be a complex task as the rate of deterioration depends on multiple factors, including the intensity and duration of the flooding, the road and infrastructure type, and the quality of maintenance and repair work. One effective approach to estimating road deterioration rates due to flooding involves utilizing historical data on flood events and their influence on the road network. Researchers have conducted different studies and proposed methods for predicting the extent of road degradation during floods and also emphasized the significance of consistent road condition monitoring for timely maintenance and reconstruction efforts.

For instance, Zhang et al. [16] collected data from pavement sections that were submerged due to flooding after Hurricane Katrina and from pavement sections that did not experience flooding. Subsequently, they conducted a systematic analysis and compared the deflection, effective structural number, and resilient modulus of both flooded and non-flooded pavement sections to measure the flood’s impact. Chen and Zhang [17] conducted an analysis of pavement performance data before and after Hurricanes Katrina and Rita in Louisiana. The authors calculated and compared the International Roughness Index values and rutting depth before and after flooding, as well as the difference in pavement performance between flooded and non-flooded road sections. Shamsabadi et al. [18] considered the impact of snowstorms and flooding events on pavement performance. They created a regression model incorporating various factors to effectively forecast the increase in IRI. The study highlights the importance of predicting post-natural disaster road conditions for effective road maintenance planning. Sultana et al. [19] examined data from both flooded and non-flooded pavement sections before and after a flooding incident. They observed the flooded section experienced a significant decline in structural strength. This study states the need for long-term monitoring and regular testing of pavements affected by floods to properly assess the impacts of floods on factors such as rutting and roughness. Khan et al. [20] assumed that the IRI value would jump significantly after flooding events and developed a risk-based framework to evaluate the impact of flooding on road conditions. Their findings indicate that well-constructed road with robust structures and high traffic loads, and which is maintained to a high standard, demonstrates great resilience to flooding. Sultana et al. [21] addressed the impact of flooding events on pavement deterioration and emphasized the necessity of developing new deterioration models specifically for flood-affected pavements, and presented two mechanistic–empirical models for rutting and roughness in flood-affected pavements.

Moreover, Valles-Valles and Torres-Machi [4] studied the 2013 Colorado floods by using statistical analysis and stochastic Markov chains with Monte Carlo simulations to quantify deterioration rates and service life loss based on the IRI, and their results highlighted accelerated deterioration in flooded pavements. Shariatfar et al. [5] employed a machine learning approach with historical distress data and 2016 Louisiana flood maps to predict post-flooding network-level pavement performance, focusing on key distress types such as roughness and cracking. Hong et al. [22] utilized a GIS analysis and a Markov process to estimate the reduction in network-level pavement performance due to a 100-year flood in Texas over a 10-year horizon, linking the findings to infrastructure resilience and maintenance planning.

2.3. AI Applications in Pavement Deterioration Modeling

AI and deep learning models are being increasingly applied to various aspects of pavement deterioration modeling, offering advancements in detection, prediction, classification, and data management [23,24]. These technologies are crucial for optimizing maintenance planning, extending pavement lifespan, and reducing road maintenance costs [25].

Recurrent neural networks (RNNs), particularly Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRUs), are promising in predicting future pavement conditions by analyzing time-series data [26]. These models learn from historical pavement condition data, such as crack severity, rutting depth, and the International Roughness Index (IRI), to forecast future pavement condition indices. Studies have shown that RNN-LSTM models outperform traditional deep neural networks (DNNs) in pavement damage prediction, achieving lower root-mean-square error (RMSE) values [25]. DNNs are also employed to predict overall evaluation indices like the IRI, crack index (CI), and pavement condition index (PCI) with high accuracy, leveraging data from on-site tests and avoiding deviations from Linear Regression models [27].

In addition to prediction, convolutional neural networks (CNNs) and hybrid models combining CNNs with RNNs (such as CNN-LSTM) are essential for detection and classification tasks. These deep learning models can automatically extract features from raw data, using images captured by vehicle-mounted cameras, unmanned aerial vehicles (UAVs), or satellite imagery to detect pavement distress including cracks, potholes, and other surface deformations [28]. Deep learning techniques often surpass traditional image processing methods in terms of accuracy and efficiency in identifying such damage, as they do not require manual feature extraction [29]. AI models can also detect whether maintenance and reconstruction treatments have been applied to pavement sections by analyzing condition data over time [30]. Moreover, advanced AI techniques such as graph neural networks (GNNs) and convolutional graph neural networks (ConvGNNs) are being explored to improve deterioration modeling accuracy by incorporating the spatial structure of road networks, such as the condition of neighboring sections [31].

2.4. Research Gap

Previous studies have relied on mechanistic–empirical models, numerical simulations, and statistical analyses to examine the impact of flooding on pavement deterioration, but these efforts often face significant limitations. First, many are constrained to a single flood event or a short time frame, restricting their ability to capture long-term deterioration trends or the cumulative effects of multiple flooding incidents across a broader network. These studies often depend on small historical datasets tied to a single flood event, limiting the temporal window and geographic scope, thus reducing the generalizability of the findings to larger networks or diverse flood scenarios. Even studies adopting a network-level perspective or over extended periods tend to focus on probabilistic reductions in performance rather than directly estimating deterioration in flooded pavements, often failing to provide specific insights into deterioration rates or mechanisms affecting individual pavement sections.

To overcome these shortcomings, this study analyzes 20 years of pavement condition data from TxDOT’s PMIS database, which covers the entire Texas road network, to provide a comprehensive assessment of flood-induced pavement deterioration across multiple events at a statewide scale. This extensive temporal and geographic scope directly addresses the problems of limited time frames and narrow spatial coverage in prior research. Furthermore, this research integrates explainable AI (XAI) techniques, including SHAP and LIME, into modeling pavement deterioration caused by flooding. While traditional black-box machine learning models offer accurate predictions, they often lack explainability, making it challenging to understand the factors driving pavement degradation. By incorporating XAI, this study enables the identification of key contributors to pavement deterioration at both the aggregate and individual levels.

3. Methodology

In recent years, interpretability has gained increasing importance in machine learning and data science. Complex models such as deep neural networks or ensemble methods (e.g., Random Forest, Gradient-Boosted Trees) have achieved state-of-the-art performance across various domains but often lack transparency. Model-agnostic interpretability methods are designed to address this shortcoming without making assumptions specific to a particular type of model. Two popular methods in this category are Local Interpretable Model-Agnostic Explanations (LIME) [32] and Shapley Additive Explanations (SHAP) [33]. LIME aims to explain the model’s predictions by approximating it locally around a specific instance. SHAP, on the other hand, leverages game-theoretic concepts, specifically Shapley values, to attribute feature importance in a more globally consistent manner. In this section, we detail the underlying mathematical foundations of LIME and the Shapley value formulation in SHAP. We also discuss how to interpret the outputs of this methodology in a statistically coherent way.

3.1. LIME

LIME explains the prediction of a complex model f at a specific instance

x

by approximating f locally with a simpler and more interpretable model g, often a linear model in the neighborhood of

x

. The notion of “local” means that LIME focuses on the behavior of f in a small region around the instance

x

, rather than trying to explain the function f across the entire data distribution [32].

Suppose

x

is the instance we wish to explain. LIME samples data points in the vicinity of

x

(by perturbing features of

x

) to form a local neighborhood. Denote this neighborhood by

N (x)

. For each point

z

in

N (x)

, we can compute

π_{x} (z) = exp (- \frac{D {(x, z)}^{2}}{σ^{2}})

(1)

where

D (x, z)

is a distance measure (e.g., Euclidean or cosine distance) between the original instance

x

and the perturbed instance

z

. The term

π_{x} (z)

is often called a weighting kernel, and it assigns higher weights to samples closer to

x

[32].

Next, LIME fits a simple interpretable model g (e.g., a linear model) in this local neighborhood by solving

\underset{g \in G}{argmin} \sum_{z \in N (x)} π_{x} (z) {(f (z) - g (z))}^{2} + Ω (g) .

(2)

where

$f (z)$ = the prediction of the original complex model.
$g (z)$ = the prediction of the simple interpretable model (often linear in the feature space of $z$ ).
G = the set of all possible interpretable models under consideration (for instance, linear models).
$Ω (g)$ = a complexity measure for g, which ensures that among all possible local approximations, the simpler ones are favored [32].

In a linear interpretation, if

g (z) = w_{0} + \sum_{j} w_{j} z_{j}

, the coefficients

w_{j}

can be interpreted as the contribution of feature j to the prediction. A higher positive

w_{j}

indicates a stronger positive contribution of feature j, whereas a negative

w_{j}

indicates a negative contribution. The resulting coefficients from the linear model, fitted only to data points near

x

, provide insight into which features have the greatest impact on the prediction of f in that local region [34].

3.2. SHAP

SHAP is grounded in the concept of Shapley values, which originate from cooperative game theory. In this framework, each feature is regarded as a “player” contributing to the overall prediction of the model. The Shapley value for a given feature quantifies its contribution to the final prediction by considering all possible subsets (coalitions) of other features [33].

Let there be M features in total. For a particular instance

x

, let the model’s output be denoted by

f (x)

. The idea is to distribute

f (x) - f (0)

(the difference between the prediction and some baseline model output) among the M features in a fair manner, where

f (0)

is often chosen as the output of the model under baseline or reference conditions (e.g., when the features are set to average values) [33].

The Shapley value

ϕ_{j}

for feature j is given by

ϕ_{j} = \sum_{S \subseteq {1, \dots, M} ∖ {j}} \frac{| S |! (M - | S | - 1)!}{M!} [f (S \cup {j}) - f (S)]

(3)

where

S = any subset of the set of features not including feature j.
$f (S)$ = the model’s output using the subset S of features. In practice, this can be implemented by marginalizing, or in some cases by conditioning on, the other features.
$\frac{| S |! (M - | S | - 1)!}{M!}$ = a weighting factor that ensures each possible ordering of features is counted appropriately [35].

Conceptually,

ϕ_{j}

is computed by looking at the difference in model output when feature j joins every possible coalition S of other features. This is then averaged over all such subsets of features, giving a comprehensive measure of feature j’s importance to the final prediction. The Shapley values

ϕ_{j}

indicate how much each feature j shifts the model prediction from the baseline

f (0)

when feature j is introduced into all possible subsets of the other features. A positive Shapley value means that the feature contributes to increasing the prediction relative to the baseline, while a negative value indicates that the feature pulls the prediction below the baseline. Importantly, SHAP values have desirable fairness and consistency properties, making them appealing for interpretability.

4. Case Study

4.1. Data Collection

In this study, data collection and preprocessing is conducted through a sequence of steps, detailed below:

Information regarding road closures associated with flooding incidents was gathered from TxDOT. More than 100 significant flooding events were identified and evaluated. Example major flooding events include the post-Ike flooding in Southeast Texas in 2008, the flooding event in Montgomery County in 2014, the Lubbock County flooding incident in 2015, and another major flood occurrence in Montgomery County in 2017.
The geographic coordinates of both the beginning and ending points of each affected route were obtained via a Geographic Information System. These coordinates were subsequently cross-referenced with the Texas Department of Transportation’s (TxDOT’s) road reference system to identify the respective road reference markers defining the start and end of each route.
The relevant pavement sections (each section has a length around 0.5 miles) that fell within the range of the identified beginning and ending points were extracted from TxDOT’s Pavement Management Information System (PMIS).
Pavement sections that had International Roughness Index (IRI) values available prior to and following the year of the respective flooding event were selectively chosen. In the absence of such information, the sections were considered unsuitable for this analysis and consequently discarded from the dataset.

Table 1 shows sample road closures due to flooding collected in this case study. It includes the route names, flood years, start points, end points, and total lengths (in miles) of the affected road sections. These sample road routes include FM0481 in 2014 (from Eagle Pass to Uvalde, spanning 43.3 miles), FM1908 in 2014 (from Spofford to Quemado, covering 20.6 miles), and SH0131 in 2014 (from Eagle Pass to Brackettville, with a length of 33.1 miles). Figure 1 provides a visual representation of these sample locations within the study area.

Table 1. Sample flooded road routes.

Figure 1. Locations of sample routes. (a) US0180, FM0179, and US0087; (b) FM1908, SH0131, and FM0481; (c) FM2854; (d) BU0090Y; (e) FM0366; (f) FM0597.

4.2. Data Preprocessing

The TxDOT PMIS is a dataset containing a comprehensive pavement condition inventory that includes various pavement performance indicators such as potholes, cracking, and rutting. However, not all features are available for every road segment. The IRI is the only condition indicator available for all road segments and is used as the primary indicator in this case study. Moreover, we selected other variables that represent traffic and environmental factors to provide a more comprehensive analysis. The key features included in the dataset are as follows:

TX_CONDITION_SCORE: Pavement condition score (0–100). 100 represents the best score and 0 represents the worst score.
TX_DISTRESS_SCORE: Pavement distress score (0–100).
TX_IRI_AVERAGE_SCORE: International Roughness Index (IRI) average score.
TX_TRUCK_AADT_PCT: Percentage of daily traffic that is trucks.
TX_CURRENT_18KIP_MEAS: A measurement related to 18-KIP equivalent single-axle loads.
TX_PVMNT_TYPE_DTL_RD_LIFE_CODE: This variable represents the predominant travel lane pavement type during the data collection year for a given section, derived from RLS pavement layer information. It is stored as a string with a maximum length of 100 characters. The possible values include 01—continuously reinforced concrete (CRCP), 02—jointed reinforced concrete (JRCP), 03—jointed plain concrete (JPCP), 04—thick asphaltic concrete (greater than 5.5”), 05—medium-thickness asphaltic concrete (2.5–5.5”), 06—thin asphaltic concrete (less than 2.5”), 07—composite (asphalt-surfaced concrete or ACP on top of a heavily stabilized base), 08—widened composite pavement, and 09—overlaid and widened asphaltic concrete pavement.
CLIMATE_ZONES: Climate zones were defined based on temperature and precipitation data obtained from the National Oceanic and Atmospheric Administration (NOAA) database. A 30-year annual average of temperature and precipitation was used to represent the climate. For counties without a weather station, data from adjacent counties were averaged, while for counties with multiple weather stations, the average of those stations was used. Counties were grouped into climate zones corresponding to the west, east, north, south, and central regions.
TX_RURAL_URBAN_CODE: Categorical code indicating if a road section is rural or urban.
Flood: Binary indicator (1 if the road segment was recorded as flooded that year, 0 otherwise).

The target variable is next year’s TX_IRI_AVERAGE_SCORE. A total of 10,022 data records were collected for this case study. Each data point represents a pavement section of length around 0.5 miles. Table 2 summarizes the descriptive statistics of the numerical variables. This includes the mean, standard deviation, minimum, maximum, and quartiles. Among the 10,022 pavement sections, 501 (5%) were classified as flooded, while the remaining 9521 (95%) were identified as non-flooded. The data points were selected based on the availability of key features. Pavement sections with missing values for the features listed in Table 2 were excluded from the statistical analysis.

Table 2. Descriptive statistics of the numerical features.

Figure 2 presents the Pearson correlation heatmap for features used in this case study. A high positive correlation (0.87) is observed between TX_CONDITION_SCORE and TX_DISTRESS_SCORE, which indicates that the overall pavement condition rating is strongly influenced by the distress score. TX_CONDITION_SCORE shows a moderate negative correlation (−0.43) with TX_IRI_AVERAGE_SCORE. The negative relationship arises because the IRI value indicates worse pavement roughness as it increases, whereas a higher pavement condition score indicates better pavement condition. Traffic-related variables, such as TX_TRUCK_AADT_PCT, show weak correlations with most pavement condition indicators, with the highest correlation (0.22) observed with TX_IRI_AVERAGE_SCORE. The correlation between Flood and current pavement condition variables is relatively weak.

Figure 2. Correlation matrix of selected numerical features.

4.3. Statistical Analysis

4.3.1. IRI Comparison of Before and After Flooding

Figure 3 illustrates the changes in IRI values, measured in inches per mile (in/mi), across multiple roadway routes from before to after flooding events. Each subplot corresponds to a specific route (e.g., FMD481, SH0131, US0087) and displays pre-flood (one year prior to flooding) and post-flood (one year after flooding) IRI measurements for individual sections within the route. For instance, route FMD481 (pre-flood 2013, post-flood 2015) shows the post-flood IRI value of one section reaching up to 200 in/mi, significantly higher than the pre-flood level. Similar trends are evident in other routes, such as US0087 and FM0179, where post-flood IRI values rise sharply.

Figure 3. IRI before and after the flooding events.

While the overall trend confirms higher (worse) post-flood IRI values, variability exists among sections. For example, SH0131 (pre-flood 2013, post-flood 2015) exhibits minimal deterioration in some sections. This inconsistency may result from localized flood impacts, temporary route closures by TxDOT during floods, or post-flood maintenance interventions that restored certain segments.

To ensure data reliability, sections lacking complete IRI records during the two-year analysis window (one year before and after flood) were excluded. Also, sections with improved IRI values (potentially due to maintenance) were omitted to isolate flood-induced deterioration. In pavement performance modeling, when maintenance records are unavailable it is common to exclude pavement condition data that show improvement over consecutive years [36]. This is because pavement conditions are not expected to improve without maintenance intervention. In this case study, maintenance records for the analyzed road segments are unavailable. Therefore, if IRI values of a road segment decrease after flooding, it is assumed that maintenance was performed, and these data points are excluded from the statistical analysis.

The analysis reveals an average IRI increase of 8.46 in/mi over two years, with a high standard deviation (10.53 in/mi), indicating significant variability in deterioration severity across sections. This underscores that while flooding generally exacerbates pavement roughness, the extent of damage is highly dependent on localized factors such as flood intensity, road segment vulnerability, and maintenance responsiveness.

4.3.2. IRI Deterioration Rate: Comparison Before and After Flooding

Figure 4 illustrates the IRI values during three different years: three years prior to flooding, one year before flooding, and one year after flooding. The chosen interval of two years allows for a comparison of the deterioration rate before and after the flooding event. One-year intervals were not used due to the lack of precise data collection dates for the IRI values. Consequently, it is difficult to determine whether the IRI at the time of flooding was measured before or after the event occurred. The IRI values presented in Figure 4 represent the average values across sections in the same route. When calculating the average, sections lacking IRI data in these three years were excluded. Furthermore, sections where the IRI values improved, indicating potential maintenance effects, were also excluded. For this reason, not all routes were plotted in the figure. The figure shows that, in general, the IRI deterioration rate increases after flooding. However, the rate of deterioration varies across different segments. Some sections experience a much sharper decline in condition, as seen in certain data points for US0087, where roughness increases significantly after flooding. In contrast, there are cases like SH0131, where the impact of flooding appears minimal, with IRI values remaining relatively stable.

Figure 4. IRI before and after flooding events.

4.3.3. Flooded and Non-Flooded Section Comparison

Figure 5 shows a comparison of the IRI deterioration rate between sections affected by flooding and sections outside the flooding region nearby. In order to compare them under the same condition, both flooded and non-flooded sections are collected from the same route. The x-axis represents different sections and the y-axis represents the IRI difference between the year after flooding and the year before flooding, representing the IRI deterioration in two years. As can be seen from Figure 5, the differences between the flooded and non-flooded sections range from 0.3 to 15 in/mi. The average IRI change difference between flooded and non-flooded sections is 5.6 in/mi.

Figure 5. IRI deterioration rates: Flooded vs. non-flooded sections.

4.4. XAI Modeling

4.4.1. Machine Learning Models

A total of six regression models were trained and evaluated to predict next year’s IRI: Linear regression (LR), Ridge Regression (Ridge), Lasso Regression (Lasso), Decision Tree (DT), Random Forest (RF), and Gradient Boosting Regression (GBR). The dataset was divided into training (80%) and test (20%) subsets through random sampling to ensure a balanced representation of pavement conditions and external factors.

We implemented all models using the scikit-learn package and initially used default settings. Hyperparameter tuning was performed to optimize performance for certain models. For the Ridge and Lasso models, we adjusted the regularization strength (

α

) using a grid search over values from

10^{- 3}

to

10^{2}

. The DT model was optimized by tuning the maximum tree depth (2–20), the minimum samples required to split a node (2–10), and the minimum samples per leaf (1–5). RF tuning included adjusting the number of trees (50, 100, 200), maximum depth (5–20), and minimum samples per split (2, 5, 10), along with feature subsampling. GBR was optimized by adjusting the learning rate (0.001, 0.01, 0.1), number of estimators (100, 300, 500), maximum tree depth (3, 5, 10), and subsampling ratios (0.5, 0.75, 1.0). The best hyperparameters were determined using 5-fold cross-validation on the training set, selecting the combination that minimized mean squared error (MSE).

Each of the six regression models was evaluated using mean squared error (MSE), mean absolute error (MAE), and

R^{2}

score on the test set. Table 3 summarizes the performance. Among the models evaluated, GBR demonstrated the best performance, achieving the lowest mean squared error (MSE) of 247.0450, the lowest mean absolute error (MAE) of 8.6095, and the highest

R^{2}

score of 0.9310. This result suggests that GBR effectively captures complex relationships within the dataset. The LR, Ridge, and Lasso exhibited nearly identical performance, with MSE values around 263 and

R^{2}

scores of approximately 0.9263–0.9266. Lasso Regression achieved a slightly better MSE (262.8292) and

R^{2}

score (0.9266) compared to the standard Linear Regression model, though the improvement was marginal. The DT model performed significantly worse than the other models, with the highest MSE (360.7363) and MAE (10.6944), resulting in a notably lower

R^{2}

score of 0.8993. The RF model achieved slightly better performance than the DT, with an MSE of 264.0790 and an

R^{2}

score of 0.9262. Since GBR outperformed all other models in terms of predictive accuracy, we use it for further XAI analysis to better understand the contributions of different features to pavement deterioration.

Table 3. Comparison of regression models for predicting next year’s IRI.

4.4.2. Feature Importance via SHAP

SHAP values were computed to assess the global contribution of each feature to the model’s predictions. Figure 6 shows a summary plot of the SHAP values for all features. The SHAP summary plot provides insights into how key variables influence the predictions of a GBR model, which is trained to predict next year’s IRI. The features are ranked by their mean absolute SHAP values, with the most influential variables appearing at the top. The results indicate that TX_IRI_AVERAGE_SCORE is the most critical predictor, followed by CLIMATE_ZONES, TX_TRUCK_AADT_PCT, and TX_CURRENT_18KIP_MEAS.

Figure 6. SHAP summary plot for the gradient boosting model.

The SHAP values along the x-axis indicate how each feature affects the model’s predictions. Positive SHAP values increase the predicted IRI, meaning worse pavement conditions, while negative SHAP values decrease the predicted IRI, indicating better pavement performance. The color coding in the SHAP plot represents the feature values, where red denotes high values and blue denotes low values.

TX_IRI_AVERAGE_SCORE has the greatest impact on next year’s IRI value. Higher current IRI scores (red) significantly increase the predicted IRI, suggesting that roads that are already rough are likely to become even rougher next year. Lower current IRI scores (blue) tend to have a neutral or slightly negative impact, indicating that smoother roads are likely to remain relatively smooth or improve slightly. TX_TRUCK_AADT_PCT has a moderate impact. A higher percentage of truck traffic (red) increases the predicted IRI, indicating that roads with more truck traffic are likely to become rougher due to the heavy loads accelerating pavement deterioration. A lower percentage (blue) has a neutral or slightly negative impact, suggesting that roads with less truck traffic are likely to experience less increase in roughness or maintain smoother surfaces. For TX_CURRENT_18KIP_MEAS, the balanced distribution and moderate spread of SHAP values indicate that its influence varies depending on the specific load magnitude across data instances. Low values of TX_CURRENT_18KIP_MEAS (blue) are associated with higher SHAP values, indicating that weaker pavements deteriorate faster.

The Flood feature also shows a tendency toward positive SHAP values, suggesting that flood-prone areas experience higher predicted IRI values, indicating accelerated pavement deterioration. This outcome aligns with the physical effects of flooding on roadways. The SHAP distribution of the Flood variable indicates that flood-exposed roadways are more likely to suffer from worsening pavement conditions in the following year. The effect of flooding is evident, as instances with high flood exposure show increased SHAP values, indicating the role of extreme weather events in pavement degradation.

4.4.3. Local Interpretations via LIME

Figure 7 presents a LIME analysis for one pavement segment and the impact of individual features on the predicted IRI. The bars represent the direction and magnitude of each feature’s contribution to the model’s prediction, where green bars indicate a positive impact (leading to a higher predicted IRI, meaning worse pavement condition) and red bars indicate a negative impact (leading to a lower predicted IRI, meaning better pavement condition). The most influential feature in this instance is TX_IRI_AVERAGE_SCORE, which falls within the range of 90.00 to 100.00. Its green bar extending farthest to the right indicates that a high initial roughness significantly contributes to a higher predicted IRI value. The Flood feature, with a condition of Flood > 0.0, appears with a green bar positioned around +5 on the x-axis, indicating that flood occurrence contributes to an increase in IRI by approximately 5 units. This aligns with the expectation that flooding accelerates pavement deterioration. The moderate impact suggests that while flooding plays a role in roughness progression, other factors such as initial pavement condition may have a greater influence in this particular instance. TX_CONDITION_SCORE has a negative impact on IRI, which is expected since higher condition scores indicate better pavement quality and are associated with lower roughness. Similarly, TX_DISTRESS_SCORE follows the same trend: higher distress scores in this range correlate with lower IRI values. Moreover, TX_CURRENT_18KIP_MEAS has a minor negative impact, indicating that higher axle loads are linked to reduced roughness in this particular case. This could be due to the presence of more robust pavement structures on roads designed to accommodate heavy traffic, leading to better surface conditions. TX_TRUCK_AADT_PCT, representing truck traffic percentage, has a red bar, implying that in this specific case, the given level of truck traffic is associated with a lower predicted IRI. This may be due to well-maintained roads in high-traffic areas or the presence of more durable pavement designs. Overall, Figure 7 provides a localized interpretation of how various factors influence the IRI prediction for one specific pavement segment. It confirms that flood exposure significantly contributes to pavement deterioration, while some features, such as truck traffic percentage, may correlate with reduced roughness in this specific case.

Figure 7. Example LIME explanation for one pavement segment, indicating that flood occurrence contributes to higher IRI (indicating worse condition). Green bars indicate positive contributions to IRI; red bars indicate negative contributions.

4.5. Discussion

This research provides important insights into the impact of flooding on pavement deterioration. The analysis reveals an average IRI increase of 8.46 in/mi over two years, with a high standard deviation of 10.53 in/mi, indicating significant variability in deterioration severity across sections. While flooding generally exacerbates pavement roughness, the extent of damage largely depends on localized factors such as flood intensity, road segment vulnerability, and maintenance responsiveness. On average, the deterioration rate of IRI increased compared to the same two-year period before the flooding, which highlights the substantial impact of flood events. Moreover, the difference in IRI changes between flooded and non-flooded sections ranges from 0.3 to 15 in/mi, with an average difference of 5.6 in/mi. These findings align with previous studies, such as Shariatfar et al. [5], who observed a slightly higher rate of IRI increase in flooded areas after a flood. Similarly, Valles-Valles and Torres-Machi [4] reported a sudden drop in the Riding Index (RIDX) of 27 and 11 points for pavements in very good and good condition, respectively, corresponding to a rise in IRI and a quantified service life reduction of 14.3% for pavements in good condition, suggesting an accelerated deterioration rate due to flooding.

The application of SHAP analysis highlights that flooding, along with truck traffic percentage and initial pavement condition, is among the most influential factor contributing to pavement degradation. Moreover, LIME explanations confirm that flood exposure significantly contributes to increased roughness. The integration of XAI techniques into pavement deterioration analysis not only enhances model transparency but also facilitates data-driven decision making for transportation agencies. SHAP analysis allows for a global understanding of feature importance. On the other hand, LIME provides local interpretability, helping identify specific instances where unexpected deterioration trends occur. The combination of these techniques can bridge the gap between traditional black-box machine learning models and actionable insights.

The findings from this study can guide transportation agencies in prioritizing maintenance for flood-prone roads by focusing on sections with higher IRI increases and variability. Proactive scheduling of preventive maintenance, such as resurfacing and drainage improvements, can help extend the service life of pavement and prevent severe deterioration. Moreover, applying flood-resistant materials and improved drainage systems in high-risk areas can enhance infrastructure resilience.

5. Conclusions

This study evaluates the impact of flooding on pavement deterioration using 20 years of pavement condition data from TxDOT’s PMIS database, focusing on roughness changes measured by the IRI. The findings indicate that flood-affected pavement sections experience significantly faster deterioration than non-flooded roads. On average, the IRI increased by 8.46 in/mi over two years for flooded sections. Moreover, the deterioration rate after flooding was significantly higher than the rate observed in the two years before flooding. When comparing flooded and non-flooded pavement sections, the study found that flooded roads deteriorated by an additional 5.6 in/mi on average, with some sections even experiencing an extreme deterioration difference of up to 15 in/mi. To understand the underlying causes of pavement degradation, this research applied explainable AI (XAI) techniques, including SHAP and LIME. The results identified initial pavement roughness (IRI values), truck traffic percentage, flood exposure, and climate zone conditions as the most influential factors affecting pavement deterioration. These findings will help transportation agencies focus maintenance efforts on flood-prone roads with high IRI increases and deterioration variability.

This research has some limitations that should be acknowledged. First, the study is based on a limited number of flooding events in the case study, which may restrict the generalization of our findings. To enhance the robustness of our conclusions, it would be beneficial to include a more extensive range of flooding events with more diverse geographic locations in future studies. Second, the availability of data points posed a constraint on our research. As we only had access to PMIS data up to 2018, some recent flooding events had to be excluded from this study. Moving forward, efforts will be made to address these two limitations and expand the scope of the research to encompass a broader set of events and more up-to-date data. Third, maintenance activities that might have influenced post-flood IRI values could not be entirely accounted for, potentially affecting the precision of the deterioration estimates. Future research should aim to incorporate past maintenance work records. Fourth, while various pavement performance indicators could provide additional insights into the impact of flooding, these parameters (e.g., cracking, rutting, and raveling) were either not available in the dataset or were too incomplete to allow for statistical analysis in this study. Although data on potholes, cracking, and rutting were present, they covered only a small fraction of the analyzed sections. In contrast, IRI values were available for all sections, which is the reason why IRI was selected as the primary indicator. Future studies should explore datasets with more comprehensive pavement condition metrics to enhance the analysis.

Author Contributions

Conceptualization, L.G.; methodology, L.P.; software, L.G.; validation, L.P., L.G. and F.H.; formal analysis, F.H.; investigation, L.P.; resources, J.S.; data curation, L.P.; writing—original draft preparation, L.P.; writing—review and editing, L.G. and J.S.; visualization, L.P.; supervision, L.G.; project administration, L.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to legal reasons.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Abdollahi, S.F.; Kutay, M.E.; Lanotte, M. UPDAPS-Flood: A mechanistic-empirical flexible pavement analysis tool to evaluate the effect of flooding events on flexible pavement performance. Int. J. Pavement Eng. 2024, 25, 2387743. [Google Scholar] [CrossRef]
Hong, F.; Prozzi, J. Assessment of flooding impact on thin pavement structure in Texas coastal region. Int. J. Transp. Sci. Technol. 2024; in press. [Google Scholar]
Asadi, M.; Nazarian, S.; Mallick, R.B.; Tirado, C. Computational process for quantifying the impact of flooding on remaining life of flexible pavement structures. J. Transp. Eng. Part B Pavements 2020, 146, 04020060. [Google Scholar] [CrossRef]
Vallès-Vallès, D.; Torres-Machi, C. Deterioration of flexible pavements induced by flooding: Case study using stochastic Monte Carlo simulations in discrete-time Markov chains. J. Infrastruct. Syst. 2023, 29, 05022009. [Google Scholar] [CrossRef]
Shariatfar, M.; Lee, Y.C.; Choi, K.; Kim, M. Effects of flooding on pavement performance: A machine learning-based network-level assessment. Sustain. Resilient Infrastruct. 2022, 7, 695–714. [Google Scholar] [CrossRef]
Matini, N.; Qiao, Y.; Sias, J.E. Development of time–depth–damage functions for flooded flexible pavements. J. Transp. Eng. Part B Pavements 2022, 148, 04022011. [Google Scholar] [CrossRef]
Nivedya, M.; Tao, M.; Mallick, R.B.; Daniel, J.S.; Jacobs, J.M. A framework for the assessment of contribution of base layer performance towards resilience of flexible pavement to flooding. Int. J. Pavement Eng. 2020, 21, 1223–1234. [Google Scholar] [CrossRef]
Yu-Shan, A.; Shakiba, M. Flooded pavement: Numerical investigation of saturation effects on asphalt pavement structures. J. Transp. Eng. Part B Pavements 2021, 147, 04021025. [Google Scholar] [CrossRef]
Mallick, R.B.; Tao, M.; Daniel, J.S.; Jacobs, J.M.; Veeraragavan, A. Combined model framework for asphalt pavement condition determination after flooding. Transp. Res. Rec. 2017, 2639, 64–72. [Google Scholar] [CrossRef]
Achebe, J.; Oyediji, O.; Saari, R.K.; Tighe, S.; Nasir, F. Incorporating flood hazards into pavement sustainability assessment. Transp. Res. Rec. 2021, 2675, 1025–1042. [Google Scholar] [CrossRef]
Sultana, M.; Chai, G.; Martin, T.; Chowdhury, S. Modeling the postflood short-term behavior of flexible pavements. J. Transp. Eng. 2016, 142, 04016042. [Google Scholar] [CrossRef]
Chen, X.; Wang, H. Evaluation of pavement resilience to flooding with inverted pavement structure. Road Mater. Pavement Des. 2024, 25, 2709–2728. [Google Scholar] [CrossRef]
Lu, D.; Tighe, S.L.; Xie, W.C. Impact of flood hazards on pavement performance. Int. J. Pavement Eng. 2020, 21, 746–752. [Google Scholar] [CrossRef]
Do, T.C. Engineering properties-based parameters used for moisture damage evaluation of asphalt mixtures: A review. Can. J. Civ. Eng. 2023, 51, 390–398. [Google Scholar] [CrossRef]
Chen, Q.; Zhang, Z.; Gaspard, K. Case study on the impact of flooding and inundation on pavement performance. Transp. Res. Rec. 2023, 2677, 157–168. [Google Scholar] [CrossRef]
Zhang, Z.; Wu, Z.; Martinez, M.; Gaspard, K. Pavement structures damage caused by Hurricane Katrina flooding. J. Geotech. Geoenviron. Eng. 2008, 134, 633–643. [Google Scholar] [CrossRef]
Chen, X.; Zhang, Z. Effects of Hurricanes Katrina and Rita flooding on Louisiana pavement performance. In Pavement Materials, Structures, and Performance; ASCE Library: Reston, VA, USA, 2014; pp. 212–221. [Google Scholar]
Shamsabadi, S.S.; Tari, Y.S.H.; Birken, R.; Wang, M. Deterioration forecasting in flexible pavements due to floods and snow storms. In Proceedings of the EWSHM—7th European Workshop on Structural Health Monitoring, Nantes, France, 8–11 July 2014. [Google Scholar]
Sultana, M.; Chai, G.; Martin, T.; Chowdhury, S. A study on the flood affected flexible pavements in Australia. In Proceedings of the 9th International Conference on Road and Airfield Pavement Technology, Dalian, China, 9–13 August 2015; pp. 9–13. [Google Scholar]
Khan, M.U.; Mesbah, M.; Ferreira, L.; Williams, D.J. Assessment of flood risk to performance of highway pavements. In Institution of Civil Engineers-Transport; Thomas Telford Ltd.: London, UK, 2017; Volume 170, pp. 363–372. [Google Scholar]
Sultana, M.; Chai, G.; Chowdhury, S.; Martin, T.; Anissimov, Y.; Rahman, A. Rutting and roughness of flood-affected pavements: Literature review and deterioration models. J. Infrastruct. Syst. 2018, 24, 04018006. [Google Scholar] [CrossRef]
Hong, F.; Wu, H.; Li, J.; Paul, E. Evaluation of flood-vulnerable pavement network in support of resilience in pavement system management. Transp. Res. Rec. 2023, 2677, 474–482. [Google Scholar] [CrossRef]
Choi, S.; Do, M. Development of the road pavement deterioration model based on the deep learning method. Electronics 2019, 9, 3. [Google Scholar] [CrossRef]
Lebaku, P.K.R.; Gao, L.; Lu, P.; Sun, J. Deep Learning for Pavement Condition Evaluation Using Satellite Imagery. Infrastructures 2024, 9, 155. [Google Scholar] [CrossRef]
Lee, Y.; Sun, J.; Lee, M. Development of deep learning based deterioration prediction model for the maintenance planning of highway pavement. Korean J. Constr. Eng. Manag. 2019, 20, 34–43. [Google Scholar]
Hosseini, S.A.; Alhasan, A.; Smadi, O. Use of deep learning to study modeling deterioration of pavements a case study in Iowa. Infrastructures 2020, 5, 95. [Google Scholar] [CrossRef]
Guo, X.; Wang, N.; Li, Y. Enhancing pavement maintenance: A deep learning model for accurate prediction and early detection of pavement structural damage. Constr. Build. Mater. 2023, 409, 133970. [Google Scholar] [CrossRef]
Chen, X.; Zhang, X.; Li, J.; Ren, M.; Zhou, B. A new method for automated monitoring of road pavement aging conditions based on recurrent neural network. IEEE Trans. Intell. Transp. Syst. 2022, 23, 24510–24523. [Google Scholar] [CrossRef]
Inácio, D.; Oliveira, H.; Oliveira, P.; Correia, P. A low-cost deep learning system to characterize asphalt surface deterioration. Remote Sens. 2023, 15, 1701. [Google Scholar] [CrossRef]
Gao, L.; Han, Z.; Chen, Y. Deep learning–based pavement performance modeling using multiple distress indicators and road work history. J. Transp. Eng. Part B Pavements 2023, 149, 04022061. [Google Scholar] [CrossRef]
Gao, L.; Yu, K.; Lu, P. Considering the spatial structure of the road network in pavement deterioration modeling. Transp. Res. Rec. 2024, 2678, 153–161. [Google Scholar] [CrossRef]
Ribeiro, M.T.; Singh, S.; Guestrin, C. “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 1135–1144. [Google Scholar]
Lundberg, S. A unified approach to interpreting model predictions. arXiv 2017, arXiv:1705.07874. [Google Scholar]
Molnar, C. Interpretable Machine Learning; Lulu. com: Morrisville, NC, USA, 2020. [Google Scholar]
Shapley, L.S. A Value for n-Person Games. 1953. Available online: https://www.rand.org/content/dam/rand/pubs/papers/2021/P295.pdf (accessed on 5 February 2025).
Mishalani, R.G.; Madanat, S.M. Computation of infrastructure transition probabilities using stochastic duration models. J. Infrastruct. Syst. 2002, 8, 139–148. [Google Scholar] [CrossRef]

Figure 1. Locations of sample routes. (a) US0180, FM0179, and US0087; (b) FM1908, SH0131, and FM0481; (c) FM2854; (d) BU0090Y; (e) FM0366; (f) FM0597.

Figure 2. Correlation matrix of selected numerical features.

Figure 3. IRI before and after the flooding events.

Figure 4. IRI before and after flooding events.

Figure 5. IRI deterioration rates: Flooded vs. non-flooded sections.

Figure 6. SHAP summary plot for the gradient boosting model.

Figure 7. Example LIME explanation for one pavement segment, indicating that flood occurrence contributes to higher IRI (indicating worse condition). Green bars indicate positive contributions to IRI; red bars indicate negative contributions.

Table 1. Sample flooded road routes.

Route Name	Flood Year	Start Point	End Point
FM0481	2014	Eagle Pass	Uvalde
FM1908	2014	Spofford	Quemado
SH0131	2014	Eagle Pass	Brackettville
FM2854	2017	Loop 336	Pinewood Dr.
US0087	2015	FM 41 south of Lubbock	Lamesa
FM0597	2015	Cochran county line	FM 303
FM0179	2015	Lynn county line	US 87 in Lamesa
US0180	2015	FM 829	SH 137
FM0366	2008	Intersection with FM 347	N.A.
BU0090Y	2008	FM 1006	Interstate 10

Table 2. Descriptive statistics of the numerical features.

Feature	Mean	Std. Dev.	Min	25%	Max
TX_CONDITION_SCORE	93.91	13.87	0.00	97.00	100.00
TX_DISTRESS_SCORE	95.70	11.35	0.00	99.00	100.00
TX_IRI_AVERAGE_SCORE	100.61	54.17	26.00	57.00	314.00
TX_TRUCK_AADT_PCT	17.60	8.52	0.00	13.10	56.90
TX_CURRENT_18KIP_MEAS	1096.57	978.45	0.00	200.00	8123.00
TX_PVMNT_TYPE_DTL_RD_LIFE_CODE	8.74	1.98	1.00	6.00	10.00
CLIMATE_ZONES_encoded	1.83	0.98	0.00	2.00	3.00
TX_RURAL_URBAN_CODE	1.03	0.21	1.00	1.00	4.00
Flood	0.05	0.21	0.00	0.00	1.00

Table 3. Comparison of regression models for predicting next year’s IRI.

Model	MSE	MAE	R² Score
Linear Regression (LR)	263.8513	8.8476	0.9263
Ridge Regression (Ridge)	263.7546	8.8463	0.9263
Lasso Regression (Lasso)	262.8292	8.8517	0.9266
Decision Tree (DT)	360.7363	10.6944	0.8993
Random Forest (RF)	264.0790	8.5476	0.9262
Gradient Boosting (GBR)	247.0450	8.6095	0.9310

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Evaluating Pavement Deterioration Rates Due to Flooding Events Using Explainable AI

Abstract

1. Introduction

2. Literature Review

2.1. Impact of Flooding on Pavement Condition and Performance

2.2. Deterioration Rate Estimation

2.3. AI Applications in Pavement Deterioration Modeling

2.4. Research Gap

3. Methodology

3.1. LIME

3.2. SHAP

4. Case Study

4.1. Data Collection

4.2. Data Preprocessing

4.3. Statistical Analysis

4.3.1. IRI Comparison of Before and After Flooding

4.3.2. IRI Deterioration Rate: Comparison Before and After Flooding

4.3.3. Flooded and Non-Flooded Section Comparison

4.4. XAI Modeling

4.4.1. Machine Learning Models

4.4.2. Feature Importance via SHAP

4.4.3. Local Interpretations via LIME

4.5. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics