1. Introduction
Tunnels are indispensable components of transportation networks, facilitating the movement of people, goods, and services across regions. As these structures age, they become increasingly susceptible to various deterioration mechanisms due to environmental exposure and intensified load demands ranging from corrosion and fatigue to concrete spalling [
1,
2]. The American Society of Civil Engineers has repeatedly highlighted the critical condition of tunnel infrastructure, warning that deferred maintenance can lead to substantial economic and safety risks [
3]. The American Society of Civil Engineers recently rated U.S. transit infrastructure at “D-”, indicating severe deterioration and chronic underinvestment [
3]. As of 2023, the rehabilitation backlog for transit systems has dramatically increased to approximately USD 176 billion, and projections suggest this figure may exceed USD 270 billion by 2029 without substantial intervention [
3]. The Federal Transit Administration further highlighted that approximately 10% of transit assets, equivalent to roughly USD 140 billion in infrastructure value, are not in a state of good repair, emphasizing the urgent need for proactive maintenance and rehabilitation strategies [
4].
The functional use of tunnels contributes to their wear and tear. Tunnels subjected to heavy traffic volumes, especially those accommodating freight transport, experience repetitive loading, which can induce stress and material fatigue [
5,
6]. Moreover, exposure to vehicular emissions introduces chemical agents that can react with tunnel materials, leading to corrosion and other forms of degradation [
7,
8]. A study highlighted that tunnels with higher traffic densities exhibited accelerated rates of deterioration, emphasizing the need for regular maintenance in such infrastructures [
9]. In addition, tunnels are subjected to a variety of environmental, material, and operational factors that collectively influence their structural integrity. Water infiltration is a primary concern, as moisture can penetrate tunnel linings, leading to the corrosion of steel reinforcements and degradation of concrete [
10,
11]. Freeze–thaw cycles, prevalent in many North American regions, can cause concrete cracking and spalling, further compromising tunnel safety. Seismic activity, especially along tectonic plate boundaries, poses risks of structural deformation or collapse. For instance, the 1994 Northridge earthquake in California caused significant damage to several tunnels, underscoring the vulnerability of subterranean structures to seismic events [
12]. In addition, the choice and quality of construction materials significantly impact a tunnel’s durability. Utilizing substandard materials or improper mix designs can accelerate deterioration. For example, inadequate concrete mixes may exhibit increased permeability, allowing harmful agents to penetrate and degrade the structure. Over time, even high-quality materials can experience fatigue, leading to micro-cracks and reduced load-bearing capacity [
13,
14].
In light of widespread aging tunnel systems, proactive asset management strategies are imperative for extending service life and safeguarding public welfare. Deterioration modeling plays a pivotal role in infrastructure asset management by capturing how an asset’s condition evolves and forecasting future performance [
15]. Moreover, they stand as an important input and pillar of maintenance budget allocation models. With that in mind, accurately predicting tunnel deterioration is highly necessitated to guarantee timely, sustainable, and cost-effective predictive maintenance plans of tunnels [
16]. Hence, capturing the temporal deterioration of tunnel elements is crucial to ensure their integrity, safety, and longevity [
16]. In recent years, the world has witnessed a surge in data-driven methodologies to address the limitations of conventional approaches to structural deterioration analysis. Machine learning (ML) techniques have shown strong potential in handling high-dimensional data and unraveling intricate relationships [
17]. Despite promising outcomes, many ML-based methods, such as artificial neural networks [
18] and support vector machines [
19,
20], are susceptible to overfitting and limited generalizability, especially when faced with heterogeneous infrastructure datasets [
21,
22]. Furthermore, modeling tunnel deterioration requires a robust strategy to accommodate the inherent variability in environmental conditions, material properties, and traffic loads [
23].
To overcome these challenges, hybrid models that merge ML algorithms with metaheuristic optimization techniques have gained traction in various civil engineering applications. Metaheuristics, including particle swarm optimization (PSO), genetic algorithm (GA), and artificial bee colony (ABC), have proven effective in refining hyperparameters and bolstering predictive accuracy [
24,
25,
26,
27]. Despite these benefits, metaheuristic approaches for tunnel deterioration prediction remain insufficiently explored. In response to these gaps, this research pursues two main objectives through the developed hybrid metaheuristic-based regression tree model. First, it seeks to leverage a large dataset encompassing multiple tunnel elements and operational contexts to capture interrelated deterioration factors comprehensively. Second, it aims to integrate metaheuristic optimization for hyperparameter tuning and feature selection, thereby promoting more interpretable, robust, and readily applicable models in real-world scenarios. The combination of metaheuristics and regression trees offers a promising yet underexplored avenue. This approach could enhance prediction accuracy by effectively handling complex, nonlinear interactions among deterioration factors. Addressing this gap could lead to more robust maintenance strategies and prolong the operational lifespan of tunnels. Following this introduction,
Section 2 explores the literature on hybrid modeling strategies;
Section 3 details the methodology, data collection, and model formulation;
Section 4; enumerates and explains the used methods and performance indictors in this study;
Section 5 presents model implementation and discusses the results and compares the proposed model with existing benchmarks; and
Section 6 concludes the paper with a summary of key findings, practical implications, and directions for future research.
2. Literature Review
In the context of tunnel infrastructure, deterioration is influenced by myriad factors, including material properties, construction quality, environmental conditions, and maintenance practices. However, there remains a gap in integrating hybrid metaheuristic methods with regression tree models specifically for tunnel elements. This gap underscores the need for innovative approaches that can provide accurate and actionable insights for proactive maintenance and management strategies. Regression tree models, such as random forest (RF), have been widely used in predicting various aspects of infrastructure deterioration due to their ability to handle complex interactions between variables and manage large datasets. For instance, Zhang et al. [
28] employed RF models to predict tunneling-induced settlement, demonstrating superior performance over other machine learning algorithms in capturing the evolution of settlement patterns. Furthermore, regression tree models are widely recognized for their capability to handle nonlinear data and interactions between variables [
29]. However, standalone regression trees often suffer from overfitting and limited generalizability when applied to complex datasets. Hybrid metaheuristic techniques, such as shuffled frog leaping [
30], moth flame optimization [
31], and particle swarm optimization [
32], offer robust solutions by optimizing model parameters and improving prediction accuracy. When combined with regression tree models, these techniques can effectively leverage their respective strengths to enhance predictive performance in deterioration modeling [
33].
Metaheuristic algorithms are optimization techniques designed to find near-optimal solutions for complex problems. When integrated with regression tree models, they can optimize model parameters, thereby improving predictive performance. Commonly used metaheuristic algorithms include particle swarm optimization (PSO), genetic algorithms (GA), and grey wolf optimization (GWO).
The application of hybrid metaheuristic algorithms combined with regression tree models has gained significant attention in predicting the deterioration behavior of tunnel elements. This approach leverages the strengths of both metaheuristic optimization techniques and regression trees to enhance predictive accuracy and model robustness. Recent studies have explored various hybrid models to forecast tunnel-related parameters. For instance, Zhang et al. [
28] developed a hybrid model integrating particle swarm optimization with machine learning algorithms to predict tunneling-induced settlement. Their findings indicated that the integration of PSO effectively optimized hyperparameters, thereby improving the model’s predictive performance. In another study, Mahmoodzadeh et al. [
34] proposed six hybrid models combining Gaussian process regression (GPR) with different metaheuristic algorithms, including PSO and grey wolf optimization (GWO), to predict the duration and cost of road tunnel construction. The GPR-PSO model demonstrated superior accuracy in forecasting construction parameters, highlighting the efficacy of hybrid approaches in complex tunneling projects.
Focusing specifically on tunnel deterioration, a study by Hasan and Elwakil [
35] introduced a stochastic cluster regression-based model to predict the condition rating of highway tunnels. This model utilized operational data and demonstrated improved prediction accuracy, underscoring the potential of combining regression techniques with clustering methods in infrastructure assessment. In another attempt, Zhao et al. [
36] compared the performances of K-nearest neighbor, support vector machines, multi-layer perceptron, random forest, decision tree, and light gradient boosting machine in predicting service states of tunnels. Light gradient boosting machine was able to provide the highest performance with prediction accuracies of 77.3%, 96.4%, and 90.9% in non-structural, civil and structural sections, respectively. Furthermore, Gao et al. [
37] presented a genetic programming-based model to estimate service life of tunnels exposed to chloride-induced corrosion. The average annual relative humidity, stray current intensity, lining thickness of tunnel, and design speed of a tunnel are amongst the factors used in their study. The presented model was able to surpass artificial neural network. In addition, it was shown that the optimal values of population size and sample size are 300 and 20, respectively.
In addition to predictive deterioration modeling, significant research has been devoted to inspecting tunnel surface defects through computer vision approaches. Wang et al. [
38] proposed a U-net architecture for the detection of defects in subway tunnels. In this regard, all convolutional blocks are substituted with inception blocks, and the first convolutional layer is replaced with atrous spatial pyramid pooling. The proposed U-net model was able to provide recall, precision, F-measure, and intersection over union values of 0.66, 0.436, 0.525, and 0.356, respectively. Wu and Zhang [
39] adopted an improved VGG19 network for the segmentation of cracks in subway tunnels. In their model, improved bilateral filtering was employed for Retinex decomposition, and Zhang–Suen fast parallel thinning was implemented to create a crack skeleton diagram. Their model accomplished an accuracy and loss rate of 95.93% and 8.7%, respectively. In another attempt, Huang et al. [
40] created a mask region-based convolutional neural network (R-CNN) for the instance segmentation of cracks in tunnel linings. The developed R-CNN was composed of a backbone network, region proposal network, and head architecture. In addition, a morphological closing operation was executed to connect disjoined cracks. The model was trained using a multi-loss function that compiled mask loss, classification loss, and bounding-box regression. The introduced model attained an F1score, balanced accuracy, and intersection over union of 68.68%, 81.94% and 52.72%, respectively, for the testing images.
Xu et al. [
41] boosted crack detection performance by combining dynamic snake convolution and large separable kernel attention with you only look once (YOLO) V8. In addition, the scale invariant feature transform (SIFT) algorithm was implemented for image stitching and creating panoramic tunnel lining representation. In their work, images were captured using mobile acquisition system equipped with are-scan and line-scan cameras. Liu et al. [
42] introduced a transformer-based method that leverages a Swin Transformer architecture to capture both local and global features of cracks, resulting in improved segmentation performance even in complex tunnel environments. Additionally, Wang et al. [
43] developed a watershed feature fusion network that integrates conventional convolutional neural network (CNN) for segmentation with watershed image processing techniques to effectively differentiate crack pixels from background noise, demonstrating significantly enhanced segmentation accuracy on challenging tunnel images. Feng et al. [
44] introduced a two-stage model for defect recognition in a metro tunnel lining. In the first step, an image classification network was proposed by compiling a convolutional neural network with visual transformers-based blocks. The second step involved multiple detection heads and attention modules upon YOLO for differentiating between cracking, spalling, and water leakage.
Other studies exploited ground-penetrating radar (GPR) to analyze subsurface defects in tunnels. Xie et al. [
45] employed a bi-frequency back projection radar method to assess grouting quality behind tunnel segments, providing the precise detection of voids and anomalies. In addition, GPRmax 3.0 was utilized to study the correlation between domain frequencies of the radar signal, generating profiles of tunnel segment, grouting and soil. Harseno et al. [
46] explored GPR capabilities in identifying air-cavities behind concrete linings, using multi-frequency analysis for improved accuracy. Moreover, a cross-correlation analysis was leveraged to discriminate between defected regions from solid regions, and two-way travel time was used to cluster the severities of air cavities according to their thickness. Thirdly, Wu et al. [
47] advanced GPR applications by employing simulation-supported radar data to detect tunnel lining void defects with high reliability. They used the GPRmax software program based on the finite-difference time-domain method to simulate and study the responses of voids behind tunnel linings under different working conditions.
In reviewing the existing work on tunnel infrastructure, it becomes evident that most research has focused primarily on identifying and categorizing defects such as cracks, spalling, and voids. However, there remains a notable shortage of predictive deterioration models capable of simulating and forecasting the long-term performance and condition of individual highway tunnel elements. Additionally, existing deterioration models often overlook critical tunnel components that influence the structural integrity of the tunnel system, such as interior walls, portals, gaskets, cross passageway, and slabs on grade. This oversight could lead to inaccurate estimations and impractical budget allocations for tunnel maintenance and rehabilitation. Furthermore, some approaches forecast a single and unified condition rating for the whole tunnel, which can be misleading since each component ages differently, and failure often arises when only one crucial element enters a critical state. Therefore, a detailed assessment of deterioration at the element level is necessary for accurate predictions. Moreover, current models have overlooked critical explanatory variables such as tunnel length, structural shape, lane count, and functional classification, which are necessary to understand their influence on the deterioration behavior of highway tunnels. Finally, regression-based deterioration models have significant limitations in accurately representing the intricate relationships between various factors that influence deterioration and the long-term condition ratings of tunnels. This challenge is particularly pronounced due to the complex and multifaceted nature of the structural mechanisms that drive tunnel deterioration [
48]. In addition, it should be noted that some defect evaluation models rely on manual optimization to fine-tune the hyperparameters of the machine learning and deep learning models. Nonetheless, manual tuning is often tedious, computationally expensive, inaccurate and may lead to suboptimal solutions [
49,
50]. This study proposes a novel hybrid metaheuristic-based regression tree model to predict the deterioration behavior of tunnel elements. By leveraging the synergistic strengths of regression trees and metaheuristic optimization, the model aims to provide accurate, reliable, and scalable predictions that can support proactive asset management strategies. The framework is validated using extensive datasets from real-world tunnel projects, emphasizing its practical applicability and effectiveness.
3. Research Methodology
The foremost objective of this research study is to devise a data-driven model that is capable of mimicking the deterioration behavior of six key tunnel components, namely (1) cast-in-place concrete tunnel liners, (2) concrete cross passageways, (3) concrete interior walls, (4) concrete portals, (5) concrete slab on grade, and (6) gaskets. The framework of the developed deterioration prediction model is elucidated in
Figure 1. It is envisioned based on two main modules, namely (1) model design and training, and (2) model validation. Hitherto, an individual deterioration model was designed and validated for each tunnel element. Several guidelines and manuals were subsequently studied and scrutinized to diagnose potential factors that influence the temporal deterioration of tunnels [
51,
52,
53,
54].
Table 1 summarizes descriptions of the possible deterioration factors on tunnels. The output variable is the condition state of tunnel element and prospective input deterioration factors comprise age (X1), age from rehabilitation (X2), total number of lanes (X3), annual average daily traffic (X4), annual average daily truck traffic (X5), detour length (X6), service in tunnel (X7), NHS designation (X8), STRAHNET designation (X9), functional classification (X10), tunnel length (X11), roadway width (X12), hazardous material selection (X13), number of bores (X14), tunnel shape (X15), portal shape (X16), ground conditions (X17), and complexity (X18). The developed deterioration model is trained, tested, and evaluated by consolidating the National Tunnel Inventory (NTI) dataset for 2020–2024. The NTI is a structured database of all tunnels in the United States, and it contains a complete registry of inventory, maintenance, and element level inspection data. This inventory database is maintained and updated annually by the Federal Highway Administration in the United States, and it is a publicly available at [
55] in Excel format. The inspection of the tunnel components was performed according to the provisions and regulations of the tunnel inspection standards [
56].
Over the past decade, artificial intelligence (AI) has received widespread and significant attention from the research community of civil engineering, including material properties [
57,
58], structural analysis [
59,
60], concrete design [
61,
62], steel design [
63,
64], and construction management [
65,
66]. Hitherto, the advent of bio-inspired metaheuristics has successfully been implemented to augment the search mechanism of AI data-driven models. Some recent attempts in this direction involve the uses of particle swarm optimization and artificial neural networks for the estimation of elastic modulus [
67], support vector machines and artificial bee colony for pavement crack classification [
68], and multi-layer perceptron and genetic algorithm for slope stability design [
69].
With that in mind, the developed model herein is envisioned on the amalgamation of an ensemble of metaheuristics and regression trees to anticipate the future performance condition of tunnel components. Based on that premise, metaheuristics are leveraged for two fundamental purposes: (1) to autonomously optimize the hyperparameters of the regression tree model and (2) to simultaneously identify the most influential deterioration factor on each tunnel element. Regression trees are chosen in this research work owing to their low computational cost burden, simplicity, robustness against noise, high interpretability and predictive efficiency, and their ability to mimic complex nonlinear interactions in the dataset [
70,
71,
72]. This research work considers and evaluates the performance of ten state-of-the-art metaheuristics from relevant research studies, including whale optimization (WOA) [
73], black widow optimization (BWOA) [
74], teaching–learning-based optimization (TLBO) [
75], red-tailed hawk (RTH) [
76], particle swarm optimization (PSO) [
32], multi-verse optimization (MVO) [
77], mountaineering team-based optimization (MTBO) [
78], differential evolution (DE) [
79], dandelion optimization (DOA) [
80], and dragonfly optimization (DA) [
81]. In this regard, WOA, TLBO, PSO, and DE are selected because they proved their efficaciousness in several asset management-related practices such as sewer pipeline inspection [
82], bridge maintenance [
83], building rehabilitation planning [
84], and deterioration prediction of water pipes [
85]. In addition, DA is selected because of its fast convergence speed, short running time, robustness, and wide suitability in many engineering applications [
86]. As for MVO, it is able to render a proper balance between the exploration and exploitation of the search space [
77]. In addition, BWOA, RTH, MTBO, and DOA are new metaheuristics that were developed within the last few years, received broad recognition, and yet to be experimented in deterioration anticipation of infrastructure assets [
87,
88].
The entire dataset is divided into training and testing subsets, and the developed deterioration model is learned using 10-fold cross-validation of the training partition. In this regard, the mean squared error is utilized as an objective function which needs to be minimized during the iterative process of training. The developed metaheuristic-based regression tree model is validated through comparison against some well-known machine learning models: support vector machine (SVM), cascade neural network (CNN), generalized regression neural network (GRNN), Elman neural network (ENN), and feed forward neural network (FFNN). The performance of each deterioration model is evaluated and analyzed based on the statistical metrics of mean absolute error, mean absolute percentage error, root mean squared percentage error, root mean squared error, normalized root mean squared error, uncertainty at 95%, objective function criterion, Akaike information criterion, and performance strength criterion. In addition, target diagrams [
89] and error histograms [
90] are leveraged to visually synthesize the precision of the investigated deterioration models. Using these graphical diagrams, the optimal machine learning model can be interpreted.
5. Model Implementation
The model implementation section is divided into eight main subsections, namely (1) data description and parameter setting, (2) convergence behaviors, (3) error histograms, (4) performance evaluation metrics, (5) visual illustrations, (6) target diagrams, (7) influential deterioration factors, and (8) analysis and discussion. In each subsection, the results of each subsection are presented in the following order: cast-in-place concrete tunnel liners, concrete cross passageways, concrete interior walls, concrete portals, concrete slab on grade, and gaskets. The first subsection describes the training and testing subsets of the used National Tunnel Inventory. It also reports the control parameters of metaheuristics. The convergence curves of the utilized metaheuristics are plotted in the second subsection. The third subsection displays the error histograms of deterioration models. The fourth subsection involves recording the performance scores of deterioration models, which are MAE, MAPE, RMSPE, RMSE, NRMSE, U95, and OBJ. The fourth subsection displays graphic illustrations of AIC values, PSC values, and agreement plots (actual vs. predicted curves). The sixth subsection elaborates on the target diagrams of deterioration models. The seventh subsection enumerates the most influential deterioration factors impacting tunnel element condition. The eighth subsection reports the main findings of this research work.
5.1. Data Description and Parameter Setting
An entire dataset is constructed based on compiling the National Tunnel Inventory spanning from 2020 to 2024. In this context, the tunnel inspection records with missing values (incomplete data) were removed from the analysis. The consolidated dataset comprises 2244, 383, 1151, 1900, 1509, and 348 inspection records for cast-in-place concrete tunnel liners, concrete cross passageways, concrete interior walls, concrete portals, concrete slab on grade, and gaskets, respectively. The tunnel condition dataset is partitioned into training and testing subsets, with 80% of the dataset utilized for training the deterioration models and the remaining 20% used for their testing. Several metaheuristics, including WOA, BWOA, TLBO, RTH, PSO, MVO, MTBO, DE, DOA, and DA, were deployed in this study to identify (1) the hyperparameters of regression tree and (2) optimal deterioration factors of each tunnel element. The values of control parameters were retrieved from the literature [
32,
73,
74,
75,
76,
77,
78,
79,
80,
81]. The control parameter settings of metaheuristics are recorded in
Table 2. To maintain a fair comparison between metaheuristics, they were simulated using a population size and maximum number of iterations of 300 and 50, respectively. Each metaheuristic algorithm was independently run five times to address its stochastic nature and maintain reliability of the obtained results. Afterwards, the results of the best runs were used to predict the condition of tunnel elements.
5.2. Convergence Behaviors
Figure 2a illustrates the convergence behaviors of the metaheuristic-based regression tree models of cast-in-place concrete tunnel liners. As can be seen, REGT-TLBO converged at iteration 287 with average MSE of 0.0144. In addition, REGT-WOA and REGT-DOA performed well such that they converged to average MSE 0.0156 at iterations 128 and 158, respectively. As for concrete cross passageways, the convergence behaviors of the metaheuristic-based regression models are presented in
Figure 2b. It is revealed that REGT-WOA was able to swiftly converge at iteration 115 to average an MSE of 0.1332. On the other hand, the highest average MSE (0.1506) was linked with REGT-PSO at iteration 242.
Figure 2c demonstrates the convergence details of deterioration models of concrete interior walls. It is evident that REGT-PSO and REGT-DOA suffered from premature convergence. Furthermore, the smallest values of average MSE were obtained by REGT-DA (0.059) and REGT-RTH (0.0598). As for concrete portals, the results are explicated in
Figure 2d. It is evinced that REGT-RTH becomes well stabilized to average MSE of 0.0101 at iteration 214. Moreover, REGT-TLBO quickly converged to average MSE of 0.0102 at iteration 48. With respect to concrete slab on grade (see
Figure 3a), it is indicated that the lowest training average MSE was produced by REGT-DOA (0.0102), REGT-RTH (0.0104), and REGT-TLBO (0.0105). Nonetheless, the highest training average MSE was yielded by REGT-BWOA (0.0146) and REGT-PSO (0.0138). The details of gaskets are presented in
Figure 3b. As can be seen, it is manifested that REGT-TLBO, REGT-RTH, and REGT-MTBO achieved the best training performance such that their values of average MSE were equal to 0.0397, 0.0399, and 0.0418, respectively.
5.3. Error Histograms
Figure 4a,b show the error histograms for REGT-MTBO and SVM of cast-in-place concrete tunnel liners. It is worth mentioning that the error values are clustered into 20 bins and the orange line denotes zero error. As can be seen, the most frequent error of REGT-MTBO is 0.075, and it is 0.1691 for SVM. Similarly, the error histograms of deterioration models of the concrete cross passageway are displayed in
Figure 4c,d. It is clear that the majority of the error values are close to 0 and 0.1174 for REGT-MVO and SVM, respectively.
Figure 5a,b illustrate the error histograms of deterioration models of concrete interior walls. It is indicated that the most repeated error values of REGT-DA and SVM are 0.075 and 0.1042, respectively.
Figure 5c,d reflect the resulting error histograms for concrete portals. It is manifested that almost 95% of the testing instances of REGT-MTBO accommodate errors less than 0.05. For the FFNN, the most frequent error values are 0.05034 and 0.1505.
Figure 6a,b exemplify the error distributions of REGT-MVO and FFNN in concrete slab on grade. It is conceived that 91% of the error values of REGT-MVO are smaller than 0.075. Moreover, the most repeated error values of FFNN are 0.07596 and 0.2249. The error distributions of REGT-DE and FFNN can be seen in
Figure 6c,d (gaskets). It can be viewed that the most common error value of REGT-DE is 0.025, while the error values of FFNN generally span from 0.03668 to 0.6704.
5.4. Performance Evaluation Metrics
Table 3 depicts the results of the deterioration models of cast-in-place concrete tunnel liners and concrete cross passageways. In cast-in-place concrete tunnel lines, it is shown that REGT-MTBO is the best predictive model (MAE = 0.037, MAPE = 1.893%, RMSPE = 10.027%, RMSE-0.192, NRMSE = 0.203, and U95 = 0.57). It is also shown that REGT-MVO has the second lowest MAE (0.038), MAPE (2.209%), and RMSE (0.195). In the same vein, REGT-WOA achieved the second lowest values of RSMPE (10.948%) and U95 (0.572). Out of all models, SVM produced the highest MAE (0.194), MAPE (15.763%), RMSE (0.44), and U95 (0.972). As for concrete cross passageways, it is observed that the REGT-MVO provides the best performance with MAE, MAPE, RMSPE, RMSE, NRMSE, and U95 of 0.149, 10.173%, 26.077%, 0.386, 0.268 and 1, respectively. In addition, REGT-TLBO comes in the second place attaining the second lowest values of MAE (0.189), MAPE (13.528%), and RMSE (0.435). On the other hand, CNN achieved the lowest accuracies with MAE, MAPE, RMSPE, RMSE, NRMSE and U95 values of 0.534, 43.865%, 52.017%, 0.731, 0.467, and 1.585, respectively.
The results of the deterioration models of concrete interior walls and concrete portals are summarized in
Table 4. In concrete interior walls, it is noted that REGT-DA provides the highest prediction accuracies (MAE = 0.092, MAPE = 5.604%, RMSPE = 17.507%, RMSE = 0.303, NRMSE = 0.301, and U95 = 0.867). In contrast, SVM performed poorly with MAE, MAPE, RMSPE, RMSE, NRMSE and U95 values of 0.234, 18.495%, 23.923%, 0.484, 0.349, and 1.126, respectively. In relation to concrete portals, the results clarify that REGT-MVO and REGT-MTBO exhibit a better performance than the remaining models (MAE = 0.02, MAPE = 1.151%, RMSPE = 8.374%, RMSE = 0.14, NRMSE = 0.134, and U95 = 0.393). On the other hand, SVM is the least effective model such that it exhibited MAE, MAPE, RMPSE, RMSE, NRMSE, and U95 values of 0.196, 16.261%, 17.464%, 0.443, 0.264, and 0.96, respectively.
Table 5 reports the prediction results of the deterioration models of concrete slab on grade and gaskets. For concrete slab on grade, it is shown that REGT-MVO has the highest level of prediction accuracy (MAE = 0.028, MAPE = 1.587%, RMSPE = 9.308%, RMSE = 0.167, NRMSE = 0.193, and U95 = 0.513). Nonetheless, SVM results in the highest prediction error (MAE = 0.165, MAPE = 14.335%, RMSPE = 15.443%, RMSE = 0.406, NRMSE = 0.241, and U95 = 0.881). A similar evaluation was developed for the deterioration models of gaskets. It is revealed that the smallest values of MAE (0.097), MAPE (8.031%), RMSPE (23.361%), RMSE (0.311), NRMSE (0.225), and U95 (0.785) are associated with REGT-DE, thus signifying its high predictive capabilities. Conversely, SVM yields the weakest prediction performance such that it attains the highest MAE (0.353), highest MAPE (28.804%), second highest RMSPE (31.711%), highest RMSE (0.594), fifth highest NRMSE (0.343), and second highest U95 (1.238).
Table 6 records the OBJ values of the deterioration models of cast-in-place concrete tunnel liners and concrete cross passageways. For cast-in-place concrete tunnel liners, it is determined that REGT-MTBO is the best fitted model since it has the lowest OBJ value (0.0326). It is followed by REGT-MVO, REGT-WOA, and REGT-DE with OBJ values of 0.037, 0.0374, and 0.04, respectively. As for concrete cross passageways, it is manifested that REGT-MVO outclasses other deterioration models, whereas it achieves the smallest OBJ value (0.1264). In addition to that, REGT-PSO and REGT-TLBO induce the second (0.1872) and third (0.2001) highest values of OBJ.
Table 7 illustrates the results of concrete interior wall and concrete portals. According to the OBJ values of concrete interior walls, REGT-MVO is deemed the best prediction model followed by REGT-RTH, REGT-TLBO, and REGT-WOA. Their respective OBJ values are 0.4105, 0.4226, 0.5621, and 0.573, respectively. With regard to concrete portals, it can be seen that REGT-MVO and REGT-MTBO have the smallest OBJ values (0.0154), and hence, they outperform the remainder of the deterioration models.
Table 8 explicates the OBJ values of deterioration models of concrete slab on grade and gaskets. Concerning concrete slab on grade, it is found that REGT-PSO exhibits superior performance, whereas its OBJ value (0.0331) is the smallest among the investigated machine learning models. However, CNN (1.4635), GRNN (5.3867), and FFNN (7.7609) have the highest OBJ values, demonstrating their incapabilities of anticipating the condition of concrete slab on grade. With respect to gaskets, it is noticed that REGT-DE exhibits the smallest OBJ value (0.0899) and hence, it is the best model from this standpoint.
5.5. Visual Illustrations
Figure 7a demonstrates the AIC values of the deterioration models of cast-in-place concrete tunnel liners. REGT-MTBO exhibits the smallest AIC values in the training (−7759.63) and testing phases (−1447.34). Subsequently, it has strong generalization abilities when compared against other deterioration models. In relation to concrete cross passageways, AIC values of training and testing phases are illustrated in
Figure 7b. It is inferred that REGT-MVO attains the lowest AIC values (−825.35 for training and −110.41 for testing) while CNN has the highest values (−191.94 for training and −12.23 for testing). With regard to concrete interior walls, REGT-MTBO (see
Figure 8a) is deemed the optimal deterioration model (AIC = −3067.95 for training and AIC = −444.28 for testing) followed by REGT-RTH (AIC = −2383.37 for training and AIC = −420.29 for testing). As for concrete portals (see
Figure 8b), REGT-WOA obtains the smallest AIC value (−7183.62) in the training phase, while REGT-MVO and REGT-MTBO obtains the smallest AIC value (−1459.85) in the testing phase. In concrete slab on grade (see
Figure 9a), REGT-PSO yields the lowest AIC value (−5876.69) in the training stage while REGT-MVO yields the smallest AIC value (−1045.22) in the testing stage. With respect to gaskets (see
Figure 9b), it is derived that REGT-PSO accomplishes the smallest AIC value (−1012.43) in training, while REGT-TLBO sustains the smallest AIC value (−111.25) in testing.
Figure 10a depicts the PSC values of the deterioration models of cast-in-place concrete tunnel liners. The results show that REGT-MTBO has the largest PSC value (0.776) followed by REGT-MVO (0.743) and REGT-WOA (0.742). Consequently, REGT-MTBO is ranked in first position, while REGT-MVO and REGT-WOA come in second and third position, respectively. The concrete cross passageway results can be viewed in
Figure 10b. It can be observed that REGT-MVO is the superior deterioration model with the highest PSC value (0.766). On the contrary, CNN (−0.265) and FFNN (0.192) produce the smallest PSC values. The details of the concrete interior walls are listed in
Figure 11a. REGT-MTBO achieves the highest PSC value (0.562), followed by REGT-RTH and REGT-TLBO with the second (0.297) and third (0.257) highest PSC values, respectively. As for concrete portals (see
Figure 11b), the largest PSC value (0.262) is associated with REGT-RTH model and the second highest PSC value is attributed to REGT-WOA (0.238). In the concrete slab on grade (see
Figure 12a), it is observed that REGT-PSO sustains the highest value of PSC (0.784) as compared to the remaining deterioration models. With respect to gaskets (see
Figure 12b), REGT-MTBO produces the largest PSC value (0.877), and then REGT-PSO comes in second by a slight margin (0.876).
Figure 13,
Figure 14 and
Figure 15 show graphic depictions of the level of performance of some deterioration models across tunnel elements plotted against the actual data. The X-axis reflects the tunnel elements, and the Y-axis is the tunnel condition, either predicted or actual. The black line is the actual tunnel condition and the red line is the predicted tunnel condition by the deterioration model.
Figure 13a depicts visual representation of accuracies of REGT-MTBO in cast-in-place concrete tunnel liners. It is shown that REGT-MTBO could give closer values to the actual condition of cast-in-place concrete tunnel liners indicating its high predictive abilities.
Figure 13b shows the condition of concrete cross passageways as forecasted by REGT-MVO. It expounds that closer distributions are experienced between the observed and predicted values demonstrating the high predictive accuracy of REGT-MVO.
Figure 14a displays graphical illustrations of projected tunnel condition by REGT-DA. Results corroborate the accurate estimations rendered by REGT-DA.
Figure 14b depicts the predicted outcomes of concrete portals for REGT-MTBO. It is clearly seen that the values of predicted condition are pretty similar to the actual ones signifying REGT-MTBO as an efficacious deterioration model of concrete portals.
Figure 15a demonstrates the differences between actual and estimated condition by REGT-MVO. A notable agreement is attained between the observed and estimated condition of concrete slab on grade, affirming the superior predictive abilities of REGT-MVO.
Figure 15b exemplifies the predictive capabilities of REGT-DE for gaskets. A close relationship is shown between the observed and anticipated condition over the different data instances, showing that the performance of REGT-DE is very near to the actual condition.
5.6. Target Diagrams
Target diagrams are harnessed herein to characterize a visual representation of the unbiased root mean square difference (uRMSD) and bias of the deterioration models (see
Figure 16,
Figure 17 and
Figure 18). In this respect, the nearest deterioration model to the origin is regarded as the best fitted. From the resulting diagrams, it can be conferred that REGT-MTBO (urmsd = 0.1171, bias = 0.0047) is the most efficacious model in predicting the condition of cast-in-place concrete tunnel liners (
Figure 16a). Looking into the target diagrams of the concrete cross passageway (
Figure 16b), it can be immediately observed that REGT-MVO is the best performing model with urmsd and bias values of 0.3649 and −0.0325, respectively. On the other hand, ENN (urmsd = 0.5877, bias = −0.2832) and CNN (urmsd = 0.5937, bias = −0.2753) are found to perform poorly in such area of application. The target diagram of concrete interior walls (
Figure 17a) shows that REGT-DA (urmsd = 0.1819, bias = 0.0054) is the closest model to the origin. On the other hand, SVM failed to imitate the actual tunnel condition, whereas it obtained urmsd and bias values of 0.2862 and −0.1055, respectively. As for concrete portals (
Figure 17b), the target diagram manifests that REGT-MVO and REGT-MTBO (urmsd = 0.0889, bias = −0.0018) work better than the remaining prediction models. As for concrete slab on grade (
Figure 18a), it is propounded that the metaheuristic-based regression tree models sustain a far better performance than the typical machine learning models. For instance, the values of urmsd and bias of REGT-PSO are 0.1096 and −0.0011, respectively. As for gaskets (
Figure 18b), the metaheuristic-based regression tree models established notable improvements in the predictive performance over the orthodox machine learning models. For example, the values of urmsd and bias of REGT-MVO are corresponding to 0.1733 and −0.015, respectively.
5.7. Influential Deterioration Factors
The optimum structures of the best performing deterioration models are listed in
Table 9. It is propounded that the most influential factors affecting the condition of cast-in-place concrete tunnel liners are age, rehabilitation year, number of lanes, ADT, ADTT, detour length, service in tunnel, STRAHNET designation, functional classification, tunnel length, roadway width, number of bores, tunnel shape, portal shape, ground conditions, and complexity. As for the concrete cross passageway, the most significant deterioration factors include age, number of lanes, detour length, service in tunnel, NHS designation, STRAHNET designation, tunnel length, roadway width, hazardous material selection, tunnel shape, ground conditions, and complexity. In the same vein, concrete interior walls are found to be highly influenced by age, rehabilitation year, ADT, ADTT, detour length, service in tunnel, NHS designation, STRAHNET designation, functional classification, tunnel length, roadway width, hazardous material selection, number of bores, portal shape, ground conditions, and complexity. In addition, the top deterioration factors affecting concrete portals are the number of lanes, ADT, ADTT, detour length, service in tunnel, NHS designation, STRAHNET designation, functional classification, tunnel length, roadway width, hazardous material selection, number of bores, and ground conditions. with regard to concrete slab on grade, the most impacting deterioration factors are the number of lanes, ADT, NHS designation, STRAHNET designation, functional classification, hazardous material selection, tunnel shape, and complexity.
It is also revealed that the most affecting factors on the condition of gaskets encompass are rehabilitation year, number of lanes, service in tunnel, functional classification, tunnel length, roadway width, hazardous material selection, number of bores, tunnel shape, portal shape, and ground conditions. Generally speaking, the most paramount factors to pay attention to when studying tunnel deterioration are number of lanes, service in tunnel, tunnel length, and roadway width. The optimum structure of regression tree was investigated (type of predictor selection (psl), minimum parent size (ps), minimum leaf size (ls), and maximum number of splits (ns)). The optimum type of predictor selection method was either “all splits” or “interaction-curvature”. The optimum values of minimum parent size were either 2, 3, or 4. The optimum value of leaf size was 2, and the optimum values of maximum number of splits range between 800 and 2000. The optimum deterioration factors and hyperparameters can serve as useful references for the proper use of regression trees to accurately anticipate the condition of tunnel elements. For example, the optimum architecture of the regression tree used for the deterioration prediction of concrete interior walls is composed of “all splits” predictor selection and the minimum parent size, minimum leaf size, and maximum number of splits of 2, 2, and 2000, respectively.
5.8. Analysis and Discussion
In light of the foregoing, it can be underlined that no single metaheuristic-based regression tree was able to be the superior deterioration model across all tunnel elements. For example, REGT-MTBO was the best fitted deterioration model in cast-in-place concrete tunnel liners and concrete portals. In addition, REGT-MVO was the best-performing model in concrete cross passageways and concrete slab on grade. Moreover, REGT-DA and REGT-DE exhibited the highest accuracies in concrete interior walls and gaskets, respectively. In view of these findings, some conclusions can be drawn: (1) MTBO is the best metaheuristic in dealing with large dataset inventories, (2) DE achieves better accuracies with small datasets than with large ones, and (3) MVO performs nearly as well with small datasets as with the large ones. The study of deterioration factors revealed that the string of the most significant deterioration factors varies from one tunnel element to the other. However, there are four common factors that are total number of lanes, service in tunnel, tunnel length and roadway width. As for the optimum structures of regression trees, “all splits” was found to be the optimum type of predictor selection in five out of the six tunnel elements, and “interaction-curvature” was the optimal type in one tunnel element. The optimal values of minimum parent size ranged between 2 and 4, the minimum leaf size was 2, and the maximum number of splits extended from 800 to 2000.
Across the six tunnel elements, it is noted that the ten metaheuristic-based regression tree models outperformed classical machine learning models. This evinces the adaptability, robustness, high exploration/exploitation abilities, and fast convergence of the developed model. The high search abilities allowed us to effectively probe the design space without stagnating in local minima solutions and premature convergence. Another important feature of the developed model is its ability to automatically tune the hyperparameters of regression tree during the training process. This allowed us to alleviate the time-consuming nature of and suboptimal solutions to manually optimizing hyperparameters. A third important advantage of the developed model is its ability to perform automated feature selection of deterioration factors during the learning process of the regression tree, which aided in dodging the less predictive results of filter methods. The developed model consumes more training time than the classical machine learning models. However, it can be argued that the training process is carried out only once, and the model is well learned so that the optimum deterioration factors and features are appended and can be directly used for prediction.
6. Conclusions
This study introduced a novel hybrid metaheuristic-based regression tree model to predict the temporal deterioration behavior of six distinct tunnel components—cast-in-place concrete tunnel liners, concrete cross passageways, concrete interior walls, concrete portals, concrete slab on grade, and gaskets. Drawing upon a large-scale dataset sourced from the National Tunnel Inventory, the proposed framework leveraged ten metaheuristic algorithms (e.g., whale optimization, teaching–learning-based optimization, red-tailed hawk, particle swarm optimization, and differential evolution) to (1) optimize regression tree hyperparameters and (2) determine the most influential deterioration factors for each tunnel element. Comparative evaluations against conventional machine learning techniques, including support vector machines, neural networks, and ensemble methods, illustrated the superior predictive performance of metaheuristic-enhanced regression tree models in terms of lowered error metrics (MAE, RMSE, and MAPE), stronger generalization (via AIC and OBJ criteria), and more robust model stability (via target diagrams and PSC values).
Notably, while the metaheuristic-based regression trees generally outperformed standard machine learning methods, no single metaheuristic variant consistently dominated across all tunnel elements. This result underscores the complexity of tunnel deterioration processes: each tunnel component may respond differently to variations in environmental conditions, traffic loads, material properties, and other factors. The detailed feature-selection analysis revealed that number of lanes, service in tunnel, tunnel length, and roadway width emerged as critical predictors across multiple tunnel elements, suggesting that these factors warrant particular attention in maintenance planning and resource allocation. From a practical standpoint, this research study supplies transportation agencies with a reliable and robust platform to forecast the temporal deterioration rates of their tunnel infrastructure. It can also aid asset managers in customizing their own deterioration models based on the available dataset. Constituting a key pillar of budget allocation models, this in turn leads to more informed, sustainable, and cost-effective maintenance and rehabilitation programs, thereby enhancing public safety and the long-term operation of tunnel elements and extending their service life. This study also reports the most significant deterioration factors for each tunnel element, whereas collecting data about these factors can assist transportation agencies in building their tailored deterioration models, reducing the data collection burden, and understanding temporal tunnel performance. Future research can build on these insights by (1) devising non-destructive techniques-based deterioration models through incorporating multi-sensor technologies such as ground penetrating and digital photogrammetry, (2) integrating the developed model with internet of things for dynamic update and real time monitoring of condition data, and (3) leveraging metaheuristics with deep learning techniques such as convolutional neural network for tunnel deterioration prediction. Overall, the outcomes of this study demonstrate that combining regression trees with metaheuristic algorithms offers a powerful and versatile approach for advancing the state of the art in tunnel deterioration modeling and ensuring more resilient infrastructure systems.