Next Article in Journal
From Speech to Summary in Turkmen: A Parameter-Efficient Neural Pipeline
Previous Article in Journal
S-Band Klystron Intra-Pulse Phase Feedback Upgrade at SPARC_LAB Facility
Previous Article in Special Issue
A Mesoscopic Study on the Constraint Mechanism of Existing Pavement Rutting on the Mechanical Behavior of Sealcoat Based on Highways in China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Pavement Deterioration Prediction Under Data Scarcity: A Hybrid BiLSTM–XGBoost Approach

1
School of Mechanics and Engineering Science, Shanghai University, Shanghai 200444, China
2
China Academy of Transportation Sciences, Beijing 100029, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(12), 5732; https://doi.org/10.3390/app16125732
Submission received: 21 April 2026 / Revised: 22 May 2026 / Accepted: 4 June 2026 / Published: 6 June 2026
(This article belongs to the Special Issue New Trends in Road Materials and Pavement Design)

Featured Application

The method proposed in this study can be used in pavement management systems (PMSs) to achieve accurate prediction of Pavement Condition Index (PCI) degradation in common situations where historical data are scarce. Through data augmentation and hybrid modeling, this method can provide reliable data and modeling support for preventive maintenance decision-making and funding optimization on road networks.

Abstract

To address the dual challenges of scarce historical time-series data and limited representational capacity of standalone models in pavement performance prediction, this study proposes an Engineering-heuristic-constrained Perturbation Data Augmentation Framework and a hybrid Bidirectional Long Short-Term Memory–Extreme Gradient Boosting (BiLSTM–XGBoost) model. The augmentation framework generates high-quality virtual samples by applying controlled perturbations aligned with engineering variability—to both covariates (e.g., traffic volume and layer thickness) and Pavement Condition Index (PCI) sequences—while enforcing the physical constraint of monotonic year-on-year deterioration. This expands 10 typical road sections into 1200 training samples. A two-stage prediction architecture is then developed: BiLSTM first extracts high-order temporal features from historical PCI sequences; these features are then fused with covariates and engineering features as input to XGBoost for final regression. Evaluated on an independent test set, the hybrid model outperforms the standalone models and the ANN model, achieving an R2 of 0.771, with RMSE, MAE, and MAPE as low as 2.043, 1.706, and 1.859%, respectively. This work provides an accurate and practical tool for pavement performance prediction under data scarcity, supporting informed decision-making in pavement management systems.
Keywords: BiLSTM; XGBoost; pavement performance prediction; data augmentation; hybrid prediction model BiLSTM; XGBoost; pavement performance prediction; data augmentation; hybrid prediction model

Share and Cite

MDPI and ACS Style

Zhou, X.; Li, L.; Zhu, J. Pavement Deterioration Prediction Under Data Scarcity: A Hybrid BiLSTM–XGBoost Approach. Appl. Sci. 2026, 16, 5732. https://doi.org/10.3390/app16125732

AMA Style

Zhou X, Li L, Zhu J. Pavement Deterioration Prediction Under Data Scarcity: A Hybrid BiLSTM–XGBoost Approach. Applied Sciences. 2026; 16(12):5732. https://doi.org/10.3390/app16125732

Chicago/Turabian Style

Zhou, Xinyu, Li Li, and Jie Zhu. 2026. "Pavement Deterioration Prediction Under Data Scarcity: A Hybrid BiLSTM–XGBoost Approach" Applied Sciences 16, no. 12: 5732. https://doi.org/10.3390/app16125732

APA Style

Zhou, X., Li, L., & Zhu, J. (2026). Pavement Deterioration Prediction Under Data Scarcity: A Hybrid BiLSTM–XGBoost Approach. Applied Sciences, 16(12), 5732. https://doi.org/10.3390/app16125732

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop