1. Introduction
Statistics show that 29% of aircraft accidents occur during landing or takeoff on wet runways [
1]. These accidents are mainly caused by rainfall accumulation, which creates a thick water film on the pavement surface. When aircraft land on wet runways, the water film significantly reduces friction [
2,
3], increasing the likelihood of hydroplaning or aquaplaning. This phenomenon heightens the risk of runway overruns, potentially resulting in substantial damage to the aircraft or passenger injuries and fatalities [
4]. Consequently, measuring the WFD to assess runway conditions is important for ensuring the safety of aircraft operations.
The International Civil Aviation Organization (ICAO) Annex 14 [
5] mandates the real-time reporting of standing water depths exceeding 3 mm on runways. This requirement, specified in the ICAO Global Reporting Format (GRF) for Runway Surface Conditions, has been in effect since November 2021 to reduce the risk of aircraft aquaplaning. In practice, measuring WFD on runways for landing or departing aircraft during adverse weather conditions is costly and hazardous for the staff involved. WFD measurement at specific pavement points using instruments is constrained by runway clearance requirements, making on-pavement installation challenging. The distance of instrument placement and adverse weather conditions also limit measurement accuracy. As a result, predictive models for predicting potential WFD in touchdown zones have gained increasing attention as an alternative.
Predictive models can be classified into two types: empirical and analytical models [
1]. Empirical models are developed using experimental data to formulate mathematical expressions, while analytical models are based on analyses of physical and mechanical principles. Over the past decades, research has focused largely on developing WFD predictive models. Prediction accuracy depends heavily on the suitability of the selected parameters regardless of model type. Most predictive models incorporate independent parameters, such as pavement slope, rainfall intensity, drainage length, and mean texture depth. In general, drainage length and rainfall intensity are positively correlated with WFD, whereas pavement slope and mean texture depth are negatively correlated.
Analytical models require validation through experimental methods, but the reliability of these experiments depends on a well-designed framework and precise data acquisition. Ensuring the quality of the predictive model necessitates thorough planning and execution. In this study, optimal sensors will be selected to measure WFD, and data will be collected through experimental methods. The gathered dataset will then be used to develop a predictive formula for WFD.
3. Methodology
This study involved experimental setup and environmental construction, data measurement and model evaluation, as shown in
Figure 1. The environmental setup included the installation of a water supply system, the construction of runway pavement, and the placement of sensors. A water supply system was used to simulate rainfall conditions, while an asphalt concrete pavement was prepared to replicate runways with different slopes and mean texture depths. Laser displacement sensors were installed to measure WFD dynamically.
Regarding the WFD prediction model, an empirical model proposed by Gallaway et al. has already been introduced. Additionally, various models with different levels of accuracy and reliability evaluation criteria have been adopted. Since using a combination of different comparison and evaluation approaches presents a significant challenge, this study focuses on exploring and analyzing commonly used empirical models and evaluation criteria.
Three evaluation metrics were employed, mean absolute error (MAE), relative absolute error (RAE), and relative root squared error (RRSE), to compare the predictive performance of the four existing WFD models with the model developed in this study.
3.1. Experimental Equipment
3.1.1. Water Supply System
As shown in
Figure 2, the layout of the water supply system was designed to simulate rainfall on pavement surfaces. This system consists of the following primary components: a water tank, water pump, water filter, steel drum, air compressor, water flow valve, flow meter, distribution device, water supply tubes, spray nozzles, and a drainage pipe. The water tank served as the reservoir, providing the necessary volume for rainfall simulation. Water was pumped from the water tank, filtered through a filtration device to remove impurities that could clog the nozzles, and directed into a high-pressure cylindrical steel drum. The air compressor, paired with a pressure regulator, a water flow valve, and a flow meter, maintained optimal water pressure to ensure steady flow to the distribution system. The water was then directed by the distribution device, which was mounted on a support frame, to the spray nozzles. Finally, excess water was collected through drainage pipes and returned to the water tank, allowing for efficient recycling within a closed-loop system.
3.1.2. Pavement System
Figure 3 illustrates the design of the pavement system for the runway. Given the symmetrical nature of runways, the pavement system in this experiment was designed to represent only one side of the actual runway structure along its centerline. The upstream section of the pavement corresponds to the runway centerline, while the downstream section represents the area near the runway edge. The key independent parameters, which include the transverse width of the pavement, transverse slope, and mean texture depth, varied in accordance with the experimental design to achieve the desired configurations.
3.1.3. Water Depth Measurement System
This study utilized specialized laser displacement sensors based on reflective principles to measure the WFD on the pavement with high accuracy. The laser beam from the sensor is projected at a fixed angle onto the surfaces of the water film and the pavement. The reflected light is collected by a receiver lens and directed to a sensor component. When the laser beam passes through the water medium, its propagation speed is slower than when passing through air, resulting in a measurable delay in the reflection time. This time delay varies with the depth of the water, enabling the prediction of water depth based on this principle. The configuration of the water depth measurement system is illustrated in
Figure 4. This study employed a probe-type water level gauge (as shown in
Figure 5) to measure the actual water depth, which corresponds to the depth measured by the laser displacement sensor.
Before the experiment, the probe-type water level gauge and laser displacement sensor were used to measure water depth, obtaining the correlation, Actual water depth = 2.9852 * Laser Displacement Sensor measured water depth, as shown in
Figure 6.
3.2. Data Processing
Data acquisition involves two primary steps, measurement and processing, to obtain the final analytical dataset. In this study, key parameters, such as mean texture depth, pavement slope, rainfall intensity, and drainage length, were controlled. WFD measurements were recorded throughout the experiment. Given that WFD measurements are inherently unstable due to disturbances from raindrop splashes and water flow fluctuations, the simple moving average (SMA) method was applied to smooth the data. The average results represent the WFD corresponding to specific experimental parameters throughout the measurement.
Equations (1) and (2) outline the data acquisition process for generating stable measurement results. During an interval P, at the
i-th iteration,
Wf denotes the WFD detected at time point
f, where
f ∈ P. At a given time point
x,
represents the average WFD measured over the interval. These procedures are repeated to calculate a moving average as the temporary WFD value for each time point
x. The iterations are performed n times, with
f(
x) denoting the average of all
iterations. The final
f(
x) value represents the WFD under the experimental design, which includes parameters such as average texture depth, pavement transverse slope, rainfall intensity, and drainage length.
where
x = the detected time point,
i = the
ith iteration for sample point
x,
b = the moving period,
Wf = the detected WFD at time points of
f,
= the mean value of
ith iteration, P = the total period,
n = the measuring times in P, and
f(
x) = the mean of all iterations of
.
3.3. Model Evaluation
After conducting experiments under various scenarios outlined in the experimental design, this study will develop a predictive model for WFD and compare its performance against existing models proposed in prior research. The existing models include the Pourhassan model (Pourhassan et al. [
12]), the Gallaway model (Gallaway et al. [
25]), and the Empirical PAVDRN model (David et al. [
13]), as presented in Equations (3)–(5). All of these models incorporate parameters common to this study, including rainfall intensity (I, in./h), mean texture depth (MTD, in), water path length (L, ft), and runway transverse slope (S, ft/ft). The WFD was measured using a gauge down to the pavement surface. Consequently, the measured WFD value was obtained by subtracting the MTD from the depth recorded by the gauge. As a result, the WFD value at the reference plane of the mean texture depth is zero.
3.4. Evaluation Criteria
MAE, RAE, and RRSE were employed, as defined in Equations (6)–(8), to compare the predictive performance of WFD models in this study.
MAE quantifies the average magnitude of errors between the observed (Wi) and the estimated values (). It is calculated as the average of the absolute differences between these values.
RAE serves as an accuracy measure for regression models by comparing the model’s performance to a naive baseline. A well-performing model will yield a RAE ratio of less than 1.
RRSE normalizes the error to the same scale as the original measurements. A robust predictive model will result in error values for MAE, RAE, and RRSE that are close to zero.
These criteria provide a comprehensive assessment of the predictive accuracy and reliability of the models under comparison.
where
= actual WFD for case
i,
= estimated WFD for case
i, and
= average actual value of WFD for case
i. 4. Experimental Results
4.1. Experimental Design
Several limitations in experimental setups were found in previous studies, such as the following:
Point-by-point observations using a water level gauge, which may introduce visual measurement errors;
Inability to obtain multi-point data simultaneously;
Difficulty in achieving uniform rainfall distribution due to the spray configuration and placement.
This study addressed these limitations through a carefully designed experiment focused on achieving uniform rainfall distribution by adjusting the spacing of the spray nozzles. Additionally, using laser measurement and electrical data collection instruments, the experiment allowed for the precise measurement of multiple WFD values at different points simultaneously along the eight-meter-wide runway pavement cross-section.
Figure 7 illustrates the configuration of the water supply system at the rear end. The spray nozzles are positioned 1.8 m above the pavement surface, and the water flow rate is controlled via a shutoff valve beneath the flow meter, adjusting the water flow rate from 25 to 50 L per minute. The water is delivered through hoses to the spray nozzles, corresponding to rainfall intensities of 6.43 mm to 12.87 mm per six minutes. Subsequently, the rainfall intensity recorded every 6 min will be multiplied by 10, representing the hourly rainfall intensity.
During typical aircraft landings, the main landing gear typically lands within 14 m of the runway centerline (the highest point of the runway cross-section), with a lateral distance of 7 m on each side. This study designed a 1.2 m wide, 8 m long asphalt pavement section positioned on a platform 1 m above the ground to conservatively simulate the runway cross-section. The pavement’s transverse slope is adjustable from 0.3% to 1.5%. The upstream end of the pavement represents the runway centerline, and the downstream end is located 8 m from the centerline. Two asphalt concrete pavement samples were fabricated with average mean texture depths of 0.38 mm and 0.98 mm, according to the measurement standards (ASTM E965-15) outlined by the American Society for Testing and Materials [
26].
In this experiment, eight spray nozzles were used, each with a maximum spray angle of 120° (
Figure 8). Theoretically, the spray distribution should be uniform under the same pressure and water flow conditions. To test the spray coverage, the experiment maintained a water supply at 4 kg/cm
2 air pressure and a flow rate of 45 L per minute.
Table 2 shows the average water output from each nozzle for over 6 min at various nozzle spacings, ranging from 96 to 180 cm. At a nozzle spacing of 120 cm, the average rainfall intensity per 6 min was 13.63 mm with a standard deviation of 1.1 mm and a variance of 0.08, representing the smallest variation among all spacing conditions. Therefore, the nozzle spacing was initially set at 120 cm and adjusted to optimize the uniformity of water distribution.
The pressure and flow loss differ because the distance from the pressurized steel drum to each nozzle varies, as well as the positioning of the branching pipes, necessitating fine-tuning of the nozzle spacing. After a trial-and-error process, the optimal spacing arrangement was selected at 23, 79, 219, 335, 469, 594, 704, and 784 cm from the upstream end. This configuration ensures the most uniform rainfall distribution across the pavement, from the upstream end (representing the runway centerline) to eight meters downstream.
Figure 9 illustrates the laser displacement sensors used for measuring WFD. Six measurement points were established at distances of 100, 190, 310, 440, 590, and 690 cm from the upstream end. A red semiconductor laser with a wavelength of 655 nm projects onto each measurement point, with a response time of five microseconds. On the pavement surface, the laser forms a visible red dot, and the sensor’s receiving component detects the time interval of the reflected waveform. The collected data are transmitted via an RS-422 interface to a programmable logic controller (PLC). WFD measurements are recorded twice per second and continuously logged for 6 min.
4.2. Data Collection
Based on the experimental design, this study collected 228 experimental data combinations for WFD measurements (NTU dataset). The combinations included variations in the transverse slope (S) at 0.3%, 0.5%, 1%, and 1.5%; outflow rates of 25, 30, 35, 40, 45, and 50 L per minute, corresponding to rainfall intensities (I) of approximately 64, 77, 90, 102.9, 115.8, and 128.7 mm/h, respectively; mean texture depths (T) of 0.38 mm and 0.98 mm; and drainage length (L) at 100, 190, 310, 440, 590, and 690 cm.
Figure 10 illustrates the measurement results for a transverse slope of 0.5%, mean texture depth of 0.38 mm and a water supply rate of 45 L per minute, collected at six measurement points (L) from the centerline. “O-100” represents the raw measurement data obtained from the laser displacement sensors, while “SMA-100” indicates the smoothed and averaged data after applying the SMA method to the original measurements.
WFD was recorded twice per second, resulting in 720 data points per measurement location over 6 min. Variability in the raw measurements was expected and deemed reasonable due to continuous rainfall on the pavement surface.
During each experiment, the WFD gradually increases before stabilizing as water flows downstream through the measurement points. Measurements were collected from the 300th to the 600th point (i.e., from two minutes thirty seconds to five minutes) of each test to ensure water flow stability. The average of these measurements was recorded as the WFD for the given experimental setup. This process was repeated across all six measurement points, yielding data for each measurement point in every experimental scenario.
4.3. Experimental Data Analysis
Table 3 presents the descriptive statistics of the parameters for the NTU dataset from this study and the Gallaway dataset, which originates from prior research (Gallaway et al. [
26]). The NTU dataset includes five parameters for each data point: mean texture depth (T), pavement transverse slope (S), rainfall intensity (I), drainage length (L), and water film depth (WFD). The units of measurement are millimeters for parameters T and WFD, centimeters for L, and millimeters per hour for rainfall intensity I. The pavement transverse slope values range from 0.3% to 1.5%, while the drainage length spans from a minimum of 100 cm to a maximum of 690 cm. In this study, WFD is measured as the distance from the water film surface to the highest point of the pavement texture, with the texture peak considered the zero point. As a result, all WFD values are positive, ranging from 0.31 mm to 3.22 mm.
In contrast, the Gallaway dataset comprises 960 data points, with WFD values ranging from −7.26 mm to 4.78 mm. In that dataset, the texture peak serves as the reference plane, with zero indicating the top of the texture. Negative WFD values represent depths below the texture peaks.
Figure 11 illustrates the histograms of WFD values for the NTU and Gallaway datasets, along with their corresponding normal distribution curves. In the NTU dataset, most WFD values fall within the range of 0 to 3 mm, with only about five data points exceeding 3 mm, accounting for approximately 2% of the entire dataset. This suggests that future predictive models may exhibit higher error rates for WFD values above 3 mm. In contrast, the Gallaway dataset shows the majority of WFD values ranging from −1 to 2 mm, including several negative WFD values, with the mean value close to zero.
The NTU dataset was analyzed, as illustrated in
Figure 12a. The possible WFD values were determined under varying drainage lengths, transverse slopes, and rainfall intensity conditions from the dataset for a fixed texture depth of 0.38 cm. These values are represented within the I-bar range shown in the figure, yielding an average WFD value of 1.25 cm. Similarly, a fixed texture depth of 0.98 cm, and varying drainage length, transverse slope, and rainfall intensity conditions, resulted in an average WFD value of 1.09 cm. These findings indicate an inverse relationship between higher average texture depth and lower WFD values.
Figure 12b demonstrates a positive correlation between longer drainage lengths and higher average WFD values.
Figure 13a reveals that higher pavement transverse slope values are associated with lower WFD values, mirroring the trend observed with average texture depth. In contrast,
Figure 13b shows that higher rainfall intensity is positively correlated with higher WFD values, a trend like that observed for texture depth.
Table 4 presents the Pearson correlations for the NTU and Gallaway datasets. For the NTU dataset, at a significance level of α = 0.05, the parameters T, S, I, and L are significantly correlated with WFD. Specifically, T and S exhibit a negative correlation with WFD, whereas I and L show a positive correlation with WFD.
Similarly, for the Gallaway dataset, at α = 0.05, T, S, I, and L are also significantly correlated with WFD. As observed in the NTU dataset, T and S have a negative correlation with WFD, while I and L have a positive correlation. Overall, the correlations between the parameter values and WFD are consistent across the two datasets.
4.4. Development of Prediction Model Based on Experimental Data
This study employs MLR to determine the best fit for the data. Using the NTU dataset, the NTU prediction formula (Equations (9) and (10)) was established, yielding an R
2 value (Coefficient of Determination) of 0.924566.
where
L = drainage length (cm),
I = rainfall intensity (mm/hr),
S = pavement transverse slope (%), and
T = average texture depth (mm).
where
L = drainage length (ft),
I = rainfall intensity (in/h),
S = pavement transverse slope (%), and
T = average texture depth (in).
4.5. Comparison of the Performance of Prediction Models
This study compares the performance of four prediction models, the Pourhassan model (Pourhassan et al. [
12]), Gallaway model (Gallaway et al. [
26]), Empirical PAVDRN model (Huebner et al. [
13]), and the NTU model, using the NTU and Gallaway datasets. The comparison utilizes three evaluation metrics: MAE, RAE, and RRSE. Larger values of MAE, RAE, and RRSE indicate poorer WFD prediction performance.
Table 5 and
Figure 14 illustrate that, for the NTU dataset, the NTU model provides the most accurate predictions among the five models, with MAE, RAE, and RRSE values of 0.01, 0.59, and 0.10, respectively, indicating superior WFD prediction performance.
Table 6 and
Figure 15 show that for the Gallaway dataset, the NTU model also achieves the best performance with MAE, RAE, and RRSE values of 0.02, 0.4, and 0.12, respectively. In conclusion, the NTU model demonstrates exceptional performance not only on the NTU dataset but also on the Gallaway dataset. Overall, the comparative analysis highlights the accuracy and robustness of the NTU model among the five evaluated models.
4.6. Sensitivity Analysis and Application of NTU Model Variables
Although the NTU model uses four variables to determine the WFD, in practice, airports can only reduce the WFD by adjusting the average texture depth or pavement slope. A sensitivity analysis is conducted to understand the impact of these two variables on the WFD, keeping the rainfall intensity and distance from the runway centerline fixed.
The average texture depths were set at 0.38, 0.58, 0.78, and 0.98 mm, corresponding to a division of 0.38 to 0.98 mm into three equal intervals. Similarly, the runway slope was set at 0.5%, 0.75%, 1%, 1.25%, and 1.5%, with a rainfall intensity of 100 mm per hour and 600 cm from the runway centerline. In each trial, only one variable was altered at a time, with the first setting value for each variable used as the baseline. When modifying one variable, the other three variables were kept at their baseline values, resulting in five WFD values.
For example, when varying the average texture depth, values of 0.58, 0.78, and 0.98 mm were associated with increases of 52%, 105%, and 157%, respectively, compared to the 0.38 mm baseline. The resulting WFD values were reduced by 2.21%, 4.32%, and 6.49%, respectively. Therefore, dividing the percentage change in WFD by the percentage change in texture depth yielded a sensitivity of 0.41. Using the same calculation method, the runway slope was found to have a sensitivity of 0.18, which is lower than that of the average texture depth.
This result suggests airport managers should prioritize increasing the average roughness of the texture, such as removing tire rubber deposits and/or grooving, as the most effective method for reducing the WFD.
4.7. Application of the NTU Model at Taiwanese Airports
This study applied the NTU model to three major international airports in Taiwan, referred to as Airports A, B, and C. The rough texture depths of the runway touchdown zones, influenced by tire rubber deposits, were set at 0.25, 0.4, and 0.78 mm, respectively. These measurements were taken at 8 m from the runway centerline, with runway slopes set at 1.4%, 1%, and 1.2%. The WFD threshold for hydroplaning was set at 3 mm (ICAO Annex 14, 2018). Using the NTU model, the critical rainfall intensities for each airport were calculated, yielding values of 212 mm/h, 193 mm/h, and 209 mm/h, respectively.
A comparison with the hourly rainfall data from the past 10 years for these three airports revealed the rainfall was significantly lower than the critical rainfall threshold, indicating that, based on the rainfall data from the past 10 years at these three international airports in Taiwan, the areas within 8 m of the runway centerline are unlikely to reach the 3 mm WFD threshold required for hydroplaning formation. Furthermore, a review of aviation incidents involving heavy rainfall at these airports showed no cases in which aircraft encountered a water film depth exceeding 3 mm upon landing, which would have led to hydroplaning.
5. Discussion and Conclusions
This study developed the NTU prediction model to forecast water film depth under various environmental conditions, providing a more reasonable and feasible predictive model and subsequent early warning measures for airports to assess whether the water film depth is sufficiently deep to endanger aircraft safety.
Water film depth on the pavement surface is one of the primary factors contributing to hydroplaning incidents. This study designed experimental equipment specifically for runway conditions to accurately predict WFD in real time. The custom-designed rainfall nozzles and adjustable positioning of the equipment ensured a uniform rainfall intensity across every point on the runway. Additionally, this study employed a laser displacement sensor and a programmable logic controller to obtain high-accuracy, high-sampling-rate WFD data. Modern automated data acquisition enables simultaneous measurement at multiple points and captures the complete WFD curve from zero to a stable depth, which was previously difficult to obtain.
Four commonly applied parameters, including mean texture depth, runway transverse slope, rainfall intensity, drainage length, and water film depth, were used to establish the NTU dataset. A multivariable linear regression approach was employed to build the predictive model (NTU model). The NTU and the Gallaway datasets were used to compare four models: NTU, POURHASSAN, GALLAWAY, and Empirical PAVDRN. The models were evaluated using MAE, RAE, and RRSE. The results indicated the NTU models exhibited lower prediction errors compared to the other models. Specifically, the NTU model not only performed well with the NTU dataset but also showed excellent performance with the Gallaway dataset, with the predicted water film depths being closer to the actual water film depths compared to other models.
This study uses laser displacement sensors to measure WFD based on reflections from texture peaks that allow a better estimation of the actual water film thickness. However, most empirical models calculate WFD from the reference plane. Using laser displacement sensors is relatively new and the measurements obtained from texture peak and reference plane are different; therefore, it is quite possible that the differences may affect model predictions and comparisons. It is suggested that future research can build on this study to develop more objective models for further analysis.
This study also found that among the controllable parameters affecting water film depth, the average texture depth of the pavement had a significant impact and should be a focus for airport management. Furthermore, when applying the NTU model to the runways of three international airports in Taiwan, it was found that, based on the hourly rainfall data from the past 10 years at these airports, the areas within 8 m of the runway centerline did not reach the 3 mm critical water film depth threshold. However, if the hourly rainfall exceeds 190 mm, special attention should be given to prevent the water film depth from reaching the critical threshold. However, this study did not consider the effect of wind on water film depth. Future experiments could further explore the effect of wind on water film depth.