1. Introduction and Background
During the warmer months (October to March), parts of eastern Australia experience severe thunderstorms. The regions most affected are northeastern New South Wales (NSW) and southeast Queensland (QLD) (
Figure 1). The major east coastal airports are Sydney and Brisbane. The regional airports referenced in this study are located within the hexagonal outline in
Figure 1. In this region, thunderstorm activity is fuelled by land heating and moisture from the adjacent warm Pacific Ocean.
The main danger to aircraft operations at take-off and landing are the density currents produced by thunderstorm downbursts [
1]. The key research question then is to see if machine learning (ML) analysis can identify dominant attributes involving heat and moisture that would lead to an increase in strong thunderstorm downbursts. As flight numbers are rising, the danger to aircraft safety and operations can also be expected to increase [
2]. A recent example shows the intense radar reflectivities indicating severe thunderstorms near Narrabri and Gunnedah, which occurred on 2 January 2025 (
Figure 2—left panel). Thunderstorm warnings were issued, including for damaging wind gusts (>90 km/h) close to the town of Moree (
Figure 2—right panel).
Studies have shown how global warming is increasing clear air turbulence (CAT) associated with jet streams in both hemispheres [
3,
4,
5,
6]. However, little research has focused on aircraft dangers during take-off and landing. Consequently, the present study emphasizes near-surface hazards close to airports, rather than CAT, cloud turbulence [
6], mountain waves [
7], and jet streams.
In 2022, Australia’s airports added
$105 billion to the national economy, supporting 690,000 full-time equivalent jobs [
8]. Beyond flights, airports provide critical infrastructure and services supporting numerous general aviation activities. For example, they assist region-based primary industries, connect workers across the country, and support tourism. In NSW, there are 70 main regional airports. In the 30 years from 30 October 1990, and prior to the COVID-19 pandemic, Australian domestic passengers grew five-fold, from 16.8 million passengers in 1988–1989 to 61 million in 2019 [
9]. Total domestic air passengers are projected to grow by 137 per cent between 2022–2023 and 2049–2050, from 66.2 billion passenger kilometers in 2022–2023 to 157.1 billion passenger kilometers in 2049–2050 [
2]. However, these projections do not explicitly model the potential direct effects of future global and regional climate warming on air travel [
2]. In a warming climate, severe thunderstorm environments are projected to increase in eastern Australia [
10].
In brief, thunderstorm occurrence depends on three ingredients: (i) sufficient moisture availability through the low- to mid-levels of the troposphere; (ii) instability to provide buoyancy of lifted parcels of air to extend high enough into the troposphere to form cumulonimbus clouds; (iii) a lifting mechanism such as a low-level, low-pressure trough, or a cold frontal system. These ingredients will lift air parcels so they become buoyant [
11]. In southeast Australia, during the warm months (October to March) when thunderstorms occur most often, thunderstorm precipitation is characterized by low-level, pre-frontal, low-pressure troughs that provide the lifting mechanism [
12]. Pre-frontal, low-pressure troughs are areas of low atmospheric pressure that often precede cold fronts and are associated with wind shifts in Australia from northeast to northwest. These troughs are elongated features on weather maps, typically marked by a dashed line, and are areas where two different air masses meet. As a trough moves, generally from the west, its drier, denser air can lift moist, warmer air, leading to cloud development, generating thunderstorms along the elongated wind change.
Downbursts from thunderstorms are especially dangerous during take-off and landing at airports, particularly in the form of wet or dry microbursts (<4 km in diameter) [
13,
14]. Dry microbursts occur in environments exhibiting convective instability. They frequently develop within environments exhibiting a deep, dry, atmospheric boundary layer (ABL) of at least 3 km in depth [
15]. Here, we focus on wet microbursts. A key contributor to wet microbursts is precipitation loading, which occurs in thunderstorms when excessive water content within the cloud creates a downward force [
15]. This effect induces a downward current of air or enhances descending air within a downflow. Isolated microbursts are driven by the weight of precipitation downdrafts as well as by evaporation and melting, thereby cooling descending air, which further accelerates the downdraft. As descending, heavier, cooler air currents reach the ground, an outburst of diverging winds is created. Another mechanism contributing to surface outflow is the downward transport of strong horizontal winds in the mid-level atmosphere. As the thunderstorm downburst descends towards the surface, it generates a horizontal component of similar magnitude to the mid-level winds. Sudden, unexpected, losses in altitude by descending aircraft, from the low-level vertical wind shear created by downbursts, have resulted in numerous aircraft accidents [
14].
All thunderstorms have downdrafts, with non-severe thunderstorms producing wind gusts less than 90 km/h, some of which are wet microbursts, according to the Australian Bureau of Meteorology’s wind speed threshold for severe thunderstorm warnings. The initiation and strength of wet microbursts depend on the 3-dimensional wind structure and moisture profile from which a thunderstorm forms. Wet microbursts occur in environments with sufficient available moisture in low- to mid-level atmosphere. This usually occurs in eastern Australia during the warmer months, when moisture content is readily available from the Tasman/Coral Seas adjacent to the Australian east coast. In the warm season from October to March, the six inland airports located in an area prone to convective rainfall formed from frontal troughs like the example discussed above. Hence, thunderstorms generated by pre-frontal troughs or subtropical inland low-pressure troughs in the easterlies are climatologically dominant.
The vertical wind profiles associated with wet microbursts typically show convergence of low-level northeast winds ahead of the pre-frontal trough with northwest to westerly winds after the pre-frontal trough passage. At Brisbane airport, a wet microburst with the vertical wind profile mentioned above produced a maximum recorded wind gust of 157 km/h in November 2016 [
16]. The maximum recorded wind gust at Brisbane airport was 185 km/h in January 1985, during a thunderstorm with a similar wind structure. In extremely dry environments, such as in central Australia, wet microbursts are less likely to occur than in the moist, subtropical environment of southeast QLD and northeast NSW during the warm season months from October to March. The characteristics of wet microbursts were present for the squall line thunderstorms investigated in this study. They produced wet microburst gusts at the northeastern NSW airports of Bourke, Coonamble, Gunnedah, Moree, Narrabri, and Walgett (see location
Figure 1).
Over the last 40 years, short bursts of heavy rainfall lasting about 10 min have increased near Sydney [
17] and worldwide in general [
18]. Wet microburst thunderstorms produce short-period, heavy rainfall bursts [
19]. Increased atmospheric moisture due to global warming is amplified because warmer air holds more water vapor. In ML related to climate applications, attributes commonly referred to as physical processes and their interactions are represented by indices or data values of components of the climate system. In this study, using ML techniques, attributes consisting of meteorological and other climate drivers in the form of indices involving, for example, atmospheric and oceanic climate drivers, are assessed to identify increased moisture availability in the low- to mid-level troposphere. Increased heating and low-level atmospheric moisture are known to have increased in recent decades, where the six airports are located (
Figure 3).
Figure 3 (left panel) shows the anomalous increase in low-level moisture (relative humidity) that has occurred in recent decades. Combined with the anomalous increase in low-level heat shown by the potential temperature at 850 hPa in
Figure 3 (right panel), low-level destabilization of the atmosphere is increased, which potentially leads to stronger convective and thunderstorm rainfall.
There are numerous studies on thunderstorm downbursts and associated hazards of severe near-surface wind gusts. Some of the more recent studies have employed ML techniques, in an emerging trend [
20,
21,
22]. However, the cited study that specifically targets airport thunderstorms does not use ML, in contrast with our study.
In the following section, machine learning (ML) techniques are described. In
Section 3, the ML techniques are analyzed with reference to an example of a squall line containing downbursts, at six airports over inland southeast Australia, caused by thunderstorm precipitation from October to March when precipitation occurs predominantly from thunderstorms. Notably, increased precipitation and stronger downbursts from wet microbursts are suggested to be the result of changes in the southern hemisphere (SH) and regional atmospheric circulation.
2. Materials and Methods
Recently, various ML techniques have been applied to precipitation to determine dominant attributes related to seasonal, monthly groupings and annual periods of Australian climate extremes such as drought, heat, and extreme rainfall [
23,
24,
25,
26]. It is not usually known a priori which technique should be optimal for a given example. Here, we applied eight techniques and identified the dominant attributes from an assessment of all eight techniques.
Numerous known climate drivers were potential attributes for October–March precipitation. They include the Atlantic Multi-decadal Oscillation (AMO); the Indian Ocean Dipole (IOD); global sea-surface temperature anomalies (GlobalSSTA); global temperature anomalies (GlobalT); Niño3.4; TPI; the Southern Annular Mode (SAM); the Southern Oscillation Index (SOI); and Tasman Sea sea-surface temperature anomalies (TSSST). Climate driver time series were provided by the Earth System Research Laboratory [
27], except for TSSST, which was obtained from the Australian Bureau of Meteorology [
28].
Two-way interaction terms combining the above predictors (e.g., Niño3.4*SAM), were also considered as potential predictors. These two-way interaction terms are considered as possible attributes, because one attribute can reinforce or reduce the influence of another. Additional relationships between possible attributes, such as additive and quotient relationships, assessed by Richman and Leslie [
29] for a single station, were not considered because some attributes in this study are known to be highly correlated with other attributes, while all ocean basins are represented. Furthermore, we restricted the combinations to those involving just two potential attributes, specifically because the number of combinations of the nine possible predictors (AMO, SOI, Niño3.4, SAM, IOD, TPI, Global temperature anomalies, and Global and Local SST anomalies) provides a large and comprehensive number (45) of possible attributes.
Because relationships between climate drivers and precipitation vary between weak and strong, linear and non-linear, and in combinations, this study developed both linear and non-linear machine learning models of October–March and annual precipitation for a range of climate drivers. From previous studies of inland areas within southeast Australia, permutation testing of precipitation and temperature have shown statistical significance before and after the mid-1990s [
24,
25].
To attribute the above climate drivers to the 1965 to 2018 precipitation period considered, ML was applied with an expanding window with a ten-fold cross-validation [
30], where the initial training window contained 10 years of data, and the testing window was fixed to contain the following 10 years of data. This method was applied to the 45 potential attributes mentioned above. The considered models were linear regression (LR), support vector regression (SVR) [
30] with both radial basis function (RBF) and polynomial (Poly) kernels [
31], and random forest (RF) regression [
32]. We also explored forward and backward selection in both SVR and RF [
29].
By selecting the attributes using SVR and RF machine learning, involving both forward and backward selection techniques, there were a total of eight techniques applied in this study. The model performances were assessed by comparing the eight techniques and expressing the dominant climate drivers (attributes) as percentages of the number of times each model appears in more than 50% of the ten-fold cross-validation. We used several ML techniques because the optimal technique is generally not known a priori and is likely to depend on the application being considered. Including the two-way interaction combinations, there were 45 possible attributes for predicting precipitation (
Table S1). Including all attributes would produce overfitting as mentioned above, leading to large errors when models are applied to a test data set in prediction mode. To select attributes that generalize well, the ten-fold cross-validation was applied as mentioned above, using both forward and backward selection of potential attributes as in Maldonado and Weber [
33]. In summary, the eight ML techniques used include two linear methods and six non-linear methods. The two linear methods are forward and backward selection using linear regression. Forward selection begins with one climate driver and sequentially adds another, based on which additional climate driver achieves minimal error. The process continues until the testing error is minimized. Backward selection starts with all predictors and continues to eliminate them until the testing errors rise above those with all predictors. Detailed results from all attributes in all models, together with error statistics, are provided in
Supplementary Table S1.
Strong wind gusts are well known to be difficult to forecast, with few operational methods reliably predicting thunderstorm wind speeds. The short-lived thunderstorm wind events associated with a long-lived squall line over inland eastern Australia on 13 December 2018 is presented. Conditions leading to damaging winds are analyzed by using high-resolution temporal and spatial Himawari-8 satellite images, an upper air sounding, surface observational data from six airports, Global Data Assimilation System (GDAS) meteorological data, and model output of predicted fields of meteorological variables. The meteorological analysis of the example regarding the dominant attributes revealed is discussed in the following Results and Discussion section.
Warm season (October to March), monthly precipitation data were obtained from the Australian Bureau of Meteorology’s homogeneous climate record through its climate change site network [
34]. There are ten precipitation stations with complete records comprising Augathella, Cunnamulla, Normandy, Miles, Surat, Bellata, Bingara, Curlewis, Brewarrina hospital, and Quambone (
Figure 1). These stations are representative of the six airports. In the few cases where several single monthly values were missing, averages of monthly values from either side were used. There was no need to use smoothing or anomaly standardization because station precipitation totals were used. Precipitation data were summed up to obtain a proxy for total precipitation across the area during October to March in the period 1965–2018. This period allows an equal split (1965–1991 and 1992–2018) centered on the early 1990s, after which climate change impacts have accelerated, e.g., [
18].
The meteorological case study describes how important heat and moisture are to the development of thunderstorm downbursts, which can potentially affect multiple airports when present in squall lines that move through eastern Australia during the warm season. ML approaches can determine how dominant or otherwise heat and moisture are by identifying the related attributes of potential climate drivers.
4. Conclusions
Eastern Australia thunderstorm activity is fueled by a combination of land heating and moisture influx from the adjacent warm Pacific Ocean off the east coast and from around the sea to the north of the Australian continent during the austral warm season from October to March. The major aircraft hazards are density currents generated from downbursts by the thunderstorms. The downbursts are particularly dangerous for aircraft during taking off and landing. Since the number of flights is increasing, the threat to aircraft safety and operations is also increasing.
A case study is presented of a squall line in January 2018 that produced numerous downbursts with associated damaging surface wind gusts at six airports over inland eastern Australia. Using eight ML regression methods, applied to 1965–2018 observational precipitation data, the five major October to March precipitation attributes were identified. These were the SOI, DMI*SAM, GlobalT*GlobalSSTA, and SOI*TPI. The atmospheric SOI influence arises when onshore winds dominate during La Nina episodes, while DMI*SAM implies anomalously warm east Indian Ocean SSTs off northwest Australia and combines upper tropospheric level moisture directed over southeast Australia, with onshore low-level moisture directed from the southwest Pacific Ocean when the SAM is positive. SAM was shown to be anomalously positive from the early to mid-1990s.
Notably, for the October to March period, the selected attributes are related to GlobalT and GlobalSSTA, highlighting the profound influence that GW now has on precipitation in the area containing the six airports. Lastly, the SAM also plays a role by modulating the influence of DMI.
For annual precipitation, the five major attributes identified by the ML methods were GlobalSSTA*TSSST, AMO*SAM, AMO*Niño3.4, TPI, and D MI*Niño3.4. The selected annual attributes are related to GW, GlobalT, GlobalSSTA, and TSSST, again highlighting the strong influence that GW now has on precipitation in the area containing the six airports. For October to March over the 27-year period from 1966 to 1992, a negative SAM was replaced by the dominance of a positive SAM for the following 27-year period from 1993 to 2018 [
34]. This abrupt SAM transition underlines the importance of changes in the large-scale SH atmospheric circulation that now provides an increased moisture flux over the eastern and northern parts of the Australian continent.
With the aim of developing improved sub-seasonal and seasonal hazardous weather outlooks, a promising next step likely will be combining high-resolution climate model projections with machine leaning models. This hybrid approach is expected to be necessary, as traditional machine learning approaches, using training and test data sets, can rapidly lose predictive skill if there is a high level of non-stationarity in the training (observational) data sets, especially if it continues into the test data sets.
In terms of the challenges in extrapolating findings under future climate change scenarios, an important limitation of the expanding window approach used in this study concerns the non-stationarity of the precipitation data and the need for a moving window approach, which can possibly deal with non-stationary time series data better. However, in other similar studies by the authors, only very minor differences were found between results from the expanding and moving window methods.
We acknowledge the limitations of ML models, which is why our approach uses a range of ML techniques. Potential biases in precipitation data would likely be minimal because Australian Bureau of Meteorology data, which are the precipitation data that were used, are highly quality controlled.