Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100

Durmus, Simla; Demirhan, Deniz; Gultepe, Ismail; Durmus, Onur

doi:10.3390/forecast8010013

Open AccessArticle

Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100

¹

Department of Flight Training, University of Turkish Aeronautical Association, Ankara 06790, Türkiye

²

Department of Meteorological Engineering, Istanbul Technical University, Istanbul 34467, Türkiye

³

Faculty of Engineering and Applied Sciences, Ontario Technical University, Oshawa, ON L1G 0C5, Canada

⁴

Department of Aeronautical Engineering, University of Turkish Aeronautical Association, Ankara 06790, Türkiye

⁵

Department of Civil and Environmental Engineering and Earth Science, University of Notre Dame, Notre Dame, IN 46556, USA

⁶

Ankara Aviation Vocational School, University of Turkish Aeronautical Association, Ankara 06790, Türkiye

^*

Author to whom correspondence should be addressed.

Forecasting 2026, 8(1), 13; https://doi.org/10.3390/forecast8010013

Submission received: 26 December 2025 / Revised: 3 February 2026 / Accepted: 6 February 2026 / Published: 10 February 2026

(This article belongs to the Section Weather and Forecasting)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

High-top CMIP5 models systematically underestimate the energetic intensity MPS (Main Phase Strength) and spatial extent TEA (Threshold Exceedance Area) of SSW events by 61% to 82% compared to ERA5 reanalysis.
A ‘teleconnection gap’ is identified where models successfully represent tropical stratospheric variability (10° N) but fail to effectively transmit this momentum forcing to the polar vortex (60° N).

What are the implications of the main findings?

The CMIP5 models fail to accurately represent the sensitivity of the T_s-U_h relationship and polar vortex responses. This suggests that current climate projections may be insufficient to predict anomalies of the stratospheric thermal budget and climate change.
The urgent improvement of the CMIP models’ vertical and horizontal resolutions for future climate studies is found to be critical for accurately prediction of SSW events and Arctic vortex development. Furthermore, the re-analyzed datasets should be enhanced using remote sensing observations to better represent wave–mean flow interactions in the upper troposphere and stratosphere.

Abstract

The main objective of this work is to characterize Sudden Stratospheric Warming (SSW) conditions and their impact on local weather forecasting and climate change, using SSW definition criteria. The SSWs strongly affect Arctic vortex structure and midlatitude weather conditions. This work evaluates the frequency, amplitude, and dynamical–thermal characteristics of SSWs under historical and Representative Concentration Pathway (RCP) 4.5 scenarios, focusing on stratospheric air temperature (T_s) and zonal wind speed (U_h) at the 10° N and 60° N latitudes. The fifth-generation ECMWF atmospheric reanalysis (ERA5) is employed as the reference dataset. Simulations of five Coupled Model Intercomparison Project Phase 5 (CMIP5) models, represented by M1 to M5, are analyzed. The primary group of models included 1) the Australian Community Climate and Earth-System Simulator, version 1.3 (ACCESS1-3, M1), 2) the Hadley Center Global Environmental Model, version 2—Carbon Cycle (HadGEM2-CC, M2), and 3) the Max Planck Institute Earth System Model—Medium Resolution (MPI-ESM-MR, M3). The analysis period covers SSW events related to the Quasi-Biennial Oscillation (QBO) in the Northern Hemisphere (NH) from 1980 to 2100. The key findings indicate that while M1, M2, and M3 simulate SSW occurrence correctly for the 21st century, they exhibit significant systematic deficiencies in capturing the structural dynamics of SSW events. Specifically, the M1, M2, and M3 models underestimate the polar stratospheric temperature amplitude (T_amp) by approximately 75–80% and zonal wind amplitude (U_amp) by more than 60% compared to the ERA5 analysis. Furthermore, ERA5 exhibits a strong negative correlation (R ≈ −0.8) between U_h and T_s that is not estimated accurately using the present models. The importance of the horizontal resolution of the models and wave–mean flow interactions in determining SSW intensity and occurrence is also found to be a critical metric. Results suggest that SSW definition criteria affect Arctic and midlatitude weather system prediction at a rate of 61–82%. It is concluded that the primary configurations of CMIP5 models for accurately capturing the dynamical structure and evolution of QBO–SSW interactions are needed, and that they affect future projections of SSW events.

Keywords:

sudden stratospheric warming (SSW); quasi-biennial oscillation (QBO); stratospheric variability; CMIP5 climate models; zonal wind–temperature coupling; ERA5 reanalysis

1. Introduction

The stratosphere begins at approximately 8–10 km altitude near the polar regions and 15–20 km in the tropical regions, reaching an average height of up to 50 km [1]. Sudden stratospheric warming (SSW) is defined as a sudden increase in stratospheric polar air temperature (T_s), mainly during winter months [2]. The Arctic tropopause height is at its lowest level over the poles compared to other regions due to the cold and denser airmass, especially in winter. In the Northern Hemisphere (NH), approximately every 28 months, extremely cold air temperatures and strong westerly zonal winds (U_h) suddenly weaken or reverse, causing the stagnant polar vortex to warm abruptly within a few weeks. During these events, the winds significantly weaken or switch to easterly U_h. In the literature, these events are known as SSWs [3].

SSW events were first observed in the early 1950s [4] and can be monitored in detail using satellite observations [3]. Over time, significant progress has been made in understanding the dynamic effects of SSWs through detailed observations and simulations. However, analyzing how SSW affects both the surface air temperature and upper atmosphere dynamics remains a challenge. Recent studies on stratospheric processes affecting operational forecasting have been conducted by [5,6]. These works, correspondingly, indicated that utilizing spatiotemporal memory flow networks for long-range SSW prediction and analyzing unprecedented winter extremes with direct surface impacts underscore the critical role of stratospheric diagnostics in operational forecasting. SSWs are generally observed in the polar regions (poleward of 60° N) in the NH, but their rarity in the Southern Hemisphere (SH) is due to weaker tropospheric planetary wave activity [7].

The first recorded SSW event was discovered by Richard Scherhag in January 1952 [8], using radiosonde measurements collected in Berlin, Germany. SSWs are atmospheric events that usually occur irregularly in the atmosphere. Using both observations and simulations, SSWs can be detected by a sudden increase in the average T_s (exceeding 30–40 K) in the mid-stratosphere at a height of approximately 30 km (10 hPa) [3,9,10]. However, these warmings sometimes exceed 70 K, creating extreme SSW events. Such warmings in the stratosphere result from the rapid intensification of tropospheric planetary waves propagating upward, which disrupt the stratospheric general circulation [9,11,12]. Tropospheric planetary waves provide the requisite dynamical forcing for SSWs, while the Quasi-Biennial Oscillation (QBO) effectively governs the stratospheric waveguide [13,14]. This indicates that while upward-propagating waves are the primary drivers, the QBO phase regulates the background conditions under which these waves interact with the polar vortex.

The World Meteorological Organization (WMO) established the Commission for Atmospheric Sciences (CAS) and launched the STRATALERT program in 1964 to monitor SSWs, following the January–February 1952 event. In this program, SSW phenomena were studied using radiosonde and rocketsonde observations [13]. The results of this field campaign have shown that SSWs are related to polar vortex disruptions attributed to several factors, such as wind reversals during QBO events [14,15]. The QBO is a periodic shift between easterly and westerly winds in the tropical stratosphere every 28 to 30 months near the equator [4]. These results suggested that SSW events play an important role in polar vortex development and their effect on mid-latitude weather systems.

Major studies since the 1970s analyzing T_s increase and U_h reversal pointed out that variable definitions and thresholds for SSWs exist [16,17]. Mclnturff et al. [9] suggest that a temperature anomaly (ΔT_s) of >25 K can be used as a criterion for SSW detection and monitoring. These studies suggested that major category SSWs were clearly associated with the QBO phase slowdown or reversal [9]. Charlton and Polvani [15] refined the definition provided by McInturff et al. [9]. This framework identifies major SSW events based on the reversal of the U_h at the 10 hPa level over 60° N latitude during winter months. While these events are fundamentally driven by the upward propagation of tropospheric planetary waves, the QBO acts as a key modulator that influences the stratospheric waveguide and the conditions under which these wave-driven reversals occur.

The relationship between the QBO and SSWs was widely used later to evaluate the dynamic consequences of SSWs and their interaction with the adjacent atmospheric layers. The SSW events are strongly related to the phases of the QBO [18]. Dependent on the QBO phase, the stability of the polar vortex and the likelihood of SSW events are modified. Typically, the westerly phase of QBO (QBO+) keeps the vortex stable, preventing the development of SSWs. On the other hand, the easterly phase of QBO (QBO−) likely increases wave activity, and the probability of SSW occurrence [4,19]. Such interactions highlight a robust coupling between the tropical stratosphere and polar vortex disruptions. Beyond the modulation of the stratospheric waveguide, the persistence of the polar vortex is further influenced by teleconnections from the Madden–Julian Oscillation (MJO) and El Niño–Southern Oscillation (ENSO), which exhibit varying signatures across climate models [20,21]. Integrating these diverse drivers into stratospheric diagnostics is essential for reducing uncertainties in seasonal forecasts [22]. Although the precise dynamical pathways of this tropical–polar teleconnection remain a subject of active research, the statistical relationship between the two is well-documented.

While several well-established definitions for SSW events exist, most notably the Charlton and Polvani criteria [15] based on U_h reversals, the historical absence of a single, universally accepted classification has led to varying results regarding SSW frequency and intensity in the literature. This diversity in criteria necessitates the use of more physically meaningful and integrated metrics, such as the Threshold Exceedance Area (TEA) and Main Phase Strength (MPS) metrics employed in this study, to capture the full structural and energetic evolution of the events. For this reason, the variable criteria based on wind reversals and T_s anomalies have led to conflicting results of SSW frequency and intensity [2]. Butler and Gerber [23] found that nine different major SSW definitions exist, and based on these definitions, annual SSW frequency varied between 40% and 90%. These large differences indicate that standardization and physically meaningful metrics are needed. Butler and Gerber [23] stated that the SSW definition should be free of the specific dataset, and its definition must capture the physical and dynamical characteristics of SSWs.

The newest approaches to the definition of SSWs have been provided lately. Li et al. [2] introduced the TEA concept, the Main Phase Duration (MPD), and Main Phase Area (MPA) metrics, and these are directly related to SSW intensity for quantitative assessment. Their approaches are adapted in this study. These concepts were applied for a specific time period (historical and future) to study SSW events of both T_s and U_h, together with reference data from Fifth Generation European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis (ERA5) and five large-scale model simulations.

The main objective of this study, using SSW definition criteria, is to investigate the relationship between QBO variability and SSW events during the historical (1980–2005) and future (2006–2100) periods. To better situate this study within the context of the existing literature, it is important to note that while recent multi-model assessments [24,25,26] suggest that the frequency of major SSWs will remain relatively stable in the 21st century, the physical and energetic fidelity of these events remains a critical research gap. Specifically, Ayarzagüena et al. [24] reported no robust evidence of future changes in the frequency or timing of major SSWs across different climate models. Similarly, Rao and Garfinkel [25] concluded that the statistical characteristics and the strength of stratospheric-tropospheric coupling of SSWs indicated negligible changes in the 21st century. Additionally, a study by Chávez-Pérez et al. [26] highlighted that while long-term averages appear to be stable, there is significant temporal variability in the distribution of events, and this suggests that long-term aggregated assessments might mask changes in sub-periods. Our aim is to move beyond simple frequency counts, a technical aspect also addressed in the context of wave forcing spread by [27], to evaluate the evolving spatial and energetic intensity of SSWs using TEA and MPS metrics while using various threshold criteria for SSW definition. Furthermore, in the content of our main objective, the quantitative assessment of the QBO–SSW coupling through a comparison of stratospheric variability at 10° N (serving as a tropical QBO proxy) and 60° N (polar response) is performed to identify whether model biases are rooted in tropical forcing or its high-latitude transmission. In this regard, the current work is significantly different than the studies of [24,25,26] and places more emphasis on the structural and energetic sensitivity analysis of threshold criteria effects on SSW analysis.

The dataset obtained using the Coupled Model Intercomparison Project Phase 5 (CMIP5) model simulations and high-resolution ERA5 reanalysis is detailed in Section 2. Section 3 presents the methodology, including statistical diagnostics and the TEA metric for SSW detection. Section 4 analyzes the comparative outcomes of past and future SSW characteristics while evaluating the quantitative link between tropical forcing and the polar vortex. Lastly, Section 5 and Section 6 offer a discussion of the findings and a conclusion.

2. Data

The datasets used in this study are obtained from ERA5 and from the CMIP5 under the Representative Concentration Pathway 4.5 (RCP4.5) scenario analysis. The RCP4.5 scenario assumes that greenhouse gas emissions stabilize in the second half of the 21st century. ERA5 provides relatively high spatial (0.25° × 0.25°) and temporal (hourly) resolution data that are used as the reference for the historical period (1980–2005) and as the verification reference dataset [28]. Recent studies have validated the utility of ERA5 reanalysis data for developing atmospheric models with parameter correction methodologies [29,30]. While ERA5 is recognized for reliably capturing planetary wave activity [31], addressing model-specific biases remains essential for a rigorous evaluation of simulated stratospheric variability [32].

The CMIP5 models provide daily outputs for both the historical (1980–2005) and future (2006–2100, RCP4.5) periods. Table 1 provides a technical comparison of the five models initially screened for this study. While all five models were considered, our methodology prioritizes models based on their vertical architecture, as detailed below. Models described as follows: M1 (ACCESS1-3) developed by the Australian Bureau of Meteorology and CSIRO, M2 (HadGEM2-CC) from the UK Met Office, M3 (MPI-ESM-MR) from the Max Planck Institute for Meteorology in Germany, M4 (GFDL-ESM2G) from NOAA/Geophysical Fluid Dynamics Laboratory in the United States, and M5 (FGOALS-g2) from LASG/IAP in China [33].

Despite the emergence of CMIP6, the CMIP5 model ensemble remains a suitable framework for this study as it provides a reliable baseline for diagnosing deep-seated structural and energetic biases. This selection is supported by recent literature, such as Rao and Garfinkel (2021) [25], which demonstrates no statistically significant differences in SSW frequency or basic characteristics between CMIP5 and the Coupled Model Inter-comparison Project Phase 6 (CMIP6). Most importantly, studies by Hall et al. (2021) [34] and Karpechko et al. (2024) [35] confirm that systemic biases in stratospheric polar vortex variability have remained essentially unchanged across model generations. Furthermore, current findings by Martínez-Andradas et al. (2025) [27] highlight that CMIP6 models continue to exhibit large spreads and persistent biases in SSW wave forcing and intensity. Therefore, utilizing CMIP5 allows for an objective comparison with established literature while addressing fundamental challenges common to both ensembles. Future work will extend this diagnostic framework to include the CMIP6 ensemble.

M1, M2, and M3 are designated as the primary group due to their extended vertical domains and enhanced resolution in the middle-to-upper stratosphere. These structural advantages facilitate a more realistic simulation of vertical wave–mean flow coupling, which is fundamental for capturing both the life cycle of SSW events and the periodic variability of the QBO. In contrast, M4 and M5 are assigned to the secondary group because their shallower model tops restrict their capacity to consistently replicate QBO–SSW interactions (Table 1). This methodological focus prevents potential bias in the analysis stemming from ‘low-top’ models, which are known to systematically underestimate stratospheric variability because of their inability to resolve vertical wave–mean flow coupling. Accordingly, results from M1, M2, and M3 constitute the basis of the main interpretation, whereas M4 and M5 remain beyond the scope of the present analysis.

The classification of models follows earlier evaluations of CMIP5 models [36,37] and emphasizes the importance of model top height in the simulation of the stratospheric variability. The models are selected based on their temporal coverage, data availability, and ability to resolve pressure levels up to 10 hPa, providing high-resolution daily data for the detection of SSW events [18].

3. Method

To investigate SSW events in the stratosphere, daily averages, variability, and long-term trends of T_s (S_Ts) and U_h (S_Uh) at the 10 hPa pressure level for the NH are examined. The data consists of ERA5 reanalysis and simulations from five CMIP5 models (M1–M5), but only the first three are utilized here due to the selection of their capability for detecting SSWs. The ERA5 dataset (originally at hourly resolution) is averaged to daily means to match the model time resolutions.

The current study covers two time periods: (1) the historical time- period from 1980 to 2005, and (2) the future time period from 2006 to 2100 under the RCP4.5 emission scenario. SSW detection relies on the TEA method, where a 30 K anomaly threshold should be exceeded for at least six consecutive days. Detected events are classified as minor, major, or extreme events. MPS metrics, which are related to both MPD and MPA, are used. More than 60% of SSWs typically occur during January–February; therefore, the analysis focuses on the November–April window to capture the full seasonal range of occurrences. The SSW characteristics related to different QBO phases are also emphasized in the analysis. A description of the TEA method and its metrics is provided in the following section.

Following earlier classifications [23,37], the models used in this work are grouped as higher-top or lower-top models according to the vertical extent of the pressure height. Higher-top models (M1, M2, and M3) extend into the upper stratosphere or lower mesosphere (above ~1 hPa), allowing a more realistic representation of planetary wave propagation, vortex breakdown, and associated SSW dynamics. Lower-top models (M4 and M5) terminate at 10–30 hPa, which limits vertical wave–mean flow coupling and leads to a systematic underestimation of stratospheric variability. Although M1 is included in the analysis, it should be noted that, in comparison to higher-top models, its lower top may result in an underestimation of major and extreme events.

On the other hand, due to their limited capacity in resolving stratospheric wave–mean flow interactions and the resulting insufficient number of detected SSW events for reliable statistical evaluation, M4 and M5 are removed from the analysis. Consequently, while M1 is retained for general frequency and time-series comparisons (Section 4.1, Section 4.2 and Section 4.3), its limited capacity to resolve coherent thermal–dynamical feedback necessitates its exclusion from the detailed amplitude and coupling diagnostics presented in Section 4.4, Section 4.5 and Section 4.6. This selective approach ensures that the high-fidelity assessment of tropical–polar coupling is based on models with adequate vertical resolution, prioritizing physical consistency over model quantity.

3.1. Profiling Temperature Anomalies for SSW Detection Using the TEA Method

To obtain SSW events, the analysis used the TEA technique [2], where ΔT_s must remain above 30 K for at least six consecutive days. This approach was originally introduced by [23] and then was subsequently adapted by the studies of Palmeiro et al. [38] and Garfinkel et al. [39]. In the analysis, the 30 K threshold at 10 hPa is utilized to identify large-scale stratospheric warmings events, and that criteria were consistent with earlier studies [9,17,40].

Following established definitions based on McInturff [9], Labitzke [17], Baldwin et al. [40], and Baldwin & Thompson [41], SSWs are identified when T_s anomalies at 10 hPa exceed 30 K. This threshold ensures consistency with prior studies while isolating large-scale warming episodes. As outlined in Table 2, T_s anomalies are derived by subtracting the long-term climatic average of T_s (T_sa) values from the daily mean T_s.

To account for model-specific mean-state biases, the baseline climatology is established independently for ERA5 and each CMIP5 simulation, a procedure essential for evaluating SSW events in climate models [32]. For the historical period (1980–2005), anomalies are calculated relative to the daily mean of that specific timeframe. Similarly, future anomalies (2006–2100) are derived from a distinct baseline established over the entire scenario window. For instance, the anomaly for 1 January 2050 is obtained by subtracting the climatological mean of all January 1st data points within the 2006–2100 period. This period-specific baseline approach isolates episodic dynamical warmings from their respective ‘climate normals’. By treating the historical and future eras independently, we ensure that anomalies represent episodic dynamical forcing rather than the background signal of long-term greenhouse-induced stratospheric cooling. This use of fixed, period-specific baselines aligns with standard diagnostic choices in major multi-model assessments [24], providing a consistent reference level to evaluate changes in dynamical variability relative to the projected mean state of the 21st century stratosphere.

The calculated anomalies that are used to detect SSW events are provided in Table 2. The TEA method distinguishes between persistent SSW events and fluctuations that do not meet the duration criteria. Then, the temporal length and geographic extent of each event are assessed separately. When ΔT_s < 30 K, the event is marked as not being an SSW event. This aligns with previous studies of Charlton & Polvani [15] and Blume et al. [42]. The start and end dates are assigned for all SSW events in the analyzed period. After SSW detection, the latitudinal and longitudinal coordinates are used to calculate the surface area influenced by anomalies exceeding the threshold values.

The spatial coverage of each SSW event is determined using surface areas of grid cells with a 30 K threshold, and then they are summed. The sum of these areas provides the daily TEA criteria value. These computations are used for the quantification of the daily spatial extent of the SSW.

3.2. MPD, MPA, and MPS Metrics

Standardized metrics of MPD, MPA, and MPS are used to evaluate SSW events in both observations (ERA5) and numerical model simulations (M1–M3). Table 3 presents the frequency, magnitude, and spatial distribution of SSWs. These metrics are derived using the TEA method and jointly account for the duration and spatial coverage of each event. MPD refers to the number of consecutive days with ΔT_s > 30 K, while MPA represents the average daily surface area affected by the ΔT_s > 30 K threshold [2].

MPS, representing the magnitude of each warming event in space and time, is calculated from the product of MPD and MPA. This metric allows us to discriminate between events with different durations and spatial extents. SSW episodes are then categorized as minor (<90 × 10⁶ km².day), major (90–180 × 10⁶ km².day), or extreme (>180 × 10⁶ km².day) events according to the MPS values [2]. Table 3 shows the sampling days from the model simulations and ERA5 analysis that cover the 1980–2005 time period. This classification provides frequency and intensity of SSWs as well as stratospheric variability and model accuracy. Trend analyses on the T_s and U_h are also performed to support the areal coverage statistics for (i) all days, (ii) SSW days, and (iii) each intensity classification level (minor, major, and extreme events).

The TEA method is used to analyze the relationship between U_h variability and T_s-based SSW types. The last phase of the analysis used is the U_h data averaged between ±10° N latitude at the 10 hPa level to focus on the QBO. This analysis considers the QBO features, including seasonal patterns, amplitude variations, phase transitions, positive and negative phases, and structural variations between models.

To investigate oscillation behavior and possible connections between QBO and SSW events, time series analyses are utilized. In addition, to quantify intra-annual variability and long-term changes in T_s and U_h, three statistical diagnostics were derived from the monthly mean series. First, range-based variability for T_s (T_rng) and U_h (U_rng) was computed using a 12-month moving averaging window as the monthly maximum–minimum spread divided by 12. Second, the amplitude of variability for T_s (T_amp) and U_h (U_amp) was defined as the standard deviation (sd) of the same 12-month windowed values. Third, long-term tendencies were expressed through trend slopes, defined in their basic finite-difference form as

S = \frac{Δ X}{Δ t} = \frac{X_{t_{2}} - X_{t_{1}}}{t_{2} - t_{1}},

(1)

where

X

denotes the respective variable (T_s or U_h). For robustness, we also estimated slopes using ordinary least-squares (OLS) regression on monthly means; these trends are denoted as S_Ts and S_Uh, reported in K.yr⁻¹ and m.s⁻¹ yr⁻¹, respectively. These specific notations (T_rng, U_rng, T_amp, U_amp, etc.) are used consistently throughout the analysis.

4. Results

This section evaluates SSW events with respect to their frequency, energy intensity, and temporal evolution, while examining their relationship with the QBO. The results are provided for two distinct periods: (1) the historical era (1980–2005) and (2) the future climate (2006–2100) under the RCP 4.5 emission scenarios. The occurrence of SSWs is related to QBO phase transitions, seasonal patterns, and long-term variability, and enables a comprehensive characterization of stratospheric dynamics. The results are prepared to show the differences between the ERA5 reanalysis (as reference) and CMIP5 model simulations.

The following subsections present detailed results from the selected model outputs and time series analysis, focusing on the U_h and T_s parameters.

4.1. Selection of Numerical Models

To evaluate the CMIP5 models’ response to SSWs, the frequency and total duration of events are compared against ERA5 (Figure 1). In the historical period (1980–2005), ERA5 identified 29 SSW events lasting a total of 337 days. Of these, 37% were classified as minor events (126 days), 17% as major events (57 days), and 46% as extreme events (154 days). The mean and sd of event duration were 11.6 ± 4.1 days, highlighting significant variability in event persistence. As clearly illustrated in Figure 1, the M4 and M5 models (secondary group) demonstrate a systematic failure to capture SSW dynamics, identifying fewer than three events throughout the historical period. This deficiency is primarily attributed to their low-top configurations, which limit their ability to resolve vertical wave–mean flow coupling and stratospheric variability. Due to this limited capacity and the resulting insufficient number of events for robust statistical evaluation, M4 and M5 were excluded from the detailed analytical diagnostics. Consequently, the primary analysis focuses on the high-top models M1, M2, and M3, which better simulate the dynamical and thermodynamical processes of the stratosphere. The November–April window was selected for the primary analysis as it encompasses more than 90% of total SSW days in both ERA5 and the models, providing a statistically representative seasonal range.

Figure 2 shows the temporal and seasonal distribution of SSWs during the historical period (1980–2005) for ERA5 and models (M1–M3), while Figure 3 shows the corresponding distributions under the RCP 4.5 scenario (2006–2100). In the analysis, M1, M2, and M3 models are selected for SSW analysis because of their better handling of the dynamical and thermodynamical processes that are provided below. This time frame was chosen to provide a consistent physical and statistical framework for evaluating how well the models capture mid-winter warming events.

The 62% of the SSWs based on ERA5 analysis occurred in January–February. Of these, 31% occurred in November–December and 7% in March–April. M1 closely reproduced ERA5’s pattern: corresponding values are found to be 58%, 34%, and 8%. M2 generated these corresponding values as 49%, 39%, and 12%. M3 produced 64% for January–February, 28% for November–December, and 8% for March–April, matching the observed seasonal phases most closely. Results suggested the mean event duration for M1, M2, and M3 were 9.2 ± 3.3, 8.7 ± 3.7, and 11.0 ± 3.9 days, respectively. Overall, compared to ERA5 analysis, it is found that M1 and M3 simulations reproduce the observed seasonality within ± 5% accuracy, whereas M2 exhibits a systematic bias toward short-lived and early winter warmings.

In summary, the comparison between historical (1980–2005) and future (2006–2100) periods under the RCP 4.5 scenario indicates that the frequency of SSW events remains relatively stable. No statistically significant increasing or decreasing trend was observed across the M1, M2, and M3 ensembles, suggesting that while the climate warms, the occurrence rate of these events does not deviate substantially from historical averages. This finding is in line with the statistical projections provided by Rao and Garfinkel (2021) [25].

However, while the recent literature indicates that SSW frequency shows little robust change under future scenarios, our results provide evidence that future climate change is closely associated with significant systematic biases in the simulated physical and dynamical intensity of these events. These biases, characterized by a 61% to 82% underestimation of warming magnitude, demonstrate that the primary impact of climate change in models is the misrepresentation of the energetic structure of SSWs, rather than a shift in their occurrence rate.

Future projections under RCP 4.5 (Figure 3) show how the simulated SSW seasonality evolves through the 21st century. Based on the same identification and classification criteria applied throughout history, it appears that the events are limited to the November–April period. M1 shows 53.4% of events in January–February months, 27.2% in November–December months, and 18.4% in March–April months, which is similar to the historical period, with a mean duration of 11 ± 5.1 days. M2 simulates 67.1% for January–February, 0% for November–December, and 32.9% for March–April, showing an early winter shift of approximately 10% and producing the shortest mean event duration of 10.7 ± 3.7 days. M3 maintains 65.8% for January–February, 26.1% for November–December, and 8.1% for March–April, with a mean event duration of 9.8 ± 3.4 days.

4.2. Temperature Time Series

In this section, T_s at 10 hPa 60° N latitude is plotted and interpreted using ERA5 reanalysis and CMIP5 models (M1–M3), along with historical period and RCP 4.5 projection time series.

4.2.1. Historical Period of T_s

Daily temperature series at 10 hPa at 60° N latitude for the period 1980–2005 are analyzed to assess how effectively three CMIP5 models (M1–M3) reproduce the observed stratospheric thermal variability (Figure 4). Statistical assessments of T_s, sd, percentile ranges (P_r), and S_Ts are summarized in Table 4.

According to ERA5 (Figure 4a), the polar stratosphere exhibits a weak but persistent cooling trend throughout the historical period. However, it displays significant seasonal and interannual thermal variability. As shown in Table 4, temperature variance decreases by approximately 46% during SSW events compared to non-SSW periods. This decrease confirms that temperature variability forms during midwinter warmings, indicating a significant thermal response.

Numerical simulations are applied using M1, M2, and M3. M1 (Figure 4b) overestimates the thermal range, producing a higher sd with a wider P_r than ERA5. It also overestimates the long-term cooling trend by about 2.9 times. M2 (Figure 4c) suppresses thermal variability with a narrower P_r. However, it overestimates the cooling rate with an inconsistent representation of stratospheric dynamics. M3 (Figure 4d) results are more comparable with ERA5 results and closely reproduce the magnitude of the mean T_s, sd, and the long-term cooling trend (Table 4).

In summary, all models successfully capture the thermodynamic signature of SSWs. The reduction in T_s and sd for all datasets ranges from 44% to 54% (Table 4). This confirms that SSW events play an important role in deriving polar cooling processes.

4.2.2. RCP 4.5 Future Climate Scenarios of T_s

In this section, the simulations are analyzed to determine how RCP 4.5 forcing modifies the characteristics of SSWs that include T_s, sd, P_r, and S_Ts (Table 5). As shown in Table 5, long-term cooling trends remain weak and negative, suggesting that polar stratospheric cooling will continue moderately throughout the 21st century.

M1 (Figure 5a) closely maintains its historical thermal structure but suggests significant stability in the long-term trend. The rate of cooling is weakened by about 76% compared to the historical baseline. In contrast, M2 (Figure 5b) shows slightly more variability with a narrower P_r. Similar to M1, the cooling rate is 60% weaker than the historical prediction. The M3 (Figure 5c) simulations for the climate scenarios are found to be similar to the historical ERA5 climate. The reduction in T_s variance during SSW events is estimated at about 49–52% for all models (Table 5), and the metrics are found to be comparable to each other. This consistency reveals that the polar vortex continues to experience events of warming with a consistent thermal structure of the historical period.

4.3. Zonal Wind Time Series

In this section, U_h at 10 hPa 60° N latitude are plotted and interpreted using ERA5 reanalysis and CMIP5 models (M1–M3), along with historical period and RCP 4.5 projection time series. SSW characteristics will be discussed using U_h characteristics similar to the structure given for T_s in the previous section.

4.3.1. Historical Period of U_h

In this subsection, daily U_h time series at the 10 hPa level above 60° N latitude for the period 1980–2005 were analyzed to assess the effectiveness of CMIP5 models (M1–M3) to produce SSW dynamical variability and evaluate the polar night jet (Figure 6). The assessment, using mean U_h, sd, S_Uh, and P_r representing dynamic amplitude, is presented in Table 6.

According to ERA5 (Figure 6a), polar stratospheric winds exhibit significant seasonal and interannual variability throughout the historical period, accompanied by significant slowing of westerly winds. The observed dynamics are a direct response to SSW activity. As shown in Table 6, U_h variance during SSW events decreases by 24% compared to non-SSW periods.

When the M1 model is examined (Figure 6b), the polar night jet’s intensity is overestimated, producing an amplified range of variability compared to ERA5 results. It also simulates a spurious strengthening of westerly U_h which is contrary to the observed deceleration (Table 6). M2 (Figure 6c) reproduces the dynamic amplitude and mean state reasonably well but also fails to capture the long-term trend direction as indicated by M1. M2 shows a systematic trend reversal, resulting in an artificial westerly U_h acceleration of the polar night jet. M3 (Figure 6d) provides the results closest in agreement with ERA5 simulations, reproducing the magnitude and variability structure of the mean U_h with the highest success among all models (Table 6). However, M3 shows a positive trend of U_h variation, but its value is weaker than that of M1 and M2.

Despite the inconsistencies in long-term trends, all three models capture the dynamic signature of SSWs. The reduction in U_h variance is found to be between 13 and 54% (Table 6). This confirms that vortex breakdowns lead the stratosphere into a state of reduced dynamic variability. However, the magnitude of this clustering varies significantly depending on the model.

4.3.2. RCP 4.5 Future Climate Scenarios of U_h

This subsection presents future climate scenarios based on SSW characteristics. Under the RCP 4.5 scenario, daily U_h time series at 10 hPa and 60° N for the period 2006–2099 are analyzed to assess how the predicted circulation changes are comparable to historical polar night jet stream intensity (Figure 7). The metrics of U_h, sd, P_r, and S_Uh are summarized in Table 7.

In M1 simulations (Figure 7a), the mean U_h shows increasing variability, with a slightly wider P_r (70.44 m.s⁻¹), at 8.5% compared to the historical average value. However, in the long-term trend, S_Uh weakens at about 82% compared to the historical period, becoming almost zero (Table 7). In M2 simulations (Figure 7b), the average U_h shows a modest 4.2% increase compared to its historical value, but the P_r range decreases. S_Uh shifts from a historically positive value to a weakly negative value (−0.004 m.s⁻¹.yr⁻¹). This represents a large relative decrease in the trend value that indicates the complete disappearance of the historical strengthening trend (Table 7). In M3 simulations (Figure 7c), an average circulation is simulated with a slightly narrow amplitude range, which remains unchanged from its historical value. The S_Uh trend weakens at about 48%, but it is not statistically significant (Table 7). M3 continues to exhibit distinct behavior during extreme events, such as winds, during which SSW days increase significantly when they are compared to the historical baseline value during SSW days.

Overall, regarding the dynamic response to SSWs, variability continues to narrow across the models compared to non-SSW conditions. As shown in Table 7, the variance reduction ranges from 4% to 43%, indicating that mid-winter warming remains abrupt and dynamically consistent under moderate future forcing. Under RCP 4.5 scenarios, each model preserves the underlying U_h climatology while clearly reducing the historical strengthening trends of the polar night jet. Specifically, none of the RCP 4.5 slopes deviates significantly from zero, implying a statistically stationary mean flow behavior throughout the 21st century. M3 provides the most consistent statistical representation of polar night jet variability as suggested by ERA5, while M1 provides a stronger mean flow. On the other hand, M2 provides weaker but more stable circulation.

4.4. Stratospheric Temperature Amplitude and Range

As justified in Section 3, while models M1, M4, and M5 were utilized for general frequency comparisons, the following high-fidelity assessments of stratospheric amplitude and thermal–dynamical coupling (Section 4.4, Section 4.5 and Section 4.6) focus exclusively on models M2 and M3 to ensure physical consistency in capturing the magnitude of stratospheric variability. The T_amp and T_rng are evaluated to highlight the SSW-related thermal variability based on daily temperature values shown in Figure 4 and Figure 5, respectively. Table 8 summarizes the mean values of T_amp and T_rng, their sd, and S_Ts for the ERA5 reanalysis and CMIP5 models (M2 and M3) at 10° N and 60° N during the historical and future periods. The daily values of the above parameters are averaged over the targeted years to obtain monthly averages.

In the tropical stratosphere (10° N), ERA5 analysis effectively characterizes thermal variability as stable, with low sd and a negligible long-term of T_s; however, CMIP5 models exhibit deviations. M2 underestimates T_rng by approximately 21% (Table 8). M3, contrary to previous evaluations, also underestimates these metrics, capturing about 91–92% of the observed variability, showing a much closer agreement with ERA5 than M2 but with a slight negative bias.

In the polar stratosphere, the observed T_amp variability at 60° N increases significantly, enhancing wave activity and polar vortex dynamics associated with SSWs. ERA5 exhibits the strongest variability at T_rng of 0.730 K and T_amp of 3.159 K. However, all models struggle to reproduce this polar T_amp magnitude. Table 8 shows that M2 captures only ~20% of the observed T_rng and underestimates T_amp by ~81%. This suggests that polar T_amp variability is reduced significantly. M3, similarly to M2, struggles to reproduce the polar magnitude, resolving only ~26% of T_rng and underestimating T_amp by ~75%. Based on the long-term evolution of T_amp, linear trends remain small for all data, reflecting model-dependent suppression of the processes rather than the real climate change signal. Under the RCP 4.5 future climate scenario, the T_amp variability structure is usually conserved for all latitudes. Deviations in projected T_amp and T_rng are found to be minimal for both models, varying by less than 4% relative to historical baselines. Furthermore, future linear trends with S_Ts < |0.001| are found to be negligible.

In summary, ERA5 exhibits the strongest T_s variability across all latitudes (Table 8). In contrast, M2 systematically underestimates both T_amp and T_rng at about 20% and 80%, respectively, whereas M3 underrepresents polar amplitude by approximately 50%. Long-term S_Ts values are found to be insignificant, indicating that there is no significant change in the SSW occurrence conditions. These finding suggest that while the tropical–polar contrast is preserved under RCP 4.5, CMIP5 models’ simulations cannot produce the magnitude of T_s variability, indicating limitations of model vertical resolution and wave–mean-flow coupling.

4.5. Zonal Wind Amplitude and Range

The U_amp and U_rng variables for U_h are analyzed at 10° N and 60° N latitudes at 10 hPa to evaluate the dynamical variability associated with SSW events, based on the daily evolution shown in Figure 6 and Figure 7. This comparison between tropical and polar dynamics provides a quantitative basis for evaluating the QBO–SSW coupling. Table 9 provides statistical measures, including the mean, median, sd, U_rng, U_amp, and long-term tendencies for the ERA5 reanalysis and CMIP5 model simulations (M2 and M3) during the historical and future periods.

In the tropical stratosphere (10° N), which acts as a proxy for the tropical QBO signal, ERA5 show an average U_rng of 3.021 m.s⁻¹ with a weak positive multi-decadal S_Uh. The models show contrasting abilities in reproducing these dynamics (Table 9). M2 follows up the ERA5 trend closely, with a ~2% and 8% difference in U_rng and U_amp, respectively. In contrast to previous expectations, M3 also aligns reasonably well with ERA5 in the tropics, showing only a slight overestimation of ~1.5% in U_rng and ~5.6% in U_amp. M2 has stable long-term trends found in ERA5 analysis, while M3 exhibits a clear increase in S_Uh which is consistent with its pronounced westerly bias.

In the polar stratosphere (60° N), ERA5 analysis shows a stronger dynamic variability, and that reflects the active wave–mean flow interactions related to mid-latitude westerly winds. Neither the M2 nor the M3 models can capture the full magnitude of this polar variability (Table 9). M2 simulations underestimate the U_rng value at about 60% and captures only 36% of the observed U_amp, and this indicates a significant suppression of polar jet dynamics. The discrepancy between the relatively lower bias in tropical amplitudes and the significant underestimation at 60° N suggests that the model deficiency is primarily associated with the representation of dynamic teleconnections linking tropical forcing to the polar stratosphere. M3 simulation performs similarly to M2 in this updated analysis, capturing only ~39% and ~37% of U_rng and U_amp, respectively. ERA5 shows a slight strengthening of S_Uh (positive) in polar westerly U_h. Both M2 and M3 models simulate negative trends at −0.0018 and −0.0080 m.s⁻¹ decade⁻¹), respectively, but fail to reproduce the observed strengthening of the polar U_h.

Under the RCP 4.5 future climate scenarios, the dynamic structure, represented with U_h metrics, remains generally stable across all latitudes. The changes in U_amp and U_rng (Table 9) are less than 4% of the historical baselines. In the tropics, M2 shows a slight increase in variability (~1%) but M3 shows a small decrease (~6%). At 60° N, both models retain their historical biases while M2 continues to significantly underestimate U_amp (~4.1 m.s⁻¹) compared to ERA5 (~11.4 m.s⁻¹). The model-based projected long-term trends become statistically insignificant because S_Uh becomes close to zero. This suggests a lack of robust dynamic evolution in U_h variability throughout the 21st century.

In summary, ERA5 analysis exhibits a stronger dynamic variability compared to CMIP5 models, particularly in the polar region (Table 9). M2 systematically underestimates both U_rng and U_amp by at about 40–70%, but M3 overestimates the tropical variability of U_amp while underrepresenting polar U_amp by about 25%. Discrepancies in the sign and magnitude of long-term U_amp trends indicate that the models inadequately capture the dynamic response of the polar night jet to interannual dynamical forcing. These persistent deviations in both historical and future simulations likely reflect structural limitations in vertical resolution as well as wave-flow coupling conditions, but this needs to be verified in future studies. Overall, these results imply that while models generate tropical forcing, the primary systematic error lies in the dynamic coupling necessary to propagate these signals poleward.

4.6. Thermal–Dynamical Coupling Analysis

To investigate the thermal–dynamical coupling in the stratosphere, scatter plots between daily mean T_s and U_h at 10 hPa are examined at 60° N latitude for both historical and future periods. Figure 8 corresponds to the historical period, and Figure 9 depicts projections under the RCP 4.5 scenario.

At 60° N latitude, ERA5 analysis (Figure 8a) shows that U_h increases with decreasing T_s, suggesting that warmer T_s is associated with weaker westerlies (or easterlies), whereas decreasing T_s is related to stronger westerlies. Both fits have similar slopes, separated at lower temperatures. The fit for the pre-1990 data had a slope of −1.62 m.s⁻¹ K⁻¹, r = −0.59; R² = 0.35. After 1990, the slope weakens to −1.13 m.s⁻¹ K⁻¹ (r = −0.49; R² = 0.24), corresponding to a roughly 30% reduction in magnitude. Overall, ERA5 shows a persistent inverse coupling that weakens in the later decades.

At 60° N latitude, M2 (Figure 8b) exhibits a much stronger and more linear inverse T_s–U_h relationship than ERA5. Both the pre- and post-1990 subsets yield regression slopes close to −2.0 m.s⁻¹ K⁻¹ (r ≈ −0.82; R² ≈ 0.65–0.68), with only minor differences between the two periods, indicating that the model maintains a persistently intense anticorrelation rather than the weakening seen in ERA5. The scatter of data points is tightly clustered along the regression axis, especially in the 220–235 K T_s range, where many SSW days occur. In addition, M2 produces a substantial number of cases with U_h < −20 m.s⁻¹, which are rare in ERA5, suggesting that the model overproduces very weak or reversed vortex states and exaggerates the dynamical response to warming anomalies.

In M3 simulations (Figure 8c), the overall T_s–U_h coupling is found to be negative with a weaker coherence than that of both ERA5 and M2. Regression slopes remain almost unchanged between the two periods (−1.40 to −1.42 m.s⁻¹ K⁻¹), implying a stable yet moderately underestimated inverse coupling with no clear post-1990 weakening. The scatter shows a broader spread in U_h (approximately −10 to 40 m.s⁻¹), indicating that M3 simulates a wider range of vortex states but with reduced sensitivity of U_h anomalies to T_s variations. Clustering in the 220–235 K band is less pronounced than in ERA5, and the model underestimates the amplitude of the thermal–dynamical link at about 15–20%.

For the RCP 4.5 projections, results of T_s–U_h coupling related to four successive time segments (i) 2006–2025, (ii) 2026–2050, (iii) 2051–2075, and (iv) 2076–2100 are shown using distinct color coding, with solid trend lines representing the least-squares fit for each period (Figure 9). This consistent visualization enables a direct comparison of reanalysis-based and model-simulated coupling characteristics across historical and future conditions. Results for each time period are provided below.

For time periods i, ii, and iv, M2 exhibits a clear inverse T_s–U_h relationship, with a slope of −1.84 m.s⁻¹.K⁻¹ (r = −0.42; R² = 0.18) (Figure 9a). The mean T_s is 224.8 ± 6.3 K and mean U_h is 17.4 ± 19.2 m.s⁻¹, indicating a conical warm–weak jet pattern. The correlation shows that the slope becomes steeper to −2.65 m.s⁻¹.K⁻¹ (r = −0.56; R² = 0.31) between 2026–2050, representing the period when the vortex is most susceptible to thermal anomalies. In 2051–2075, the relationship weakens substantially, with the slope flattening to −1.32 m.s⁻¹ K⁻¹ (r = −0.27; R² = 0.07), and by 2076–2100, it decreases further to −0.74 m.s⁻¹ K⁻¹ (r = −0.17; R² = 0.03). Although the average T_s rose by about 1 K over the time period, the U_h pattern persisted with a predominantly westward orientation. Overall, M2 maintained the correct sign of the coupling. However, towards the end of the century, it appears to indicate a gradual decrease in dynamic strength and consistency.

For time periods ii, iii, and iv, M3 shows a moderate inverse coupling with a slope of −1.21 m.s⁻¹ K⁻¹ (r = −0.36; R² = 0.13), mean values of T_s = 225.1 ± 6.5 K, and U_h = 20.6 ± 18.7 m.s⁻¹ (Figure 9b). The coupling strengthens during 2026–2050, as the slope steepens to −2.08 m.s⁻¹ K⁻¹ (r = −0.47; R² = 0.22) and reaches maximum intensity in 2051–2075 (slope = −2.42 m.s⁻¹ K⁻¹; r = −0.52; R² = 0.27), reflecting the period of strongest polarity between warming anomalies and response. In the final segment (2076–2100), the slope weakens sharply to −0.63 m.s⁻¹ K⁻¹ (r = −0.19; R² = 0.04), accompanied by increased scatter and higher mean T_s (~227 K). Thus, M3 shows a transient mid-century strengthening followed by a late century weakening, indicating a non-monotonic dynamical response to RCP 4.5 forcing.

Collectively, Figure 8 and Figure 9 demonstrate a robust and persistent inverse coupling between T_s and U_h in the polar stratosphere at 60° N across both the historical and RCP 4.5 periods. In historical records, ERA5 exhibits strong negative slopes and high correlation magnitudes, with a clear weakening after 1990, indicating reduced dynamical coherence of the polar vortex in recent decades. Both CMIP5 M2 and M3 models reproduce the correct sign of the coupling, albeit with distinct biases: M2 consistently amplifies the slope and correlation strength relative to ERA5, whereas M3 captures the overall structure but underestimates the dynamical amplitude.

5. Discussion

This study investigated the relationship between SSWs and QBOs using ERA5 reanalysis and CMIP5 model simulations, both in historical and future periods. Comparison of SSW frequency and intensity between the ERA5 and CMIP5 models revealed important differences, and here they are briefly discussed.

In the RCP 4.5 scenario, M1 and M3 maintained almost constant event frequencies (±5%) but M2 showed a tendency to occur in the early winter months with a decrease of approximately 10%. These differences are consistent with the outputs of CMIP5 models, but vertical resolution and planetary wave propagation characteristics directly affect eddy distortion frequency and intensity. In this respect, CMIP5 simulations need to focus more on turbulent heat fluxes in high resolution mode. Note that both M4 and M5 are not analyzed due to their inability to generate realistic SSWs (# < 3 events).

Analysis of T_s and U_h variability revealed structural biases in how the models preserved the polar vortex rather than the use of event frequency. Historically, ERA5 analysis exhibits a sustained cooling trend with a westward slowing trend. In contrast, M1 misleadingly simulates a strengthening of the polar night jet during which M2 suppresses dynamic variability, systematically reducing T_amp and U_rng magnitudes in the polar region at about 60−80%. M3 more accurately captures the variability in the structure of system dynamics and thermodynamics, reproducing approximately 75% of the observed amplitude of T_amp. However, M2 significantly overestimates tropical variability. Furthermore, the reduction in variance during SSW events acts as a key indicator of vortex breaking across the models (~24−54%). This confirms that the thermodynamic response of the SSW event remains robust despite mean-state deviations. These shortcomings in reproducing the magnitude of polar variability point to limitations of the models representing intrinsic dynamic forcing. These persistent biases reflect broader challenges identified in both CMIP5 and CMIP6 ensembles regarding stationary wave driving [34,35]. While high-top configurations in newer model generations show improvements in stratospheric dynamics [43], low-top models like M4 and M5 remain essential for demonstrating the structural sensitivity of the polar vortex to model vertical extent.

The QBO processes are analyzed in determining the frequency and intensity of SSWs due to their effect on planetary wave propagation and background horizontal wind shear. Both QBO− and QBO+ phases affect the upward transmission of Rossby waves into the polar vortex, altering the vertical refractive index of the tropical stratosphere [4,44]. Previous studies using reanalysis have shown that SSWs occur twice as often in QBO− phases when compared to QBO+ phases [10,15,45]. This reflects that when upward wave flow increases, the polar vortex is further disrupted. In the CMIP5 models, QBO dynamics are not explicitly resolved because tropical momentum forcing is parameterized rather than dynamically generated. Of the three selected models, only M2 and M3 exhibit intermittent equatorial wind direction changes resembling semi-biennial behavior. However, these oscillations are weak and irregular. M1 shows a limited semi-annual oscillation in the lower stratosphere. This is a known limitation of the upper–lower CMIP5 configurations [36]. Because this study focuses on only the 10 hPa level, direct QBO–SSW coupling cannot be retrieved because the QBO is not dynamically expressed. Therefore, the interpretation relies on established dynamical pathways and previous intercomparisons.

M2 predicts fewer SSW days and a narrower U_rng, while M3 shows a modest upward trend with slightly stronger event classification. The contrast between the monotonic attenuation in M2 and the non-monotonic transient peak in M3 highlights significant structural ambiguities in the representation of future wave–mean flow feedback under radiative forcing. Phase-space analyses of T_s–U_h relationships further emphasize these contrasts, showing a weakening of the post-1990 inverse correlation in ERA5 (r ≈ −0.8). This is partly captured by M2 simulations but remains inconsistent with M3. This inconsistency suggests that CMIP5 models may have overestimated the dynamic stability of the polar vortex, as they generally support a strong linear response, in contrast to the recent ‘decoupling’ seen in the reanalysis data.

The representation of stratospheric variability in CMIP5 models remains persistent, and significant differences occur compared to the ERA5 reanalysis. M1 underrepresents strong SSW events; M2 underestimates overall both dynamical and physical variabilities. On the other hand, M3 reproduces the observed temporal structure of both U_h and T_s metrics. However, specifically in the polar region, it significantly underestimates the magnitudes of both T_amp and U_amp, similarly to M2. The systematic underestimation of SSW intensity in models has direct implications for projected surface weather impacts. Since the strength of the downward coupling is physically linked to the magnitude of the stratospheric warming, an 80% bias in temperature anomalies suggests that models may fail to capture the full severity of mid-latitude cold spells and storm track shifts that typically follow these events. Therefore, although the frequency of SSWs is projected to be stable, their simulated impact on surface weather conditions is likely underestimated due to these persistent dynamical biases. Additionally, the energetic influence of SSWs extends into the upper atmosphere, potentially modulating broader oscillations such as the Semi-Annual Oscillation (SAO) [46]. ERA5 therefore provides a consistent observational benchmark for assessing these differences in both historical and future contexts but better observations covering the upper atmosphere based on in situ measurements, and satellite data can improve the outcome of this work. These results suggest a need to improve the model physics of QBO–SSW interaction, planetary wave propagation dynamics, and mean flow interactions, to increase the reliability of future stratospheric climate projections.

6. Conclusions

This study analyzes the variability of SSWs at 10 hPa in the NH using ERA5 reanalysis and five CMIP5 models, two of which are found to be unsuccessful in capturing SSW dynamics. The findings for the historical and future periods are summarized below.

(1): Frequency: Under the RCP 4.5 scenario, the frequency of SSW events is projected to remain stable throughout the 21st century, showing no significant trend.
(2): Predictability and Impact: The major challenge for future projections lies not in the number of events, but in their simulated magnitude. The systematic variability identified in SSW intensity and its projected changes are closely linked to the specific criteria and metrics employed for detection. This dependency directly influences the accuracy of Arctic vortex intensity forecasting and subsequent mid-latitude weather predictions, particularly where model intensity deficits reach 61–82%. Such deficits indicate a critical weakness in simulating the dynamical energy of polar vortex disruptions, which is essential for accurately forecasting their subsequent impacts on surface weather patterns [5,6].

The systematic failure of M4 and M5 to capture SSW events is attributed to their low-top configurations and limited vertical resolution [34,35].
M1 captures the seasonal cycle well but cannot resolve extreme SSW events in detail. While both M2 and M3 show better temporal agreement with ERA5 analysis, they exhibit structural deviations in the amplitude of both T_s and U_h parameters.
At 10° N, the models exhibit a clear discrepancy in thermal variability. M2 underestimates both T_amp and T_rng at about 21% and 22% compared to ERA5 analysis, respectively. Similarly, M3 underestimates these metrics by approximately 9% and 8%, indicating a slight damping of tropical thermal variability compared to ERA5. These results indicate a clear failure of the models to capture tropical thermal variability. Under the RCP 4.5 scenario, these internal biases persist; specifically, M3 maintains T_s amplitudes consistently higher than its own historical mean, suggesting a sustained overestimation of tropical dynamics throughout the 21st century.
At 60° N, ERA5 reveals strong variability in T_amp and T_rng values, with 3.16 K and 0.73 K, respectively. The M2 and M3 models consistently fail to capture the full magnitude of these metrics. M2 and M3 underestimate these values by 80–82% and 74–75% compared to ERA5 analysis, respectively. While M2 and M3 underestimate the historical magnitudes observed in ERA5, their future projections indicate that T_amp remains nearly constant in M2 (1.4% change relative to its historical mean), while M3 shows a modest increase of 5.7%. This suggests that the systematic suppression of polar vortex dynamics is projected to continue under future forcing.
At 60° N, the ERA5 U_amp and U_rng values are found to be 11.39 m.s⁻¹ and 2.73 m.s⁻¹, respectively. M2 and M3 underestimate these wind speeds of about 61–64% and 61–63%, respectively.
Scatter plot diagnostics at 60° N show that ERA5 exhibits a strengthened negative relationship between T_s and U_h after 1990 with a correlation of R ≈ –0.8. This relationship is partially captured by M2 but it is not consistently reproduced by M3.
Regarding to polar variability at 60° N, ERA5 shows a strengthening of polar westerlies with a trend of –0.87 m.s⁻¹ per decade. The M1, M2, and M3 models fail to capture this trend and simulate spurious positive trends between 0.34 and 0.80 m.s⁻¹ per decade. Under RCP 4.5 scenario, these trends remain near zero, which suggests a lack of dynamic evolution in future simulations.
In historical evaluations, the sensitivity of the T_s–U_h coupling in M3 weakens significantly toward the end of the century, which means there is a decreasing relationship between them. The regression slope drops from −1.42 m.s⁻¹/10 yr to −0.63 m.s⁻¹/10 yr between 2076 and 2100. This change indicates that both M2 and M3 lack the physical processes necessary to accurately capture how the vortex responds to thermal anomalies.

These conclusions suggest that CMIP5 models need to be further improved with respect to their vertical and horizontal resolutions, and the representation of physical and dynamical processes in the upper troposphere, which are more susceptible to climate change processes. It is also suggested that ERA5 analysis results should be further refined using remote sensing observations of stratospheric conditions. This work’s results can also be basis for improving CMIP6 simulations to study a similar content. In addition to these points, several limitations of the current study should be acknowledged in future studies. Firstly, the analysis focuses exclusively on the NH; therefore, the findings may not be directly applicable to the SH, where planetary wave activity and vortex dynamics follow significantly different patterns. Secondly, while the 10 hPa pressure level is a standard reference for identifying SSW events, incorporating a multi-level vertical analysis in future studies could provide a more comprehensive view of the downward propagation of warming signals. Thirdly, the investigation is based on the RCP 4.5 scenario, which represents a moderate future forcing; evaluating these dynamics under more extreme scenarios like RCP 8.5 could further test the sensitivity of the polar vortex to higher radiative forcing. Lastly, although the selected primary models (M1–M3) represent stratospheric processes in detail, persistent structural biases in vertical resolution across climate models remain a factor that requires continued refinement in future studies to better represent fine-scale wave–mean flow interactions.

Author Contributions

The authors contributed the manuscript in following ways: Conceptualization, S.D., I.G. and D.D.; methodology, S.D., I.G. and D.D.; software, S.D.; validation, S.D. and I.G.; formal analysis, S.D.; investigation, S.D., I.G. and D.D.; resources, ECMWF; data curation, S.D.; writing—original draft preparation, S.D. and I.G.; writing—review and editing, O.D.; visualization, S.D. and O.D.; supervision, I.G. and D.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the Fog and Turbulence in Marine Atmosphere (FATIMA) project, which is funded by the U.S. Office of Naval Research (ONR), USA, with Grant No. N00014-21-1-2296 and by the University of Notre Dame, IN, USA. In addition, this work was also partially completed during Dr. Ismail Gultepe’s visit to the University of Notre Dame, IN, USA.

Data Availability Statement

The datasets analyzed in this study were retrieved from the Copernicus Climate Change Service (C3S) Climate Data Store, which provides access to reanalysis and climate datasets (https://climate.copernicus.eu/ (accesed on 15 January 2024). Software Availability: All data processing was performed using MATLAB R2016b (The MathWorks Inc., Natick, MA, USA). Codes are available in request from the first author.

Acknowledgments

The authors thank the European Centre for Medium-Range Weather Forecasts (ECMWF) for making the ERA5 reanalysis data available. They are also grateful to the Joint Modelling Working Group of the World Climate Research Program, responsible for CMIP, and to the climate modeling groups that produced and made the model outputs available. The authors gratefully acknowledge the computing resources and technical support provided by the National Center for High Performance Computing (UHEM) at Istanbul Technical University (ITU) under grant number 4019242024. The computational analyses presented in this paper were performed using UHEM’s high-performance computing infrastructure through remote access services.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

CAS	Commission for Atmospheric Sciences
CMIP5	Coupled Model Intercomparison Project Phase 5
CMIP6	Coupled Model Intercomparison Project Phase 6
ECMWF	European Centre for Medium-Range Weather Forecasts
ENSO	El Niño–Southern Oscillation
ERA5	Fifth Generation ECMWF Atmospheric Reanalysis
hPa	Hectopascal (pressure level)
IQSY	International Quiet Sun Year
MAX	Maximum
MIN	Minimum
MJO	Madden–Julian Oscillation
MPA	Main Phase Area
MPD	Main Phase Duration
MPS	Main Phase Strength
NH	Northern Hemisphere
OLS	Ordinary Least-Squares
P_r	95th–5th variability metric
QBO	Quasi-Biennial Oscillation
QBO−	Easterly phase of QBO
QBO+	Westerly phase of QBO
R²	Coefficient of Determination
RCP 4.5	Representative Concentration Pathway 4.5 Scenario
SAO	Semiannual Oscillation
sd	Standard Deviation
SH	Southern Hemisphere
SSW	Sudden Stratospheric Warming
S_Ts	Linear trend of stratospheric temperature (K.yr⁻¹)
S_Uh	Linear trend of zonal wind speed (m.s⁻¹ yr⁻¹)
T_amp	Amplitude of variability for surface temperature (Standard Deviation)
T_rng	12-month variability range metric (max–min)/12 stratospheric temperature
TEA	Threshold Exceedance Area (diagnostic metric)
T_s	Stratospheric temperature (K)
T_sa	Long-term climatic average of stratospheric temperature (K)
U_amp	Amplitude of variability for zonal wind (Standard Deviation).
U_h	Zonal-mean wind speed (m.s⁻¹)
VAR	Variance
U_rng	12-month variability range metric (max–min)/12 for zonal wind
WMO	World Meteorological Organization
ΔT_s	Change in temperature anomaly

References

Butchart, N. The Stratosphere: A Review of The Dynamics and Variability. Weather Clim. Dyn. 2022, 3, 1237–1272. [Google Scholar] [CrossRef]
Li, Y.; Kirchengast, G.; Schwaerz, M.; Yuan, Y. Monitoring Sudden Stratospheric Warmings under Climate Change since 1980 Based on Reanalysis Data Verified by Radio Occultation. Atmos. Chem. Phys. 2023, 23, 1259–1284. [Google Scholar] [CrossRef]
Butler, A.H.; Seidel, D.J.; Hardiman, S.C.; Butchart, N.; Birner, T.; Match, A. Defining Sudden Stratospheric Warmings. Bull. Am. Meteorol. Soc. 2015, 96, 1913–1928. [Google Scholar] [CrossRef]
Baldwin, M.P.; Dunkerton, T.J. Stratospheric Harbingers of Anomalous Weather Regimes. Science 2001, 294, 581–584. [Google Scholar] [CrossRef]
Ma, X.; Zhao, F.; Yue, B.; Liu, X. DSMF-Net: A Spatiotemporal Memory Flow Network for Long-Range Prediction of Stratospheric Sudden Warming Events. Atmosphere 2025, 16, 1316. [Google Scholar] [CrossRef]
Xie, J.; Hu, J.; Li, X.; Luo, J.-J.; Xu, H.; Jia, Y. Two Major Sudden Warming Events in the Unprecedentedly Active Stratosphere during the Boreal Winter of 2023/2024 and Their Distinct Surface Impacts over China. Atmos. Res. 2025, 319, 108032. [Google Scholar] [CrossRef]
Van Loon, H.; Jenne, R.L.; Labitzke, K. Zonal Harmonic Standing Waves. J. Geophys. Res. 1973, 78, 4463–4471. [Google Scholar] [CrossRef]
Scherhag, R. Die Explosinonsartigen Stratospharen Warmungen Des Stratintere 1951/52. Deutsch. Wetterdienstes US-Zone Berl. 1952, 6, 51–63. [Google Scholar]
McInturff, R.M. Stratospheric Warmings: Synoptic, Dynamic and General-Circulation Aspects; National Aeronautics and Space Administration, Scientific and Technical Information Office: Hampton, VI, USA, 1978; ISBN 978-0-8156-2252-9.
Baldwin, M.P.; Ayarzagüena, B.; Birner, T.; Butchart, N.; Butler, A.H.; Charlton-Perez, A.J.; Domeisen, D.I.V.; Garfinkel, C.I.; Garny, H.; Gerber, E.P.; et al. Sudden Stratospheric Warmings. Rev. Geophys. 2021, 59, e2020RG000708. [Google Scholar] [CrossRef]
Thompson, D.W.J.; Baldwin, M.P.; Wallace, J.M. Stratospheric Connection to Northern Hemisphere Wintertime Weather: Implications for Prediction. J. Clim. 2002, 15, 1421–1428. [Google Scholar] [CrossRef]
Labitzke, K.; Kunze, M. On the Remarkable Arctic Winter in 2008/2009. J. Geophys. Res. 2009, 114, 2009JD012273. [Google Scholar] [CrossRef]
World Meteorological Organization. International Years of the Quiet Sun (IQSY) 1964–1965: Alert Messages with Special References to Stratwarms; WMO/IQSY Report No 6; Secretariat of the World Meteorological Organization: Geneva, Switzerland, 1964. [Google Scholar]
Johnson, K.W.; Miller, A.J.; Gelman, M.E. Proposed Indices Characterizing Stratospheric Circulation and Temperature Fields. Mon. Weather Rev. 1969, 97, 565–570. [Google Scholar] [CrossRef][Green Version]
Charlton, A.J.; Polvani, L.M. A New Look at Stratospheric Sudden Warmings. Part I: Climatology and Modeling Benchmarks. J. Clim. 2007, 20, 449–469. [Google Scholar] [CrossRef]
Schoeberl, M.R.; Strobel, D.F. The Response of the Zonally Averaged Circulation to Stratospheric Ozone Reductions. J. Atmos. Sci. 1978, 35, 1751–1757. [Google Scholar] [CrossRef][Green Version]
Labitzke, K. Stratospheric-mesospheric Midwinter Disturbances: A Summary of Observed Characteristics. J. Geophys. Res. 1981, 86, 9665–9678. [Google Scholar] [CrossRef]
Baldwin, M.P.; Gray, L.J.; Dunkerton, T.J.; Hamilton, K.; Haynes, P.H.; Randel, W.J.; Holton, J.R.; Alexander, M.J.; Hirota, I.; Horinouchi, T.; et al. The Quasi-biennial Oscillation. Rev. Geophys. 2001, 39, 179–229. [Google Scholar] [CrossRef]
Holton, J.R.; Tan, H.-C. The Influence of the Equatorial Quasi-Biennial Oscillation on the Global Circulation at 50 Mb. J. Atmos. Sci. 1980, 37, 2200–2208. [Google Scholar] [CrossRef]
Toms, B.A.; Barnes, E.A.; Maloney, E.D.; Van Den Heever, S.C. The Global Teleconnection Signature of the Madden-Julian Oscillation and Its Modulation by the Quasi-Biennial Oscillation. J. Geophys. Res. Atmos. 2020, 125, e2020JD032653. [Google Scholar] [CrossRef]
Ermakova, T.; Koval, A.; Didenko, K.; Fadeev, A.; Sokolov, A. Analyzing Stratospheric Polar Vortex Strength and Persistence Under Different QBO and ENSO Phases: Insights from the Model Study. Atmosphere 2025, 16, 1371. [Google Scholar] [CrossRef]
Sunkara, E.; Seo, K.-H.; Mengist, C.K.; Ratnam, M.V.; Niranjan Kumar, K.; Venkata Chalapathi, G. Role of QBO and MJO in Sudden Stratospheric Warmings: A Case Study. Atmosphere 2024, 15, 1458. [Google Scholar] [CrossRef]
Butler, A.H.; Gerber, E.P. Optimizing the Definition of a Sudden Stratospheric Warming. J. Clim. 2018, 31, 2337–2344. [Google Scholar] [CrossRef]
Ayarzagüena, B.; Polvani, L.M.; Langematz, U.; Akiyoshi, H.; Bekki, S.; Butchart, N.; Dameris, M.; Deushi, M.; Hardiman, S.C.; Jöckel, P.; et al. No Robust Evidence of Future Changes in Major Stratospheric Sudden Warmings: A Multi-Model Assessment from CCMI. Atmos. Chem. Phys. 2018, 18, 11277–11287. [Google Scholar] [CrossRef]
Rao, J.; Garfinkel, C.I. CMIP5/6 Models Project Little Change in the Statistical Characteristics of Sudden Stratospheric Warmings in the 21st Century. Environ. Res. Lett. 2021, 16, 034024. [Google Scholar] [CrossRef]
Chávez-Pérez, V.M.; Añel, J.A.; Almaguer-Gómez, C.; De La Torre, L. Temporal Variability of Major Stratospheric Sudden Warmings in CMIP5 Climate Change Scenarios. Climate 2025, 13, 207. [Google Scholar] [CrossRef]
Martínez-Andradas, V.; De La Cámara, A.; Zurita-Gotor, P.; Lott, F.; Serva, F. Quantifying the Spread in Sudden Stratospheric Warming Wave Forcing in CMIP6. Weather Clim. Dyn. 2025, 6, 329–343. [Google Scholar] [CrossRef]
Copernicus Climate Change Service; Climate Data Store. ERA5 Hourly Data on Single Levels from 1940 to Present; Copernicus Climate Change Service (C3S): Bonn, Germany; Climate Data Store (CDS): Reading, UK, 2023. [Google Scholar] [CrossRef]
Li, J.; Wang, Y.; Liu, L.; Yao, Y.; Huang, L.; Li, F. A Grid Model for Vertical Correction of Precipitable Water Vapor over the Chinese Mainland and Surrounding Areas Using Random Forest. Geosci. Model Dev. 2024, 17, 2569–2581. [Google Scholar] [CrossRef]
Li, J.; Li, F.; Liu, L.; Yao, Y.; Huang, L.; Wang, Y. A Weighted Mean Temperature Forecast Model Based on Fused Data and Generalized Regression Neural Network and Its Impact on GNSS-Based Precipitable Water Vapor Estimation. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4101914. [Google Scholar] [CrossRef]
Yang, Y.; Li, H. Planetary Wave Activity During 2019 Sudden Stratospheric Warming Event Revealed by ERA5 Reanalysis Data. Remote Sens. 2024, 16, 4739. [Google Scholar] [CrossRef]
Kim, J.; Son, S.-W.; Gerber, E.P.; Park, H.-S. Defining Sudden Stratospheric Warming in Climate Models: Accounting for Biases in Model Climatologies. J. Clim. 2017, 30, 5529–5546. [Google Scholar] [CrossRef]
Copernicus Climate Change Service; Climate Data Store. CMIP5 Daily Data on Pressure Levels; Copernicus Climate Change Service (C3S): Bonn, Germany; Climate Data Store (CDS): Reading, UK, 2018. [Google Scholar] [CrossRef]
Hall, R.J.; Mitchell, D.M.; Seviour, W.J.M.; Wright, C.J. Persistent Model Biases in the CMIP6 Representation of Stratospheric Polar Vortex Variability. J. Geophys. Res. Atmos. 2021, 126, e2021JD034759. [Google Scholar] [CrossRef]
Karpechko, A.Y.; Wu, Z.; Simpson, I.R.; Kretschmer, M.; Afargan-Gerstman, H.; Butler, A.H.; Domeisen, D.I.V.; Garny, H.; Lawrence, Z.; Manzini, E.; et al. Northern Hemisphere Stratosphere-Troposphere Circulation Change in CMIP6 Models: 2. Mechanisms and Sources of the Spread. J. Geophys. Res. Atmos. 2024, 129, e2024JD040823. [Google Scholar] [CrossRef]
Butchart, N.; Charlton-Perez, A.J.; Cionni, I.; Hardiman, S.C.; Haynes, P.H.; Krüger, K.; Kushner, P.J.; Newman, P.A.; Osprey, S.M.; Perlwitz, J.; et al. Multimodel Climate and Variability of the Stratosphere. J. Geophys. Res. 2011, 116, D05102. [Google Scholar] [CrossRef]
Charlton-Perez, A.J.; Baldwin, M.P.; Birner, T.; Black, R.X.; Butler, A.H.; Calvo, N.; Davis, N.A.; Gerber, E.P.; Gillett, N.; Hardiman, S.; et al. On the Lack of Stratospheric Dynamical Variability in Low-top Versions of the CMIP5 Models. J. Geophys. Res. Atmos. 2013, 118, 2494–2505. [Google Scholar] [CrossRef]
Palmeiro, F.M.; Barriopedro, D.; García-Herrera, R.; Calvo, N. Comparing Sudden Stratospheric Warming Definitions in Reanalysis Data. J. Clim. 2015, 28, 6823–6840. [Google Scholar] [CrossRef]
Garfinkel, C.I.; White, I.; Gerber, E.P.; Son, S.-W.; Jucker, M. Stationary Waves Weaken and Delay the Near-Surface Response to Stratospheric Ozone Depletion. J. Clim. 2023, 36, 565–583. [Google Scholar] [CrossRef]
Baldwin, M.P.; Stephenson, D.B.; Thompson, D.W.J.; Dunkerton, T.J.; Charlton, A.J.; O’Neill, A. Stratospheric Memory and Skill of Extended-Range Weather Forecasts. Science 2003, 301, 636–640. [Google Scholar] [CrossRef]
Baldwin, M.P.; Thompson, D.W.J. A Critical Comparison of Stratosphere–Troposphere Coupling Indices. Q. J. R. Meteorol. Soc. 2009, 135, 1661–1672. [Google Scholar] [CrossRef]
Blume, C.; Matthes, K.; Horenko, I. Supervised Learning Approaches to Classify Sudden Stratospheric Warming Events. J. Atmos. Sci. 2012, 69, 1824–1840. [Google Scholar] [CrossRef]
Serva, F.; Christiansen, B.; Davini, P.; Von Hardenberg, J.; Van Den Oord, G.; Reerink, T.J.; Wyser, K.; Yang, S. Changes in Stratospheric Dynamics Simulated by the EC-Earth Model From CMIP5 to CMIP6. J. Adv. Model. Earth Syst. 2024, 16, e2023MS003756. [Google Scholar] [CrossRef]
Anstey, J.A.; Shepherd, T.G. High-latitude Influence of the Quasi-biennial Oscillation. Quart. J. R. Meteorol. Soc. 2014, 140, 1–21. [Google Scholar] [CrossRef]
Garfinkel, C.I.; Shaw, T.A.; Hartmann, D.L.; Waugh, D.W. Does the Holton–Tan Mechanism Explain How the Quasi-Biennial Oscillation Modulates the Arctic Polar Vortex? J. Atmos. Sci. 2012, 69, 1713–1733. [Google Scholar] [CrossRef]
Zhang, J.; Orsolini, Y.; Sato, K. Modulation of the Semi-Annual Oscillation by Stratospheric Sudden Warmings as Seen in the High-Altitude JAWARA Re-Analyses. Atmosphere 2025, 16, 1320. [Google Scholar] [CrossRef]

Figure 1. Distribution and duration of SSW events from 1980 to 2005 based on ERA5 reanalysis and selected CMIP5 models. Panel (a) shows the number of SSW events categorized as minor, major, and extreme events, along with the total count of data points. Panel (b) presents the cumulative number of SSW days, which reflects both the frequency and persistence of events. The colors correspond to each dataset as follows: ERA5 (green), M1 (purple), M2 (light blue), M3 (dark green), M4 (orange), and M5 (navy blue).

Figure 2. Classification of SSW events at 10 hPa between 1980 and 2005 based on ERA5 reanalysis and CMIP5 historical model data. Panels show the timing and intensity of events across different datasets: (a) ERA5 reanalysis, (b) M1, (c) M2, and (d) M3. Event categories are marked as minor (blue), major (green), and extreme (red), plotted from November to April to reflect the typical seasonal window for the occurrence of SSWs.

Figure 3. Classification of SSW events at 10 hPa between 2006 and 2100 based on CMIP5 RCP 4.5 data. The panels show the timing and intensity of events across different datasets: (a) M1, (b) M2, and (c) M3. Event categories are marked as minor (blue), major (green), and extreme (red), plotted from November to April to reflect the typical seasonal window for the occurrence of SSWs.

Figure 4. Stratospheric temperature (T_s) time series at 10 hPa and 60° N for the period 1980–2005 based on (a) ERA5 reanalysis and CMIP5 historical simulations from (b) M1, (c) M2, and (d) M3. Daily T_s are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.

Figure 5. Stratospheric temperature (T_s) time series at 10 hPa and 60° N for the period 2006–2100 based on the RCP 4.5 scenario from CMIP5 models: (a) M1, (b) M2, and (c) M3. Daily T_s are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.

Figure 6. Zonal wind (U_h) time series at 10 hPa and 60° N for the period 1980–2005 based on (a) ERA5 reanalysis and CMIP5 historical simulations from (b) M1, (c) M2, and (d) M3. Daily U_h are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.

Figure 7. Zonal wind (U_h) time series at 10 hPa and 60° N for the period 2006 –2100 based on CMIP5 RCP 4.5 simulations for (a) M1, (b) M2, and (c) M3. Daily U_h overlaid with SSW classifications are shown as minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.

Figure 8. Scatter plots of daily mean zonal wind (U_h) versus stratospheric temperature (T_s) at 10 hPa and 60° N from CMIP5 models: (a) ERA5 (b) M2 and (c) M3 for the historical period (1980–2005). Orange dots represent data after 1990 and green dots represent data before 1990. Dashed black and blue lines show linear trends for pre-1990 and post-1990 periods.

Figure 9. Scatter plots of daily mean zonal wind (U_h) versus stratospheric temperature (T_s) at 10 hPa and 60° N from CMIP5 models: (a) M2 and (b) M3 for the RCP 4.5 scenario (2006–2100). Colored dots represent different future sub-periods: magenta (2006–2025), blue (2026–2050), orange (2051–2075), and green (2076–2100). Solid lines show linear trends for each sub-period: brown (2006–2025), purple (2026–2050), blue (2051–2075), and black (2076–2100).

Table 1. Datasets and numerical weather models used in the analysis [28,33].

Model	Abbreviation	Institution/Country	Spatial Resolution	Temporal Resolution	Periods	Group
ERA5 Reanalysis	ERA5	ECMWF, Europe	0.25° × 0.25°	Hourly	1980–2005	Reference
Australian Community Climate and Earth-System Simulator v1.3	ACCESS1-3 (M1)	Bureau of Meteorology & CSIRO, Melbourne, VIC, Australia	1.875° × 1.25°	Daily	1980–2005 (Historical), 2006–2100 (RCP4.5)	Primary
Hadley Centre Global Environmental Model v2—Carbon Cycle	HadGEM2-CC (M2)	UK Met Office, Exeter, United Kingdom	1.25° × 1.25°	Daily	1980–2005 (Historical), 2006–2100 (RCP4.5)	Primary
Max Planck Institute Earth System Model—Medium Resolution	MPI-ESM-MR (M3)	Max Planck Institute for Meteorology, Hamburg, Germany	1.875° × 1.875°	Daily	1980–2005 (Historical), 2006–2100 (RCP4.5)	Primary
Geophysical Fluid Dynamics Laboratory Earth System Model v2G	GFDL-ESM2G (M4)	NOAA Geophysical Fluid Dynamics Laboratory, Princeton, NJ, USA	2.5° × 2.5°	Daily	1980–2005 (Historical), 2006–2100 (RCP4.5)	Secondary
Flexible Global Ocean–Atmosphere–Land System Model, Grid-Point v2	FGOALS-g2 (M5)	LASG/IAP, Beijing, China	2.8125° × 2.8125°	Daily	1980–2005 (Historical), 2006–2100 (RCP4.5)	Secondary

Table 2. Threshold Exceedance Area Temperature (TEA) stratospheric temperature (T_s) criteria used in the SSW detection [2].

Criteria Used in Temperature Anomaly Calculation in TEA Metrics
Step 1	T_s: Stratospheric air temperature obtained from either CMIP5 or ERA5 analysis.
Step 2	T_sa: Long-term climatic average of the stratospheric air temperature (T_s)
Step 3	ΔT_s = T_s − T_sa (Anomaly)

Table 3. SSW metrics: definitions, calculation methods, and classification thresholds [2].

Metric/Class	Definition/Calculation Method	Physical Interpretation	Example Value (1980–2005)
TEA	Geographical area where the Daily temperature anomaly exceeds a defined threshold.	Horizontal extent of warming on a given day	12 January 1987: >30 K ≈ 9.1 × 10⁶ km²
MPD	Number of consecutive days with TEA above threshold.	Persistence of warming (temporal duration)	22 days
MPA	Average of Daily TEA during the main phase (∑TEA/MPD)	Typical or average horizontal coverage of warming	8.52 × 10⁶ km²
MPS	Product of MPD and MPA (area × duration)	Integrated magnitude of warming event	187.4 × 10⁶ km².days
SSW Classification Minor	MPS < 90 × 10⁶ km².days	Small-scale or low-impact SSW event	GFDL-ESM2G—1984 (MPS ≈ 77 × 10⁶ km².days)
SSW Classification Major	90–180 × 10⁶ km².days	Large-scale impact SSW event	HadGEM2CC—1999 (MPS ≈ 132 × 10⁶ km².days)
SSW Classification Extreme	MPS > 180 × 10⁶ km².days	Exceptionally strong/extensive SSW	ACCESS1-3—1987 (MPS ≈ 520 × 10⁶ km².days)

Table 4. Statistical comparison of daily stratospheric temperature (T_s) series at 10 hPa 60° N for ERA5 and CMIP5 models (M1–M3) over the historical period (1980–2005). The metrics include mean T_s, standard deviation (sd), percentile range (P_r), linear trends (S_Ts), and variance reduction during SSW events.

Metric	ERA5	M1	M2	M3
Mean T_s (K)	225.94	227.9	227.13	227.76
Std. Deviation (sd) (K)	9.55	11.69	7.68	9.45
Percentile Range (P_r) (K)	29.37	35.18	23.83	28.43
Trend (S_Ts) (K.decade⁻¹)	–0.298	–0.85	–0.69	–0.316
SSW Days Mean T_s (K)	226.19	224.96	229.22	228.81
Non-SSW Days Mean T_s (K)	225.93	~227.96	~227.07	~227.72
SSW Variance Reduction	46%	54%	50%	44%

Table 5. Statistical comparison of daily stratospheric temperature (T_s) series at 10 hPa 60° N for CMIP5 models (M1–M3) under the RCP 4.5 future climate scenario. The diagnostics include mean T_s, standard deviation (sd), percentile range (P_r), linear trends (S_Ts), and variance reduction during SSW events compared to the historical baseline.

Metric	M1	M2	M3
Mean T_s (K)	225.85	224.69	225.6
Std. Deviation (sd) (K)	11.99	7.38	9.04
Percentile Range (P_r) (K)	35.87	22.77	27.16
Trend (S_Ts) (K.decade⁻¹)	–0.21	–0.26	–0.24
SSW Days Mean T_s (K)	222.95	228.68	225.75
Non-SSW Days Mean T_s (K)	~225.95	~224.69	225.93
SSW Variance Reduction	49%	52%	49%

Table 6. Statistical comparison of daily zonal wind (U_h) series at 10 hPa 60° N for ERA5 and CMIP5 models (M1–M3) over the historical period (1980–2005). The metrics include mean U_h, standard deviation (sd), percentile range (P_r), linear trends (S_Ts), and variance reduction during SSW events.

Metric	ERA5	M1	M2	M3
Mean U_h (m.s⁻¹)	10.62	16.22	9.34	10.83
Std. Deviation (sd) (m.s⁻¹)	19.19	21.66	18.17	19.19
Percentile Range (P_r) (m.s⁻¹)	58.33	65.72	57.6	59.71
Trend S_Uh (m.s⁻¹ decade⁻¹)	–0.87	+0.64	+0.80	+0.34
SSW Days Mean U_h (m.s⁻¹)	14.5	17.61	12.78	11.9
Non-SSW Days Mean U_h (m.s⁻¹)	10.45	16.18	9.26	10.8
SSW Variance Reduction	24%	37%	54%	13%

Table 7. Statistical comparison of daily zonal wind (U_h) series at 10 hPa 60° N for CMIP5 models (M1–M3) under the RCP 4.5 future climate scenario. The diagnostics include mean U_h, standard deviation (sd), percentile range (P_r), linear trends (S_Ts), and variance reduction during SSW events compared to the historical baseline.

Metric	M1	M2	M3
Mean U_h (m.s⁻¹)	17.6	9.73	10.8
Std. Deviation (sd) (m.s⁻¹)	23.51	18.13	18.88
Percentile Range (P_r) (m.s⁻¹)	70.44	55.95	57.64
Trend S_Uh (m.s⁻¹ decade⁻¹)	+0.11	–0.04	+0.17
SSW Days Mean U_h (m.s⁻¹)	19.66	13.1	31.75
Non-SSW Days Mean U_h (m.s⁻¹)	17.53	9.69	10.11
SSW Variance Reduction	43%	31%	4%

Table 8. Statistical analysis of stratospheric temperature amplitude (T_amp) and range (T_rng) for the ERA5 reanalysis and CMIP5 model simulations (M2 and M3) at 10° N and 60° N for the historical (1980–2005) and future (2006–2100, RCP 4.5) periods. The metrics include the mean, median, standard deviation (sd), variance (Var), minimum (Min), maximum (Max), and linear trend (Slope, in K per decade).

Period	Metric	Lat	Model	Mean	Median	Sd	Var	Min	Max	Slope
1980–2005	T_amp	10° N	ERA5	2.071	2.034	0.379	0.143	1.333	3.397	−0.001236
			M2	1.637	1.596	0.349	0.122	0.848	2.663	0.000487
			M3	1.884	1.82	0.415	0.172	0.836	3.117	0.001729
		60° N	ERA5	3.159	3.186	0.311	0.097	2.334	3.898	0.001618
			M2	0.585	0.59	0.094	0.009	0.373	0.874	−0.000605
			M3	0.785	0.772	0.108	0.012	0.499	1.213	0.000621
	T_rng	10° N	ERA5	0.553	0.547	0.109	0.012	0.3	0.835	−0.00075
			M2	0.434	0.429	0.098	0.01	0.218	0.675	−0.000182
			M3	0.507	0.5	0.118	0.014	0.211	0.805	0.000407
		60° N	ERA5	0.73	0.727	0.086	0.007	0.526	0.947	0.000186
			M2	0.144	0.144	0.023	0.001	0.096	0.205	−0.000136
			M3	0.193	0.189	0.025	0.001	0.145	0.264	0.000077
2006–2100	T_amp	10° N	M2	1.706	1.7	0.337	0.114	0.84	2.938	0.000661
		10° N	M3	1.962	1.935	0.369	0.136	1.017	3.231	−0.00019
		60° N	M2	0.574	0.571	0.086	0.007	0.391	0.855	−0.000254
		60° N	M3	0.758	0.753	0.107	0.011	0.456	1.098	−0.000106
	T_rng	10° N	M2	0.456	0.445	0.099	0.01	0.218	0.827	0.000166
		10° N	M3	0.522	0.508	0.106	0.011	0.259	0.943	−0.000086
		60° N	M2	0.15	0.149	0.021	0	0.098	0.202	−0.000051
		60° N	M3	0.191	0.189	0.025	0.001	0.117	0.245	−0.000023

Table 9. Statistical analysis of zonal wind amplitude (U_amp) and range (U_rng) for ERA5 reanalysis and CMIP5 model simulations (M2 and M3) at 10° N and 60° N for the historical (1980–2005) and future (2006–2100, RCP 4.5) periods. Metrics include the mean, median, standard deviation (sd), variance (Var), minimum (Min), maximum (Max), and linear trend slope (m.s⁻¹ per decade).

Period	Metric	Lat	Model	Mean	Median	Std	Var	n	Max	Slope
1980–2005	U_amp	10° N	ERA5	12.513	12.656	4.56	20.793	3.033	22.602	0.01084
			M2	13.352	14.483	5.497	30.212	1.376	23.14	0.000349
			M3	13.214	14.309	7.622	58.088	1.066	25.571	0.005
		60° N	ERA5	11.385	11.347	1.754	3.075	7.047	15.436	0.01092
			M2	4.087	4.003	1.138	1.295	1.657	6.865	−0.002389
			M3	4.168	3.971	1.474	2.173	1.215	8.212	−0.00104
	U_rng	10° N	ERA5	3.021	3.142	0.926	0.858	0.853	4.838	0.002139
			M2	3.06	3.432	1.014	1.028	0.398	4.764	−0.000255
			M3	3.068	3.615	1.526	2.327	0.302	5.06	0.001179
		60° N	ERA5	2.728	2.742	0.392	0.154	1.696	3.691	0.001098
			M2	1.072	1.022	0.29	0.084	0.489	1.788	−0.000499
			M3	1.059	1.019	0.333	0.111	0.386	1.91	−0.000604
2006–2100	U_amp	10° N	M2	13.251	13.982	4.306	18.546	2.635	21.302	−0.004462
		10° N	M3	14.458	15.551	7.133	50.886	0.793	26.944	0.0048
		60° N	M2	4.133	4.053	1.237	1.531	1.263	7.554	−0.000401
		60° N	M3	4.578	4.432	1.486	2.21	1.012	9.142	0.000261
	U_rng	10° N	M2	3.094	3.243	0.768	0.589	0.678	4.419	−0.000877
		10° N	M3	3.315	3.76	1.365	1.863	0.218	5.092	0.000826
		60° N	M2	1.071	1.078	0.29	0.084	0.338	1.761	−0.00014
		60° N	M3	1.174	1.146	0.354	0.126	0.303	2.184	−0.000025

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Durmus, S.; Demirhan, D.; Gultepe, I.; Durmus, O. Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100. Forecasting 2026, 8, 13. https://doi.org/10.3390/forecast8010013

AMA Style

Durmus S, Demirhan D, Gultepe I, Durmus O. Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100. Forecasting. 2026; 8(1):13. https://doi.org/10.3390/forecast8010013

Chicago/Turabian Style

Durmus, Simla, Deniz Demirhan, Ismail Gultepe, and Onur Durmus. 2026. "Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100" Forecasting 8, no. 1: 13. https://doi.org/10.3390/forecast8010013

APA Style

Durmus, S., Demirhan, D., Gultepe, I., & Durmus, O. (2026). Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100. Forecasting, 8(1), 13. https://doi.org/10.3390/forecast8010013

Article Menu

Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100

Highlights

Abstract

1. Introduction

2. Data

3. Method