Next Article in Journal
The Impact of ESG Performance on Financial Performance: Evidence from Listed Companies in Thailand
Previous Article in Journal
Projection of Changes in Coastal Water Temperature of the Baltic Sea up to 2100
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100

1
Department of Flight Training, University of Turkish Aeronautical Association, Ankara 06790, Türkiye
2
Department of Meteorological Engineering, Istanbul Technical University, Istanbul 34467, Türkiye
3
Faculty of Engineering and Applied Sciences, Ontario Technical University, Oshawa, ON L1G 0C5, Canada
4
Department of Aeronautical Engineering, University of Turkish Aeronautical Association, Ankara 06790, Türkiye
5
Department of Civil and Environmental Engineering and Earth Science, University of Notre Dame, Notre Dame, IN 46556, USA
6
Ankara Aviation Vocational School, University of Turkish Aeronautical Association, Ankara 06790, Türkiye
*
Author to whom correspondence should be addressed.
Forecasting 2026, 8(1), 13; https://doi.org/10.3390/forecast8010013
Submission received: 26 December 2025 / Revised: 3 February 2026 / Accepted: 6 February 2026 / Published: 10 February 2026
(This article belongs to the Section Weather and Forecasting)

Highlights

What are the main findings?
  • High-top CMIP5 models systematically underestimate the energetic intensity MPS (Main Phase Strength) and spatial extent TEA (Threshold Exceedance Area) of SSW events by 61% to 82% compared to ERA5 reanalysis.
  • A ‘teleconnection gap’ is identified where models successfully represent tropical stratospheric variability (10° N) but fail to effectively transmit this momentum forcing to the polar vortex (60° N).
What are the implications of the main findings?
  • The CMIP5 models fail to accurately represent the sensitivity of the Ts-Uh relationship and polar vortex responses. This suggests that current climate projections may be insufficient to predict anomalies of the stratospheric thermal budget and climate change.
  • The urgent improvement of the CMIP models’ vertical and horizontal resolutions for future climate studies is found to be critical for accurately prediction of SSW events and Arctic vortex development. Furthermore, the re-analyzed datasets should be enhanced using remote sensing observations to better represent wave–mean flow interactions in the upper troposphere and stratosphere.

Abstract

The main objective of this work is to characterize Sudden Stratospheric Warming (SSW) conditions and their impact on local weather forecasting and climate change, using SSW definition criteria. The SSWs strongly affect Arctic vortex structure and midlatitude weather conditions. This work evaluates the frequency, amplitude, and dynamical–thermal characteristics of SSWs under historical and Representative Concentration Pathway (RCP) 4.5 scenarios, focusing on stratospheric air temperature (Ts) and zonal wind speed (Uh) at the 10° N and 60° N latitudes. The fifth-generation ECMWF atmospheric reanalysis (ERA5) is employed as the reference dataset. Simulations of five Coupled Model Intercomparison Project Phase 5 (CMIP5) models, represented by M1 to M5, are analyzed. The primary group of models included 1) the Australian Community Climate and Earth-System Simulator, version 1.3 (ACCESS1-3, M1), 2) the Hadley Center Global Environmental Model, version 2—Carbon Cycle (HadGEM2-CC, M2), and 3) the Max Planck Institute Earth System Model—Medium Resolution (MPI-ESM-MR, M3). The analysis period covers SSW events related to the Quasi-Biennial Oscillation (QBO) in the Northern Hemisphere (NH) from 1980 to 2100. The key findings indicate that while M1, M2, and M3 simulate SSW occurrence correctly for the 21st century, they exhibit significant systematic deficiencies in capturing the structural dynamics of SSW events. Specifically, the M1, M2, and M3 models underestimate the polar stratospheric temperature amplitude (Tamp) by approximately 75–80% and zonal wind amplitude (Uamp) by more than 60% compared to the ERA5 analysis. Furthermore, ERA5 exhibits a strong negative correlation (R ≈ −0.8) between Uh and Ts that is not estimated accurately using the present models. The importance of the horizontal resolution of the models and wave–mean flow interactions in determining SSW intensity and occurrence is also found to be a critical metric. Results suggest that SSW definition criteria affect Arctic and midlatitude weather system prediction at a rate of 61–82%. It is concluded that the primary configurations of CMIP5 models for accurately capturing the dynamical structure and evolution of QBO–SSW interactions are needed, and that they affect future projections of SSW events.

1. Introduction

The stratosphere begins at approximately 8–10 km altitude near the polar regions and 15–20 km in the tropical regions, reaching an average height of up to 50 km [1]. Sudden stratospheric warming (SSW) is defined as a sudden increase in stratospheric polar air temperature (Ts), mainly during winter months [2]. The Arctic tropopause height is at its lowest level over the poles compared to other regions due to the cold and denser airmass, especially in winter. In the Northern Hemisphere (NH), approximately every 28 months, extremely cold air temperatures and strong westerly zonal winds (Uh) suddenly weaken or reverse, causing the stagnant polar vortex to warm abruptly within a few weeks. During these events, the winds significantly weaken or switch to easterly Uh. In the literature, these events are known as SSWs [3].
SSW events were first observed in the early 1950s [4] and can be monitored in detail using satellite observations [3]. Over time, significant progress has been made in understanding the dynamic effects of SSWs through detailed observations and simulations. However, analyzing how SSW affects both the surface air temperature and upper atmosphere dynamics remains a challenge. Recent studies on stratospheric processes affecting operational forecasting have been conducted by [5,6]. These works, correspondingly, indicated that utilizing spatiotemporal memory flow networks for long-range SSW prediction and analyzing unprecedented winter extremes with direct surface impacts underscore the critical role of stratospheric diagnostics in operational forecasting. SSWs are generally observed in the polar regions (poleward of 60° N) in the NH, but their rarity in the Southern Hemisphere (SH) is due to weaker tropospheric planetary wave activity [7].
The first recorded SSW event was discovered by Richard Scherhag in January 1952 [8], using radiosonde measurements collected in Berlin, Germany. SSWs are atmospheric events that usually occur irregularly in the atmosphere. Using both observations and simulations, SSWs can be detected by a sudden increase in the average Ts (exceeding 30–40 K) in the mid-stratosphere at a height of approximately 30 km (10 hPa) [3,9,10]. However, these warmings sometimes exceed 70 K, creating extreme SSW events. Such warmings in the stratosphere result from the rapid intensification of tropospheric planetary waves propagating upward, which disrupt the stratospheric general circulation [9,11,12]. Tropospheric planetary waves provide the requisite dynamical forcing for SSWs, while the Quasi-Biennial Oscillation (QBO) effectively governs the stratospheric waveguide [13,14]. This indicates that while upward-propagating waves are the primary drivers, the QBO phase regulates the background conditions under which these waves interact with the polar vortex.
The World Meteorological Organization (WMO) established the Commission for Atmospheric Sciences (CAS) and launched the STRATALERT program in 1964 to monitor SSWs, following the January–February 1952 event. In this program, SSW phenomena were studied using radiosonde and rocketsonde observations [13]. The results of this field campaign have shown that SSWs are related to polar vortex disruptions attributed to several factors, such as wind reversals during QBO events [14,15]. The QBO is a periodic shift between easterly and westerly winds in the tropical stratosphere every 28 to 30 months near the equator [4]. These results suggested that SSW events play an important role in polar vortex development and their effect on mid-latitude weather systems.
Major studies since the 1970s analyzing Ts increase and Uh reversal pointed out that variable definitions and thresholds for SSWs exist [16,17]. Mclnturff et al. [9] suggest that a temperature anomaly (ΔTs) of >25 K can be used as a criterion for SSW detection and monitoring. These studies suggested that major category SSWs were clearly associated with the QBO phase slowdown or reversal [9]. Charlton and Polvani [15] refined the definition provided by McInturff et al. [9]. This framework identifies major SSW events based on the reversal of the Uh at the 10 hPa level over 60° N latitude during winter months. While these events are fundamentally driven by the upward propagation of tropospheric planetary waves, the QBO acts as a key modulator that influences the stratospheric waveguide and the conditions under which these wave-driven reversals occur.
The relationship between the QBO and SSWs was widely used later to evaluate the dynamic consequences of SSWs and their interaction with the adjacent atmospheric layers. The SSW events are strongly related to the phases of the QBO [18]. Dependent on the QBO phase, the stability of the polar vortex and the likelihood of SSW events are modified. Typically, the westerly phase of QBO (QBO+) keeps the vortex stable, preventing the development of SSWs. On the other hand, the easterly phase of QBO (QBO−) likely increases wave activity, and the probability of SSW occurrence [4,19]. Such interactions highlight a robust coupling between the tropical stratosphere and polar vortex disruptions. Beyond the modulation of the stratospheric waveguide, the persistence of the polar vortex is further influenced by teleconnections from the Madden–Julian Oscillation (MJO) and El Niño–Southern Oscillation (ENSO), which exhibit varying signatures across climate models [20,21]. Integrating these diverse drivers into stratospheric diagnostics is essential for reducing uncertainties in seasonal forecasts [22]. Although the precise dynamical pathways of this tropical–polar teleconnection remain a subject of active research, the statistical relationship between the two is well-documented.
While several well-established definitions for SSW events exist, most notably the Charlton and Polvani criteria [15] based on Uh reversals, the historical absence of a single, universally accepted classification has led to varying results regarding SSW frequency and intensity in the literature. This diversity in criteria necessitates the use of more physically meaningful and integrated metrics, such as the Threshold Exceedance Area (TEA) and Main Phase Strength (MPS) metrics employed in this study, to capture the full structural and energetic evolution of the events. For this reason, the variable criteria based on wind reversals and Ts anomalies have led to conflicting results of SSW frequency and intensity [2]. Butler and Gerber [23] found that nine different major SSW definitions exist, and based on these definitions, annual SSW frequency varied between 40% and 90%. These large differences indicate that standardization and physically meaningful metrics are needed. Butler and Gerber [23] stated that the SSW definition should be free of the specific dataset, and its definition must capture the physical and dynamical characteristics of SSWs.
The newest approaches to the definition of SSWs have been provided lately. Li et al. [2] introduced the TEA concept, the Main Phase Duration (MPD), and Main Phase Area (MPA) metrics, and these are directly related to SSW intensity for quantitative assessment. Their approaches are adapted in this study. These concepts were applied for a specific time period (historical and future) to study SSW events of both Ts and Uh, together with reference data from Fifth Generation European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis (ERA5) and five large-scale model simulations.
The main objective of this study, using SSW definition criteria, is to investigate the relationship between QBO variability and SSW events during the historical (1980–2005) and future (2006–2100) periods. To better situate this study within the context of the existing literature, it is important to note that while recent multi-model assessments [24,25,26] suggest that the frequency of major SSWs will remain relatively stable in the 21st century, the physical and energetic fidelity of these events remains a critical research gap. Specifically, Ayarzagüena et al. [24] reported no robust evidence of future changes in the frequency or timing of major SSWs across different climate models. Similarly, Rao and Garfinkel [25] concluded that the statistical characteristics and the strength of stratospheric-tropospheric coupling of SSWs indicated negligible changes in the 21st century. Additionally, a study by Chávez-Pérez et al. [26] highlighted that while long-term averages appear to be stable, there is significant temporal variability in the distribution of events, and this suggests that long-term aggregated assessments might mask changes in sub-periods. Our aim is to move beyond simple frequency counts, a technical aspect also addressed in the context of wave forcing spread by [27], to evaluate the evolving spatial and energetic intensity of SSWs using TEA and MPS metrics while using various threshold criteria for SSW definition. Furthermore, in the content of our main objective, the quantitative assessment of the QBO–SSW coupling through a comparison of stratospheric variability at 10° N (serving as a tropical QBO proxy) and 60° N (polar response) is performed to identify whether model biases are rooted in tropical forcing or its high-latitude transmission. In this regard, the current work is significantly different than the studies of [24,25,26] and places more emphasis on the structural and energetic sensitivity analysis of threshold criteria effects on SSW analysis.
The dataset obtained using the Coupled Model Intercomparison Project Phase 5 (CMIP5) model simulations and high-resolution ERA5 reanalysis is detailed in Section 2. Section 3 presents the methodology, including statistical diagnostics and the TEA metric for SSW detection. Section 4 analyzes the comparative outcomes of past and future SSW characteristics while evaluating the quantitative link between tropical forcing and the polar vortex. Lastly, Section 5 and Section 6 offer a discussion of the findings and a conclusion.

2. Data

The datasets used in this study are obtained from ERA5 and from the CMIP5 under the Representative Concentration Pathway 4.5 (RCP4.5) scenario analysis. The RCP4.5 scenario assumes that greenhouse gas emissions stabilize in the second half of the 21st century. ERA5 provides relatively high spatial (0.25° × 0.25°) and temporal (hourly) resolution data that are used as the reference for the historical period (1980–2005) and as the verification reference dataset [28]. Recent studies have validated the utility of ERA5 reanalysis data for developing atmospheric models with parameter correction methodologies [29,30]. While ERA5 is recognized for reliably capturing planetary wave activity [31], addressing model-specific biases remains essential for a rigorous evaluation of simulated stratospheric variability [32].
The CMIP5 models provide daily outputs for both the historical (1980–2005) and future (2006–2100, RCP4.5) periods. Table 1 provides a technical comparison of the five models initially screened for this study. While all five models were considered, our methodology prioritizes models based on their vertical architecture, as detailed below. Models described as follows: M1 (ACCESS1-3) developed by the Australian Bureau of Meteorology and CSIRO, M2 (HadGEM2-CC) from the UK Met Office, M3 (MPI-ESM-MR) from the Max Planck Institute for Meteorology in Germany, M4 (GFDL-ESM2G) from NOAA/Geophysical Fluid Dynamics Laboratory in the United States, and M5 (FGOALS-g2) from LASG/IAP in China [33].
Despite the emergence of CMIP6, the CMIP5 model ensemble remains a suitable framework for this study as it provides a reliable baseline for diagnosing deep-seated structural and energetic biases. This selection is supported by recent literature, such as Rao and Garfinkel (2021) [25], which demonstrates no statistically significant differences in SSW frequency or basic characteristics between CMIP5 and the Coupled Model Inter-comparison Project Phase 6 (CMIP6). Most importantly, studies by Hall et al. (2021) [34] and Karpechko et al. (2024) [35] confirm that systemic biases in stratospheric polar vortex variability have remained essentially unchanged across model generations. Furthermore, current findings by Martínez-Andradas et al. (2025) [27] highlight that CMIP6 models continue to exhibit large spreads and persistent biases in SSW wave forcing and intensity. Therefore, utilizing CMIP5 allows for an objective comparison with established literature while addressing fundamental challenges common to both ensembles. Future work will extend this diagnostic framework to include the CMIP6 ensemble.
M1, M2, and M3 are designated as the primary group due to their extended vertical domains and enhanced resolution in the middle-to-upper stratosphere. These structural advantages facilitate a more realistic simulation of vertical wave–mean flow coupling, which is fundamental for capturing both the life cycle of SSW events and the periodic variability of the QBO. In contrast, M4 and M5 are assigned to the secondary group because their shallower model tops restrict their capacity to consistently replicate QBO–SSW interactions (Table 1). This methodological focus prevents potential bias in the analysis stemming from ‘low-top’ models, which are known to systematically underestimate stratospheric variability because of their inability to resolve vertical wave–mean flow coupling. Accordingly, results from M1, M2, and M3 constitute the basis of the main interpretation, whereas M4 and M5 remain beyond the scope of the present analysis.
The classification of models follows earlier evaluations of CMIP5 models [36,37] and emphasizes the importance of model top height in the simulation of the stratospheric variability. The models are selected based on their temporal coverage, data availability, and ability to resolve pressure levels up to 10 hPa, providing high-resolution daily data for the detection of SSW events [18].

3. Method

To investigate SSW events in the stratosphere, daily averages, variability, and long-term trends of Ts (STs) and Uh (SUh) at the 10 hPa pressure level for the NH are examined. The data consists of ERA5 reanalysis and simulations from five CMIP5 models (M1–M5), but only the first three are utilized here due to the selection of their capability for detecting SSWs. The ERA5 dataset (originally at hourly resolution) is averaged to daily means to match the model time resolutions.
The current study covers two time periods: (1) the historical time- period from 1980 to 2005, and (2) the future time period from 2006 to 2100 under the RCP4.5 emission scenario. SSW detection relies on the TEA method, where a 30 K anomaly threshold should be exceeded for at least six consecutive days. Detected events are classified as minor, major, or extreme events. MPS metrics, which are related to both MPD and MPA, are used. More than 60% of SSWs typically occur during January–February; therefore, the analysis focuses on the November–April window to capture the full seasonal range of occurrences. The SSW characteristics related to different QBO phases are also emphasized in the analysis. A description of the TEA method and its metrics is provided in the following section.
Following earlier classifications [23,37], the models used in this work are grouped as higher-top or lower-top models according to the vertical extent of the pressure height. Higher-top models (M1, M2, and M3) extend into the upper stratosphere or lower mesosphere (above ~1 hPa), allowing a more realistic representation of planetary wave propagation, vortex breakdown, and associated SSW dynamics. Lower-top models (M4 and M5) terminate at 10–30 hPa, which limits vertical wave–mean flow coupling and leads to a systematic underestimation of stratospheric variability. Although M1 is included in the analysis, it should be noted that, in comparison to higher-top models, its lower top may result in an underestimation of major and extreme events.
On the other hand, due to their limited capacity in resolving stratospheric wave–mean flow interactions and the resulting insufficient number of detected SSW events for reliable statistical evaluation, M4 and M5 are removed from the analysis. Consequently, while M1 is retained for general frequency and time-series comparisons (Section 4.1, Section 4.2 and Section 4.3), its limited capacity to resolve coherent thermal–dynamical feedback necessitates its exclusion from the detailed amplitude and coupling diagnostics presented in Section 4.4, Section 4.5 and Section 4.6. This selective approach ensures that the high-fidelity assessment of tropical–polar coupling is based on models with adequate vertical resolution, prioritizing physical consistency over model quantity.

3.1. Profiling Temperature Anomalies for SSW Detection Using the TEA Method

To obtain SSW events, the analysis used the TEA technique [2], where ΔTs must remain above 30 K for at least six consecutive days. This approach was originally introduced by [23] and then was subsequently adapted by the studies of Palmeiro et al. [38] and Garfinkel et al. [39]. In the analysis, the 30 K threshold at 10 hPa is utilized to identify large-scale stratospheric warmings events, and that criteria were consistent with earlier studies [9,17,40].
Following established definitions based on McInturff [9], Labitzke [17], Baldwin et al. [40], and Baldwin & Thompson [41], SSWs are identified when Ts anomalies at 10 hPa exceed 30 K. This threshold ensures consistency with prior studies while isolating large-scale warming episodes. As outlined in Table 2, Ts anomalies are derived by subtracting the long-term climatic average of Ts (Tsa) values from the daily mean Ts.
To account for model-specific mean-state biases, the baseline climatology is established independently for ERA5 and each CMIP5 simulation, a procedure essential for evaluating SSW events in climate models [32]. For the historical period (1980–2005), anomalies are calculated relative to the daily mean of that specific timeframe. Similarly, future anomalies (2006–2100) are derived from a distinct baseline established over the entire scenario window. For instance, the anomaly for 1 January 2050 is obtained by subtracting the climatological mean of all January 1st data points within the 2006–2100 period. This period-specific baseline approach isolates episodic dynamical warmings from their respective ‘climate normals’. By treating the historical and future eras independently, we ensure that anomalies represent episodic dynamical forcing rather than the background signal of long-term greenhouse-induced stratospheric cooling. This use of fixed, period-specific baselines aligns with standard diagnostic choices in major multi-model assessments [24], providing a consistent reference level to evaluate changes in dynamical variability relative to the projected mean state of the 21st century stratosphere.
The calculated anomalies that are used to detect SSW events are provided in Table 2. The TEA method distinguishes between persistent SSW events and fluctuations that do not meet the duration criteria. Then, the temporal length and geographic extent of each event are assessed separately. When ΔTs < 30 K, the event is marked as not being an SSW event. This aligns with previous studies of Charlton & Polvani [15] and Blume et al. [42]. The start and end dates are assigned for all SSW events in the analyzed period. After SSW detection, the latitudinal and longitudinal coordinates are used to calculate the surface area influenced by anomalies exceeding the threshold values.
The spatial coverage of each SSW event is determined using surface areas of grid cells with a 30 K threshold, and then they are summed. The sum of these areas provides the daily TEA criteria value. These computations are used for the quantification of the daily spatial extent of the SSW.

3.2. MPD, MPA, and MPS Metrics

Standardized metrics of MPD, MPA, and MPS are used to evaluate SSW events in both observations (ERA5) and numerical model simulations (M1–M3). Table 3 presents the frequency, magnitude, and spatial distribution of SSWs. These metrics are derived using the TEA method and jointly account for the duration and spatial coverage of each event. MPD refers to the number of consecutive days with ΔTs > 30 K, while MPA represents the average daily surface area affected by the ΔTs > 30 K threshold [2].
MPS, representing the magnitude of each warming event in space and time, is calculated from the product of MPD and MPA. This metric allows us to discriminate between events with different durations and spatial extents. SSW episodes are then categorized as minor (<90 × 106 km2.day), major (90–180 × 106 km2.day), or extreme (>180 × 106 km2.day) events according to the MPS values [2]. Table 3 shows the sampling days from the model simulations and ERA5 analysis that cover the 1980–2005 time period. This classification provides frequency and intensity of SSWs as well as stratospheric variability and model accuracy. Trend analyses on the Ts and Uh are also performed to support the areal coverage statistics for (i) all days, (ii) SSW days, and (iii) each intensity classification level (minor, major, and extreme events).
The TEA method is used to analyze the relationship between Uh variability and Ts-based SSW types. The last phase of the analysis used is the Uh data averaged between ±10° N latitude at the 10 hPa level to focus on the QBO. This analysis considers the QBO features, including seasonal patterns, amplitude variations, phase transitions, positive and negative phases, and structural variations between models.
To investigate oscillation behavior and possible connections between QBO and SSW events, time series analyses are utilized. In addition, to quantify intra-annual variability and long-term changes in Ts and Uh, three statistical diagnostics were derived from the monthly mean series. First, range-based variability for Ts (Trng) and Uh (Urng) was computed using a 12-month moving averaging window as the monthly maximum–minimum spread divided by 12. Second, the amplitude of variability for Ts (Tamp) and Uh (Uamp) was defined as the standard deviation (sd) of the same 12-month windowed values. Third, long-term tendencies were expressed through trend slopes, defined in their basic finite-difference form as
S = Δ X Δ t = X t 2 X t 1 t 2 t 1 ,
where X denotes the respective variable (Ts or Uh). For robustness, we also estimated slopes using ordinary least-squares (OLS) regression on monthly means; these trends are denoted as STs and SUh, reported in K.yr−1 and m.s−1 yr−1, respectively. These specific notations (Trng, Urng, Tamp, Uamp, etc.) are used consistently throughout the analysis.

4. Results

This section evaluates SSW events with respect to their frequency, energy intensity, and temporal evolution, while examining their relationship with the QBO. The results are provided for two distinct periods: (1) the historical era (1980–2005) and (2) the future climate (2006–2100) under the RCP 4.5 emission scenarios. The occurrence of SSWs is related to QBO phase transitions, seasonal patterns, and long-term variability, and enables a comprehensive characterization of stratospheric dynamics. The results are prepared to show the differences between the ERA5 reanalysis (as reference) and CMIP5 model simulations.
The following subsections present detailed results from the selected model outputs and time series analysis, focusing on the Uh and Ts parameters.

4.1. Selection of Numerical Models

To evaluate the CMIP5 models’ response to SSWs, the frequency and total duration of events are compared against ERA5 (Figure 1). In the historical period (1980–2005), ERA5 identified 29 SSW events lasting a total of 337 days. Of these, 37% were classified as minor events (126 days), 17% as major events (57 days), and 46% as extreme events (154 days). The mean and sd of event duration were 11.6 ± 4.1 days, highlighting significant variability in event persistence. As clearly illustrated in Figure 1, the M4 and M5 models (secondary group) demonstrate a systematic failure to capture SSW dynamics, identifying fewer than three events throughout the historical period. This deficiency is primarily attributed to their low-top configurations, which limit their ability to resolve vertical wave–mean flow coupling and stratospheric variability. Due to this limited capacity and the resulting insufficient number of events for robust statistical evaluation, M4 and M5 were excluded from the detailed analytical diagnostics. Consequently, the primary analysis focuses on the high-top models M1, M2, and M3, which better simulate the dynamical and thermodynamical processes of the stratosphere. The November–April window was selected for the primary analysis as it encompasses more than 90% of total SSW days in both ERA5 and the models, providing a statistically representative seasonal range.
Figure 2 shows the temporal and seasonal distribution of SSWs during the historical period (1980–2005) for ERA5 and models (M1–M3), while Figure 3 shows the corresponding distributions under the RCP 4.5 scenario (2006–2100). In the analysis, M1, M2, and M3 models are selected for SSW analysis because of their better handling of the dynamical and thermodynamical processes that are provided below. This time frame was chosen to provide a consistent physical and statistical framework for evaluating how well the models capture mid-winter warming events.
The 62% of the SSWs based on ERA5 analysis occurred in January–February. Of these, 31% occurred in November–December and 7% in March–April. M1 closely reproduced ERA5’s pattern: corresponding values are found to be 58%, 34%, and 8%. M2 generated these corresponding values as 49%, 39%, and 12%. M3 produced 64% for January–February, 28% for November–December, and 8% for March–April, matching the observed seasonal phases most closely. Results suggested the mean event duration for M1, M2, and M3 were 9.2 ± 3.3, 8.7 ± 3.7, and 11.0 ± 3.9 days, respectively. Overall, compared to ERA5 analysis, it is found that M1 and M3 simulations reproduce the observed seasonality within ± 5% accuracy, whereas M2 exhibits a systematic bias toward short-lived and early winter warmings.
In summary, the comparison between historical (1980–2005) and future (2006–2100) periods under the RCP 4.5 scenario indicates that the frequency of SSW events remains relatively stable. No statistically significant increasing or decreasing trend was observed across the M1, M2, and M3 ensembles, suggesting that while the climate warms, the occurrence rate of these events does not deviate substantially from historical averages. This finding is in line with the statistical projections provided by Rao and Garfinkel (2021) [25].
However, while the recent literature indicates that SSW frequency shows little robust change under future scenarios, our results provide evidence that future climate change is closely associated with significant systematic biases in the simulated physical and dynamical intensity of these events. These biases, characterized by a 61% to 82% underestimation of warming magnitude, demonstrate that the primary impact of climate change in models is the misrepresentation of the energetic structure of SSWs, rather than a shift in their occurrence rate.
Future projections under RCP 4.5 (Figure 3) show how the simulated SSW seasonality evolves through the 21st century. Based on the same identification and classification criteria applied throughout history, it appears that the events are limited to the November–April period. M1 shows 53.4% of events in January–February months, 27.2% in November–December months, and 18.4% in March–April months, which is similar to the historical period, with a mean duration of 11 ± 5.1 days. M2 simulates 67.1% for January–February, 0% for November–December, and 32.9% for March–April, showing an early winter shift of approximately 10% and producing the shortest mean event duration of 10.7 ± 3.7 days. M3 maintains 65.8% for January–February, 26.1% for November–December, and 8.1% for March–April, with a mean event duration of 9.8 ± 3.4 days.

4.2. Temperature Time Series

In this section, Ts at 10 hPa 60° N latitude is plotted and interpreted using ERA5 reanalysis and CMIP5 models (M1–M3), along with historical period and RCP 4.5 projection time series.

4.2.1. Historical Period of Ts

Daily temperature series at 10 hPa at 60° N latitude for the period 1980–2005 are analyzed to assess how effectively three CMIP5 models (M1–M3) reproduce the observed stratospheric thermal variability (Figure 4). Statistical assessments of Ts, sd, percentile ranges (Pr), and STs are summarized in Table 4.
According to ERA5 (Figure 4a), the polar stratosphere exhibits a weak but persistent cooling trend throughout the historical period. However, it displays significant seasonal and interannual thermal variability. As shown in Table 4, temperature variance decreases by approximately 46% during SSW events compared to non-SSW periods. This decrease confirms that temperature variability forms during midwinter warmings, indicating a significant thermal response.
Numerical simulations are applied using M1, M2, and M3. M1 (Figure 4b) overestimates the thermal range, producing a higher sd with a wider Pr than ERA5. It also overestimates the long-term cooling trend by about 2.9 times. M2 (Figure 4c) suppresses thermal variability with a narrower Pr. However, it overestimates the cooling rate with an inconsistent representation of stratospheric dynamics. M3 (Figure 4d) results are more comparable with ERA5 results and closely reproduce the magnitude of the mean Ts, sd, and the long-term cooling trend (Table 4).
In summary, all models successfully capture the thermodynamic signature of SSWs. The reduction in Ts and sd for all datasets ranges from 44% to 54% (Table 4). This confirms that SSW events play an important role in deriving polar cooling processes.

4.2.2. RCP 4.5 Future Climate Scenarios of Ts

In this section, the simulations are analyzed to determine how RCP 4.5 forcing modifies the characteristics of SSWs that include Ts, sd, Pr, and STs (Table 5). As shown in Table 5, long-term cooling trends remain weak and negative, suggesting that polar stratospheric cooling will continue moderately throughout the 21st century.
M1 (Figure 5a) closely maintains its historical thermal structure but suggests significant stability in the long-term trend. The rate of cooling is weakened by about 76% compared to the historical baseline. In contrast, M2 (Figure 5b) shows slightly more variability with a narrower Pr. Similar to M1, the cooling rate is 60% weaker than the historical prediction. The M3 (Figure 5c) simulations for the climate scenarios are found to be similar to the historical ERA5 climate. The reduction in Ts variance during SSW events is estimated at about 49–52% for all models (Table 5), and the metrics are found to be comparable to each other. This consistency reveals that the polar vortex continues to experience events of warming with a consistent thermal structure of the historical period.

4.3. Zonal Wind Time Series

In this section, Uh at 10 hPa 60° N latitude are plotted and interpreted using ERA5 reanalysis and CMIP5 models (M1–M3), along with historical period and RCP 4.5 projection time series. SSW characteristics will be discussed using Uh characteristics similar to the structure given for Ts in the previous section.

4.3.1. Historical Period of Uh

In this subsection, daily Uh time series at the 10 hPa level above 60° N latitude for the period 1980–2005 were analyzed to assess the effectiveness of CMIP5 models (M1–M3) to produce SSW dynamical variability and evaluate the polar night jet (Figure 6). The assessment, using mean Uh, sd, SUh, and Pr representing dynamic amplitude, is presented in Table 6.
According to ERA5 (Figure 6a), polar stratospheric winds exhibit significant seasonal and interannual variability throughout the historical period, accompanied by significant slowing of westerly winds. The observed dynamics are a direct response to SSW activity. As shown in Table 6, Uh variance during SSW events decreases by 24% compared to non-SSW periods.
When the M1 model is examined (Figure 6b), the polar night jet’s intensity is overestimated, producing an amplified range of variability compared to ERA5 results. It also simulates a spurious strengthening of westerly Uh which is contrary to the observed deceleration (Table 6). M2 (Figure 6c) reproduces the dynamic amplitude and mean state reasonably well but also fails to capture the long-term trend direction as indicated by M1. M2 shows a systematic trend reversal, resulting in an artificial westerly Uh acceleration of the polar night jet. M3 (Figure 6d) provides the results closest in agreement with ERA5 simulations, reproducing the magnitude and variability structure of the mean Uh with the highest success among all models (Table 6). However, M3 shows a positive trend of Uh variation, but its value is weaker than that of M1 and M2.
Despite the inconsistencies in long-term trends, all three models capture the dynamic signature of SSWs. The reduction in Uh variance is found to be between 13 and 54% (Table 6). This confirms that vortex breakdowns lead the stratosphere into a state of reduced dynamic variability. However, the magnitude of this clustering varies significantly depending on the model.

4.3.2. RCP 4.5 Future Climate Scenarios of Uh

This subsection presents future climate scenarios based on SSW characteristics. Under the RCP 4.5 scenario, daily Uh time series at 10 hPa and 60° N for the period 2006–2099 are analyzed to assess how the predicted circulation changes are comparable to historical polar night jet stream intensity (Figure 7). The metrics of Uh, sd, Pr, and SUh are summarized in Table 7.
In M1 simulations (Figure 7a), the mean Uh shows increasing variability, with a slightly wider Pr (70.44 m.s−1), at 8.5% compared to the historical average value. However, in the long-term trend, SUh weakens at about 82% compared to the historical period, becoming almost zero (Table 7). In M2 simulations (Figure 7b), the average Uh shows a modest 4.2% increase compared to its historical value, but the Pr range decreases. SUh shifts from a historically positive value to a weakly negative value (−0.004 m.s−1.yr−1). This represents a large relative decrease in the trend value that indicates the complete disappearance of the historical strengthening trend (Table 7). In M3 simulations (Figure 7c), an average circulation is simulated with a slightly narrow amplitude range, which remains unchanged from its historical value. The SUh trend weakens at about 48%, but it is not statistically significant (Table 7). M3 continues to exhibit distinct behavior during extreme events, such as winds, during which SSW days increase significantly when they are compared to the historical baseline value during SSW days.
Overall, regarding the dynamic response to SSWs, variability continues to narrow across the models compared to non-SSW conditions. As shown in Table 7, the variance reduction ranges from 4% to 43%, indicating that mid-winter warming remains abrupt and dynamically consistent under moderate future forcing. Under RCP 4.5 scenarios, each model preserves the underlying Uh climatology while clearly reducing the historical strengthening trends of the polar night jet. Specifically, none of the RCP 4.5 slopes deviates significantly from zero, implying a statistically stationary mean flow behavior throughout the 21st century. M3 provides the most consistent statistical representation of polar night jet variability as suggested by ERA5, while M1 provides a stronger mean flow. On the other hand, M2 provides weaker but more stable circulation.

4.4. Stratospheric Temperature Amplitude and Range

As justified in Section 3, while models M1, M4, and M5 were utilized for general frequency comparisons, the following high-fidelity assessments of stratospheric amplitude and thermal–dynamical coupling (Section 4.4, Section 4.5 and Section 4.6) focus exclusively on models M2 and M3 to ensure physical consistency in capturing the magnitude of stratospheric variability. The Tamp and Trng are evaluated to highlight the SSW-related thermal variability based on daily temperature values shown in Figure 4 and Figure 5, respectively. Table 8 summarizes the mean values of Tamp and Trng, their sd, and STs for the ERA5 reanalysis and CMIP5 models (M2 and M3) at 10° N and 60° N during the historical and future periods. The daily values of the above parameters are averaged over the targeted years to obtain monthly averages.
In the tropical stratosphere (10° N), ERA5 analysis effectively characterizes thermal variability as stable, with low sd and a negligible long-term of Ts; however, CMIP5 models exhibit deviations. M2 underestimates Trng by approximately 21% (Table 8). M3, contrary to previous evaluations, also underestimates these metrics, capturing about 91–92% of the observed variability, showing a much closer agreement with ERA5 than M2 but with a slight negative bias.
In the polar stratosphere, the observed Tamp variability at 60° N increases significantly, enhancing wave activity and polar vortex dynamics associated with SSWs. ERA5 exhibits the strongest variability at Trng of 0.730 K and Tamp of 3.159 K. However, all models struggle to reproduce this polar Tamp magnitude. Table 8 shows that M2 captures only ~20% of the observed Trng and underestimates Tamp by ~81%. This suggests that polar Tamp variability is reduced significantly. M3, similarly to M2, struggles to reproduce the polar magnitude, resolving only ~26% of Trng and underestimating Tamp by ~75%. Based on the long-term evolution of Tamp, linear trends remain small for all data, reflecting model-dependent suppression of the processes rather than the real climate change signal. Under the RCP 4.5 future climate scenario, the Tamp variability structure is usually conserved for all latitudes. Deviations in projected Tamp and Trng are found to be minimal for both models, varying by less than 4% relative to historical baselines. Furthermore, future linear trends with STs < |0.001| are found to be negligible.
In summary, ERA5 exhibits the strongest Ts variability across all latitudes (Table 8). In contrast, M2 systematically underestimates both Tamp and Trng at about 20% and 80%, respectively, whereas M3 underrepresents polar amplitude by approximately 50%. Long-term STs values are found to be insignificant, indicating that there is no significant change in the SSW occurrence conditions. These finding suggest that while the tropical–polar contrast is preserved under RCP 4.5, CMIP5 models’ simulations cannot produce the magnitude of Ts variability, indicating limitations of model vertical resolution and wave–mean-flow coupling.

4.5. Zonal Wind Amplitude and Range

The Uamp and Urng variables for Uh are analyzed at 10° N and 60° N latitudes at 10 hPa to evaluate the dynamical variability associated with SSW events, based on the daily evolution shown in Figure 6 and Figure 7. This comparison between tropical and polar dynamics provides a quantitative basis for evaluating the QBO–SSW coupling. Table 9 provides statistical measures, including the mean, median, sd, Urng, Uamp, and long-term tendencies for the ERA5 reanalysis and CMIP5 model simulations (M2 and M3) during the historical and future periods.
In the tropical stratosphere (10° N), which acts as a proxy for the tropical QBO signal, ERA5 show an average Urng of 3.021 m.s−1 with a weak positive multi-decadal SUh. The models show contrasting abilities in reproducing these dynamics (Table 9). M2 follows up the ERA5 trend closely, with a ~2% and 8% difference in Urng and Uamp, respectively. In contrast to previous expectations, M3 also aligns reasonably well with ERA5 in the tropics, showing only a slight overestimation of ~1.5% in Urng and ~5.6% in Uamp. M2 has stable long-term trends found in ERA5 analysis, while M3 exhibits a clear increase in SUh which is consistent with its pronounced westerly bias.
In the polar stratosphere (60° N), ERA5 analysis shows a stronger dynamic variability, and that reflects the active wave–mean flow interactions related to mid-latitude westerly winds. Neither the M2 nor the M3 models can capture the full magnitude of this polar variability (Table 9). M2 simulations underestimate the Urng value at about 60% and captures only 36% of the observed Uamp, and this indicates a significant suppression of polar jet dynamics. The discrepancy between the relatively lower bias in tropical amplitudes and the significant underestimation at 60° N suggests that the model deficiency is primarily associated with the representation of dynamic teleconnections linking tropical forcing to the polar stratosphere. M3 simulation performs similarly to M2 in this updated analysis, capturing only ~39% and ~37% of Urng and Uamp, respectively. ERA5 shows a slight strengthening of SUh (positive) in polar westerly Uh. Both M2 and M3 models simulate negative trends at −0.0018 and −0.0080 m.s−1 decade−1), respectively, but fail to reproduce the observed strengthening of the polar Uh.
Under the RCP 4.5 future climate scenarios, the dynamic structure, represented with Uh metrics, remains generally stable across all latitudes. The changes in Uamp and Urng (Table 9) are less than 4% of the historical baselines. In the tropics, M2 shows a slight increase in variability (~1%) but M3 shows a small decrease (~6%). At 60° N, both models retain their historical biases while M2 continues to significantly underestimate Uamp (~4.1 m.s−1) compared to ERA5 (~11.4 m.s−1). The model-based projected long-term trends become statistically insignificant because SUh becomes close to zero. This suggests a lack of robust dynamic evolution in Uh variability throughout the 21st century.
In summary, ERA5 analysis exhibits a stronger dynamic variability compared to CMIP5 models, particularly in the polar region (Table 9). M2 systematically underestimates both Urng and Uamp by at about 40–70%, but M3 overestimates the tropical variability of Uamp while underrepresenting polar Uamp by about 25%. Discrepancies in the sign and magnitude of long-term Uamp trends indicate that the models inadequately capture the dynamic response of the polar night jet to interannual dynamical forcing. These persistent deviations in both historical and future simulations likely reflect structural limitations in vertical resolution as well as wave-flow coupling conditions, but this needs to be verified in future studies. Overall, these results imply that while models generate tropical forcing, the primary systematic error lies in the dynamic coupling necessary to propagate these signals poleward.

4.6. Thermal–Dynamical Coupling Analysis

To investigate the thermal–dynamical coupling in the stratosphere, scatter plots between daily mean Ts and Uh at 10 hPa are examined at 60° N latitude for both historical and future periods. Figure 8 corresponds to the historical period, and Figure 9 depicts projections under the RCP 4.5 scenario.
At 60° N latitude, ERA5 analysis (Figure 8a) shows that Uh increases with decreasing Ts, suggesting that warmer Ts is associated with weaker westerlies (or easterlies), whereas decreasing Ts is related to stronger westerlies. Both fits have similar slopes, separated at lower temperatures. The fit for the pre-1990 data had a slope of −1.62 m.s−1 K−1, r = −0.59; R2 = 0.35. After 1990, the slope weakens to −1.13 m.s−1 K−1 (r = −0.49; R2 = 0.24), corresponding to a roughly 30% reduction in magnitude. Overall, ERA5 shows a persistent inverse coupling that weakens in the later decades.
At 60° N latitude, M2 (Figure 8b) exhibits a much stronger and more linear inverse Ts–Uh relationship than ERA5. Both the pre- and post-1990 subsets yield regression slopes close to −2.0 m.s−1 K−1 (r ≈ −0.82; R2 ≈ 0.65–0.68), with only minor differences between the two periods, indicating that the model maintains a persistently intense anticorrelation rather than the weakening seen in ERA5. The scatter of data points is tightly clustered along the regression axis, especially in the 220–235 K Ts range, where many SSW days occur. In addition, M2 produces a substantial number of cases with Uh < −20 m.s−1, which are rare in ERA5, suggesting that the model overproduces very weak or reversed vortex states and exaggerates the dynamical response to warming anomalies.
In M3 simulations (Figure 8c), the overall Ts–Uh coupling is found to be negative with a weaker coherence than that of both ERA5 and M2. Regression slopes remain almost unchanged between the two periods (−1.40 to −1.42 m.s−1 K−1), implying a stable yet moderately underestimated inverse coupling with no clear post-1990 weakening. The scatter shows a broader spread in Uh (approximately −10 to 40 m.s−1), indicating that M3 simulates a wider range of vortex states but with reduced sensitivity of Uh anomalies to Ts variations. Clustering in the 220–235 K band is less pronounced than in ERA5, and the model underestimates the amplitude of the thermal–dynamical link at about 15–20%.
For the RCP 4.5 projections, results of Ts–Uh coupling related to four successive time segments (i) 2006–2025, (ii) 2026–2050, (iii) 2051–2075, and (iv) 2076–2100 are shown using distinct color coding, with solid trend lines representing the least-squares fit for each period (Figure 9). This consistent visualization enables a direct comparison of reanalysis-based and model-simulated coupling characteristics across historical and future conditions. Results for each time period are provided below.
For time periods i, ii, and iv, M2 exhibits a clear inverse Ts–Uh relationship, with a slope of −1.84 m.s−1.K−1 (r = −0.42; R2 = 0.18) (Figure 9a). The mean Ts is 224.8 ± 6.3 K and mean Uh is 17.4 ± 19.2 m.s−1, indicating a conical warm–weak jet pattern. The correlation shows that the slope becomes steeper to −2.65 m.s−1.K−1 (r = −0.56; R2 = 0.31) between 2026–2050, representing the period when the vortex is most susceptible to thermal anomalies. In 2051–2075, the relationship weakens substantially, with the slope flattening to −1.32 m.s−1 K−1 (r = −0.27; R2 = 0.07), and by 2076–2100, it decreases further to −0.74 m.s−1 K−1 (r = −0.17; R2 = 0.03). Although the average Ts rose by about 1 K over the time period, the Uh pattern persisted with a predominantly westward orientation. Overall, M2 maintained the correct sign of the coupling. However, towards the end of the century, it appears to indicate a gradual decrease in dynamic strength and consistency.
For time periods ii, iii, and iv, M3 shows a moderate inverse coupling with a slope of −1.21 m.s−1 K−1 (r = −0.36; R2 = 0.13), mean values of Ts = 225.1 ± 6.5 K, and Uh = 20.6 ± 18.7 m.s−1 (Figure 9b). The coupling strengthens during 2026–2050, as the slope steepens to −2.08 m.s−1 K−1 (r = −0.47; R2 = 0.22) and reaches maximum intensity in 2051–2075 (slope = −2.42 m.s−1 K−1; r = −0.52; R2 = 0.27), reflecting the period of strongest polarity between warming anomalies and response. In the final segment (2076–2100), the slope weakens sharply to −0.63 m.s−1 K−1 (r = −0.19; R2 = 0.04), accompanied by increased scatter and higher mean Ts (~227 K). Thus, M3 shows a transient mid-century strengthening followed by a late century weakening, indicating a non-monotonic dynamical response to RCP 4.5 forcing.
Collectively, Figure 8 and Figure 9 demonstrate a robust and persistent inverse coupling between Ts and Uh in the polar stratosphere at 60° N across both the historical and RCP 4.5 periods. In historical records, ERA5 exhibits strong negative slopes and high correlation magnitudes, with a clear weakening after 1990, indicating reduced dynamical coherence of the polar vortex in recent decades. Both CMIP5 M2 and M3 models reproduce the correct sign of the coupling, albeit with distinct biases: M2 consistently amplifies the slope and correlation strength relative to ERA5, whereas M3 captures the overall structure but underestimates the dynamical amplitude.

5. Discussion

This study investigated the relationship between SSWs and QBOs using ERA5 reanalysis and CMIP5 model simulations, both in historical and future periods. Comparison of SSW frequency and intensity between the ERA5 and CMIP5 models revealed important differences, and here they are briefly discussed.
In the RCP 4.5 scenario, M1 and M3 maintained almost constant event frequencies (±5%) but M2 showed a tendency to occur in the early winter months with a decrease of approximately 10%. These differences are consistent with the outputs of CMIP5 models, but vertical resolution and planetary wave propagation characteristics directly affect eddy distortion frequency and intensity. In this respect, CMIP5 simulations need to focus more on turbulent heat fluxes in high resolution mode. Note that both M4 and M5 are not analyzed due to their inability to generate realistic SSWs (# < 3 events).
Analysis of Ts and Uh variability revealed structural biases in how the models preserved the polar vortex rather than the use of event frequency. Historically, ERA5 analysis exhibits a sustained cooling trend with a westward slowing trend. In contrast, M1 misleadingly simulates a strengthening of the polar night jet during which M2 suppresses dynamic variability, systematically reducing Tamp and Urng magnitudes in the polar region at about 60−80%. M3 more accurately captures the variability in the structure of system dynamics and thermodynamics, reproducing approximately 75% of the observed amplitude of Tamp. However, M2 significantly overestimates tropical variability. Furthermore, the reduction in variance during SSW events acts as a key indicator of vortex breaking across the models (~24−54%). This confirms that the thermodynamic response of the SSW event remains robust despite mean-state deviations. These shortcomings in reproducing the magnitude of polar variability point to limitations of the models representing intrinsic dynamic forcing. These persistent biases reflect broader challenges identified in both CMIP5 and CMIP6 ensembles regarding stationary wave driving [34,35]. While high-top configurations in newer model generations show improvements in stratospheric dynamics [43], low-top models like M4 and M5 remain essential for demonstrating the structural sensitivity of the polar vortex to model vertical extent.
The QBO processes are analyzed in determining the frequency and intensity of SSWs due to their effect on planetary wave propagation and background horizontal wind shear. Both QBO− and QBO+ phases affect the upward transmission of Rossby waves into the polar vortex, altering the vertical refractive index of the tropical stratosphere [4,44]. Previous studies using reanalysis have shown that SSWs occur twice as often in QBO− phases when compared to QBO+ phases [10,15,45]. This reflects that when upward wave flow increases, the polar vortex is further disrupted. In the CMIP5 models, QBO dynamics are not explicitly resolved because tropical momentum forcing is parameterized rather than dynamically generated. Of the three selected models, only M2 and M3 exhibit intermittent equatorial wind direction changes resembling semi-biennial behavior. However, these oscillations are weak and irregular. M1 shows a limited semi-annual oscillation in the lower stratosphere. This is a known limitation of the upper–lower CMIP5 configurations [36]. Because this study focuses on only the 10 hPa level, direct QBO–SSW coupling cannot be retrieved because the QBO is not dynamically expressed. Therefore, the interpretation relies on established dynamical pathways and previous intercomparisons.
M2 predicts fewer SSW days and a narrower Urng, while M3 shows a modest upward trend with slightly stronger event classification. The contrast between the monotonic attenuation in M2 and the non-monotonic transient peak in M3 highlights significant structural ambiguities in the representation of future wave–mean flow feedback under radiative forcing. Phase-space analyses of Ts–Uh relationships further emphasize these contrasts, showing a weakening of the post-1990 inverse correlation in ERA5 (r ≈ −0.8). This is partly captured by M2 simulations but remains inconsistent with M3. This inconsistency suggests that CMIP5 models may have overestimated the dynamic stability of the polar vortex, as they generally support a strong linear response, in contrast to the recent ‘decoupling’ seen in the reanalysis data.
The representation of stratospheric variability in CMIP5 models remains persistent, and significant differences occur compared to the ERA5 reanalysis. M1 underrepresents strong SSW events; M2 underestimates overall both dynamical and physical variabilities. On the other hand, M3 reproduces the observed temporal structure of both Uh and Ts metrics. However, specifically in the polar region, it significantly underestimates the magnitudes of both Tamp and Uamp, similarly to M2. The systematic underestimation of SSW intensity in models has direct implications for projected surface weather impacts. Since the strength of the downward coupling is physically linked to the magnitude of the stratospheric warming, an 80% bias in temperature anomalies suggests that models may fail to capture the full severity of mid-latitude cold spells and storm track shifts that typically follow these events. Therefore, although the frequency of SSWs is projected to be stable, their simulated impact on surface weather conditions is likely underestimated due to these persistent dynamical biases. Additionally, the energetic influence of SSWs extends into the upper atmosphere, potentially modulating broader oscillations such as the Semi-Annual Oscillation (SAO) [46]. ERA5 therefore provides a consistent observational benchmark for assessing these differences in both historical and future contexts but better observations covering the upper atmosphere based on in situ measurements, and satellite data can improve the outcome of this work. These results suggest a need to improve the model physics of QBO–SSW interaction, planetary wave propagation dynamics, and mean flow interactions, to increase the reliability of future stratospheric climate projections.

6. Conclusions

This study analyzes the variability of SSWs at 10 hPa in the NH using ERA5 reanalysis and five CMIP5 models, two of which are found to be unsuccessful in capturing SSW dynamics. The findings for the historical and future periods are summarized below.
(1)
Frequency: Under the RCP 4.5 scenario, the frequency of SSW events is projected to remain stable throughout the 21st century, showing no significant trend.
(2)
Predictability and Impact: The major challenge for future projections lies not in the number of events, but in their simulated magnitude. The systematic variability identified in SSW intensity and its projected changes are closely linked to the specific criteria and metrics employed for detection. This dependency directly influences the accuracy of Arctic vortex intensity forecasting and subsequent mid-latitude weather predictions, particularly where model intensity deficits reach 61–82%. Such deficits indicate a critical weakness in simulating the dynamical energy of polar vortex disruptions, which is essential for accurately forecasting their subsequent impacts on surface weather patterns [5,6].
  • The systematic failure of M4 and M5 to capture SSW events is attributed to their low-top configurations and limited vertical resolution [34,35].
  • M1 captures the seasonal cycle well but cannot resolve extreme SSW events in detail. While both M2 and M3 show better temporal agreement with ERA5 analysis, they exhibit structural deviations in the amplitude of both Ts and Uh parameters.
  • At 10° N, the models exhibit a clear discrepancy in thermal variability. M2 underestimates both Tamp and Trng at about 21% and 22% compared to ERA5 analysis, respectively. Similarly, M3 underestimates these metrics by approximately 9% and 8%, indicating a slight damping of tropical thermal variability compared to ERA5. These results indicate a clear failure of the models to capture tropical thermal variability. Under the RCP 4.5 scenario, these internal biases persist; specifically, M3 maintains Ts amplitudes consistently higher than its own historical mean, suggesting a sustained overestimation of tropical dynamics throughout the 21st century.
  • At 60° N, ERA5 reveals strong variability in Tamp and Trng values, with 3.16 K and 0.73 K, respectively. The M2 and M3 models consistently fail to capture the full magnitude of these metrics. M2 and M3 underestimate these values by 80–82% and 74–75% compared to ERA5 analysis, respectively. While M2 and M3 underestimate the historical magnitudes observed in ERA5, their future projections indicate that Tamp remains nearly constant in M2 (1.4% change relative to its historical mean), while M3 shows a modest increase of 5.7%. This suggests that the systematic suppression of polar vortex dynamics is projected to continue under future forcing.
  • At 60° N, the ERA5 Uamp and Urng values are found to be 11.39 m.s−1 and 2.73 m.s−1, respectively. M2 and M3 underestimate these wind speeds of about 61–64% and 61–63%, respectively.
  • Scatter plot diagnostics at 60° N show that ERA5 exhibits a strengthened negative relationship between Ts and Uh after 1990 with a correlation of R ≈ –0.8. This relationship is partially captured by M2 but it is not consistently reproduced by M3.
  • Regarding to polar variability at 60° N, ERA5 shows a strengthening of polar westerlies with a trend of –0.87 m.s−1 per decade. The M1, M2, and M3 models fail to capture this trend and simulate spurious positive trends between 0.34 and 0.80 m.s−1 per decade. Under RCP 4.5 scenario, these trends remain near zero, which suggests a lack of dynamic evolution in future simulations.
  • In historical evaluations, the sensitivity of the Ts–Uh coupling in M3 weakens significantly toward the end of the century, which means there is a decreasing relationship between them. The regression slope drops from −1.42 m.s−1/10 yr to −0.63 m.s−1/10 yr between 2076 and 2100. This change indicates that both M2 and M3 lack the physical processes necessary to accurately capture how the vortex responds to thermal anomalies.
These conclusions suggest that CMIP5 models need to be further improved with respect to their vertical and horizontal resolutions, and the representation of physical and dynamical processes in the upper troposphere, which are more susceptible to climate change processes. It is also suggested that ERA5 analysis results should be further refined using remote sensing observations of stratospheric conditions. This work’s results can also be basis for improving CMIP6 simulations to study a similar content. In addition to these points, several limitations of the current study should be acknowledged in future studies. Firstly, the analysis focuses exclusively on the NH; therefore, the findings may not be directly applicable to the SH, where planetary wave activity and vortex dynamics follow significantly different patterns. Secondly, while the 10 hPa pressure level is a standard reference for identifying SSW events, incorporating a multi-level vertical analysis in future studies could provide a more comprehensive view of the downward propagation of warming signals. Thirdly, the investigation is based on the RCP 4.5 scenario, which represents a moderate future forcing; evaluating these dynamics under more extreme scenarios like RCP 8.5 could further test the sensitivity of the polar vortex to higher radiative forcing. Lastly, although the selected primary models (M1–M3) represent stratospheric processes in detail, persistent structural biases in vertical resolution across climate models remain a factor that requires continued refinement in future studies to better represent fine-scale wave–mean flow interactions.

Author Contributions

The authors contributed the manuscript in following ways: Conceptualization, S.D., I.G. and D.D.; methodology, S.D., I.G. and D.D.; software, S.D.; validation, S.D. and I.G.; formal analysis, S.D.; investigation, S.D., I.G. and D.D.; resources, ECMWF; data curation, S.D.; writing—original draft preparation, S.D. and I.G.; writing—review and editing, O.D.; visualization, S.D. and O.D.; supervision, I.G. and D.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the Fog and Turbulence in Marine Atmosphere (FATIMA) project, which is funded by the U.S. Office of Naval Research (ONR), USA, with Grant No. N00014-21-1-2296 and by the University of Notre Dame, IN, USA. In addition, this work was also partially completed during Dr. Ismail Gultepe’s visit to the University of Notre Dame, IN, USA.

Data Availability Statement

The datasets analyzed in this study were retrieved from the Copernicus Climate Change Service (C3S) Climate Data Store, which provides access to reanalysis and climate datasets (https://climate.copernicus.eu/ (accesed on 15 January 2024). Software Availability: All data processing was performed using MATLAB R2016b (The MathWorks Inc., Natick, MA, USA). Codes are available in request from the first author.

Acknowledgments

The authors thank the European Centre for Medium-Range Weather Forecasts (ECMWF) for making the ERA5 reanalysis data available. They are also grateful to the Joint Modelling Working Group of the World Climate Research Program, responsible for CMIP, and to the climate modeling groups that produced and made the model outputs available. The authors gratefully acknowledge the computing resources and technical support provided by the National Center for High Performance Computing (UHEM) at Istanbul Technical University (ITU) under grant number 4019242024. The computational analyses presented in this paper were performed using UHEM’s high-performance computing infrastructure through remote access services.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
CASCommission for Atmospheric Sciences
CMIP5Coupled Model Intercomparison Project Phase 5
CMIP6Coupled Model Intercomparison Project Phase 6
ECMWFEuropean Centre for Medium-Range Weather Forecasts
ENSOEl Niño–Southern Oscillation
ERA5Fifth Generation ECMWF Atmospheric Reanalysis
hPaHectopascal (pressure level)
IQSYInternational Quiet Sun Year
MAXMaximum
MINMinimum
MJOMadden–Julian Oscillation
MPAMain Phase Area
MPDMain Phase Duration
MPSMain Phase Strength
NHNorthern Hemisphere
OLSOrdinary Least-Squares
Pr95th–5th variability metric
QBOQuasi-Biennial Oscillation
QBO−Easterly phase of QBO
QBO+Westerly phase of QBO
R2Coefficient of Determination
RCP 4.5Representative Concentration Pathway 4.5 Scenario
SAOSemiannual Oscillation
sdStandard Deviation
SHSouthern Hemisphere
SSWSudden Stratospheric Warming
STsLinear trend of stratospheric temperature (K.yr−1)
SUhLinear trend of zonal wind speed (m.s−1 yr−1)
TampAmplitude of variability for surface temperature (Standard Deviation)
Trng12-month variability range metric (max–min)/12 stratospheric temperature
TEAThreshold Exceedance Area (diagnostic metric)
TsStratospheric temperature (K)
TsaLong-term climatic average of stratospheric temperature (K)
UampAmplitude of variability for zonal wind (Standard Deviation).
UhZonal-mean wind speed (m.s−1)
VARVariance
Urng12-month variability range metric (max–min)/12 for zonal wind
WMOWorld Meteorological Organization
ΔTsChange in temperature anomaly

References

  1. Butchart, N. The Stratosphere: A Review of The Dynamics and Variability. Weather Clim. Dyn. 2022, 3, 1237–1272. [Google Scholar] [CrossRef]
  2. Li, Y.; Kirchengast, G.; Schwaerz, M.; Yuan, Y. Monitoring Sudden Stratospheric Warmings under Climate Change since 1980 Based on Reanalysis Data Verified by Radio Occultation. Atmos. Chem. Phys. 2023, 23, 1259–1284. [Google Scholar] [CrossRef]
  3. Butler, A.H.; Seidel, D.J.; Hardiman, S.C.; Butchart, N.; Birner, T.; Match, A. Defining Sudden Stratospheric Warmings. Bull. Am. Meteorol. Soc. 2015, 96, 1913–1928. [Google Scholar] [CrossRef]
  4. Baldwin, M.P.; Dunkerton, T.J. Stratospheric Harbingers of Anomalous Weather Regimes. Science 2001, 294, 581–584. [Google Scholar] [CrossRef]
  5. Ma, X.; Zhao, F.; Yue, B.; Liu, X. DSMF-Net: A Spatiotemporal Memory Flow Network for Long-Range Prediction of Stratospheric Sudden Warming Events. Atmosphere 2025, 16, 1316. [Google Scholar] [CrossRef]
  6. Xie, J.; Hu, J.; Li, X.; Luo, J.-J.; Xu, H.; Jia, Y. Two Major Sudden Warming Events in the Unprecedentedly Active Stratosphere during the Boreal Winter of 2023/2024 and Their Distinct Surface Impacts over China. Atmos. Res. 2025, 319, 108032. [Google Scholar] [CrossRef]
  7. Van Loon, H.; Jenne, R.L.; Labitzke, K. Zonal Harmonic Standing Waves. J. Geophys. Res. 1973, 78, 4463–4471. [Google Scholar] [CrossRef]
  8. Scherhag, R. Die Explosinonsartigen Stratospharen Warmungen Des Stratintere 1951/52. Deutsch. Wetterdienstes US-Zone Berl. 1952, 6, 51–63. [Google Scholar]
  9. McInturff, R.M. Stratospheric Warmings: Synoptic, Dynamic and General-Circulation Aspects; National Aeronautics and Space Administration, Scientific and Technical Information Office: Hampton, VI, USA, 1978; ISBN 978-0-8156-2252-9.
  10. Baldwin, M.P.; Ayarzagüena, B.; Birner, T.; Butchart, N.; Butler, A.H.; Charlton-Perez, A.J.; Domeisen, D.I.V.; Garfinkel, C.I.; Garny, H.; Gerber, E.P.; et al. Sudden Stratospheric Warmings. Rev. Geophys. 2021, 59, e2020RG000708. [Google Scholar] [CrossRef]
  11. Thompson, D.W.J.; Baldwin, M.P.; Wallace, J.M. Stratospheric Connection to Northern Hemisphere Wintertime Weather: Implications for Prediction. J. Clim. 2002, 15, 1421–1428. [Google Scholar] [CrossRef]
  12. Labitzke, K.; Kunze, M. On the Remarkable Arctic Winter in 2008/2009. J. Geophys. Res. 2009, 114, 2009JD012273. [Google Scholar] [CrossRef]
  13. World Meteorological Organization. International Years of the Quiet Sun (IQSY) 1964–1965: Alert Messages with Special References to Stratwarms; WMO/IQSY Report No 6; Secretariat of the World Meteorological Organization: Geneva, Switzerland, 1964. [Google Scholar]
  14. Johnson, K.W.; Miller, A.J.; Gelman, M.E. Proposed Indices Characterizing Stratospheric Circulation and Temperature Fields. Mon. Weather Rev. 1969, 97, 565–570. [Google Scholar] [CrossRef][Green Version]
  15. Charlton, A.J.; Polvani, L.M. A New Look at Stratospheric Sudden Warmings. Part I: Climatology and Modeling Benchmarks. J. Clim. 2007, 20, 449–469. [Google Scholar] [CrossRef]
  16. Schoeberl, M.R.; Strobel, D.F. The Response of the Zonally Averaged Circulation to Stratospheric Ozone Reductions. J. Atmos. Sci. 1978, 35, 1751–1757. [Google Scholar] [CrossRef][Green Version]
  17. Labitzke, K. Stratospheric-mesospheric Midwinter Disturbances: A Summary of Observed Characteristics. J. Geophys. Res. 1981, 86, 9665–9678. [Google Scholar] [CrossRef]
  18. Baldwin, M.P.; Gray, L.J.; Dunkerton, T.J.; Hamilton, K.; Haynes, P.H.; Randel, W.J.; Holton, J.R.; Alexander, M.J.; Hirota, I.; Horinouchi, T.; et al. The Quasi-biennial Oscillation. Rev. Geophys. 2001, 39, 179–229. [Google Scholar] [CrossRef]
  19. Holton, J.R.; Tan, H.-C. The Influence of the Equatorial Quasi-Biennial Oscillation on the Global Circulation at 50 Mb. J. Atmos. Sci. 1980, 37, 2200–2208. [Google Scholar] [CrossRef]
  20. Toms, B.A.; Barnes, E.A.; Maloney, E.D.; Van Den Heever, S.C. The Global Teleconnection Signature of the Madden-Julian Oscillation and Its Modulation by the Quasi-Biennial Oscillation. J. Geophys. Res. Atmos. 2020, 125, e2020JD032653. [Google Scholar] [CrossRef]
  21. Ermakova, T.; Koval, A.; Didenko, K.; Fadeev, A.; Sokolov, A. Analyzing Stratospheric Polar Vortex Strength and Persistence Under Different QBO and ENSO Phases: Insights from the Model Study. Atmosphere 2025, 16, 1371. [Google Scholar] [CrossRef]
  22. Sunkara, E.; Seo, K.-H.; Mengist, C.K.; Ratnam, M.V.; Niranjan Kumar, K.; Venkata Chalapathi, G. Role of QBO and MJO in Sudden Stratospheric Warmings: A Case Study. Atmosphere 2024, 15, 1458. [Google Scholar] [CrossRef]
  23. Butler, A.H.; Gerber, E.P. Optimizing the Definition of a Sudden Stratospheric Warming. J. Clim. 2018, 31, 2337–2344. [Google Scholar] [CrossRef]
  24. Ayarzagüena, B.; Polvani, L.M.; Langematz, U.; Akiyoshi, H.; Bekki, S.; Butchart, N.; Dameris, M.; Deushi, M.; Hardiman, S.C.; Jöckel, P.; et al. No Robust Evidence of Future Changes in Major Stratospheric Sudden Warmings: A Multi-Model Assessment from CCMI. Atmos. Chem. Phys. 2018, 18, 11277–11287. [Google Scholar] [CrossRef]
  25. Rao, J.; Garfinkel, C.I. CMIP5/6 Models Project Little Change in the Statistical Characteristics of Sudden Stratospheric Warmings in the 21st Century. Environ. Res. Lett. 2021, 16, 034024. [Google Scholar] [CrossRef]
  26. Chávez-Pérez, V.M.; Añel, J.A.; Almaguer-Gómez, C.; De La Torre, L. Temporal Variability of Major Stratospheric Sudden Warmings in CMIP5 Climate Change Scenarios. Climate 2025, 13, 207. [Google Scholar] [CrossRef]
  27. Martínez-Andradas, V.; De La Cámara, A.; Zurita-Gotor, P.; Lott, F.; Serva, F. Quantifying the Spread in Sudden Stratospheric Warming Wave Forcing in CMIP6. Weather Clim. Dyn. 2025, 6, 329–343. [Google Scholar] [CrossRef]
  28. Copernicus Climate Change Service; Climate Data Store. ERA5 Hourly Data on Single Levels from 1940 to Present; Copernicus Climate Change Service (C3S): Bonn, Germany; Climate Data Store (CDS): Reading, UK, 2023. [Google Scholar] [CrossRef]
  29. Li, J.; Wang, Y.; Liu, L.; Yao, Y.; Huang, L.; Li, F. A Grid Model for Vertical Correction of Precipitable Water Vapor over the Chinese Mainland and Surrounding Areas Using Random Forest. Geosci. Model Dev. 2024, 17, 2569–2581. [Google Scholar] [CrossRef]
  30. Li, J.; Li, F.; Liu, L.; Yao, Y.; Huang, L.; Wang, Y. A Weighted Mean Temperature Forecast Model Based on Fused Data and Generalized Regression Neural Network and Its Impact on GNSS-Based Precipitable Water Vapor Estimation. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4101914. [Google Scholar] [CrossRef]
  31. Yang, Y.; Li, H. Planetary Wave Activity During 2019 Sudden Stratospheric Warming Event Revealed by ERA5 Reanalysis Data. Remote Sens. 2024, 16, 4739. [Google Scholar] [CrossRef]
  32. Kim, J.; Son, S.-W.; Gerber, E.P.; Park, H.-S. Defining Sudden Stratospheric Warming in Climate Models: Accounting for Biases in Model Climatologies. J. Clim. 2017, 30, 5529–5546. [Google Scholar] [CrossRef]
  33. Copernicus Climate Change Service; Climate Data Store. CMIP5 Daily Data on Pressure Levels; Copernicus Climate Change Service (C3S): Bonn, Germany; Climate Data Store (CDS): Reading, UK, 2018. [Google Scholar] [CrossRef]
  34. Hall, R.J.; Mitchell, D.M.; Seviour, W.J.M.; Wright, C.J. Persistent Model Biases in the CMIP6 Representation of Stratospheric Polar Vortex Variability. J. Geophys. Res. Atmos. 2021, 126, e2021JD034759. [Google Scholar] [CrossRef]
  35. Karpechko, A.Y.; Wu, Z.; Simpson, I.R.; Kretschmer, M.; Afargan-Gerstman, H.; Butler, A.H.; Domeisen, D.I.V.; Garny, H.; Lawrence, Z.; Manzini, E.; et al. Northern Hemisphere Stratosphere-Troposphere Circulation Change in CMIP6 Models: 2. Mechanisms and Sources of the Spread. J. Geophys. Res. Atmos. 2024, 129, e2024JD040823. [Google Scholar] [CrossRef]
  36. Butchart, N.; Charlton-Perez, A.J.; Cionni, I.; Hardiman, S.C.; Haynes, P.H.; Krüger, K.; Kushner, P.J.; Newman, P.A.; Osprey, S.M.; Perlwitz, J.; et al. Multimodel Climate and Variability of the Stratosphere. J. Geophys. Res. 2011, 116, D05102. [Google Scholar] [CrossRef]
  37. Charlton-Perez, A.J.; Baldwin, M.P.; Birner, T.; Black, R.X.; Butler, A.H.; Calvo, N.; Davis, N.A.; Gerber, E.P.; Gillett, N.; Hardiman, S.; et al. On the Lack of Stratospheric Dynamical Variability in Low-top Versions of the CMIP5 Models. J. Geophys. Res. Atmos. 2013, 118, 2494–2505. [Google Scholar] [CrossRef]
  38. Palmeiro, F.M.; Barriopedro, D.; García-Herrera, R.; Calvo, N. Comparing Sudden Stratospheric Warming Definitions in Reanalysis Data. J. Clim. 2015, 28, 6823–6840. [Google Scholar] [CrossRef]
  39. Garfinkel, C.I.; White, I.; Gerber, E.P.; Son, S.-W.; Jucker, M. Stationary Waves Weaken and Delay the Near-Surface Response to Stratospheric Ozone Depletion. J. Clim. 2023, 36, 565–583. [Google Scholar] [CrossRef]
  40. Baldwin, M.P.; Stephenson, D.B.; Thompson, D.W.J.; Dunkerton, T.J.; Charlton, A.J.; O’Neill, A. Stratospheric Memory and Skill of Extended-Range Weather Forecasts. Science 2003, 301, 636–640. [Google Scholar] [CrossRef]
  41. Baldwin, M.P.; Thompson, D.W.J. A Critical Comparison of Stratosphere–Troposphere Coupling Indices. Q. J. R. Meteorol. Soc. 2009, 135, 1661–1672. [Google Scholar] [CrossRef]
  42. Blume, C.; Matthes, K.; Horenko, I. Supervised Learning Approaches to Classify Sudden Stratospheric Warming Events. J. Atmos. Sci. 2012, 69, 1824–1840. [Google Scholar] [CrossRef]
  43. Serva, F.; Christiansen, B.; Davini, P.; Von Hardenberg, J.; Van Den Oord, G.; Reerink, T.J.; Wyser, K.; Yang, S. Changes in Stratospheric Dynamics Simulated by the EC-Earth Model From CMIP5 to CMIP6. J. Adv. Model. Earth Syst. 2024, 16, e2023MS003756. [Google Scholar] [CrossRef]
  44. Anstey, J.A.; Shepherd, T.G. High-latitude Influence of the Quasi-biennial Oscillation. Quart. J. R. Meteorol. Soc. 2014, 140, 1–21. [Google Scholar] [CrossRef]
  45. Garfinkel, C.I.; Shaw, T.A.; Hartmann, D.L.; Waugh, D.W. Does the Holton–Tan Mechanism Explain How the Quasi-Biennial Oscillation Modulates the Arctic Polar Vortex? J. Atmos. Sci. 2012, 69, 1713–1733. [Google Scholar] [CrossRef]
  46. Zhang, J.; Orsolini, Y.; Sato, K. Modulation of the Semi-Annual Oscillation by Stratospheric Sudden Warmings as Seen in the High-Altitude JAWARA Re-Analyses. Atmosphere 2025, 16, 1320. [Google Scholar] [CrossRef]
Figure 1. Distribution and duration of SSW events from 1980 to 2005 based on ERA5 reanalysis and selected CMIP5 models. Panel (a) shows the number of SSW events categorized as minor, major, and extreme events, along with the total count of data points. Panel (b) presents the cumulative number of SSW days, which reflects both the frequency and persistence of events. The colors correspond to each dataset as follows: ERA5 (green), M1 (purple), M2 (light blue), M3 (dark green), M4 (orange), and M5 (navy blue).
Figure 1. Distribution and duration of SSW events from 1980 to 2005 based on ERA5 reanalysis and selected CMIP5 models. Panel (a) shows the number of SSW events categorized as minor, major, and extreme events, along with the total count of data points. Panel (b) presents the cumulative number of SSW days, which reflects both the frequency and persistence of events. The colors correspond to each dataset as follows: ERA5 (green), M1 (purple), M2 (light blue), M3 (dark green), M4 (orange), and M5 (navy blue).
Forecasting 08 00013 g001
Figure 2. Classification of SSW events at 10 hPa between 1980 and 2005 based on ERA5 reanalysis and CMIP5 historical model data. Panels show the timing and intensity of events across different datasets: (a) ERA5 reanalysis, (b) M1, (c) M2, and (d) M3. Event categories are marked as minor (blue), major (green), and extreme (red), plotted from November to April to reflect the typical seasonal window for the occurrence of SSWs.
Figure 2. Classification of SSW events at 10 hPa between 1980 and 2005 based on ERA5 reanalysis and CMIP5 historical model data. Panels show the timing and intensity of events across different datasets: (a) ERA5 reanalysis, (b) M1, (c) M2, and (d) M3. Event categories are marked as minor (blue), major (green), and extreme (red), plotted from November to April to reflect the typical seasonal window for the occurrence of SSWs.
Forecasting 08 00013 g002
Figure 3. Classification of SSW events at 10 hPa between 2006 and 2100 based on CMIP5 RCP 4.5 data. The panels show the timing and intensity of events across different datasets: (a) M1, (b) M2, and (c) M3. Event categories are marked as minor (blue), major (green), and extreme (red), plotted from November to April to reflect the typical seasonal window for the occurrence of SSWs.
Figure 3. Classification of SSW events at 10 hPa between 2006 and 2100 based on CMIP5 RCP 4.5 data. The panels show the timing and intensity of events across different datasets: (a) M1, (b) M2, and (c) M3. Event categories are marked as minor (blue), major (green), and extreme (red), plotted from November to April to reflect the typical seasonal window for the occurrence of SSWs.
Forecasting 08 00013 g003
Figure 4. Stratospheric temperature (Ts) time series at 10 hPa and 60° N for the period 1980–2005 based on (a) ERA5 reanalysis and CMIP5 historical simulations from (b) M1, (c) M2, and (d) M3. Daily Ts are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Figure 4. Stratospheric temperature (Ts) time series at 10 hPa and 60° N for the period 1980–2005 based on (a) ERA5 reanalysis and CMIP5 historical simulations from (b) M1, (c) M2, and (d) M3. Daily Ts are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Forecasting 08 00013 g004
Figure 5. Stratospheric temperature (Ts) time series at 10 hPa and 60° N for the period 2006–2100 based on the RCP 4.5 scenario from CMIP5 models: (a) M1, (b) M2, and (c) M3. Daily Ts are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Figure 5. Stratospheric temperature (Ts) time series at 10 hPa and 60° N for the period 2006–2100 based on the RCP 4.5 scenario from CMIP5 models: (a) M1, (b) M2, and (c) M3. Daily Ts are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Forecasting 08 00013 g005
Figure 6. Zonal wind (Uh) time series at 10 hPa and 60° N for the period 1980–2005 based on (a) ERA5 reanalysis and CMIP5 historical simulations from (b) M1, (c) M2, and (d) M3. Daily Uh are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Figure 6. Zonal wind (Uh) time series at 10 hPa and 60° N for the period 1980–2005 based on (a) ERA5 reanalysis and CMIP5 historical simulations from (b) M1, (c) M2, and (d) M3. Daily Uh are overlaid with SSW classifications: minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Forecasting 08 00013 g006
Figure 7. Zonal wind (Uh) time series at 10 hPa and 60° N for the period 2006 –2100 based on CMIP5 RCP 4.5 simulations for (a) M1, (b) M2, and (c) M3. Daily Uh overlaid with SSW classifications are shown as minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Figure 7. Zonal wind (Uh) time series at 10 hPa and 60° N for the period 2006 –2100 based on CMIP5 RCP 4.5 simulations for (a) M1, (b) M2, and (c) M3. Daily Uh overlaid with SSW classifications are shown as minor (yellow circles), major (green triangles), and extreme (red squares). The dashed line indicates the trend in SSW days over time.
Forecasting 08 00013 g007
Figure 8. Scatter plots of daily mean zonal wind (Uh) versus stratospheric temperature (Ts) at 10 hPa and 60° N from CMIP5 models: (a) ERA5 (b) M2 and (c) M3 for the historical period (1980–2005). Orange dots represent data after 1990 and green dots represent data before 1990. Dashed black and blue lines show linear trends for pre-1990 and post-1990 periods.
Figure 8. Scatter plots of daily mean zonal wind (Uh) versus stratospheric temperature (Ts) at 10 hPa and 60° N from CMIP5 models: (a) ERA5 (b) M2 and (c) M3 for the historical period (1980–2005). Orange dots represent data after 1990 and green dots represent data before 1990. Dashed black and blue lines show linear trends for pre-1990 and post-1990 periods.
Forecasting 08 00013 g008
Figure 9. Scatter plots of daily mean zonal wind (Uh) versus stratospheric temperature (Ts) at 10 hPa and 60° N from CMIP5 models: (a) M2 and (b) M3 for the RCP 4.5 scenario (2006–2100). Colored dots represent different future sub-periods: magenta (2006–2025), blue (2026–2050), orange (2051–2075), and green (2076–2100). Solid lines show linear trends for each sub-period: brown (2006–2025), purple (2026–2050), blue (2051–2075), and black (2076–2100).
Figure 9. Scatter plots of daily mean zonal wind (Uh) versus stratospheric temperature (Ts) at 10 hPa and 60° N from CMIP5 models: (a) M2 and (b) M3 for the RCP 4.5 scenario (2006–2100). Colored dots represent different future sub-periods: magenta (2006–2025), blue (2026–2050), orange (2051–2075), and green (2076–2100). Solid lines show linear trends for each sub-period: brown (2006–2025), purple (2026–2050), blue (2051–2075), and black (2076–2100).
Forecasting 08 00013 g009
Table 1. Datasets and numerical weather models used in the analysis [28,33].
Table 1. Datasets and numerical weather models used in the analysis [28,33].
ModelAbbreviationInstitution/CountrySpatial ResolutionTemporal ResolutionPeriodsGroup
ERA5 ReanalysisERA5ECMWF, Europe0.25° × 0.25°Hourly1980–2005Reference
Australian Community Climate and Earth-System Simulator v1.3ACCESS1-3 (M1)Bureau of Meteorology & CSIRO, Melbourne, VIC, Australia1.875° × 1.25°Daily1980–2005 (Historical), 2006–2100 (RCP4.5)Primary
Hadley Centre Global Environmental Model v2—Carbon CycleHadGEM2-CC (M2)UK Met Office, Exeter, United Kingdom1.25° × 1.25°Daily1980–2005 (Historical), 2006–2100 (RCP4.5)Primary
Max Planck Institute Earth System Model—Medium ResolutionMPI-ESM-MR (M3)Max Planck Institute for Meteorology, Hamburg, Germany1.875° × 1.875°Daily1980–2005 (Historical), 2006–2100 (RCP4.5)Primary
Geophysical Fluid Dynamics Laboratory Earth System Model v2GGFDL-ESM2G
(M4)
NOAA Geophysical Fluid Dynamics Laboratory, Princeton, NJ, USA2.5° × 2.5°Daily1980–2005 (Historical), 2006–2100 (RCP4.5)Secondary
Flexible Global Ocean–Atmosphere–Land System Model, Grid-Point v2FGOALS-g2 (M5)LASG/IAP, Beijing, China2.8125° × 2.8125°Daily1980–2005 (Historical), 2006–2100 (RCP4.5)Secondary
Table 2. Threshold Exceedance Area Temperature (TEA) stratospheric temperature (Ts) criteria used in the SSW detection [2].
Table 2. Threshold Exceedance Area Temperature (TEA) stratospheric temperature (Ts) criteria used in the SSW detection [2].
Criteria Used in Temperature Anomaly Calculation in TEA Metrics
Step 1Ts: Stratospheric air temperature obtained from either CMIP5 or ERA5 analysis.
Step 2Tsa: Long-term climatic average of the stratospheric air temperature (Ts)
Step 3ΔTs = Ts − Tsa (Anomaly)
Table 3. SSW metrics: definitions, calculation methods, and classification thresholds [2].
Table 3. SSW metrics: definitions, calculation methods, and classification thresholds [2].
Metric/ClassDefinition/Calculation MethodPhysical InterpretationExample Value
(1980–2005)
TEAGeographical area where the Daily temperature anomaly exceeds a defined threshold.Horizontal extent of warming on a given day12 January 1987: >30 K ≈ 9.1 × 106 km2
MPDNumber of consecutive days with TEA above threshold.Persistence of warming (temporal duration)22 days
MPAAverage of Daily TEA during the main phase (∑TEA/MPD)Typical or average horizontal coverage of warming8.52 × 106 km2
MPSProduct of MPD and MPA (area × duration)Integrated magnitude of warming event187.4 × 106 km2.days
SSW Classification
Minor
MPS < 90 × 106 km2.daysSmall-scale or low-impact SSW eventGFDL-ESM2G—1984
(MPS ≈ 77 × 106 km2.days)
SSW Classification
Major
90–180 × 106 km2.daysLarge-scale impact SSW event HadGEM2CC—1999
(MPS ≈ 132 × 106 km2.days)
SSW Classification
Extreme
MPS > 180 × 106 km2.daysExceptionally strong/extensive SSWACCESS1-3—1987
(MPS ≈ 520 × 106 km2.days)
Table 4. Statistical comparison of daily stratospheric temperature (Ts) series at 10 hPa 60° N for ERA5 and CMIP5 models (M1–M3) over the historical period (1980–2005). The metrics include mean Ts, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events.
Table 4. Statistical comparison of daily stratospheric temperature (Ts) series at 10 hPa 60° N for ERA5 and CMIP5 models (M1–M3) over the historical period (1980–2005). The metrics include mean Ts, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events.
MetricERA5M1M2M3
Mean Ts (K)225.94227.9227.13227.76
Std. Deviation (sd) (K)9.5511.697.689.45
Percentile Range (Pr) (K)29.3735.1823.8328.43
Trend (STs) (K.decade−1)–0.298–0.85–0.69–0.316
SSW Days Mean Ts (K)226.19224.96229.22228.81
Non-SSW Days Mean Ts (K)225.93~227.96~227.07~227.72
SSW Variance Reduction46%54%50%44%
Table 5. Statistical comparison of daily stratospheric temperature (Ts) series at 10 hPa 60° N for CMIP5 models (M1–M3) under the RCP 4.5 future climate scenario. The diagnostics include mean Ts, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events compared to the historical baseline.
Table 5. Statistical comparison of daily stratospheric temperature (Ts) series at 10 hPa 60° N for CMIP5 models (M1–M3) under the RCP 4.5 future climate scenario. The diagnostics include mean Ts, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events compared to the historical baseline.
MetricM1M2M3
Mean Ts (K)225.85224.69225.6
Std. Deviation (sd) (K)11.997.389.04
Percentile Range (Pr) (K)35.8722.7727.16
Trend (STs) (K.decade−1)–0.21–0.26–0.24
SSW Days Mean Ts (K)222.95228.68225.75
Non-SSW Days Mean Ts (K)~225.95~224.69225.93
SSW Variance Reduction49%52%49%
Table 6. Statistical comparison of daily zonal wind (Uh) series at 10 hPa 60° N for ERA5 and CMIP5 models (M1–M3) over the historical period (1980–2005). The metrics include mean Uh, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events.
Table 6. Statistical comparison of daily zonal wind (Uh) series at 10 hPa 60° N for ERA5 and CMIP5 models (M1–M3) over the historical period (1980–2005). The metrics include mean Uh, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events.
MetricERA5M1M2M3
Mean Uh (m.s−1)10.6216.229.3410.83
Std. Deviation (sd) (m.s−1)19.1921.6618.1719.19
Percentile Range (Pr) (m.s−1)58.3365.7257.659.71
Trend SUh (m.s−1 decade−1)–0.87+0.64+0.80+0.34
SSW Days Mean Uh (m.s−1)14.517.6112.7811.9
Non-SSW Days Mean Uh (m.s−1)10.4516.189.2610.8
SSW Variance Reduction24%37%54%13%
Table 7. Statistical comparison of daily zonal wind (Uh) series at 10 hPa 60° N for CMIP5 models (M1–M3) under the RCP 4.5 future climate scenario. The diagnostics include mean Uh, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events compared to the historical baseline.
Table 7. Statistical comparison of daily zonal wind (Uh) series at 10 hPa 60° N for CMIP5 models (M1–M3) under the RCP 4.5 future climate scenario. The diagnostics include mean Uh, standard deviation (sd), percentile range (Pr), linear trends (STs), and variance reduction during SSW events compared to the historical baseline.
MetricM1M2M3
Mean Uh (m.s−1)17.69.7310.8
Std. Deviation (sd) (m.s−1)23.5118.1318.88
Percentile Range (Pr) (m.s−1)70.4455.9557.64
Trend SUh (m.s−1 decade−1)+0.11–0.04+0.17
SSW Days Mean Uh (m.s−1)19.6613.131.75
Non-SSW Days Mean Uh (m.s−1)17.539.6910.11
SSW Variance Reduction43%31%4%
Table 8. Statistical analysis of stratospheric temperature amplitude (Tamp) and range (Trng) for the ERA5 reanalysis and CMIP5 model simulations (M2 and M3) at 10° N and 60° N for the historical (1980–2005) and future (2006–2100, RCP 4.5) periods. The metrics include the mean, median, standard deviation (sd), variance (Var), minimum (Min), maximum (Max), and linear trend (Slope, in K per decade).
Table 8. Statistical analysis of stratospheric temperature amplitude (Tamp) and range (Trng) for the ERA5 reanalysis and CMIP5 model simulations (M2 and M3) at 10° N and 60° N for the historical (1980–2005) and future (2006–2100, RCP 4.5) periods. The metrics include the mean, median, standard deviation (sd), variance (Var), minimum (Min), maximum (Max), and linear trend (Slope, in K per decade).
PeriodMetricLatModelMeanMedianSdVarMinMaxSlope
1980–2005Tamp10° NERA52.0712.0340.3790.1431.3333.397−0.001236
M21.6371.5960.3490.1220.8482.6630.000487
M31.8841.820.4150.1720.8363.1170.001729
60° NERA53.1593.1860.3110.0972.3343.8980.001618
M20.5850.590.0940.0090.3730.874−0.000605
M30.7850.7720.1080.0120.4991.2130.000621
Trng10° NERA50.5530.5470.1090.0120.30.835−0.00075
M20.4340.4290.0980.010.2180.675−0.000182
M30.5070.50.1180.0140.2110.8050.000407
60° NERA50.730.7270.0860.0070.5260.9470.000186
M20.1440.1440.0230.0010.0960.205−0.000136
M30.1930.1890.0250.0010.1450.2640.000077
2006–2100Tamp10° NM21.7061.70.3370.1140.842.9380.000661
M31.9621.9350.3690.1361.0173.231−0.00019
60° NM20.5740.5710.0860.0070.3910.855−0.000254
M30.7580.7530.1070.0110.4561.098−0.000106
Trng10° NM20.4560.4450.0990.010.2180.8270.000166
M30.5220.5080.1060.0110.2590.943−0.000086
60° NM20.150.1490.02100.0980.202−0.000051
M30.1910.1890.0250.0010.1170.245−0.000023
Table 9. Statistical analysis of zonal wind amplitude (Uamp) and range (Urng) for ERA5 reanalysis and CMIP5 model simulations (M2 and M3) at 10° N and 60° N for the historical (1980–2005) and future (2006–2100, RCP 4.5) periods. Metrics include the mean, median, standard deviation (sd), variance (Var), minimum (Min), maximum (Max), and linear trend slope (m.s−1 per decade).
Table 9. Statistical analysis of zonal wind amplitude (Uamp) and range (Urng) for ERA5 reanalysis and CMIP5 model simulations (M2 and M3) at 10° N and 60° N for the historical (1980–2005) and future (2006–2100, RCP 4.5) periods. Metrics include the mean, median, standard deviation (sd), variance (Var), minimum (Min), maximum (Max), and linear trend slope (m.s−1 per decade).
PeriodMetricLatModelMeanMedianStdVarnMaxSlope
1980–2005Uamp10° NERA512.51312.6564.5620.7933.03322.6020.01084
M213.35214.4835.49730.2121.37623.140.000349
M313.21414.3097.62258.0881.06625.5710.005
60° NERA511.38511.3471.7543.0757.04715.4360.01092
M24.0874.0031.1381.2951.6576.865−0.002389
M34.1683.9711.4742.1731.2158.212−0.00104
Urng10° NERA53.0213.1420.9260.8580.8534.8380.002139
M23.063.4321.0141.0280.3984.764−0.000255
M33.0683.6151.5262.3270.3025.060.001179
60° NERA52.7282.7420.3920.1541.6963.6910.001098
M21.0721.0220.290.0840.4891.788−0.000499
M31.0591.0190.3330.1110.3861.91−0.000604
2006–2100Uamp10° NM213.25113.9824.30618.5462.63521.302−0.004462
M314.45815.5517.13350.8860.79326.9440.0048
60° NM24.1334.0531.2371.5311.2637.554−0.000401
M34.5784.4321.4862.211.0129.1420.000261
Urng10° NM23.0943.2430.7680.5890.6784.419−0.000877
M33.3153.761.3651.8630.2185.0920.000826
60° NM21.0711.0780.290.0840.3381.761−0.00014
M31.1741.1460.3540.1260.3032.184−0.000025
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Durmus, S.; Demirhan, D.; Gultepe, I.; Durmus, O. Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100. Forecasting 2026, 8, 13. https://doi.org/10.3390/forecast8010013

AMA Style

Durmus S, Demirhan D, Gultepe I, Durmus O. Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100. Forecasting. 2026; 8(1):13. https://doi.org/10.3390/forecast8010013

Chicago/Turabian Style

Durmus, Simla, Deniz Demirhan, Ismail Gultepe, and Onur Durmus. 2026. "Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100" Forecasting 8, no. 1: 13. https://doi.org/10.3390/forecast8010013

APA Style

Durmus, S., Demirhan, D., Gultepe, I., & Durmus, O. (2026). Investigation of Sudden Stratospheric Warming (SSW) Events Between 1980 and 2100. Forecasting, 8(1), 13. https://doi.org/10.3390/forecast8010013

Article Metrics

Back to TopTop