Characteristics of Subseasonal Winter Prediction Skill Assessment of GloSea5 for East Asia

: In this study, the characteristics of systematic errors in subseasonal prediction for East Asia are investigated using an ensemble hindcast (1991 – 2010) produced by the Global Seasonal Forecasting System version 5 (GloSea5). GloSea5 is a global prediction system for the subseason-al-to-seasonal time scale, based on a fully coupled atmosphere, land, ocean, and sea ice model. To examine the fidelity of the system with respect to reproducing and forecasting phenomena, this study assesses the systematic biases in the global prediction model focusing on the prediction skill for the East Asian winter monsoon (EAWM), which is a major driver of weather and climate variability in East Asia. To investigate the error characteristics of GloSea5, the hindcast period is analyzed by dividing it into two periods: 1991 – 2000 and 2001 – 2010. The main results show that the prediction skill for the EAWM with a lead time of 3 weeks is significantly decreased in the 2000s compared to the 1990s. To investigate the reason for the reduced EAWM prediction performance in the 2000s, the characteristics of the teleconnections relating to the polar and equatorial regions are examined. It is found that the simulated excessive weakening of the East Asian jet relating to the tropics and a failure in representing the Siberian high pressure relating to the Arctic are mainly responsible for the decreased EAWM prediction skill. This study evaluates the basic performance of the EAWM prediction using an ensemble hindcast produced by the KMA subseasonal prediction system. The study includes an identification and evaluation of systematic biases in the global prediction model, focusing on the prediction skill for the EAWM, with its interaction between the tropics and the Arctic climate. It is found that the main issue relating to the decreased prediction skill for the EAWM in the 2000s is the failure of the simulation of the East Asia upper-level circulation in the GloSea5 subseasonal prediction. Our study has a limitation in that there are recent studies for the S2S prediction skills of various modes of atmospheric variabilities (e.g., AO, ENSO, MJO, and QBO) [36 – 38]. In addition, the phase or amplitude of the AO and the ENSO can affect the various conditions with mid-latitude teleconnections. Nevertheless, our study suggests that the recent decrease in EAWM skill is significantly related to changes in the Arctic and the tropics, which is the most relevant mode of East Asia winter variability. However, the specific causes of the low prediction skill with regard to changes in equatorial surface temperature variability or Arc-tic-related circulation, need to be further investigated using a more comprehensive analysis. Consistently with this study, Hsu et al. [39] recently revealed that tropical sea surface temperature biases are related to the EAWM variability in the subseasonal simulation for 2011 – 2020. In this study, although the hindcast period is limited to 1991 – 2010 due to data availability, it will be extended to more recent periods (e.g., 2011 – 2020) after updating the model version.


Introduction
Subseasonal to seasonal (S2S) prediction is becoming increasingly important in reducing the damage caused by extreme weather and climate events, since it bridges the gap between the weather and the seasonal climate forecast [1,2]. Since the subseasonal time scale (about 30-60 days) has long been considered a "predictability desert", there is growing interest within the scientific, operational, and applications communities in developing forecasts for subseasonal prediction [3,4]. Following the launch of various S2S prediction projects, substantial effort and progress has been made in understanding the S2S predictions [5]. At present, the vast majority of global producing centers (GPCs) produce operational subseasonal forecasts. These dynamical prediction models in operational centers are fully coupled climate system models that include the comprehensive dynamics and physics of the atmosphere, land surface, ocean, and sea ice interactions. To improve subseasonal prediction skills, GPCs such as the National Centers for Environmental Prediction (NCEP CFS) [6], the European Centre for Medium-Range Weather Forecasts (ECMWF), the United Kingdom Meteorological Office (UKMO) [7,8], and many other institutes and research groups, continuously update their operational coupled forecast systems with improved physics and increased resolution.
The sources of subseasonal predictability are various processes in the atmosphere, ocean, sea ice, and land (such as the Madden-Julian oscillation, the El Nino-Southern Oscillation, the Arctic Oscillation, tropical convection, tropical cyclones, sea ice cover, soil moisture, snow cover, stratospheric conditions, ocean statuses, and various teleconnections), although these are not yet fully understood [9][10][11][12]. Although subseasonal prediction is known for its low predictability, many studies have attempted to reduce the gap between weather forecasting (up to 2 weeks) and long-range or seasonal climate prediction (3-6 months) [1][2][3][4]. Operational prediction systems are divided into two types: extended-range weather systems and seasonal climate forecast systems. For example, the ECMWF operational system provides an extended-range forecast for the upcoming 46 days focusing mainly on week-to-week changes in the weather. The forecasts are derived from the ensemble system, which is extended twice per week (on Thursday and Monday) to run for 46 days ahead. Meanwhile, the UKMO provides four ensemble members initialized every day: two runs for 60 days and two runs for 195 days, using a seasonal prediction system. The Korea Meteorological Administration (KMA) also provides subseasonal and seasonal forecasts using a global prediction system based on a fully coupled atmosphere, land, ocean, and sea ice model.
The East Asian winter monsoon (EAWM) is one of the most active atmospheric circulation systems during the boreal winter [13,14]. According to previous studies, the EAWM has significant impacts on the weather and climate over the East Asian region, including extreme cold events related to subsystems such as the Siberian high, Aleutian low, East Asian trough, low-level northerly wind, and high-level jet stream [15][16][17][18]. Many of the climate forecast models have shown reasonable skill in predicting the East Asian summer monsoon. However, the EAWM is still difficult to predict in a climate forecast model because the subsystems have the characteristic of complex seasonality, exhibiting a distinct subseasonal phase [18][19][20]. In this study, the characteristics of systematic errors in the subseasonal predictions for East Asia are evaluated using an ensemble hindcast produced by the KMA subseasonal prediction system and run for 20 years (1991-2010). This study includes the identification and evaluation of systematic biases in the global prediction model and focuses on the prediction skill for the EAWM, with its interaction between the tropic and Arctic climates, which are major drivers of weather and climate variability in East Asia. This paper is divided into the following sections: a brief description of the KMA operational model is provided in Section 2; Section 3 examines the hindcast climatology and the characteristics of the prediction skill that are closely related to the East Asian climate; Section 4 summarizes the results and provides the major conclusions.

Model and Data Description
The S2S project provides a publicly available database of historical reforecasts and real-time forecasts from several operational forecast groups, including KMA. This study used a subseasonal dataset in the S2S project databases, consisting of ensemble hindcast runs for 20 years (1991-2010) produced by the KMA prediction system. The KMA has had an operational subseasonal to seasonal forecast system, which is a joint system between the KMA and the UK Met Office, since 2014. Currently, the operational subseasonal forecast system of the KMA is the Global Seasonal Forecasting System version 5 (GloSea5). GloSea5 is a fully coupled global climate model that consists of atmosphere (Unified Model; UM), ocean (Nucleus for European Modeling of the Ocean; NEMO), land-surface (Joint UK Land Environment Simulator; JULES) and sea−ice (Los Alamos sea ice model; CICE) components, combined through the OASIS coupler [7]. The resolution is N216N85 (approximately 60 km) for the atmosphere and ORCA0.25L75 (0.25° on a tri-polar grid) for the ocean. A stochastic kinetic energy backscatter (SKEB) scheme is used to generate the spread between members initialized from the same analysis. The hindcast covers the period 1991-2010 with three members initialized on fixed calendar dates: 1st, 9th, 17th, and 25th. The initial condition for the hindcast uses the reanalysis from the European Centre for Medium-Range Weather Forecasts (ECMWF) ERA-Interim project for the atmosphere and the land surface. The assimilation system used to create the hindcast (based on the NEMOVAR scheme) has the same ocean and sea ice model as the coupled model used in GloSea5. Further detailed information on GloSea5 is provided in MacLachlan et al. [7].
In this study, we applied the dynamical EAWM index proposed by Li and Yang [17] to measure the interannual variability of the EAWM. Because this index takes into account several factors influencing the monsoon, such as the Arctic Oscillation (AO) and the El Niño-Southern Oscillation (ENSO), and better elucidates the physical processes associated with the EAWM [17,18,21,22], it is considered suitable for the evaluation of the EAWM prediction skill in this study. The index is defined as the mean horizontal shear of 200 hPa zonal wind: EAWM = {(U200(30-35° N, 90-160° E) -U200(50-60° N, 70-170° E)) + (U200(30-35° N, 90-160° E) -U200(5° S-10° N, 90-160° E))}/2. Following the definition of the AO by Thompson and Wallace [23], the AO index is calculated as the principal component (PC) of the first empirical orthogonal function (EOF) mode for daily mean sea level pressure (SLP) anomalies during the boreal winter. In addition, the Nino 3.4 index is used to represent the ENSO variability in this study.
To validate the various features related to the EAWM in GloSea5, reanalysis from the ECMWF ERA-Interim project (ERA-Interim) [24] and the Global Precipitation Climatology Project (GPCP) version 2.1, combined with the precipitation dataset [25], are used in this study. Figure 1 shows the spatial distribution of the weekly mean forecast biases for surface temperature with lead times of 1-4 weeks obtained from GloSea5 and ERA-Interim for the season comprising December, January and February (DJF). The pattern over East Asia, which has a cold temperature on the continent and a warm temperature in the ocean, is one of the most critical driving mechanisms of the EAWM [18]. GloSea5 generally represents the observed temperature patterns well. However, the temperature from the GloSea5 hindcast shows distinct cold biases over the continent in the northern hemisphere, including the East Asia region. Thus, the temperature contrast between the land and the ocean over East Asia in GloSea5 is weaker than that in the observations. In addition, the cold biases in the hindcast increase with an increasing lead time. Meanwhile, there are slight warm biases over some areas of the western Pacific, South America, and the Arctic. , respectively. Pink, blue, green, and red dots indicate the weekly averaged PCCs for the variables with lead times of 1, 2, 3, and 4 weeks, respectively. Although the PCCs for all the variables indicate the highest prediction skill (approximately 0.8) for a lead time of 1 week, it is relatively low for most of the other lead times. The PCCs of all the variables over the tropics are generally higher than those of other regions, whereas those over East Asia show a low prediction skill for lead times of 3 to 4 weeks. The prediction skill for the 850hPa zonal wind is generally low compared to the other variables (surface temperature, 500hPa geopotential height, and 200hPa zonal wind). To confirm the general prediction skill for the EAWM, the time series of the East Asia winter monsoon index produced by Li and Yang [17] from ERA-Interim and GloSea5 with lead times of 1 and 3 weeks, are shown in Figure 3. This index indicates the interannual variability of the EAWM and is defined as the domain-averaged 200hPa zonal wind shear. GloSea5 represents the observed variation well in most years with a 1week lead. Although GloSea5 with a 3-week lead time shows some skill in realistically representing the variability, the recent prediction skill is quite poor. Averaged correlations for EAWM skills from ERA-Interim and GloSea5 with lead times of 1 to 4 weeks are shown in Figure 4. To confirm that the prediction skill is reduced in the recent period, the correlation is calculated by dividing it into two periods: 1991-1999 (hereafter P1) and 2000-2009 (hereafter P2). In the period from 1991 to 2009, the correlation decreases according to the lead time, but the value is greater than 0.6 up to a lead time of 4 weeks (black line in Figure 4). The correlation in P1 (P2) is higher (lower) than that for the entire period for all lead times. Notably, it is clear that the correlation for the 3-week lead time in P2 is significantly lower than in the other periods, and even lower than the value for the 4-week lead time in the same period.   Figure 5 shows the surface temperature and the 200hPa zonal wind patterns regressed onto the EAWM index from ERA-Interim and GloSea5 with 1-week and 3-week lead times. From observation, a strong monsoon indicates two distinct patterns: one is the intensified East Asian jet in the upper-level wind, with continent-wide cooling on the surface over northeast Asia, and the other is the Pacific-East Asian teleconnection pattern influenced by ENSO during its cold phase (Figure 5a). GloSea5 with a 1-week lead realistically represented the EAWM variability as shown in the reanalysis and realistically captured the surface temperature cooling over northeast Asia (Figure 5b). However, the model with a 3-week lead tends to show a slightly weak warming over the Arctic and cooling over East Asia (Figure 5c). Additionally, it is clear that the weakening of the upper-level wind circulation over East Asia is dominant for the 3-week lead. To confirm the cause of the degradation of the EAWM performance, the effects on high and low latitudes are investigated over the next two sections.

Effect of Arctic-Mid-Latitude Interaction on EAWM
Recently, many studies have discussed the effect of climate variability in the Arctic on the mid-latitudes [26][27][28][29]. These studies indicate that the temperature trends for the mid-latitudes and the Arctic show opposite patterns before and after the late 1990s. In the earlier period, the Arctic shows slight cooling, whereas in the later period, it shows rapid warming. It is well known that the recent cold winters in northern continents are related to Arctic warming; the so-called "warm Arctic-cold continent" pattern enforced via stationary Rossby waves. Labe et al. [29] recently mentioned that these Arctic-mid-latitude linkages show little connection with modeling results, although some model experiments support the observational evidence. To examine whether GloSea5 represents this recent phenomenon well, the linear trends for the surface temperature for the period 2000-2009, from ERA-Interim and GloSea5 with lead times of 1 and 3 weeks, are shown in Figure 6. The Arctic warming and Eurasian cooling trends shown in the reanalysis data are well simulated in the NWP for this period for a lead time of 1 week. The trend is well represented in the subseasonal period with a 3-week lead time, although the cooling strength is slightly weak. To characterize the variability of the surface temperature over East Asia for the winter climate, an EOF analysis using surface temperature north of 20° N was applied. Figures 7 and 8 show the sea level pressure and surface temperature regressed onto the first and second EOF analyses of surface temperature over the Arctic to the mid-latitudes, from the ERA-Interim and GloSea5 hindcasts, respectively. The first-mode eigenvector from ERA-Interim explains approximately 30% of the total variance. The first dominant structure of the EAWM is characterized by the strengthening of the Siberian high ( Figure  7). This shows the cold temperatures in Eurasia to the East Asia continent, including the Korean peninsula, whereas there are the warm temperatures in the high-latitude regions. GloSea5 with a 1-week lead represents the pressure and temperature features from the observations well. The strengthening of the Siberian high and the cold temperature patterns are not captured by the increasing the lead time in GloSea5, although the warm temperature pattern is well represented. Consistently with other studies [29], Arcticmid-latitude linkages tend to be weakened in the model experiment compared to the observational phenomena, especially with increasing lead times. The second dominant structure of the EAWM, which explains approximately 20% of the total variance, shows an intensive pressure gradient forming between the Siberian high and the Aleutian low, associated with the main pattern of the EAWM (Figure 8). Gong et al. [30] found a significant out-of-phase relationship between the AO and the EAWM during the period 1951-1999. The negative phase of the AO is concurrent with a strong EAWM, though it could significantly influence the EAWM strengthening related to the Siberian high. Observation showed weakened sea level pressure and warm temperatures over the East Asia region. These patterns are exactly simulated by GloSea5 with a 1-week lead time. GloSea5 with a 3-week lead time generally captures the out-of-phase AO-EAWM relationship well, although its strength is weak.  Figure 9 shows the principal component (PC) time series of the first leading mode governing the detrended winter sea level pressure, from ERA−Interim and GloSea5 with lead times of 1 and 3 weeks. The PC time series from GloSea5 with a lead time of 1 week captures the interannual variation shown in the reanalysis data. However, the results from GloSea5 with a lead time of 3 weeks show a very different pattern from the observations, especially in the 2000s. The quantitative value of the Siberian high index is defined as the regional mean SLP averaged over the area 40-65° N and 80-120° E in winter, providing a measure of the strength of the Siberian high [31]. The interannual variation of the Siberian high index, representing the expansion or contraction of the Siberian high pressure, which is the characteristic most closely related to the first EOF mode from the surface temperature, is shown with a black dotted line in Figure 9. The time series of the Siberian high index is almost consistent with the observed pattern. In other words, it can be seen that the recent decrease in the EAWM prediction skill is also related to the failure of the simulation of the expansion or reduction of the Siberian high in GloSea5 with a lead time of 3 weeks. The AO is a dominant pattern for the variation of sea level pressure from the Arctic to the mid-latitude region and is described by sea level pressure anomalies of one sign in the Arctic and anomalies of the opposite sign in the mid-latitudes [19]. To confirm the performance for the AO, which is a characteristic related to the second EOF mode from the surface temperature, the prediction skill of the AO index and the correlation between the EAWM and the AO based on the number of lead weeks are shown in Figure 10. The prediction skill of the AO index in the 2000s for GloSea5 is higher than in the 1990s and is especially low for a lead time of 4 weeks in the 1990s. It is a well-known fact that, through influencing large-scale circulations such as the Siberian high, a positive phase of winter AO generally leads to a weaker EAWM and warmer than normal winters. It can be observed that the periods 1991-1999 and 2000-2009 show a negative correlation between the EAWM and the AO. The correlation between the AO and the EAWM in the earlier period tends to slightly decrease in the later period. The relationship in the earlier period is well simulated up to a lead time of 3 weeks; however, the weakened relationship in the later period is more strongly simulated than that observed for lead times of 2 and 3 weeks. Interestingly, for the 4−week lead time, although the correlation in the earlier period shows the opposite result, the correlation in the later period appears similar to the observation. Thus, the effect of the AO simulation on the recent decrease in the EAWM prediction skill in the 2000s, is not significant.

Effect of Tropics-Mid-Latitude Interaction on EAWM
It is well known that current seasonal prediction systems forecast the surface temperature reliably only in lower latitudes in the boreal winter. This is mainly driven by the extended prediction skill of ENSO, although the skill is very limited in the higher latitudes [32,33]. Therefore, the representation of the teleconnection from the tropics should be one of the key uncertainties in prediction systems. In addition, many studies have mentioned that the interannual variations of the EAWM are well correlated with ENSO [18,34,35]. The EAWM and ENSO show a negative correlation, associated with the anticyclonic circulation over the Philippine Sea, which is caused by warming in the central Pacific [35].
To examine the ENSO-related characteristics in the GloSea5 subseasonal prediction, the precipitation, temperature, and wind fields regressed to the Nino 3.4 index are shown in Figures 11 and 12. The results from the ERA-Interim reanalysis are not shown, because the simulated pattern of association with the ENSO from GloSea5 with a lead time of 1 week mostly matched the observed pattern. In the early period, for positive cases of Nino 3.4 corresponding to the 850hPa wind pattern related to the ENSO, precipitation decreases over the Maritime Continent and the western Indian Ocean and increases from southern China to southern Japan, compared to the negative cases. This is associated with weak westerlies over the subtropics and weak easterlies over the tropical Indian Ocean and the western North Pacific (Figure 11a). Even when the lead time increases, the relationship between the EAWM and the ENSO is generally well represented. However, the center of equatorial precipitation is weakened and wind circulation in the western North Pacific is overestimated (Figure 11b). In the later period, the center of the equatorial precipitation related to the ENSO shifts to the left, and the rainband from southern China to the south of Japan is weakened. In addition, the low-level circulation in the northeast Pacific is strengthened (Figure 11c). Changes in low-level circulation and precipitation patterns in the 2000s are well represented by GloSea5 with a lead time of 3 weeks ( Figure  11d). The surface temperature and upper-level circulation regressed onto the Nino 3.4 index are shown in Figure 12. In the earlier period, the high temperature over the central Pacific and the north-south dipole pattern of the upper-level circulation is related to the ENSO (Figure 12a). The temperature and the wind circulation from GloSea5 with a lead time of 3 weeks are generally well represented, although the wind circulation over the northwest Pacific is overestimated (Figure 12b). In the later period, the temperature peak over the central Pacific is shifted to the left and is weakened slightly, similarly to the precipitation pattern (Figure 12c). In addition, the circulation over the northeast Pacific is significantly strengthened and the East Asian jet is slightly weakened. It should be noted that GloSea5 with a lead time of 3 weeks underestimates the weakened circulation over East Asia and overestimates the strengthened circulation over the northeast Pacific (Figure 12d). Interestingly, the surface temperature over the central Pacific and the Korean Peninsula shows the same sign in the earlier period, whereas the sign is reversed in the later period. The correlations between the EAWM and the Nino 3.4 index from observation and GloSea5, with various lead times, are plotted in Figure 13a. In the observation, both periods (1991-1999 and 2000-2009) show a negative correlation between the EAWM and Nino 3.4. In the earlier period, the correlation between the EAWM and the ENSO is generally well captured by GloSea5, even when the lead time is increased. However, the recent relationship between the EAWM and the ENSO simulated by GloSea5 is reduced rapidly as the lead time increases, showing different results from the observations. For GloSea5 with a 3−week lead time, the correlation is completely reversed. To examine the cause of the failure to represent this relationship, the prediction skill for the 200hPa zonal wind over East Asia, which is highly related to the EAWM, is plotted in Figure 13b. As mentioned above, GloSea5 with a 3−week lead time shows a well-simulated 200hPa zonal wind and relationship between the EAWM and Nino3.4 in the earlier period. However, the prediction skill for the 200hPa zonal wind in GloSea5 with a 3−week lead time in the later period is significantly reduced as the lead time increases. In other word, it is clear that the failure to simulate the upper wind field in the later period is strongly related to the reduced prediction skill of the EAWM index.

Concluding Remarks
In this study, the characteristics of systematic errors in subseasonal predictions are investigated using an ensemble hindcast (1991-2010) produced by the KMA GloSea5. The KMA GloSea5 is a global prediction system for the subseasonal-to-seasonal time scale, based on a fully coupled atmosphere, land, ocean, and sea-ice model. To examine the fidelity of the prediction system in reproducing and forecasting phenomena, this study assesses the systematic biases in the global prediction model, focusing on the prediction skill for EAWM, which is a major driver of weather and climate variability in East Asia.
GloSea5 generally represents the observed temperature patterns well; however, there are clear cold biases over the continent in the northern hemisphere, including the East Asia region. In addition, the cold biases increase with increasing lead times. Although the pattern correlation coefficients of large-scale variables indicate great skill for lead times of 1 week, this reduces for lead times of 2 weeks onward. The EAWM index forecast from GloSea5, which is calculated from the upper-level zonal wind shear, represents the observed variation well with a 1−week lead in most years. GloSea5 with a 3−week lead shows a realistic representation of the variation; however, the recent EAWM prediction skill is quite poor.
To investigate the error characteristics of GloSea5, the hindcast period was analyzed by dividing it into two periods: 1991-2000 and 2001-2010. The results show that the prediction skill for the EAWM for a lead time of 3 weeks is significantly decreased in the 2000s compared to the 1990s, which is due to the weakening of the upper-level wind circulation over East Asia. In order to investigate the reason for the reduced EAWM prediction performance in the 2000s, the represented characteristics of the teleconnection relating to the polar and equatorial regions were examined. Although the prediction skill of GloSea5 for the AO index in the 2000s is higher than in the 1990s, the simulated excessive weakening of the East Asian jet related to the tropics and the failure to represent the Siberian high pressure related to the Arctic, are mainly responsible for the decreased EAWM prediction skill in the 2000s. In addition, one of the possible reasons for the low prediction skill of EAWM in the 2000s is that the observed characteristic, the peak of SST distribution in the equatorial Pacific moving from the east to the center, is weakly represented by GloSea5. This can affect the simulation of the large-scale circulation, which eventually leads to the mid-latitude variability.
This study evaluates the basic performance of the EAWM prediction using an ensemble hindcast produced by the KMA subseasonal prediction system. The study includes an identification and evaluation of systematic biases in the global prediction model, focusing on the prediction skill for the EAWM, with its interaction between the tropics and the Arctic climate. It is found that the main issue relating to the decreased prediction skill for the EAWM in the 2000s is the failure of the simulation of the East Asia upper-level circulation in the GloSea5 subseasonal prediction. Our study has a limitation in that there are recent studies for the S2S prediction skills of various modes of atmospheric variabilities (e.g., AO, ENSO, MJO, and QBO) [36][37][38]. In addition, the phase or amplitude of the AO and the ENSO can affect the various conditions with mid-latitude teleconnections. Nevertheless, our study suggests that the recent decrease in EAWM skill is significantly related to changes in the Arctic and the tropics, which is the most relevant mode of East Asia winter variability. However, the specific causes of the low prediction skill with regard to changes in equatorial surface temperature variability or Arctic-related circulation, need to be further investigated using a more comprehensive analysis. Consistently with this study, Hsu et al. [39] recently revealed that tropical sea surface temperature biases are related to the EAWM variability in the subseasonal simulation for 2011-2020. In this study, although the hindcast period is limited to 1991-2010 due to data availability, it will be extended to more recent periods (e.g., 2011-2020) after updating the model version.