1. Introduction
The monsoons in East Asia have complex spatio-temporal variations throughout the subtropical and mid-latitude regions [
1,
2]. The monsoonal belt during summer, which stretches thousands of kilometers, rains in the countries and the adjacent oceans of China, Japan, and North and South Korea, as the regional monsoons, called
Meiyu,
Baiyu,
Jangma and
Changma, respectively. The heavy rain during the summer monsoon period and complex terrain in East Asian countries make water management more difficult in a timely manner. Particularly, floods and droughts caused by the monsoon variations can significantly affect human life and the economy for the more than 1.5 billion people in the East Asian monsoon (EAM) region. As the EAM affects one-fifth of the world’s population and interacts with the global climate systems, the EAM study has been one of the major areas in climate science.
To understand the EAM and its variability, spatially and temporally reliable climate data over a long period of time are required. Observational station data are not continuous throughout space and time [
3,
4,
5,
6] and, thus, they are not sufficient to provide consistent atmospheric states for a given location, which results in limitations to study the EAM as well as other climatic phenomena. The climate reanalysis data can replace observational data by compensating the shortcomings of weather station data, which can provide continuous atmospheric variables through data assimilation processes and numerical models. Since the first climate reanalysis data of National Centers for Environmental Prediction/National Center for Atmospheric Research (NCEP/NCAR, hereinafter NCEP1) [
7] were released in the mid-1990s, reanalysis data have been widely applied to three-dimensional analysis of various weather and climate phenomena, climate variability, and model validation studies. The performance of reanalysis data has been continuously improved in terms of spatial resolution, atmospheric-land–ocean coupled model and numerical forecasts. Recently, the fifth-generation climate reanalysis of European Centre for Medium-Range Weather Forecasts (ECMWF), ERA5, was released [
8].
However, due to the uncertainty of input data, errors in numerical models, and limitations in spatial resolution, there are still differences between reanalysis data and observed atmospheric variables. Therefore, recognizing the differences and errors in the reanalysis data from the observations and considering their effects on the research outputs are essential to apply the reanalysis data to climate research [
9]. The studies of comparing differences of reanalyses with observations and validating errors to evaluate the performance of reanalysis have been conducted on global e.g., [
10], continental e.g., [
11,
12], and country scale e.g., [
3,
13]. As for studies on a global scale, plenty of research compared station observations with global gridded observations, such as the Climate Research Unit (CRU) and the Global Precipitation Climate Center (GPCC), the merged data of satellite estimates and gauge data, such as CPC Merged Analysis of Precipitation (CMAP), and climate reanalysis data, such as NCEP, ECMWF, and Japanese Reanalysis (JRA) series. For example, Donat et al., (2014) evaluated the consistency of the reanalyses and observations in terms of extreme temperature and precipitation and revealed that the ECMWF reanalysis showed spatially higher correlations with the observations compared to the Japanese 25-year Reanalysis (JRA25) and NCEP reanalyses [
10]. Sun et al., (2018) compared various gridded global precipitation data, including gauge-based, satellite-related, and reanalysis products [
14]. There were deviations up to 300 mm among the products in the estimated annual precipitation over the land and the reanalysis datasets generally had the largest discrepancies compared to gauge-based and satellite-related datasets.
The comparison and validation studies of reanalysis have been conducted on a spatial scale of continents, including eastern Eurasia, southern South America, and Antarctica e.g., [
11,
12,
15]. Several studies have verified the uncertainty of the reanalysis data in polar regions, where it is difficult to obtain observational data due to a lack of data accessibility. For example, Zhang et al., (2021) divided Greenland into sedimentary and erosion areas to compare surface temperature, relative humidity, and wind speed of observations with Japanese 55-year Reanalysis (JRA55), ERA5, Climate Forecast System Reanalysis (CFSR), and Modern-Era Retrospective Observations (MERRA2) [
16]. Based on Root Mean Square Errors (RMSEs) and biases, MERRA2 showed the highest performances in surface temperature in the sedimentary area and JRA55 had the best in the erosion area. MERRA2 was the best reanalysis for relative humidity as it captured the seasonal cycles well. For wind speed, JRA55 and ERA5 were the best in the erosion and the sedimentary areas, respectively. Huai et al., (2019) evaluated surface temperatures in Antarctica obtained on the eight reanalysis datasets, including ERA-Interim, CFSR, JRA55, MERRA2, ECMWF 20th century atmospheric model ensemble (ERA-20cm), ECMWF 20th century reanalysis (ERA-20c), NOAA-CIRES-DOE Twentieth Century Reanalysis (20CR), and Coupled Reanalysis of the Twentieth Century (CERA-20C) [
15]. All the reanalysis data generally represented well the seasonal cycles of temperature. In particular, MERRA2 showed the best performance with mean absolute error (MAE) of 2 °C or less for all twelve months. For annual variations in the monthly average temperature, the accuracy was in the order of ERA-Interim, CFSR, and MERRA2 and the other reanalysis data showed relatively lower performance.
In the mid-latitude regions, it can be challenging to simulate true atmospheric conditions using a climate forecast model because of the various factors associated with complex interactions of high- and low-latitude climate systems [
17] and also due to the effects of human-induced land cover and land use changes on the atmosphere [
18]. Therefore, research on comparison and validation of reanalysis data have been actively conducted in the mid-latitudes. Mooney et al., (2011) compared reanalysis data (ERA40, ERA-Interim, and NCEP1) with station observations for surface temperature over Ireland during 1989–2001 and found that ERA-Interim showed higher linear correlation coefficients than the other two datasets [
13]. Balmaceda-Huarte et al., (2021) divided southern South America into five sub-regions and compared the reanalysis data of ERA-Interim, ERA5, NCEP1, NCEP-Department of Energy (DOE) reanalysis (hereinafter NCEP2), and JRA55 for temperature and precipitation [
12]. NCEP1 and NCEP2 showed poor illustrations for overall trend, but good for interannual variations. ERA5 was better in capturing the spatio-temporal variations in temperature and precipitation, compared to ERA-Interim, because of the advanced numerical models and higher spatial resolution. de Lima and Alcântara (2019) validated the reanalysis data of ERA-Interim, CFSR, and NCEP1 with station observations based on five extreme climate indices by the Expert Team on Climate Change Detection and Indices (ETCCDI) for the eastern region of northeast Brazil [
19]. ERA-Interim showed lower RMSEs, compared to CFSR, for precipitation, monthly maximum of daily minimum temperature (TXn), and monthly minimum of daily minimum temperature (TNn). RMSEs of monthly maximum of daily maximum temperature (TXx) and monthly maximum of daily minimum temperature (TNx) were relatively lower in CFSR. Pearson correlations of precipitation, TXn, and TNn were higher in the order of ERA-Interim, CFSR, and NCEP1. For TXx and TNx, the correlations of NCEP1 were higher than those of CFSR. Tang et al., (2017) evaluated the reanalyses of NCEP1, NCEP2, and CRU-NCEP with NOAA long-term weather data on monthly and annual scales in 90 metropolitan areas in the United States [
20]. On the annual, CRUNCEP showed the best performance for average and minimum temperatures and NCEP1 and NCEP2 for maximum temperature and precipitation. The least bias with observational data was in NCEP1 for maximum temperature and CRUNCEP for the precipitation, average temperature, and minimum temperature.
East Asia is one of the world’s most densely populated areas in the mid-latitude, where the East Asian summer and winter monsoons have a dominant influence on the climate of the region. For an example of a comparison study in East Asia, Inoue and Matsumoto (2004) compared sea-level pressure of NCEP1 and ERA40 with observations during the summer of 1960–1999 in eastern Eurasia to evaluate their reliabilities [
11]. The annual and seasonal averages of reanalysis data showed an increase in sea-level pressure in Mongolia between 1960–1979 and 1980–1999 in NCEP1. These changes in sea-level pressure observed in NCEP1 were not found in ERA40 and also observations, indicating poor performance of NCEP1 compared to the ERA40. Many studies that compare reanalysis data with observational station data in East Asia have been conducted in China, with its large area and various climates. Zhao and Fu (2006) evaluated the reliability of reanalysis data by comparing the spatio-temporal distribution of summer precipitation obtained from ERA40, NCEP2, and CRU and their differences from observational data during 1979 to 2001 in mainland China [
21]. According to the results of spatial distribution, interannual variations, and empirical orthogonal function (EOF) analysis, CRU showed high consistency with observation data. Gao et al., (2008) compared surface temperatures of NCEP1 and ERA40 with station observations in mainland China from 1979 to 2001 and estimated the effect of elevation on temperature biases in the reanalyses [
22]. Although the reanalysis data represented clear cold biases, ERA40 was relatively similar to the observations compared to NCEP1. Ma et al., (2008) compared the performance of reanalysis data of ERA40, NCEP1, and NCEP2 in China from 1979 to 2001 in terms of spatio-temporal aspects [
23]. The results showed that ERA40 was better at reproducing surface temperature than NCEP series and NCEP2 was more consistent with the observations than NCEP1. NCEP2 had a smaller MAE, a higher standard deviation, and a slightly higher correlation coefficient than NCEP1. You et al., (2012) compared precipitation of NCEP1 and ERA40 with adjusted observations from 1961 to 2007 over the Tibetan Plateau [
24]. The temporal variations in the two reanalyses were different from the observations. Further, dry biases were observed in both reanalyses and NCEP1 showed a greater dry bias compared to ERA40, which supported the better performance of ERA40 than NCEP1 in reproducing the interannual variations in precipitation in the Tibetan Plateau. He and Zhao (2017) compared NCEP2 and CFSR with observations by daily temperature indices in China from 1979 to 2010 [
3]. For the observation data of the Beijing stations, both reanalysis data reproduced well the long-range, also called long-term memory or long-range persistence, correlation. However, there are some overestimations in daily temperature along the Tibetan Plateau. NCEP2 performed better than CFSR in relatively colder regions, including Inner Mongolia and the central, eastern, and northwestern regions of China, while CFSR was superior in the warmer southern China.
There are relatively fewer studies of reanalysis comparison and validation for other countries in East Asia, although South Korea and Japan have been included in the global validation study of reanalysis (e.g., [
4]). For South Korea, Han and Kim (2019) compared the reanalysis data of ERA-20cm, ERA-20c, ERA40, and 20CR with the gridded observational data of CRU and GPCC [
25]. Precipitations of CRU and GPCC showed higher correlations in interannual variations compared to reanalysis data and ERA40 was the highest for temperature. As a local-scale study, Xin et al., (2021) evaluated the reproducing performance of precipitation in ERA5-Land and ERA5 reanalysis in urban areas of Guangdong, Hong Kong, and Macau at different time scales of hourly, daily, and monthly [
5]. The reanalysis data reproduced well the spatial distribution and monthly trends of precipitation. However, for the shorter time scales, at hourly and daily, the frequency of precipitation was overestimated. Despite the higher resolution in the ERA5-Land, ERA5 reanalysis was more accurate in representing precipitation due to the absence of an atmospheric model in generating the ERA5-Land reanalysis.
While the climate reanalysis has evolved by improving the quality of input observational data, by enhancing spatial resolution, and by reducing the systematic model biases over the generations, it is still essential to verify the reanalysis data at a regional scale. ERA5, as the most recently produced reanalysis with the highest spatial resolution, was improved from ERA-Interim in the integration of model processes, tangential approximation, and data assimilation in an Integrated Forecast System (IFS) [
8]. However, several previous studies that investigated the applicability of ERA5 in China showed the performance of ERA5 was relatively degraded at high altitude. Zhang et al., (2021) compared reanalysis data of ERA5, JRA55, NCEP1, NCEP2, MERRA, 20CR, and the 40-year global reanalysis dataset released by China Meteorological Administration (CRA40) with the observations after removing urbanization bias in mainland China during 1979–2015 [
16]. The results showed that the surface temperature from JRA55 was better than that from ERA5 and other reanalysis data. Jiao et al., (2021) evaluated precipitations from ERA5 with observations in mainland China by analyzing the annual and seasonal trends and spatial patterns [
26]. ERA5 had good agreements with observations at elevations below 1000 m, but significant differences at elevations above 4000 m. These previous results suggest that the reliability of climate reanalyses needs to be validated in the regional domain at various time scales (e.g., annual and seasonal averages).
Most previous studies comparing the existing reanalysis data with observation data were conducted for specific countries. In particular, most of the studies in East Asia have focused on China, so, to our knowledge, there are no studies comparing and validating reanalysis data only for the entire EAM region. As mentioned earlier, the EAM is a major source of water for humans, animals, and plants living in the region and, at the same time, the extreme nature of monsoons can cause damage by heavy rain and drought as well as winter storms. As nearly one-fifth of the world’s population lives in the EAM region, the importance of understanding the EAM is paramount. Therefore, for a better understanding of the EAM variability and, thus, improving the monsoon predictability, the use of climate reanalysis evaluated in the EAM region is needed. This study compared and validated representative reanalysis data across the world, which are ERA5 from Europe, NCEP2 from America, and JRA55 from Asia, with station observation data provided by each country, China, North Korea, South Korea, and Japan in the EAM region. By evaluating the performance of the reanalysis data in terms of temporal and spatial associations and differences using the statistical metrics of the parametric and non-parametric correlation coefficients, RMSE, and bias, we suggested the most appropriate reanalysis data for the EAM region. In addition, the potential geographical factors for the systematic biases found in the reanalysis data were discussed, which could be considered in utilizing the reanalysis data for East Asian studies.
4. Discussion
Among ERA5, NCEP2, and JRA55, ERA5 was selected as the most consistent reanalysis data with observational station data of temperature and precipitation in the EAM region for the forty years of 1981 to 2020 (the thirty years of 1981 to 2010 in China). It may be expected that the most recent reanalysis data would be the best due to their high spatio-temporal resolutions, improved assimilations, and an advanced numerical model. However, prior studies comparing and validating reanalyses with observations suggested that ERA5 may have some discrepancies with observations, especially in the high altitudes. For instance, Zhang et al., (2021) compared the various reanalysis data, including ERA5, JRA55, NCEP1, NCEP2, MERRA, 20CR, and CRA40, with urbanization bias-corrected observations in mainland China for the period of 1979–2015 [
6]. The results revealed that Pearson correlation coefficients and standard deviation ratio values of temperature from ERA5 at high altitudes above 1500 m were lower than those from JRA55 and CRA40. The uncertainties in ERA5 were also found in a study comparing ERA5 precipitation data with gridded station observations in mainland China for the period of 1979–2018 [
26]. The results showed good agreement between ERA5 and observations in the elevations below 1000 m, but significant differences in the regions of elevations above 4000 m. These results could be confirmed in our study as well. In
Figure 8a,b, ERA5 best stations dominated in the coastal regions of China, supported by [
8], and the relatively lower elevations in other countries. As our study region does not include the high elevated regions of western China, ERA5 could outperform JRA55 and NCEP2 in the EAM region, in contrast to the previous studies on the entirety of China.
In this study, we also aimed to figure out inherent systematic biases of reanalyses and their potential causes to enhance the performances. The spatial distributions of positive (negative) biases vary by seasons and locations as shown in
Figure 6a,b. There are several reasons for the local, regional, and global uncertainties and biases in the reanalysis datasets. Previous studies found that the bias of temperature in between reanalyses and observations in mainland China was associated with elevations [
6,
21,
23]. Surface warming mainly caused by global warming is amplified in high-altitude regions due to their sensitivity to the warming effect and, thus, its underestimation in the numerical model would be possible [
6,
20,
42]. In our study, cold biases were clearly identified in the mountainous regions of Inner Mongolia, for which the reanalyses need to be improved to estimate the elevation effect on temperature under a warming climate. In the mountainous regions of Inner Mongolia and of the borders between China and North Korea, the cold biases in ERA5 are much less than those in NCEP2 and JRA55. One possible reason could be the better representations of terrain models and higher spatial resolutions used in ERA5, which can be better in estimating the elevation effect on temperature. In addition to the elevation effect, the warming effect of vegetation in the high-altitude regions [
43] might enhance the cold biases in the climate reanalysis. In the mountainous regions of Inner Mongolia, the forests have significantly increased over 1982–2015 since the large-scale afforestation policy was implemented by the Chinese Government [
44]. According to Wang and Zeng (2013), the land surface scheme adopted by the reanalysis and atmospheric boundary layer turbulence can strongly affect the qualities of reanalysis temperature data [
45]. Considering ERA5 with the improved land surface schemes, including soil texture map, seasonally varying monthly vegetation maps specified from a MODIS-based dataset and more satellite imageries, these applications in the model may be attributed to diminish cold biases in the mountainous regions of Inner Mongolia [
8]. As the human-induced land cover and land use (LCLU) changes have been significantly extended and intensified in the EAM region, the LCLU changes can affect not only air temperature (e.g., [
43]) but also the summer monsoon precipitation (e.g., [
18]). Therefore, the advancements in assimilation processes with the higher spatio-temporal resolutions of LCLU maps and numerical modeling for the LCLU effects on climates are important in accurately estimating the climate variables of reanalyses.
As for the systematic biases of precipitation, in ERA5, the significant wet biases appeared in the humid regions of southern China during spring and summer, which were different from JRA55 and CRU with dry biases and wet–dry random biases (see
Figure 6b). The contrasting result is consistent with those of previous studies as Amjad et al., (2020) identified that ERA-Interim overestimated the frequency of precipitation by over-compensating dry biases in humid areas [
46]. Some studies suggested that ERA5 could detect rainfall events and reproduce their spatio-temporal distributions, but generally overestimated hourly and daily precipitation in terms of the frequency (e.g., [
5,
47]). In previous studies, it was also described that ERA5 as well as other reanalyses were useful in representing frontal precipitation in dry seasons, compared to local convective precipitation in the wet season [
5]. The biases of precipitation could be associated with several reasons, such as complex topography, unreliable input data, and the flaws of numerical models in simulating precipitations of various formation types [
48,
49,
50]. Particularly, the monsoons as the regional climate phenomena can induce the precipitation biases in reanalysis data [
31].
The reanalysis biases could also have originated from the uncertainties in observational station data. The spatial inconsistency of stations, the difference in station environments with surrounding areas, mismanaging measurement, and the changes in the number of the observational stations could affect the reliability of observational data [
3,
4,
5,
6]. For instance, the high temperatures due to the urban-island effect can increase uncertainties and biases in the observational data as well as input data for data assimilation and, thus, reanalyses [
6,
20]. In addition, the errors involved in gauge measurement, such as the effect of wind, splashing, evaporations, and condensations, are counted as the global factors in underestimations of precipitation observations [
5]. It is obvious that these observation-related errors need to be considered in evaluating the reanalysis data.
5. Conclusions
This study evaluated the performance of temperature and precipitation from climate reanalysis data, both in spatial and temporal domains. The sets of the reanalysis data (ERA5, JRA55, and NCEP2) and CRU were evaluated in terms of correlation coefficients (Pearson, Spearman, and Kendall) and difference statistics (RMSE and bias), comparing the annual and seasonal temperature and precipitation from the three reanalyses and CRU with those from the total of 537 weather stations in the EAM region over the forty years of 1981–2020 in North Korea, South Korea, and Japan and the thirty years of 1981–2010 in China. Based on the spatio-temporal patterns of correlation coefficients, RMSEs, and biases and also Taylor diagrams, the potential uncertainties in the reanalysis data were identified.
The result of temporal comparisons using Pearson correlation coefficients showed that all the datasets, except for NCEP2, fairly reproduced the temporal variations in temperature from the observations at most of the stations (ratio of stations with 95.5%, 94.4%, and 94.2% on the annual for ERA5, JRA55, and CRU, respectively) with high correlation coefficients (R > 0.8, p < 0.01). However, the higher RMSEs, related with cold biases, in the three reanalyses and CRU appeared along the mountainous regions in central-western China. The spatial patterns of annual and seasonal biases among the reanalysis data were generally similar, but the magnitude of biases was clearly noticeable by the largest biases in NCEP2 and the least in ERA5. For temporal variations in precipitation, CRU had the largest number of stations that represented high correlation coefficients of R > 0.8 for the annual and seasonal means, except for winter. Among the reanalyses, ERA5 was better than the other reanalyses, showing more stations with high correlation coefficients of R > 0.8 for all the annual and seasonal means. The highest RMSEs of precipitation were in NCEP2, followed by ERA5 and then JRA55. While the difference between ERA5 and JRA55 was not identifiable, there was an opposite pattern of biases between ERA5 and JRA55 in southern China, shown as dry biases in JRA55 and wet biases in ERA5. The spatial associations of the reanalysis and observational data at each station also supported that CRU and ERA5 were better in representing the climatologically observed temperature and precipitation. Based on the Pearson correlation coefficients and RMSEs of temperature at each station, annually and in all four seasons, the performance was in the order of ERA5, CRU, JRA55, and NCEP2. The precipitation datasets were better in the order of CRU, ERA5, JRA55, and NCEP2 and CRU was better as it showed the highest Pearson correlations, annually and in all seasons, and the lowest RMSEs, annually and in MAM, JJA, and SON.
We suggested the best reanalysis data among ERA5, JRA55, and NCEP2 in the EAM region, according to the Pearson correlations, biases, and RMSEs by stations and the Taylor diagrams by climatological averages. ERA5 represented the best temporal variations in temperature, having the most stations with the highest Pearson correlations, annually and in JJA and SON, and the smallest RMSEs as well as the smallest biases for all seasons and annually. For precipitation, ERA5 was also the best reanalysis data as they showed the highest Pearson correlations at the most stations in the annual and all seasonal means, the smallest RMSEs, annually and in, MAM, JJA, and SON, and the smallest biases, annually and in, JJA, and SON. Additionally, Taylor diagrams revealed good spatial associations of ERA5 as ERA5 was the closest to the reference point of observed temperature among ERA5, JRA55, NCEP2, and CRU and, for precipitation, it was the second best, after CRU.
The three reanalysis datasets considered in this study generally well reproduced the temperature data by presenting significant correlations and low differences. However, it was detected that the regions with abrupt changes in elevations could have high uncertainties in the temperature from climate reanalysis due to the elevation effect as well as warming effect of vegetation on temperature, as the large biases and RMSEs appeared along the mountainous regions in central-western China. According to the correlation and difference statistics, ERA5 was suggested to be the most consistent reanalysis data with temperature observations. The statistical results for precipitation suggested that CRU has the strongest correlations with observations. Among the reanalyses, ERA5 better reproduced precipitation than the other two for all the annual and seasonal means. Nevertheless, the spatial patterns of the correlations and differences were heterogenous as their seasons and datasets vary, implying that additional validations with more temporally and spatially detailed scales are necessary before selecting the best precipitation reanalysis data in the EAM region.
As mentioned above, it is important to evaluate the performance of reanalysis data before adapting the data for the study. However, the previous studies in East Asia to evaluate reanalysis data focused on the country or the sub-country scales, mostly in China. In this paper, we focused on the entire EAM region to provide the varying performance of the reanalysis data across the monsoon region. Additionally, we offered insight into the regional-specific biases in reanalysis datasets, associated with the geographical factors of elevation and vegetation, which could be considered in improving the reanalysis data. Thus, the results of this study could be useful for EAM studies in selecting reliable reanalysis data for the entire, as well as regional, monsoon systems in East Asia.