1. Introduction
The climate of the United Arab Emirates (UAE) is characterized by a trio of distinct atmospheric phenomena: pervasive dust storms that sweep across the landscape, dense fog that envelops the region, and the infrequent yet intense convective systems unleashing torrential downpours [
1,
2,
3,
4,
5,
6,
7,
8,
9]. Recent studies underscore a worrying trend of increased frequencies and intensities of these phenomena over the Arabian Peninsula, signaling a shift in regional climatic behavior [
10,
11,
12]. Understanding the full lifecycle and the consequent impacts of these meteorological phenomena is imperative for precise weather forecasting. However, the limited number of ground-based observations across the Arabian Peninsula [
13] presents a significant challenge. While satellite observations offer some insights, they fall short of providing a complete picture, primarily due to limitations in their temporal and spatial resolution, as well as challenges posed by cloud cover, aerosols, and humidity. Within this context, reanalysis products, which assimilate available observations with numerical models, are invaluable. They offer expansive datasets of surface meteorological elements, including air temperature, relative humidity, wind speed, atmospheric pressure, radiation fluxes, and soil properties, with adequate spatial and temporal resolution on a global scale. These datasets are essential for elucidating the processes that trigger and maintain these weather events on both regional and local scales.
Six major reanalysis products are currently available to the research community: the European Centre for Medium-Range Weather Forecasts (ECMWF) ERA-Interim, Climate Forecast System Reanalysis (CFSR), National Centers for Environmental Prediction/National Center for Atmospheric Research Reanalysis 1 (NNR1), Japanese 55-year Reanalysis (JRA55), Modern-Era Analysis for Research and Applications version 2 (MERRA2), and the fifth generation of the European Reanalysis (ERA5). Among these, ERA5 is particularly notable for its high spatial (0.25 degrees or ~27 km) and temporal (hourly) resolution [
14,
15]. Moreover, the recently introduced ERA5-Land (hereafter ERA5L) offers an even finer spatial resolution of 0.1 degrees (or ~11 km), albeit exclusively for terrestrial regions [
16]. Nonetheless, it is crucial to also assess the accuracy of the reanalysis data against ground-based observations, especially in desert regions like the Arabian Peninsula where long-term observational data are sparse [
13,
17]. The literature indicates that the newer ECMWF releases—namely, ERA5 and ERA5L—outperform their predecessor, ERA-Interim [
18], as well as other reanalysis products [
18,
19,
20]. It is also noted that the performance of ERA5 and ERA5L can be influenced by factors such as the orographic representation, seasons, and other regional characteristics, including surface albedo, land use and land cover (LULC), soil texture, and vegetation cover [
21,
22]. This highlights the need for a thorough evaluation of these products before their application in further research.
Among the many variables simulated by reanalysis datasets, the Land Surface Temperature (LST), or skin temperature, is one of the most important. It has been recognized as a fundamental parameter in climate change studies, highlighted by its recent designation as an Essential Climate Variable (ECV) by the Global Climate Observing System (GCOS, [
23,
24,
25]). This acknowledgment underscores the LST’s role in maintaining the Earth’s energy balance, influencing ecosystem processes and mediating energy and water exchanges between the surface and the atmosphere. The impact of the LST extends further, influencing climate variability, regulating energy fluxes across the land–atmosphere interface, and affecting the hydrological cycle. These aspects are integral to the complex dynamics of climate change, showcasing the critical nature of the LST in environmental studies [
26,
27,
28]. The LST is not a parameter that is commonly evaluated as it is not measured by standard weather stations. Additionally, satellite-derived products, such as the Moderate Resolution Imaging Spectroradiometer (MODIS), can exhibit substantial biases in arid/semi-arid regions [
29]. The in situ observations collected here will allow for the direct evaluation of the reanalyses’ LSTs in one of the driest and most inhospitable regions on Earth. Furthermore, in order to have a full understanding of the deficiencies in the simulation of the LST, it is important to assess how the different components in the surface energy balance are predicted. While satellite-derived datasets exist for this purpose, such as the Clouds and Earth’s Radiant Energy System (CERES), their performance in this region has not been comprehensively evaluated, and they may exhibit significant biases. For example, Jia, et al. [
30] conducted a comprehensive validation of CERES surface net radiation fluxes across various sites, covering a range of land cover types, including forests, scrublands, grasslands, bare lands, wetlands, tundra, ice, croplands, and urban areas. The authors reported mean biases of around 3 Wm
−2 and Root Mean Square Errors (RMSEs) of 26–34 Wm
−2 for daily and monthly data. Lower scores were found at higher latitudes, probably a reflection of the poorer performance of the CERES inversion process over snowy surfaces, and near coastlines, due to the coarser spatial resolution of the CERES data. Xu, et al. [
31] evaluated the long-wave fluxes in CERES and ERA-5, amongst other products, against in situ ground measurements at 288 sites around the world. For arid and semi-arid regions, the biases in the net longwave flux are typically negative and can exceed −20 Wm
−2, with RMSEs that can be higher than 30 Wm
−2. By and large, the performance of the CERES fluxes is poorer compared to temperate regions, for which the absolute biases are mostly within 10 Wm
−2. In this work, in situ measurements of the surface radiation fluxes are used to assess both ERA5/ERA5L and CERES, which will ultimately allow for an improvement of the model physics for the reanalysis products and a fine tuning of the retrieval algorithm for the satellite-derived dataset.
Prior studies have validated ERA5 and ERA5L data against various observations globally and regionally. Wang, et al. [
32] validated the ERA5L LST against the MODIS-derived LST over global and regional scale for the period of 2001 to 2020. They found a global mean bias of 1.8 K, with regional biases of 1.5 K for the Arctic, 0.7 K for North America, 0.4 K for Europe, 1.7 K for Russia, 0.1 K for Asia, 0.8 K for Africa, 1.9 K for South America, 0.1 K for Oceania, and 3.9 K for Antarctica. Dai [
33] extended the validation to the global scale, focusing on variables such as the 10 m wind speed, 2 m dew point temperature, and surface pressure from 1976 to 2005 using only in situ observations. The study found that ERA5 surface parameters are able to reproduce the observed diurnal variations over land but fail to do so over the ocean due to the lack of diurnal variations in sea surface temperature (SST) in the ERA5 data. Babar, et al. [
34] focused on solar radiation estimations in Norway from 2000 to 2015, comparing ERA5 with in situ and satellite data. They found that ERA5 provides reliable estimates for global horizontal irradiance (GHI) at high latitudes, with a mean absolute deviation (MAD) of 6.8 W m
−2 for monthly averages. Their analysis revealed that ERA5 exhibits a higher bias in solar radiation on shorter time scales due to cloud representation inaccuracies, but these errors decrease significantly over longer time scales, such as months and years. Yilmaz [
17] assessed temperature trends from both ERA5 and ERA5L against in situ observations in Turkey from 1951 to 2020, finding strong agreement. ERA5 slightly outperformed ERA5-Land, particularly in recent decades (2001–2020), demonstrating both datasets’ reliability for capturing temperature trends, especially in climate change studies. Tarek, et al. [
35] conducted an evaluation of the ERA5 reanalysis as a potential reference dataset for hydrological modeling across the continent, covering the period from 1979 to 2018. They utilized in situ observations to validate the reanalysis data, contributing to the understanding of ERA5’s applicability and reliability for hydrological studies in North America. The study found that ERA5 provided a significant improvement over its predecessor, ERA-Interim, in terms of accuracy for key hydrometeorological variables, making it a valuable dataset for hydrological modeling and water resource management across diverse climates. Concurrently, Bonshoms, et al. [
36] focused on the validation of ERA5L 2 m temperature and relative humidity on four Peruvian glaciers using observations from 2017 to 2019. They report that ERA5L showed strong performance for T2m with correlations above 0.80 and an RMSE around ±0.2 °C across sites. Rh2m had lower correlations (~0.4 to ~0.6), with an RMSE ranging from 3% to 7%. These results suggest that ERA5L can be reliably used to characterize T2m and Rh2m, particularly in wet outer tropical glacier regions.
With the advent of machine learning (ML) techniques, the utilization of ERA5 data has grown significantly. ERA5’s high spatial and temporal resolution datasets provide an invaluable resource for training ML models aimed at various applications, such as weather forecasting, climate trend analysis, and anomaly detection [
37,
38]. These models have shown great promise in enhancing the predictive capabilities and understanding of complex atmospheric phenomena. For instance, ML algorithms have been successfully employed to downscale ERA5 data, thereby improving the local weather prediction accuracy [
39]. Additionally, deep learning methods leveraging ERA5 datasets have facilitated the development of sophisticated models for precipitation prediction and extreme weather event forecasting [
40,
41,
42]. Despite these advancements, it is imperative to validate the ERA5 data against ground-based observations to ensure their reliability and accuracy before using them to train ML models. The inherent biases and uncertainties in reanalysis products necessitate this validation, particularly in regions with sparse observational data and an extreme environment, like the Arabian Peninsula. Rigorous cross-validation with in situ measurements can help identify discrepancies and refine the reanalysis datasets, thereby enhancing their applicability and trustworthiness in atmospheric research and operational meteorology [
42].
To the best of the authors’ knowledge, there are no comprehensive studies that have directly compared the ERA5/ERA5L products with surface observations over the Southeast Arabian Peninsula region, except for the work by Fonseca, et al. [
43], who compared the ERA5 2 m temperature and mean sea-level pressure at three airport stations for a one-week period in July 2018. Therefore, there is a dire need for thorough validation of ERA5 and ERA5L surface products over the Arabian Peninsula region.
In this study, we evaluate the ERA5 and ERA5L products against in situ measurements obtained during the Wind-Blown Sand Experiment–United Arab Emirates (WISE-UAE, [
44,
45]) in the Empty Quarter desert, also known as the Rub’ al Khali. The WISE-UAE experiment started on 25 July 2022 at Madinat Zayed (23.5761°N, 53.7242°E; elevation: 119 m) located 120 km southwest of Abu Dhabi, UAE. The ERA5 and ERA5L parameters evaluated in this study include the 2 m air temperature (T2m), 2 m relative humidity (RH2m), 10 m wind speed (WS10m), skin temperature (T
skin), and surface pressure. Additionally, net shortwave radiation (SW
net), and net longwave radiation (LW
net) fluxes from ERA5, ERA5L, and CERES are evaluated.
This paper is structured as follows:
Section 2 provides details about the observational and reanalysis datasets, along with the methodology employed in this study.
Section 3 presents intercomparison results at diurnal and daily scales across different seasons. The discussion and summary are presented in
Section 4.
3. Results
The evaluation of diurnal and seasonal variations in the variables targeted for validation was carried out distinctively across boreal seasons: winter (December to February, DJF), spring (March to May, MAM), summer (June to August, JJA), and autumn (September to November, SON). The findings are illustrated in
Figure 1,
Figure 2,
Figure 3,
Figure 4,
Figure 5 and
Figure 6. The diurnal cycle is shown in local time (LT).
Figure 1 show the diurnal cycles of T2m, RH2m, WS10m, and surface pressure from observations, ERA5, and ERA5L products for four different seasons. Similarly,
Figure 3 and
Figure 5 illustrate the diurnal cycles of skin temperature, soil temperature, and radiation fluxes (net shortwave and net longwave radiation), respectively. The biases in ERA5 and ERA5L parameters that we targeted in this study, with respect to observations for each hour of the day and during different seasons, are illustrated with box plots in
Figure 2,
Figure 4,
Figure 6 and
Figure 7. The results from
Figure 1,
Figure 2,
Figure 3,
Figure 4,
Figure 5 and
Figure 6 are discussed in
Section 3.1,
Section 3.2 and
Section 3.3.
3.1. Evaluations of Temperature, RH, Wind Speed, and Surface Pressure
The observed T2m exhibits a significant diurnal range, peaking in the afternoon and reaching a minimum in the early morning, which aligns with typical patterns in hyper- arid climates [
53,
55]. In comparison, both ERA5 and ERA5 Land demonstrate strong agreement with the observed diurnal cycle of T2m throughout all seasons, with ERA5 Land showing a marginally better fit to the observations (
Figure 1a–d). Each of the reanalysis products displays a warm bias during the night and a cold bias during the daytime hours (
Figure 2a). The range of median bias in T2m is between 0.2 and 2.9 K for ERA5 and −0.4 and 1.5 K for ERA5L during nighttime hours (
Figure 2a), and between −0.3 and −2.5 K for ERA5 and 0.2 and −2.5 K for ERA5L during daytime hours (
Figure 2a).
Remarkably, irrespective of the season, both ERA5 and ERA5L consistently exhibit a one-hour delay in capturing the daily maxima and minima of T2m (
Figure 1a–d). The bias in daily maximum and minimum T2m in ERA5 and ERA5L for the four seasons is shown in
Figure 8a. Irrespective of the season, ERA5L is in closer agreement with the observations, and the bias is positive (warm) with a higher median of 0.7 K during the winter season for the daily minimum T2m. The bias is negative (cooler) with a higher median of −0.55 K during the autumn season for the daily maximum T2m.
The bias in daily maximum and minimum T2m values in ERA5 and ERA5L with respect to the seven airport stations in the UAE for the four seasons is shown in
Figure S1 and Table S1. The range of median bias in the daily maximum T2m for ERA5 is −0.1 to 0.2 K, −0.8 to 0.0 K, −2.0 to −0.9 K, −0.2 to 0.1 K, −1.2 to 0 K, −3.3 to −1.9 K, and 0.3 to 3.9 K for OMAA, OMDW, OMDB, OMSJ, OMAL, OMRK, and OMFJ, respectively. Similarly, for ERA5L, the range of median bias in the daily maximum T2m is 0.3 to 1.6 K, −0.1 to 0.7 K, −0.8 to −0.5 K, −0.6 to 0 K, −0.9 to −0.5 K, −3.2 to −1.8 K, and −1.1 to 2.6 K for OMAA, OMDW, OMDB, OMSJ, OMAL, OMRK, and OMFJ, respectively.
Similar to findings at the WISE-UAE location, both reanalysis datasets (ERA5 and ERA5L) generally show a negative (cooler) bias in daily maximum T2m. The exception is OMFJ (Fujairah), in particular in ERA5, where a positive bias (warmer) is observed. An inspection of
Figure S1 reveals that it occurs predominantly during spring and summer, with the highest magnitudes in the summer months. The weather conditions at Fujairah in the summer are markedly different from those in other parts of the UAE to the west of the Al Hajar mountains: fog occurs predominantly during this season [
56], and low-level clouds are a regular occurrence in the morning hours [
57]. As the moisture-laden air from the Arabian Sea enters the Sea of Oman and encounters the Al Hajar mountains, it condenses and leads to the development of mesoscale clouds. At ~27 km resolution, ERA-5 may not correctly simulate the daytime cloud cover, resulting in warmer maximum air temperatures. There are other factors that may account for this bias. An incorrect representation of the sea surface temperatures, which exhibit a reduced seasonal cycle in the Sea of Oman compared to the Arabian Gulf [
58], is damped in ERA-5 [
59]. In addition, the topography of the nearby Hajar Mountains may contribute to localized warming through downslope winds (foehn effect, [
60]), where the air heats up as it descends into the coastal region. For the daily minimum T2m, most of the stations exhibit a positive (warmer) bias, which is consistent with findings from the WISE-UAE location.
Both reanalysis products, ERA5 and ERA5L, compare RH2m well with observations during daytime hours (
Figure 1e–h). During the remaining hours, ERA5 consistently exhibits a dry bias. ERA5L aligns closely with observations in the early morning hours (0–6 LT) across all seasons but displays a dry bias during the nighttime hours, similar to ERA5. The box plot of hourly bias in RH from ERA5 and ERA5L aligns with the diurnal composites for the four seasons (
Figure 2b). The range of median bias in RH2m is between −3 and −12% for ERA5 and 2 and −12% for ERA5L during nighttime hours, and between −1 and 6% for ERA5 and −4 and 5% for ERA5L during daytime hours.
The wind speed at the site typically remains below 6 m s
−1 and demonstrates a bimodal distribution, peaking around 16–18 LT due to the influence of daytime sea-breeze circulation (
Figure 1i–l). A secondary peak, occurring between 8 and 10 LT, is attributed to the downward mixing of momentum from the nighttime low-level jet to the surface [
53,
61,
62,
63]. This secondary peak tends to be less pronounced during the winter season due to smoother temperature inversion near the surface compared to summer [
8]. The ERA5 and ERA5 Land reanalysis products accurately replicate the diurnal wind speed pattern, aligning closely with daytime observations. However, during nighttime, the reanalysis products tend to overestimate wind speeds by approximately 2 m s
−1 across all seasons.
Both reanalysis products tend to underestimate the daily maximum wind speed at 17:00 LT, except during the winter season, as depicted in
Figure 1i–l. The maximum median bias in WS10m reaches 1.9 m s
−1 for ERA5 and 1.8 m s
−1 for ERA5L in winter during nighttime hours. Conversely, the bias is −1.2 m s
−1 for ERA5 and 1.0 m s
−1 for ERA5L during summer daytime hours, as illustrated in
Figure 2c. The seasonal variability in surface pressure closely follows that of air temperature, exhibiting lower pressure during the summer months, in line with the higher temperatures at the location, and the reverse during colder periods (
Figure 1m–p).
To assess the overall performance of the reanalysis products with respect to hourly averaged observations in the hyper-arid region, we calculated the mean bias and RMSE over the analysis period.
Table S2 provides an intercomparison of these metrics for ERA5 and ERA5L reanalysis products, focusing on T2m, RH2m, and WS10m, against observations from the WISE-UAE site and seven airport stations across the UAE.
For T2m, both ERA5 and ERA5L exhibit relatively small biases, with ERA5L generally improving upon ERA5. At the WISE-UAE site, ERA5 shows a slight positive bias (0.35 K), which is nearly corrected in ERA5L (−0.07 K). Across most airport stations, RMSE values for T2m are low, ranging from 1.22 K to 2.66 K, indicating good overall performance in capturing the diurnal temperature variations. However, ERA5L has a slightly increased RMSE at OMAA and OMDB. This increase in RMSE may be attributed to the complex urban environments surrounding these stations, where local effects such as urban heat island dynamics and land-use variability can introduce discrepancies between reanalysis products and actual observations. The importance of correctly representing the land use land cover and soil texture at these locations in models for the air temperature predictions has been highlighted by Temimi, Fonseca, Nelli, Weston, Thota, Valappil, Branch, Wizemann, Kondapalli, Wehbe, Al Hosary, Shalaby, Al Shamsi and Al Naqbi [
22]. The higher resolution, improved land surface parameterizations, and lack of data assimilation are responsible for the greater variability and slightly higher RMSEs in ERA5L, despite the more skillful bias predictions.
For RH2m, ERA5 tends to overestimate relative humidity at several stations, with particularly high positive biases observed at OMFJ (8.08%) and OMDB (6.32%). ERA5L significantly reduces this bias in most cases; for example, at OMFJ, the bias decreases from −7.6% to −0.8%, which seems to be mostly linked to more accurate temperature forecasts. Despite these improvements in bias, RMSE remains high across all stations, ranging from 8.22% to 18.01%, indicating substantial variability in relative humidity. This suggests that, while ERA5L performs better overall at least partially due to the improved air temperature forecasts, both reanalysis products struggle to fully capture the complexity of humidity dynamics, likely due to factors such as sea-/land-breeze and topography-driven circulations that are difficult to represent accurately at ~27 km resolution in hyper-arid regions. This is also the case for the biases in the representation of the variability of the sea surface temperatures in ERA5 [
59].
For WS10m, both ERA5 and ERA5L generally exhibit small negative biases, indicating a slight underestimation of wind speed at most stations, with RMSE values in the range of 1.32 m s
−1 to 1.84 m s
−1. The mean bias and RMSE values are relatively consistent between the two products, with ERA5L showing slight improvements at stations like OMFJ and WISE-UAE, probably due to improved physics and a higher spatial resolution, which are known to be important for wind forecasts in the region [
45]. This suggests that both reanalysis products perform reasonably well in representing wind dynamics in the region, with ERA5L offering a marginal enhancement in accuracy.
It is important to emphasize that the WISE-UAE site, located inland away from the coast and high terrain, is highly representative of the desert environment. The wind speed sensor at this site is precisely positioned at a height of 10 m, with no obstacles obstructing the prevailing wind directions. This setup ensures that the bias and RMSE values at WISE-UAE more accurately reflect true wind conditions in hyper-arid regions, in contrast to airport stations, where obstacles or environmental factors may affect the observations. As a result, the WISE-UAE site serves as a key reference point for evaluating the performance of reanalysis products in this region.
On a diurnal cycle, pressure peaks are observed at 10 LT and 22 LT, while the lowest pressures are seen near 4 LT and 16 LT. These fluctuations are associated with the semi-diurnal tide in this area, showing a notable amplitude of about 1 hPa [
53,
64]. In winter, the combination of decreased surface pressures in the pre-dawn and early morning hours, together with elevated relative humidity and reduced ground temperatures, promotes the formation of condensation and fog, phenomena that are notably frequent in this region [
1,
65].
Both reanalysis products exhibit semi-diurnal tide variability in surface pressure, but ERA5 underestimates the surface pressure, whereas ERA5L overestimates it, regardless of the season (
Figure 1m–p and
Figure 2d). The range of median bias in surface pressure is between −0.24 and −1.34 hPa for ERA5 and 0.68 and 2.07 hPa for ERA5L (
Figure 2d).
3.2. Evaluation of Skin Temperature
Within ERA5 and ERA5L, skin temperature is estimated using a sophisticated land surface model that incorporates variables such as air temperature, soil moisture, vegetation cover, and cloud cover [
15,
16,
66]. This model effectively simulates the energy and water fluxes at the land surface, facilitating the estimation of skin temperature across the globe. Despite the comprehensive coverage and advanced modeling techniques employed in ERA5 and ERA5L, the validation of these reanalysis skin temperatures with in situ measurements remains indispensable. In situ validation, involving the comparison of reanalysis data with ground-based observations, serves as a benchmark for accuracy, particularly in hyper-arid regions like the UAE. These regions present unique challenges for both remote sensing and reanalysis models due to extreme temperatures and scarce vegetation.
In situ measurements in such environments are crucial for identifying biases and uncertainties in reanalysis datasets, ensuring they accurately represent the unique land–atmosphere interactions occurring in arid landscapes. The validation of ERA5 and ERA5L skin temperatures in hyper-arid regions is not only essential for improving dataset accuracy but also for enhancing our understanding of how these regions respond to and impact climate change, reinforcing the global importance of precise LST data in climate science. To our knowledge, this is the first study in which the bias in ERA5 and ERA5L skin temperature has been analyzed for a hyper-arid region such as the UAE.
Figure 3 shows the diurnal cycles of skin temperature from observations and ERA5 and ERA5L products. The biases in ERA5 and ERA5L skin temperature, with respect to observations for each hour of the day and during different seasons, are illustrated in box plots in
Figure 4.
The peak in skin temperature occurs 1–2 h earlier than the peak in air temperature, with the highest daily minimum and maximum temperatures noted during the summer season (
Figure 4). This aligns with the soil temperature variability documented at the Al Ain location by [
53]. Regardless of the season, the reanalysis skin temperatures from both ERA5 and ERA5L demonstrate a small bias (<1 K) during the early morning hours. However, during daytime hours, both reanalysis products exhibit a pronounced warm bias, accompanied by a 1 h delay in the daily maximum skin temperature (
Figure 3). The bias in daily maximum and minimum skin temperature in ERA5 and ERA5L for the four seasons is shown in
Figure 7b. Irrespective of the season, ERA5 shows a positive (warm) bias. The bias has a higher median of 4.0 K during the spring season in daily maximum Tskin, while the median bias ranges from −1.5 K in summer to 1.0 K in winter for daily minimum T
skin (
Figure 8b).
3.3. Evaluation of Surface Radiation Fluxes
Net surface radiation (Rn) is critical in the Earth’s energy budget, and is calculated by the algebraic sum of net shortwave (SWnet = SWdown − SWup) and net longwave radiation (LWnet = LWdown − LWup). As a fundamental component in determining the Earth’s surface energy balance, Rn is pivotal in driving soil and atmospheric heat exchanges, evapotranspiration, and plant photosynthetic processes. Therefore, an accurate assessment of Rn is fundamental for models that simulate land processes, water cycles, and ecological systems. Measuring Rn is accomplished through a variety of methods, including ground-based observations, model simulations, remote sensing techniques, and reanalysis data assimilation. Ground observations, while highly accurate, are geographically limited, making it difficult to meet the demands of comprehensive regional or global analysis. Basic models can provide localized Rn estimates from standard meteorological data, but these lack broader applicability. Conversely, advanced global climate models offer Rn predictions on a global scale, though they often have to contend with limited spatial resolution.
Remote sensing offers a means to obtain high-resolution Rn data, with several algorithms designed to retrieve Rn using data from instruments such as MODIS, SEVIRI, and Landsat. However, the temporal resolution of remote sensing data is sometimes insufficient for certain meteorological and hydrological studies due to limitations imposed by satellite pass times and cloud cover. Reanalysis datasets fill this gap by combining observational data with atmospheric models to furnish Rn information with both high spatial and temporal resolution on a global scale. Nevertheless, the validity of these reanalysis datasets must be confirmed through a comprehensive comparison with ground-based measurements to ensure their accuracy and usefulness. The precision of Rn data is critical for deepening our understanding of climate change, the water cycle, and the degradation of the cryosphere. Inaccuracies in Rn data can lead to significant errors in energy balance studies and affect forecasts and evaluations related to weather, agriculture, and environmental preservation. Consequently, it is crucial to validate the components of Rn, particularly net shortwave and net longwave radiation fluxes, to maintain the integrity of climate models and the strength of related climatological research. This section aims to meticulously validate the net shortwave and net longwave radiation fluxes from ERA5, ERA5L, and CERES using in situ measurements from the WISE-UAE station, located in a hyper-arid region.
Figure 5 shows the diurnal cycles of net shortwave and net longwave radiation fluxes from observations and ERA5, ERA5L, and CERES products. The biases in ERA5, ERA5L, and CERES radiation fluxes, with respect to observations for each hour of the day and during different seasons, are illustrated in box plots in
Figure 6 and
Figure 7.
Regardless of the season, both in situ and satellite-based (CERES) observations exhibit similar diurnal patterns in net shortwave and net longwave radiation, with the peak in net shortwave radiation typically occurring at 12 LT and the lowest point of net longwave radiation at around 13 LT (
Figure 5). Nonetheless, there are notable discrepancies between CERES-derived net surface fluxes and those measured at the ground, as depicted in
Figure 5. The median bias for the daily maximum net shortwave radiation observed by CERES ranges between 24.5 W m
−2 and 150.9 W m
−2, as shown in
Figure 6c. A similar variation in median bias is observed for the daily maximum net longwave radiation flux by CERES, with values ranging from −10.3 W m
−2 to −37.6 W m
−2 (
Figure 8c). We found the net shortwave radiation flux bias of 24 Wm
−2 reported in this study to be within the −5 to +42 Wm
−2 range estimated by Inamdar and Guillevic [
67] at eight stations in the United States, including at an arid site in the Sonoran Desert. The scores for the net longwave radiation flux, on the other hand, are lower than those reported by Xu, Liang, He, Ma, Zhang, Zhang and Liang [
31] for 288 sites over the world, for which the seasonal mean biases are up to +5 Wm
−2 and the RMSE values are up to 15 Wm
−2. In desert sites, such as in the Australian Desert and Sonoran Desert, the biases in the surface net longwave radiation flux are in the range of −10 Wm
−2 to −20 Wm
−2, with RMSEs typically between 20 Wm
−2 and 30 Wm
−2, comparable to those reported here. It is important to note that deficiencies in the representation of the atmospheric composition and/or the surface properties will impact the performance of the CERES product for both shortwave and longwave fluxes.
These discrepancies may be due to several factors. The overestimation of net shortwave radiation by CERES could stem from inaccuracies in representing surface albedo in hyper-arid regions, as small errors in surface reflectivity can significantly affect absorbed solar radiation [
68,
69]. Additionally, CERES may not fully capture the effects of dust, aerosols, or minimal cloud cover, which scatter and absorb sunlight, leading to an overestimation of the net shortwave flux [
68,
69]. Model predictions of the surface fluxes in this region are highly sensitive to the amount and optical properties of aerosols [
70]. For longwave radiation, the underestimation observed may result from difficulties in accurately capturing surface emissivity and temperature (as noted in
Figure 4 and
Figure 8b, both reanalysis products have considerable surface temperature biases), particularly in regions with rapid nighttime cooling and heterogeneous surfaces. Moreover, atmospheric water vapor and aerosols, both influential in outgoing longwave radiation, may not be properly represented by CERES in arid environments [
68,
69].
In line with CERES surface fluxes, the reanalysis of net radiation fluxes, encompassing both shortwave and longwave radiation, exhibits a significant positive bias when compared to ground-based observations. Moreover, the reanalysis data for net shortwave radiation fluxes tends to display a 1 h delay in the peak of daily net shortwave radiation. Specifically, the ERA5 and ERA5L data for net longwave radiation flux show a lag of one and two hours, respectively, in reaching the daily minimum (
Figure 5). The biases in daily maximum of net shortwave and net longwave radiation fluxes in ERA5 and ERA5L across the four seasons are depicted in
Figure 8c. The median bias for net shortwave radiation flux at its daily maximum varies from 87.5 W m
−2 in summer for ERA5 to 97.8 W m
−2 in summer for ERA5L. Meanwhile, the median bias for net longwave radiation flux at its daily minimum ranges from −24.4 W m
−2 in autumn for ERA5 to −32.7 W m
−2 in autumn for ERA5L, as shown in
Figure 8c.
3.4. Intercomparison of Reanalysis/Satellite Products with Observations on a Daily Time Scale
In
Figure 9, a scatter plot analysis is shown and represents the correlation between the daily averaged in situ measurements and two reanalysis products, ERA5 and ERA5 Land, for four meteorological parameters: 2 m temperature (T2m), 2 m relative humidity (RH2m), 10 m wind speed (WS10m), and surface pressure. The plots are organized into two rows, one for each reanalysis product, and four columns, each corresponding to a different parameter.
Figure 10a,b, similar to
Figure 9a,e, focuses on T
skin.
Figure 11a–f depicts a scatter plot analysis of the correlation between daily averaged in situ measurements of net shortwave and net longwave radiation and the two reanalysis products, ERA5 and ERA5 Land, as well as CERES, across four seasons.
Temperature (T2m): The scatter plots for T2m indicate an exceptional correlation with the in situ measurements for both ERA5 and ERA5 Land, as denoted by the correlation coefficients (R) of 0.99 (
Figure 9a–e,
Table 3). This near-perfect correlation suggests that the reanalysis temperatures are highly representative of the observed temperatures, making them reliable for applications that require accurate temperature data, such as climate modeling and agricultural planning.
Relative Humidity (RH2m): The RH2m shows a very strong correlation with both reanalysis products, with an R-value of 0.95 (
Figure 9b–f,
Table 3). While not as high as the temperature correlation, this is still indicative of a very good agreement between the reanalysis humidity data and the in situ observations. The slight spread away from the 1:1 line, especially at higher humidity values, could suggest a tendency of the reanalysis to either slightly overestimate or underestimate extreme humidity levels.
Wind Speed (WS10m): Wind speed correlations are robust, with R-values of 0.88 for both ERA5 and ERA5 Land (
Figure 9c,g,
Table 3). The plots exhibit a consistent linear trend, although there is a visible spread, suggesting variability in the accuracy of wind speed predictions. This might be attributed to the complexity of capturing wind dynamics in hyper-arid regions, where local topography and thermal gradients can significantly influence wind patterns.
The overall performance metrics of ERA5 and ERA5L with daily averaged data for WISE-UAE and seven airport stations in the UAE are given in
Table S3. Overall, there are noticeable improvements in bias and RMSE for both temperature and wind speed when compared to the hourly averages. This suggests that the reanalysis datasets fail to simulate the diurnal cycle of the considered weather variables in the region, which may arise due to the deficiencies in the physics schemes, coarse spatial resolution, and incorrect representation of the diurnal variability in the sea surface temperatures [
59]. For T2m, ERA5L continues to outperform ERA5, with RMSE values being significantly lower than in the daily data (e.g., at WISE-UAE, RMSE drops from 1.94 K to 0.74 K). Similarly, for RH2m, the biases are smaller in the daily data, and ERA5L shows a clear improvement, particularly at stations like OMFJ. The RMSE for RH2m is also notably lower, reflecting less variability in the daily averages. WS10m shows consistent minor biases across both timescales, but the RMSE decreases in the daily data, indicating that daily averaging smooths out short-term variations and provides more stable performance metrics. Overall, the daily averaging of observations highlights ERA5L’s improved accuracy over ERA5, particularly in reducing biases and RMSEs for temperature and humidity.
Surface Pressure: Both reanalysis datasets show a perfect correlation (R = 1.00,
Figure 9d,h,
Table 3) with in situ surface pressure measurements. The data points tightly cluster around the 1:1 line, indicating that the reanalysis surface pressure is virtually indistinguishable from the tower measurements. This level of accuracy is crucial for a range of applications, including weather forecasting and climate research.
Skin temperature (T
skin): In both panels in
Figure 10, (a) ERA5 and (b) ERA5 Land exhibit a strong correlation (R = 0.98) with the in situ measurements, as indicated by the proximity of the data points to the solid black line, which denotes a perfect 1:1 match. This high degree of correlation suggests that both ERA5 and ERA5L reanalyses reliably represent the actual surface temperatures observed in the hyper-arid environment of the UAE across different seasons. It is also noteworthy that the distribution of data points across all seasons suggests a consistent performance of the reanalysis products throughout the year. However, slight deviations from the 1:1 correspondence line could indicate potential biases under specific seasonal conditions. Furthermore, a lead–lag relationship between the two time series is particularly evident during the winter season, as depicted in
Figure S2. The results presented here validate the reanalysis products as accurate tools for representing surface temperature in this region on daily time scale, which is crucial for climate studies and meteorological research in hyper-arid zones.
Radiation fluxes: The R-values indicate a strong agreement between in situ measurements and reanalysis/satellite products, with R-values of 0.86 for both ERA5 and ERA5L for net shortwave radiation, and a slightly lower correlation of R = 0.83 for CERES (
Figure 11a–c). For net longwave radiation, the correlations remain robust with R = 0.82 for ERA5, R = 0.84 for ERA5 Land, and R = 0.73 for CERES (
Figure 11d–f). These good correlations suggest that the reanalysis products and CERES satellite data generally capture the variability and magnitude of the surface radiation fluxes well. However, the CERES data show a marginally lower correlation, particularly for net longwave radiation, which may warrant further investigation into sensor discrepancies or algorithm differences. Deviations from the 1:1 line, particularly in certain seasons, may suggest seasonal biases or systematic errors in the datasets. Such biases are crucial for understanding model limitations and improving the accuracy of climate models in hyper-arid regions. Overall, the strong performance of ERA5 and ERA5 Land reanalysis radiation products on daily timescale across all seasons bolsters their utility for climate research, while the slightly lower performance of CERES highlights areas for potential refinement.
4. Discussion and Summary
In this study, we conducted, for the first time, a comprehensive validation of ERA5 and ERA5L reanalysis products, along with CERES radiative fluxes, using measurements acquired at a site in the Empty Quarter Desert in the UAE during the WISE-UAE field experiment. The period of data considered for this analysis spans from 15 July 2022 to December 2023. The parameters validated cover a broad spectrum of meteorological variables, including air temperature (T2m), relative humidity (RH2m), wind speed (WS10m), surface pressure, skin temperature (Tskin), and net radiation fluxes. The validation of reanalysis and CERES products was performed on diurnal and daily timescales across four seasons. To provide a broader spatial context for evaluating the performance of ERA5 and ERA5L across the UAE, we extended the validation to include seven UAE airport stations—Abu Dhabi (OMAA), Al Maktoum (OMDW), Dubai (OMDB), Sharjah (OMSJ), Al Ain (OMAL), Ras Al Khaimah (OMRK), and Fujairah (OMFJ)—for key parameters such as the 2 m air temperature, relative humidity, and 10 m wind speed.
Both ERA5 and ERA5L demonstrate strong agreement with the observed diurnal cycle of T2m across all seasons, with ERA5L showing a marginally better fit with the observations. Each of the reanalysis products exhibits a warm bias during the night and a cold bias during the daytime, with magnitudes generally within 2 K. The warm night biases and cold day biases in temperature are likely influenced by the unique thermal properties of desert surfaces, which have a low heat capacity and high thermal inertia. During the night, the rapid radiative cooling of the surface can lead to a decoupling of the boundary layer, reducing mixing and trapping heat closer to the surface, which may not be fully captured by reanalysis models. Similarly, during the day, the intense surface heating can lead to deeper boundary layers, which can result in cold biases if the models underestimate the depth of mixing or surface energy fluxes. An incorrect representation of the aerosol loading and the aerosol optical properties may also contribute to these discrepancies. ERA5L reduces biases in T2m across most locations, particularly at the WISE-UAE site, where the bias drops from 0.35 K (ERA5) to −0.07 K (ERA5L). Similar improvements are observed in the daily averaged statistics, where ERA5L further reduces T2m biases. Our analysis reveals that the wind speed at the site typically does not exceed 6 m s
−1 and exhibits a distinctive bimodal distribution, underscored by the dynamics of the sea-breeze circulation and the nocturnal low-level jet. The first peak in wind speed, ranging from 4–6 m s
−1 observed between 16 and 18 LT, is attributed to the daily advancement of the sea breeze inland. The secondary peak of 3–4 m s
−1 between 8 and 10 LT is associated with the downward mixing of momentum from the nocturnal low-level jet which starts after sunrise. This finding aligns with previous studies in desert environments that have demonstrated the downward mixing of momentum from elevated jets to the surface, a process that effectively enhances surface wind speeds in the early hours [
53,
61,
62,
63]. Notably, the prominence of this peak varies seasonally, with a diminished intensity observed during the winter months, suggesting a possible modulation by the seasonal variation in temperature gradients and the strength of the low-level jet.
While the ERA5 and ERA5L reanalysis products demonstrate a commendable accuracy in replicating the diurnal wind speed pattern, especially during the daytime, a notable discrepancy emerges during the nighttime. The reanalysis products tend to overestimate wind speeds by approximately 2 m s
−1, a bias that persists across all seasons. This overestimation may be related to the stable boundary layer conditions that typically form in desert environments after sunset. Under such conditions, reanalysis models may overestimate the strength of turbulent mixing and wind strength near the surface, leading to higher wind speed estimates than those observed. This is also evident in hourly WS10m, where mean biases range from −0.7 m s
−1 to 0.29 m s
−1 across different locations, with ERA5L showing slight improvements. An incorrect representation of the wind’s subgrid-scale variability and of the surface drag parameterization scheme are also factors that may explain wind biases in numerical models (e.g., [
22,
71,
72]). In the daily averaged data, ERA5L shows marginal improvements in RMSE for WS10m, with values ranging from 0.57 m s
−1 to 1.15 m s
−1. This overestimation may reflect limitations in the reanalysis models’ representations of nocturnal boundary layer processes, particularly the stabilization of the boundary layer and the attenuation of surface wind speeds under stable conditions [
9,
52].
At night, the Earth’s surface cools due to the emission of longwave radiation back into the atmosphere, leading to the formation of a stable boundary layer characterized by a temperature inversion. Under such conditions, the cooler, denser air near the ground reduces vertical mixing. The reanalysis products’ overestimations of 10 m wind speeds during nighttime could artificially enhance the mixing within the boundary layer, homogenizing the temperature profile near the surface by bringing warmer air from above down to the 2 m level. This mechanism could contribute to the warm bias in 2 m air temperature during nighttime, as the models may over represent the downward mixing of warmer air due to overestimated wind speeds.
During the day, we observed a 1–3 K cold bias in T2m within the reanalysis datasets. This can be linked to biases in (i) T
skin, which is also underestimated by up to 10 K; (ii) surface net shortwave radiation flux, which is underpredicted by up to 100 W m
−2; and (iii) surface net longwave radiation, which is underestimated by up to 50 W m
−2. The positive biases in Tskin and net shortwave radiation indicate that the reanalysis models might overestimate the solar radiation absorbed by the Earth’s surface. However, this does not translate into a higher air T2m, as evidenced by the cold bias in this field, suggesting an interference from other factors. The cold bias in T2m may arise from an underestimation of the surface sensible heat flux, possibly due to incorrect representations of surface properties, such as surface albedo, emissivity, and aerodynamic roughness length [
72]. Other factors include deficiencies in the representation of the observed aerosol loading and optical properties, cloud cover, topography, as well as deficiencies in the soil model. The latter may be more prominent in arid regions where the water table can be very shallow and, at some sites, may even reach the surface [
29]. The negative bias in net longwave radiation is an indication that the reanalyses emit more longwave radiation than observed, which can be attributed to the higher-than-observed surface skin temperature.
The observed time-lagged biases in capturing daily temperature extremes, as well as the delays in peak radiation times, suggest deficiencies in the models’ data assimilation processes, the physical representation of diurnal cycles, and the handling of surface and atmospheric boundary layer processes. Improving the representation of these processes, especially in arid environments, will be critical for enhancing the accuracy and reliability of reanalysis products. In addition, incorporating improved dust and aerosol representations into the models could address biases in radiation fluxes, as these particles strongly influence both shortwave and longwave radiation in arid regions. Continuous validation against high-quality, high-resolution observational data is crucial for reducing these biases, thereby enhancing the utility of reanalysis products for a wide range of applications in climate research and operational meteorology. This validation work contributes significantly to the ongoing evaluation and improvement of global reanalysis products, offering valuable insights for researchers utilizing these datasets for meteorological and climatological studies.
For future work, we recommend prioritizing enhanced field campaigns in desert regions to gather higher-resolution in situ data, including turbulence flux measurements, for the more accurate validation and refinement of model parameterizations. Additionally, incorporating machine learning techniques could help fine-tune model parameters in reanalysis products, offering a more adaptive approach to better capture region-specific processes.