1. Introduction
The power of the electromagnetic radiation from the sun that reaches the surface of the Earth is the main driver behind the weather and climate systems on the planet. The density of power received from the sun on a horizontal surface at ground level per unit surface is called the solar irradiance at the surface (SIS) and is expressed in W m−2. It is hereafter abbreviated as SIS and noted E. Equivalent terms may be found in the literature, such as solar flux, surface solar irradiance, downwelling solar irradiance, downwelling shortwave flux, downwelling surface shortwave flux, surface incoming solar radiation, surface incoming shortwave radiation, surface net downward shortwave flux, or global horizontal irradiance. The SIS is usually integrated over the solar spectrum (total SIS).
The Global Climate Observing System (GCOS) program has identified the SIS as an essential climate variable that helps us understand climate evolution and guides adaptation and mitigation efforts [
1,
2]. The SIS is also one of the essential variables in air quality [
3], the terrestrial and marine environment [
4,
5], and renewable energies [
6]. In addition to these domains, it impacts on human health [
7] and many other aspects of our daily lives and our activities (see many examples in [
8] or [
9] (pp. xxii–xxviii)). As a result, there is a significant and rising demand for precise and trustworthy estimates and measurements of the SIS, anytime, anywhere.
Numerical weather prediction (NWP) models, whether used for forecasting or reanalysis, are one of the methods used to estimate SIS. They can offer a global coverage and span of several years. This study focuses on three widely used atmospheric reanalysis datasets: the ECMWF Reanalysis version 5 (ERA5) produced by the European Centre for Medium-Range Weather Forecasts (ECMWF), the Modern-Era Retrospective analysis for Research and Applications version 2 (MERRA-2) developed by NASA’s Global Modeling and Assimilation Office (GMAO), and the Japanese 55-year Reanalysis with Quarterly updates (JRA-3Q) provided by the Japan Meteorological Agency (JMA). Reanalyses derive from weather forecast models used in a reanalysis mode to replicate observed conditions. A wide range of observations—including weather station measurements, satellite data, and GPS signals—are assimilated to estimate atmospheric state variables such as temperature, humidity, and wind speed. A description of the data assimilated in the three reanalysis is given in [
10,
11,
12,
13]. In contrast, measurements of the SIS are not assimilated in the three considered reanalyses and the SIS is a diagnostic quantity, derived from all related variables and calculated using a radiative transfer model. It depends on the complete set of radiatively active variables in the atmospheric column above the specific location. Therefore, reanalysis estimates of the SIS should not be confused with direct measurements as they carry the uncertainty inherent in the model simulations such as limited resolution of the model, uncertainties in the parameterizations used, and individual uncertainty of observations.
The SIS
E is often written as the product of the SIS received in cloud-free conditions, noted as
, and a cloud modification factor, also known as the clear-sky index, noted as
Kc.
A clear-sky index close to 1 would imply a sky without clouds, and a value close to 0 would imply a sky covered by optically thick clouds. Over short time periods, especially in broken-cloud conditions, might be greater than 1, resulting from a so-called cloud enhancement effect. Situations may also arise in which exceeds 1 over extended periods, notably due to the combination of high surface reflectivity and low solar elevation.
Oumbe et al. [
14] showed that at first approximation,
does not depend on the properties of the cloud-free atmosphere and depends only on the cloud properties and the reflective properties of the ground. This approximation yields errors comparable to the typical uncertainties of pyranometers, except when the ground albedo exceeds 0.7. This result shows that the effects of the cloudless atmosphere are entirely conveyed in the modelling of
and highlights the importance of accurate assessment of
. Clear-sky models aim at estimating
. They provide realistic upper limits of the SIS and contribute indirectly to quantifying the radiative effects of clouds.
varies with the date, time of day, geographic location, and gas concentrations. For total SIS, aerosol, water vapor, and ozone are the most influent variables, while other gases can be assigned standard concentrations. Extinction also depends on local temperature, density, pressure, and volume mixing ratio for the non-ozone, non-water vapor gases, necessitating knowledge of the vertical profiles of these variables, which is crucial for calculating the scattering effects from air molecules. Additionally,
is influenced by the elevation and reflective properties of the ground.
The present paper builds upon the result from [
14] and focuses on the estimates of the hourly means of
, made by each of the three reanalyses considered in the present work. More precisely, it aims at assessing the quality of these estimates by a comparison against coincident high-quality ground-based measurements of
. It is an attempt to identify potential drawbacks in the modelling of
and may help the agencies in charge of the reanalyses in further developments as uncertainties in
affect the estimates of the SIS in all conditions.
There are many scientific articles relating to the comparison between estimates of
from reanalyses and ground-based measurements on hourly, daily, monthly, or yearly basis. To the best of our knowledge, based on a comprehensive survey of the literature, none of them was dedicated to clear-sky conditions, except [
15], and none of them explored the clear-sky products of these reanalyses except [
15,
16].
Wang et al. [
15] compared the monthly mean of the clear-sky product of MERRA-2
to ground-based measurements of
under cloudless skies in Asia obtained by thresholding the monthly clearness index
, where the clearness index is the ratio of
to the corresponding extraterrestrial irradiance received on a horizontal surface located at the top of the atmosphere, noted as
. They found that MERRA-2 overestimates
and
significantly. Working with hourly measurements in the tropical Atlantic Ocean, Trolliet et al. [
16] did the sole study comparing the hourly clear-sky product from MERRA-2,
, and the corresponding SIS
from the McClear service of the Copernicus Atmosphere Monitoring Service (CAMS) [
17,
18], whose quality is well documented [
19]. Without providing any number, they reported that these datasets are similar and concluded that the MERRA-2 cloud-free SIS are likely accurate. As clear-sky products of ERA5 were unavailable at that time, the authors performed a visual inspection of daily profiles of
for randomly selected cloud-free days on which daily profiles of
were superimposed and found that
was underestimated by ERA5 in these cases.
In addition to [
15,
16], several studies have partially investigated clear-sky conditions. They dealt with ground-based measurements in several places: Scandinavia [
20], South Africa [
21], West Africa [
22], Pakistan [
23,
24], Indonesia [
25], or worldwide [
26]. Clear-sky conditions were determined using various approaches. Some works select clear sky instants by thresholding the daily clearness index
[
22,
23,
25,
26], or modified clearness index [
24], and others use the daily clear-sky index when an external clear-sky model is used [
20]. Mabasa et al. [
21] did not detail how the clear-sky days were determined, but from [
19], it appears that they used ERA5 in a first step and then a combination of a modified version of the algorithm of Reno and Hansen [
27] and McClear outputs.
This survey of the literature reveals several discrepancies in results. Several of them may be partly explained by discrepancies in the studied timescale, from hour to month. For example, Wang et al. [
15] report that MERRA-2 overestimates
and
significantly while others [
23,
24,
25,
26] wrote that MERRA-2 slightly underestimates the daily means
, and Trolliet et al. [
16] concluded that MERRA-2 cloud-free hourly means,
, are likely accurate. Other partial explanations of the discrepancies may be due to differences in the studied areas and periods as the quality of the estimates of the cloudless SIS may depend on the climate and changes in aerosol properties and water vapor among others. Discrepancies are also partly due in the manner of determining clear-sky conditions, as discussed above, as it has a great influence on results [
19,
28].
From this survey of the literature, we observed several discrepancies in results that may be partly explained by discrepancies in the studied timescale, discrepancies in the geographical area, and by discrepancies in determining clear-sky conditions, the latter having a great influence on results [
19,
28].
Sianturi et al. [
25] reported that ERA5 estimates
fairly correctly, in contradiction with other studies [
23,
26] that found a slight underestimation of
, or an overestimation [
20]. ERA5 was found to report cloudy conditions while the actual conditions are cloud-free [
16,
21,
26], and in another study ground-based measurements and ERA5 agree on the clear-sky category 77% of the time [
20]. As for JRA-55, the predecessor of JRA-3Q, the sole study reported that JRA-55 tends to underestimate
[
23]. As for MERRA-2, it slightly underestimates
[
24,
25,
26] but Wang et al. found that it overestimates
significantly.
These works give some indications of the ability of ERA5, JRA-3Q, and MERRA-2 to reproduce
on hourly, daily, and monthly bases though they provide little quantitative measurement of it, except for [
15]. Our study focuses on hourly means of the clear-sky SIS
and aims at extending the current knowledge as we believe that it would be fruitful to better know the strengths and weaknesses of the reanalyses in clear-sky conditions for further improvements in the diagnostic of
in all conditions. The present paper does not intend to rank reanalyses. There are many reasons for choosing one reanalysis over others; a ranking based solely on comparisons with in situ measurements at a limited number of stations during a limited period is unlikely to cover all needs and constraints.
Figure 1 exhibits an overview of the methodology adopted for this paper, with details given later for each step. The ground-based measurements collected every 1 min in the BSRN network [
29], noted as
, are checked for their plausibility. Detection of clear-sky instants is made on the retained 1 min data, yielding the time series
. From this series, hourly means of SIS in cloudless conditions, noted as
, are created and are compared to the coincident estimates of
given by each reanalysis. The comparisons are also performed on the clearness index
. As the estimates of the SIS depend on the calculation of the extraterrestrial irradiance
made by each reanalysis, a comparison is performed between the various
to check their consistency. This is justified in the case of MERRA-2 as it was reported in [
16] that MERRA-2 is using the mean solar time and does not account for the difference between the true solar time and the mean solar time, which is a function of the day in the year as a first approximation, thus bringing additional uncertainty in the estimates of
.
The data and models are briefly presented in
Section 2. The protocol for validation is described in
Section 3. The verification of the extraterrestrial irradiance
is discussed in
Section 4. There are several ways of discussing results when several models are dealt with. Here, the selected approach is to present and discuss the results of each reanalysis separately in
Section 5,
Section 6, and
Section 7, respectively, so that each Section provides the reader with a full view of the performances of a given reanalysis.
Section 8 concludes the paper. How to access the data is described in the “Data availability” section.
3. Protocol for Comparison
There are two kinds of comparison, one deals with the extraterrestrial irradiance, the other with the SIS. The protocol is fairly similar in both cases and applies to each reanalysis in turn. The comparison for the SIS was performed at each BSRN site and for all sites merged, for each trimester and for the ensemble of trimesters.
For each reanalysis, the comparison was made on the estimates provided by this reanalysis and either the extraterrestrial irradiance from the CAMS McClear service
, or reference values
and
for coincident instants only. The Pearson correlation coefficients, slopes, and offsets of the least-squares fitting lines were computed as well as the differences between coincident estimates and reference values, which were summarized by their mean, known as the bias or mean bias error, their standard deviation, and their root mean square error (RMSE). Relative biases, relative standard deviations, and relative RMSEs were expressed with respect to the reference means. Two-dimensional histograms of reference values and estimates were drawn. In addition, results were analysed as a function of the station, latitude, climate, elevation, year,
, reference means, and variances to evidence possible trends. All produced graphs are available as
Supplementary Materials.
The present protocol is not new and is similar to many used in comparable studies. It presents several drawbacks that may affect the comparison.
First of all, the reanalyses provide total irradiance, i.e., integrated over the whole spectrum, while BSRN measurements taken by pyranometers are based on a more limited range, as shown in
Table 1. The often-used BSRN spectral range [285, 2800] nm accounts for about 99% of the total irradiance with slight dependence on the optical properties of the clear atmosphere constituents and the air mass. The 1% irradiance difference resulting from the limited spectral sensitivity of the pyranometer is partly compensated for during the assessment of the sensitivity coefficient of the instrument, which is the conversion factor between measured thermopile voltage and solar irradiance. The sensitivity coefficient is determined during the calibration of the instrument by comparing readings of the pyranometer and the World Radiometric Reference given by cryogenic solar absolute radiometers that measure total irradiance with a very high accuracy (see [
39,
40] for details). Consequently, the sensitivity coefficient implicitly accounts for the average difference between the irradiance in the interval [285, 2800] nm and the total irradiance. This suggests that a slight difference, with a zero mean and dependence on air mass and atmospheric content, may exist between the reanalyses and measurements due to the difference in spectral ranges (see [
19] for further discussion on the difference in spectral ranges between measurements and models).
A second drawback is due to the mismatch in spatial representation of the field of
by the reanalyses and the BSRN pyranometers. The reanalyses provide
on cells several kilometers in size and, thus, do not contain the sub-cell variability of
that is partly perceived by the pyranometers. This adds to the discrepancy expected between the reanalyses’ outputs and
, whose magnitude cannot be predicted as it depends on the atmospheric conditions experienced by each station every 1 h. This is a well-known issue that encompasses the present case and applies to many domains in Earth sciences and others. Several examples are provided in [
19] showing the large spatial variability of the SIS.
Also to be taken into account is the site elevation as well as the presence of seacoasts or mountains and, more generally, of orographic discontinuities. Urraca et al. [
26] reported an increased positive bias in
due to the large size of the grid cells of ERA5 or MERRA-2 over the coasts where the spatial variability in
may be large. This effect may be minimized by a proper choice of BSRN stations located in homogeneous areas with respect to orography, vegetation, or the presence of large water bodies. The limited horizontal and vertical resolutions of the elevation model used in each reanalysis can also bring about a systematic error when the assumed elevation is less than the actual elevation of the station as it may imply, for example, a greater aerosol optical depth or water vapor content.
A final point is the uncertainty affecting the references and , which should not be neglected. This uncertainty is a combination of the uncertainty of the measurements themselves and of the construction of the hourly values with gaps.
4. Verification of the Extraterrestrial Irradiance Received on a Horizontal Surface
There is no measurement of the extraterrestrial irradiance at hand. This section aims at verifying the consistency over a year between the extraterrestrial irradiance received on a horizontal surface from the CAMS McClear service, , and that from each reanalysis, , , and . Differences (–), (–), and (–) were computed and analyzed. Null values were excluded from the analysis.
The nominal total solar irradiances of the CAMS McClear service and each reanalysis are very close: 1361 W m
−2, 1361.6 W m
−2, 1365 W m
−2, and 1365.2 W m
−2, respectively, for McClear service [
18], ERA5 [
10], JRA-3Q (according to ancillary metadata), and MERRA-2 [
41]. This would be reflected in a small bias in the extraterrestrial irradiance. The standard deviation of the differences should be close to 0 and the correlation coefficients should be close to 1, meaning that
and reanalyses vary similarly in time with similar amplitudes of the variations. As the extraterrestrial irradiance received on a horizontal surface depends on the latitude, the bias and standard deviations of differences depend on latitude too and exhibit symmetry in latitude, i.e., they do not depend on the hemisphere. They are greatest in the tropical belt and tend to decrease as the latitude increases northward or decreases southward. There is very little variation with the year.
Table 2 provides results at latitudes 0° and 45° and year 2017 for illustration. Although the actual numbers depend on latitude, conclusions do not.
The values for ERA5 and JRA-3Q are close to expectations. It is concluded that , , and are very similar, except for the nominal total solar irradiance. Note that this difference in the nominal total solar irradiance does not affect the computation of the clearness index (Equations (2) and (3)).
As for MERRA-2, the bias is close to 0 and the correlation coefficient is close to 1, although a little less. However, the standard deviations of the differences are much greater than expected: 34 W m
−2 and 21 W m
−2 (
Table 2), instead of 0 W m
−2. To further investigate this discrepancy between
and
,
Figure 7 displays the differences (
–
) at 09:00, 12:00, and 15:00 UT at longitude 0° and two latitudes, 0° (top) and 45° (bottom), across the year. During a given day, the differences (
–
) can vary from one hour to the next in a non-negligible way. They are close to 0 W m
−2 at 12:00 UT and increase in absolute value with
for a given day, although they decrease as latitudes increase or decrease towards the poles. They also vary throughout the year for a given hour.
Such variations are not expected and are reminiscent of those between the mean solar time (MST) and the true solar time (TST), which are drawn with the secondary axis in
Figure 7, by means of equations taken from [
9]. It is visible that (
–
) vary over the year in correlation or anticorrelation with the difference (MST–TST). The MST is defined in such a way that it is equal to 12:00 on average over the year when the sun reaches its highest during a day. It is equivalent to the UT time at longitude 0°. Due to slight changes in the angular speed of the Earth along its elliptical orbit, the sun does not reach its highest at 12:00 MST every day. The peak time defines the TST, which may differ from 12:00 MST by up to ±15 min approximately. This difference is often termed “equation of time” though it is not an equation in the usual sense. It depends on the eccentricity and obliquity of the orbit. Using the MST instead of the TST has an impact on the calculation of the solar zenithal angle
as the latter depends on the TST, and, therefore, it impacts the calculation of the extraterrestrial irradiance, and further the SIS.
,
, and
are calculated with the TST and do not exhibit such behavior.
Despite their efforts, the present authors found no explicit mention of MST or TST in the calculation of radiation in MERRA-2 in publicly available documentation. Trolliet et al. [
16] have suspected that the MST is used in MERRA-2 instead of the TST for computing
and
. This hypothesis, already partially confirmed by
Figure 7, is studied in more detail in the following. The authors acknowledge that this development is somewhat lengthy but emphasize the need to understand the origin of the error in MERRA-2, which deeply influences the MERRA-2 SIS under clear-sky conditions and, moreover, under all sky conditions at intraday timescales.
The hourly extraterrestrial irradiance in MST
and in TST
were calculated over a year using equations in [
9]. These equations are not very accurate but were used here for illustration purposes only. The differences (
) were superimposed onto those (
–
) in
Figure 7. It is clear that, at a given hour and latitude, the differences are similar. They have the same shape and the same amplitude; their averages differ by −1 W m
−2 at both latitudes. The standard deviations of their differences are 2 W m
−2 at 0° and 3 W m
−2 at 45° and the correlation coefficient is equal to 0.997 and 0.988, respectively. This gives credence to the use of the MST instead of the TST in MERRA-2 and to a miscalculation of the solar zenithal angle, which implies subsequent errors in
and SIS
.
A cross-correlation analysis between and , , and , respectively, was conducted to further verify that the lower correlation and higher bias observed for MERRA-2 is related to the omission of the equation of time. Indeed, if the time reference is correct, a maximum correlation will be obtained for a time lag equal to zero and only for this time lag. Otherwise, if the equation of time is omitted, the error in time is diagnosable by a maximum correlation for a non-zero lag evolving over the year. , , , and over the year 2021 at latitude 0° and longitude 0° were collected from the McClear service and the three reanalyses. They were resampled at 1 min resolution by linearly interpolating between two adjacent hourly values, yielding , , , and . More advanced interpolation methods would require accurate knowledge of time reference. The cross-correlation, i.e., the correlation between and temporally lagged , and , has been calculated on a moving window of five days and the temporal lag ranges from −30 to 30 min. In contrast to the standard cross-correlation analysis, our approach, relying on a moving window, captures the temporal evolution of the results.
Figure 8 exhibits the calculated cross-correlation as a function of time (horizontal axis) and time lag (vertical axis), where yellow colours indicate correlations less than or equal to 0.995, while blue colours indicate correlations close to unity. Maximal cross-correlation is obtained for a zero lag throughout the year for ERA5 and JRA-3Q, indicating that the temporal reference is correct. In contrast, the maximal cross-correlation of MERRA-2 is observed for a time lag different from zero and varying over the year 2021. The shape of the domain corresponding to the maximal cross-correlation is very similar to the yearly evolution of the equation of time (
Figure 7), supporting the hypothesis that the equation of time is missing in MERRA-2.
5. Results for Clear-Sky Irradiance from ERA5 and Discussion
Figure 9 exhibits the 2D histograms between
and
for each trimester and all stations merged. A good overall fit is observed between the estimates and the reference: the points are well aligned along the 1:1 line with limited dispersion, except during SON2017. The correlation coefficients are greater than 0.979 at any station, for any trimester, and the slope of the fitting line ranges between 0.93 and 1.08 (as detailed in the
Supplementary Materials). One may note in
Figure 9 that there is a tendency to underestimation of
by
: the bias for all data merged is −11 W m
−2.
Figure 10 exhibits the bias and its relative value with respect to the reference means in
(
Figure 5) at each station and all stations merged for each trimester and all trimesters merged. The bias ranges between −42 W m
−2 and 27 W m
−2. It is most often negative: among the 333 cases, 70% underestimate with a bias less than −5 W m
−2, 17% have a bias in the interval [−5, 5] W m
−2, and 13% a bias greater than 5 W m
−2. Thirteen stations have a bias per trimester always less than −5 W m
−2: TOR, LIN, CAB, PAL, PAY, CNR, LRC, MNM, ASP, FLO, DAA, GVN, and DOM. The largest negative biases for all trimesters merged are found at LIN (−28 W m
−2), CAB, and ASP (both −25 W m
−2). The bias has low variations from year to year or from season to season, except at FPE, SAP, and ASP. Though there are large variations in bias among stations, there is no visible relationship between the bias in
and the latitude, or with the effective solar zenithal angle
. Notable changes in bias are observed with Köppen–Geiger climatic classes (
Table 1), but no trend is visible.
Figure 11 exhibits the standard deviation of the differences (
) and its relative value with respect to the reference means in
(
Figure 5) at each station and all stations merged for each trimester and all trimesters merged. The standard deviation per trimester ranges between 2 W m
−2 and 44 W m
−2. In 57% of the 333 cases, the standard deviation is less than 10 W m
−2, and less than 20 W m
−2 in 94%. The standard deviations at the two southernmost sites GVN and DOM are always less than 10 W m
−2, while those at NYA, TOR, CAB, PAL, PAY, CAR, MNM, PTR, FLO, LAU, and SYO are always less than 15 W m
−2. Standard deviations per trimester greater than 25 W m
−2 are found at SAP, BOS, and GOB. In relative value, the standard deviation is less than 2% in 65% of the cases, and less than 3% in 90% of the cases. During a given season, the standard deviation has low variations from year to year, except at SAP and GOB. The contrast between seasons is weak in the southern hemisphere and more marked in the northern hemisphere, where values are among the greatest during MAM and JJA.
Figure 12 shows the 2D histograms between
and
for each trimester and all stations merged. As expected, the fit along the 1:1 line is not as good as for
but still it is quite good and the scatter is limited, except during SON2017, for which the bias is 0.01 and the standard deviation of differences is 0.03. The correlation coefficient between
and
per trimester is greater than 0.85 in most of the cases, with several lower values at SAP and DOM (see
Supplementary Materials). One may note in
Figure 12 that there is a tendency to the underestimation of
by
: the bias for all data merged is −0.016. Only 17% of the 333 cases exhibit a positive bias per trimester (see
Supplementary Materials). The features of the biases and standard deviations per trimester per station for
are similar to those for
(
Figure 10 and
Figure 11).
Such 2D histograms were drawn for each station and offer a great number of similarities (see
Supplementary Materials).
Figure 13 exhibits the 2D histograms between
and
at Gobabeb for each trimester as an example. Of particular interest here is the 2D histograms for JJA and SON. One may observe sorts of yellow horizontal lines that indicate that while
varies,
remains constant. This could reflect the influence of using a climatology that cannot reproduce the actual fluctuations in aerosol properties during this trimester, resulting in fluctuations in
, which are not captured by ERA5.
Figure 14 exhibits boxplots of the differences (
–
) for all stations merged for several classes of
for each trimester. The bias in
is always negative in this graph. The bias and the standard deviation of the differences vary with
and the trimester. The bias is more and more negative and the standard deviation increases as
increases. This effect is most pronounced during JJA. Changes from year to year are limited and less marked than those from season to season.
No trend with climate can be detected visually on the bias and standard deviation. Whatever the climate, the differences are most often negative; the standard deviation remains fairly constant, and the bias exhibits more fluctuations. Boxplots for each climate, each season, and each year for the classes exhibit complex patterns with no obvious trend. A thorough analysis is necessary and is beyond our investigative means.
Table 3 summarizes the bias, standard deviation of differences, and root mean square of differences (RMSD) in
and
at each station and all stations merged for all trimesters merged. The relative bias in
and
, respectively, is most often negative and ranges from −5% to 1% and −6% to 1%. The relative standard deviation in
or
has low variations as it is most often in the range [2, 3]%.
6. Results for Clear-Sky Irradiance from JRA-3Q and Discussion
Figure 15 exhibits the 2D histograms between
and
for each trimester and all stations merged. A good overall fit is observed between the estimates and the reference: the points are well aligned along the 1:1 line with limited dispersion. The correlation coefficients between
and
are greater than 0.980 at any station for any trimester, and the slope of the fitting line ranges between 0.95 and 1.09, with a mean equal to 1.01 (see
Supplementary Materials).
Figure 16 exhibits the bias and its relative value with respect to the reference means in
(
Figure 5) at each station and all stations merged for each trimester and all trimesters merged. The bias at Desert Rock (DRA) is always negative and very low, being between −13 W m
−2 and −63 W m
−2. Otherwise, the bias ranges between −36 W m
−2 and 33 W m
−2. The bias is in the range [−5, 5] W m
−2 in 114 cases (34%), in the range [5, 33] W m
−2 in 84 cases (25%), and less than −5 W m
−2 in 135 cases (41%). BIL, E13, and PTR exhibit the greatest biases. At a given station and during a given trimester, the bias has low variations from year to year except at the stations in Northern America (FPE, LRC, DRA, BIL, and E13) and TAT. The biases exhibit large variations from trimester to trimester, and from station to station. There is no visible relationship between the bias in
and the latitude, or with the effective solar zenithal angle
. Significant changes with Köppen–Geiger climatic classes (
Table 1) are observed but no trend has been identified.
Figure 17 exhibits the standard deviation of the differences (
−
) and its relative value with respect to the reference means in
(
Figure 5) at each station and all stations merged for each trimester and all trimesters merged. The standard deviation per trimester ranges between 3 W m
−2 and 46 W m
−2. In 54% of the 333 cases, the standard deviation is less than 10 W m
−2, and less than 20 W m
−2 in 96% of cases. MNM and the two southernmost sites GVN and DOM have standard deviations always less than 10 W m
−2. Those at NYA, TOR, LIN, CAB, PAL, PAY, CAR, PTR, FLO, and LAU, are always less than 15 W m
−2. Values greater than 25 W m
−2 are found at SAP and GOB. In relative value, the standard deviation is less than 2% in 75% of the cases and less than 3% in 96% of the cases. During a given season, the standard deviation has low variations from year to year, except at SAP and GOB. The contrast between seasons is weak in the southern hemisphere and more marked in the northern hemisphere, where values are among the greatest during MAM and JJA.
Figure 18 shows the 2D histograms between
and
for each trimester and all stations merged. The fit along the 1:1 line is quite good though not as good as for
. The scatter varies from trimester to trimester and is systematically high during JJA. The correlation coefficient between
and
per trimester is greater than 0.85 in most of the cases. Very low values are found at CNR during DJF2017 (0.672) and DOM during MAM2017 (0.481) (see
Supplementary Materials). The slope per trimester (see
Supplementary Materials) is comprised between 0.95 and 1.05 in 24% of the cases. It is greater than 1.05 in 33% of the cases and less than 0.95 in 43% of the cases.
The bias in
per trimester (see
Supplementary Materials) is close to 0 overall: 41% of the cases are in the range [−0.01, 0.01], 18% are greater than 0.01, 18% in the range [−0.02, −0.01], and 23% are less than −0.02. The bias is always less than −0.01 at CAB, DRA, and SYO and always greater than 0.01 at BIL and PTR. It varies from station to station, trimester to trimester, and year to year. The features of the biases and standard deviations per trimester per station for
are similar to those for
(
Figure 16 and
Figure 17).
Two-dimensional histograms were drawn at each station and offer a great number of similarities (see
Supplementary Materials).
Figure 19 exhibits the 2D histograms between
and
at Gobabeb for each trimester as an example. During JJA and SON, one may observe sorts of yellow quasi-horizontal lines (see e.g., JJA2015 for
) that indicate that while
varies,
remains constant. This could reflect the influence of using a climatology for aerosols, which cannot reproduce the actual fluctuations in aerosol optical properties during this trimester, resulting in fluctuations in
, which are not captured by JRA-3Q.
Figure 20 exhibits boxplots of the differences (
–
) for all stations merged for several classes of
for each trimester. During a given year and a given season, changes in bias with
are limited while the standard deviation increases as
increases. During DJF and MAM, differences are most often negative whatever the value of
, meaning an underestimation of
by
. Changes from year to year are limited during these seasons. During JJA and SON, differences are more centered on 0 for all years but 2016, which tends to be more negative. Other graphs show that the bias and standard deviation for each class of
fluctuate between stations; no trend with latitude or climate can be detected visually.
Boxplots for each climate, each season, and each year for the classes exhibit complex patterns with no obvious trend. For each possible class, the differences may be low or high, negative or positive. Biases fluctuate significantly; standard deviations tend to increase as increases.
Table 4 summarizes the bias, standard deviation of differences, and root mean square of differences (RMSD) in
and
at each station and all stations merged for all trimesters merged. The relative bias in
and
, respectively, ranges from −6% to 4% and −7% to 4%. The relative standard deviation in
or
has low variations as it is often in the range [2, 3]%.
7. Results for Clear-Sky Irradiance from MERRA-2 and Discussion
The results for MERRA-2 regarding
are affected by the mismatch in time system at intraday scales and, therefore, by a miscalculation of the solar zenithal angle
. This miscalculation is propagated to the irradiance at the top of the atmosphere,
, and also to the clear-sky SIS
and clearness index
due to the dependence of absorption and scattering by atmospheric constituents on
.
Figure 21 exhibits the 2D histograms between
and
for each trimester and all stations merged. Overall, the points are well aligned along the 1:1 line. The slope of the fitting line per trimester per station ranges between 0.91 and 1.11 but ranges between 0.95 and 1.05 in most cases (see
Supplementary Materials). The dispersion of the most populated classes (green and blue) along the 1:1 line is noticeable during DJF and SON and is limited during MAM and JJA. This is only a general feature when all stations are merged and the actual situation varies from station to station. Similar 2D histograms at each station are given in the
Supplementary Materials. As an example,
Figure 22 exhibits the 2D histograms between
and
for each trimester at Gobabeb. One may observe during DJF that the dots are not well distributed along the 1:1 line and form horseshoe-shaped spots along this line that are typical of the time system mismatch.
Figure 23 displays the correlation coefficients per trimester per station. They are greater than 0.967 but many of them (14%) are less than 0.985. The coefficients exhibit a pattern that has seasonal and latitudinal dependency. The smallest are observed in the northern hemisphere during DJF and SON to a lesser extent and during MAM at the South Pole. This pattern in time and space originates from the mismatch in time system and can affect intraday calculations.
The bias and its relative value with respect to the reference means in
(
Figure 5) at each station and all stations merged for each trimester and all trimesters merged are displayed in
Figure 24. The bias ranges between −39 W m
−2 and 34 W m
−2. It is in the interval [−5, 5] W m
−2 in 35% of the cases; 16% have a bias less than −5 W m
−2, 49% a bias greater than 5 W m
−2, and 17% a bias greater than 15 W m
−2. As a whole, the bias at a given station during a given trimester does not vary much with the year. It varies more from trimester to trimester and from station to station. The two southernmost stations have large negative biases, less than −15 W m
−2, during all trimesters. Except for these stations, there is no clear relationship between the bias in
and the latitude, though one may note that the stations between 49° and 58° (TOR, LIN, CAB, and PAL) and between −23° and −45° (GOB, ASP, FLO, DAA, and LAU) have low relative biases, mostly comprised between −2% and 2%, with no clear link with Köppen–Geiger climatic classes (
Table 1).
Figure 25 exhibits the standard deviation of the differences (
−
) and its relative value with respect to the reference means in
(
Figure 5) at each station and all stations merged for each trimester and all trimesters merged. The standard deviation per trimester ranges between 5 W m
−2 and 43 W m
−2. It is large as a whole: it is less than 10 W m
−2 in only 5% of the 333 cases. It lies in the range [10, 20] W m
−2 in 57% of the cases, in the range [20, 30] W m
−2 in 24% of the cases, and is greater than 30 W m
−2 in 14% of the cases. In relative value, the standard deviation is less than 2% in only 9% of the cases. It is in the range [2, 3]% in 34% of the cases, in the range [3, 4]% in 20% of the cases, and greater than 4% in 23% of the cases. There is no clear picture between the standard deviation and latitude or station or climate. What is clear is the increase in the standard deviation during DJF and SON compared to MAM and JJA. This is to be related to the similar pattern discussed in
Figure 23, which is attributed to the mismatch in time system.
Figure 26 shows the 2D histograms between
and
for each trimester and all stations merged. As expected, the fit along the 1:1 line is not as good as for
but it is still quite good and the scatter is limited. One may note during DJF, JJA, and SON, a group of dots of high
that lies well below the 1:1: line. This group is due to the southernmost stations: SYO, GVN, and DOM, for which the bias is largely negative, being, respectively, −0.02, −0.05, and −0.06 when averaged over all trimesters. The northernmost station NYA, also in a polar climate, also has a negative bias of −0.01 on average. These four stations have the most noticeable negative biases averaged over all trimesters; the other stations exhibit biases close to 0 or with positive values of up to 0.02 (
Table 5), as illustrated in
Figure 26. It may be hypothesized that these negative biases are partly due to incomplete consideration of the high albedo surfaces surrounding the sites, though the authors do not have the details of the calculation of
, and other reasons are possible. Among them, one can think of measurement issues due to frozen pyranometers [
42] not detected by quality control, although measurement issues cannot be the major reason as they should also affect the results for ERA5 and JRA-3Q, which do not exhibit behaviours at polar stations similar to those for MERRA-2.
The correlation coefficient between
and
per trimester is greater than 0.85 in most of the cases (see
Supplementary Materials). This coefficient does not exhibit the same pattern as that between
and
(
Figure 23) as the effect of the diurnal cycle of
is removed in
and this mitigates the effects of the mismatch in time system.
Similar 2D histograms were drawn at each station and offer a great number of similarities (see
Supplementary Materials). As an example,
Figure 27 exhibits the 2D histograms between
and
at Gobabeb for each trimester. Of particular interest here is the 2D histogram for SON2017, which can be compared to those obtained for ERA5 (
Figure 13) and JRA-3Q (
Figure 19) on which sorts of yellow quasi-horizontal lines indicate that while
varies,
and
, respectively, remain constant, reflecting the influence of using a climatology. MERRA-2 does not exhibit such a pattern and captures the actual fluctuations in
, though the scatter cannot be neglected. This could be attributed to the assimilation of aerosols’ properties in MERRA-2.
Figure 28 exhibits boxplots of the differences (
–
) for all stations for several classes of
for each trimester. Changes from year to year are limited for a given trimester. The bias and the standard deviation of the differences vary with
and the trimester. During DJF, the bias is negative and decreases as
increases. The differences are most often negative, meaning a very frequent underestimation of
by
. The pattern is fairly similar during SON, and, to a lesser extent, MAM, though the bias is positive, except at the greatest
, and differences are more balanced between negative and positive values. During JJA, the bias is always positive with no influence of
;
very frequently overestimates
. During each trimester, the standard deviation increases with
.
Boxplots for each climate, each season, and each year for the classes exhibit complex patterns; no trend can be visually detected, except at the polar stations, as already reported. Whatever the climate, the differences are frequently positive. They sometimes fluctuate with the year or with . Standard deviations tend to increase with . A deeper analysis of the links between differences, climate, and is beyond our investigative means.
Table 5 summarizes the bias, standard deviation of differences, and root mean square of differences (RMSD) in
and
at each station and all stations merged for all trimesters merged. The relative biases in
and
are often positive and range from −7% to 4%. The relative standard deviation in
or
has fairly low variations as it is often in the range [3, 4]%.
8. Conclusions
The present work has compared estimates of the hourly means in clear-sky conditions provided by the ERA5, JRA-3Q, and MERRA-2 reanalyses against coincident ground-based measurements from 28 BSRN stations covering four years (2015–2018) in an attempt to identify strengths and potential drawbacks in the modelling of and to inform the agencies in charge of the reanalyses of further developments as uncertainties in affect the estimates of the SIS in all conditions. It is stressed here that the presented work deals with hourly values , , and , thus complementing the studies already mentioned, dedicated to the analysis of daily and monthly means of SIS under clear-sky conditions.
The 28 stations are located worldwide but still cover a limited number of climates and a limited number of geographical and meteorological conditions. The BSRN measurements are 1-min measurements and were processed to identify clear-sky conditions. The selected measurements were then aggregated into hourly clear-sky irradiance . These processes decrease the information content of the initial 1-min BSRN measurements to a limited extent, but they are necessary to match the temporal resolution of the reanalysis.
It was found that ERA5 very frequently underestimates . The bias per trimester per station ranges from −7% to 5%, relative to the corresponding average in . The relative biases are most often negative: 70% of them are less than −1% and 52% are less than −2%. The bias varies little from one year to another and from one trimester to another at a given station, but, significantly, from one station to another. As a whole, the differences (−) tend to be more and more negative as the effective solar zenithal angle increases. The fidelity or consistency to is satisfactory as fluctuations in time are well captured by ERA5 with little dispersion: the correlation coefficients are very high and the standard deviations of differences per trimester relative to the corresponding average in are fairly constant in time and across stations and are small: they are less than 2% in 60% of the cases, and less than 3% in 90% of the cases. From our results, we conclude that ERA5 can be considered as fairly reliable in time, despite its use of an aerosol climatology, i.e., it offers the same level of uncertainty because the statistical indicators do not fluctuate significantly with time at a given station. However, we observed that this is not true at certain locations, such as Gobabeb, where the use of a climatology leads to notable fluctuations in statistical indicators over time. In contrast, ERA5 is not reliable in space because of the large fluctuations in bias from station to station.
Our results suggest some areas for improving estimates of the hourly means in clear-sky conditions provided by ERA5. Further analyses are necessary to understand why so frequently underestimates and the reasons for the spatial variability of the bias. In particular, experiments could be conducted to assess whether the aerosol climatology alone explains the underestimation and its fluctuations in space. Another hypothesis to be tested is whether the underestimation is partly due to a systematic underestimation of the ground albedo.
As for JRA-3Q, the bias per trimester per station ranges from −9% to 6%, relative to the corresponding average in . Values are fairly well balanced between negative and positive values: 28% of the cases are in the range [−3, −1]%, 34% in [−1, 1]%, and 19% in [1, 3]%. The bias most often exhibits low variations from year to year but large variations across trimesters and stations are observed. Fluctuations in time are well captured by JRA-3Q: the correlation coefficients are very high. The standard deviations of differences per trimester relative to the corresponding average in are small: they are less than 2% in 75% of the cases and less than 3% in 96% of the cases. Most often, the standard deviation has low variations across stations and years during a given season. In contrast, changes across seasons are weak in the southern hemisphere and significant in the northern hemisphere. Despite the low values of the standard deviations, these possible changes with the season do not guarantee a good fidelity to . No significant overall trend with was observed. From our results, we conclude that JRA-3Q is not reliable in time or space because of the large changes in bias and, sometimes, standard deviation from one station to another and from one trimester to another.
Our results suggest, as for ERA5, that additional analyses should be conducted to understand the origin of spatial and seasonal variations in bias and potentially identify ways to improve the model. One could also assess the impact of using aerosol climatology on the uncertainty of clear-sky estimates, especially to evaluate whether the observed differences in standard deviation and bias between regions and seasons can be explained by the use of a climatology.
The results for MERRA-2 regarding are strongly affected by the use of the mean solar time instead of the true solar time, which leads to a miscalculation of the solar zenithal angle . This miscalculation is propagated to the irradiance at the top of the atmosphere, , and also to the clear-sky SIS and clearness index due to the dependence of absorption and scattering by atmospheric constituents on . Among many examples of this influence on the statistical indicators, the correlation coefficients exhibit a pattern that has seasonal and latitudinal dependency that can affect intraday calculations. At a given station during a given trimester, the bias does not vary much with the year. It varies more across trimesters and stations. Only 43% of the cases have a relative standard deviation less than 3% and 23% of them have values greater than 4%. The standard deviation exhibits a marked seasonal pattern with an overall increase during DJF and SON compared to MAM and JJA. The standard deviation increases with while the link between the bias and is more complex and depends on the season. From our results, we conclude that MERRA-2 is not reliable in time or space because of the large changes in statistical indicators from one station to another and from one trimester to another.
Our recommendations for improving estimates of the hourly means in clear-sky conditions provided by MERRA-2 are as follows. Correct the time system. This should improve all statistical indicators. A study similar to this one may then reveal some features whose analysis can lead to further improvements. Attention should be paid to the extreme latitudes as the four polar stations exhibit the most noticeable negative biases compared to the other stations.
This work exposes the strengths and weaknesses of each reanalysis in clear-sky conditions and formulates suggestions for further improvements in the estimates of the hourly means of irradiance
in such conditions. Currently, none of the three studied reanalyses is satisfactory in cloud-free conditions and all are far from matching the quality of advanced models dedicated to the estimates of
such as CAMS-McClear and others, as reported in recent papers [
19,
28].
In addition to the assessment of the hourly means in clear-sky conditions provided by the ERA5, JRA-3Q, and MERRA-2 reanalyses, this paper has presented a new algorithm for detecting cloud-free instants that produces similar results to two other equivalent methods. This new algorithm requires further analysis to rigorously assess its strengths and limitations. This will be addressed in a separate article specifically dedicated to this investigation.