Validating Hourly Satellite Based and Reanalysis Based Global Horizontal Irradiance Datasets over South Africa

This study validates the hourly satellite based and reanalysis based global horizontal irradiance (GHI) for sites in South Africa. Hourly GHI satellite based namely: SOLCAST, Copernicus Atmosphere Monitoring Service (CAMS), and Satellite Application Facility on Climate Monitoring (CMSAF SARAH) and two reanalysis based, namely, fifth generation European Center for Medium-Range Weather Forecasts atmospheric reanalysis (ERA5) and Modern-Era Retrospective Analysis for Research and Applications (MERRA2) were assessed by comparing in situ measured data from 13 South African Weather Service radiometric stations, located in the country’s six macro climatological regions, for the period 2013–2019. The in situ data were first quality controlled using the Baseline Surface Radiation Network methodology. Data visualization and statistical metrics relative mean bias error (rMBE), relative root mean square error (rRMSE), relative mean absolute error (rMAE), and the coefficient of determination (R2) were used to evaluate the performance of the datasets. There was very good correlation against in situ GHI for the satellite based GHI, all with R2 above 0.95. The R2 correlations for the reanalysis based GHI were less than 0.95 (0.931 for ERA5 and 0.888 for MERRA2). The satellite and reanalysis based GHI showed a positive rMBE (SOLCAST 0.81%, CAMS 2.14%, CMSAF 2.13%, ERA5 1.7%, and MERRA2 11%), suggesting consistent overestimation over the country. SOLCAST satellite based GHI showed the best rRMSE (14%) and rMAE (9%) combinations. MERRA2 reanalysis based GHI showed the weakest rRMSE (37%) and rMAE (22%) combinations. SOLCAST satellite based GHI showed the best overall performance. When considering only the freely available datasets, CAMS and CMSAF performed better with the same overall rMBE (2%), however, CAMS showed slightly better rRMSE (16%), rMAE(10%), and R2 (0.98) combinations than CMSAF rRMSE (17%), rMAE (11%), and R2 (0.97). CAMS and CMSAF are viable freely available data sources for South African locations.


Introduction
Solar radiation is the electromagnetic radiation or energy emitted from the surface of the Sun because of the fusion of atoms inside the sun [1]. Global horizontal Irradiance (GHI) is the electromagnetic radiation that reaches the Earth's horizontal surface after passing through the atmosphere and is the sum of direct normal irradiance (DNI), which is the incident radiative flux on the surface without interacting with the atmosphere and diffuse horizontal irradiance (DIF), which is because of the scattering of radiation by the atmospheric constituents [2]. Accurate knowledge of GHI is important for the technical and economic evaluation of solar energy technologies [3][4][5][6][7][8] and in the development and validation of empirical models [6]. Amongst the myriad of applications, GHI is important in climate change and environmental studies, agricultural sciences, hydrology, atmospheric research [6,7], and in astronomy [6]. GHI is also important in assessing ultraviolet effects on health as well as in material science [6], and in the development of a typical meteorological year of a country. Therefore, obtaining true solar measurements at a location is important.
GHI measurements taken from radiometric stations using at least a good quality (broadband) operational pyranometer remain the most accurate way to collect GHI data. However, GHI monitoring stations are sparse and expensive to install and maintain. As a result, data are only available for a limited number of locations [6,[8][9][10][11][12][13][14].
Alternative sources of GHI data include models such as the Ångström-Prescott model. Though the model with calibrated coefficients can be used to accurately estimate GHI data [13][14][15], the drawback might be the unavailability of sunshine duration data that are needed as an input at some areas. Another limitation of the Ångström-Prescott model is that the highest possible temporal resolution of estimated GHI is the daily average, so the model is not capable of estimating hourly averages. According to Žák et al. [16], GHI data can also be generated by interpolation of measured in situ GHI. The drawback of the interpolation method is the biases that are introduced by interpolation, and the additional errors introduced by using sparsely distributed in situ stations.
Given that GHI datasets are critical for better understanding of wider coverage of solar radiation [10], satellite and reanalysis based GHI datasets can be used to provide reliable alternative GHI data and compensate for the scarcity of monitoring stations by increasing the density of GHI data. The satellite or reanalysis-based datasets must first be validated by using GHI data from a good quality pyranometer [9,11,12,[16][17][18][19][20] to obtain proof of their reliability before they are used in different applications.
The satellite based, reanalysis based, and in situ measurements differ in spatial and temporal resolutions. This creates challenges when using satellite based and reanalysis based GHI datasets as alternative GHI sources [21]. To address these challenges, there has been an improvement in the spatio-temporal resolution of satellites and reanalysis-based datasets in the past few years [21,22]. According to Slater (2016) [22], the improvement was due to the advances in modeling and data assimilation systems. However, there are still challenges due to limited spatio-temporal coverage of observation data in some areas as required by the models, for example, in South Africa, there is only one Baseline Surface Radiation Network (BSRN) station. Baseline Surface Radiation Network (BSRN) was established in 1992, and is a centralized database that archives one minute temporalresolution in situ radiation measurements from 59 stations worldwide. The archived data are used for the validation of satellite data and improvement in radiative transfer calculations in climate models [23].
The algorithms that are used to convert satellite images to estimate GHI data depend on inputs of meteorological parameters (albedo, cloud thickness, aerosols, water vapor, and ozone content). When the parameters have not been measured in some areas, estimated or monthly climatologies are used. Climatological values might not fully represent changes in atmospheric constituents, and as a result, introduce errors in the estimated GHI data when used as inputs.
This study aims to contribute to the reviewed literature by quantifying the errors between the in situ measured GHI and estimated gridded datasets such as to validate satellite-based datasets (SOLCAST, CAMS, and CMSAF SARAH) and reanalysis-based datasets (ERA5 and MERRA2) relative to quality controlled in situ data from 13 reference stations managed by the South African Weather Services (SAWS). The validation was conducted on an hourly temporal scale over all six macro climatic zones of South Africa. This can be applied in studies for overall local accuracy of the datasets to be evaluated.
Validated datasets could be used to estimate GHI in the long-term and over a wide spatial resolution in South Africa. This will enable climate studies, which is generally not possible with ground observed data because there are no continuous long-term records covering decades and covering all areas of the country. The validated datasets could also be used as an additional quality control parameter of the measured data. The bias information from different sources of GHI in different areas of the country can be used as a basis for bias correction. The bias corrected data sources could also be merged with measured in situ data by applying interpolation with external drift kriging to produce most accurate GHI maps than when using an individual data source, as shown by [24,25].

Literature Review
GHI varies from point to point in an area and from time to time. Validation studies have been carried out worldwide to quantify the errors between in situ measured GHI and gridded data sources and is an ongoing process. This is because the algorithms and inputs that are used to generate the gridded data sources are continuously evolving. The findings, the challenges, and recommendations of some of the studies that have been carried out are summarized in Table 1.
The study by Bright [26] validated hourly SOLCAST GHI data, which is satellite based, against 48 BSRN stations by considering individual stations and grouping stations into global climates. There was a good agreement across all climates ( Table 1). The author emphasized that a comparative study against freely available alternative satellite based and reanalysis GHI datasets is necessary to gauge its performance. Yang and Bright [17] validated six satellite based and two reanalysis based GHI data sources using hourly data from 57 BSRN stations spread across all continents. They found that reanalysis products did not perform well compared to satellite-based products, since they overestimated irradiance in most sites ( Table 1). The authors found that each gridded product had a site where it performed better, so testing different available gridded datasets at different sites was emphasized. It was also shown that SOLCAST, which is a commercial satellite-based product, did not outperform the freely available products at all sites, but overall, it was the best performing product.
Merchand et al. [27] validated the CAMS satellite-based dataset against hourly GHI from 16 stations from the Royal Meteorological Institute KNMI in the Netherlands, a temperature climate without a dry season and warm winter. The reference stations were located inland and along the coast. They found that CAMS satellite-based datasets could very well estimate the hourly-to-hourly variation in GHI ( Table 1). The biases reported were suspected to be due to the McClear model, described by Lefèvre et al. in [28], and an input to the Heliosat-4 method, described by Qu et al. in [29], but failed to identify actual cloud free conditions in some regions due to errors in aerosols used as inputs to the McClear model. Negative rMBEs were reported from stations located along the southern coast and positive rMBE were reported from inland stations ( Table 1). The authors concluded that the data were of low moderate quality and disagreed with some studies, for example, in [9], which found the data to be of moderate quality. The disagreement was suspected to be due to the low number of cloud free days in the Netherlands.
Thomas et al. [30] used the CAMS satellite based against hourly GHI data from 42 stations in Brazil to assess the quality of the satellite datasets in Brazil. Prior to their validation exercise, the datasets were only validated in Europe and North Africa. They found that the CAMS satellite-based dataset could estimate hourly GHI ( Table 1). The biases increased with the viewing angle of the satellite. Additionally, the biases were high in tropical climates because of high humidity. CAMS overestimated GHI, but it was deemed suitable for solar energy applications.
Ameen et al. [20] validated CAMS satellite-based dataset in Northeast Iraq, which has a complex topography, using hourly observation data from nine stations. It was found that the CAMS satellite-based dataset captured the spatio-temporal trends of the measured data in clear sky, cloudy sky, and all sky conditions (combination of clear sky and cloud conditions). The performance was found to be better for clear sky and all sky conditions, but worse in cloudy sky conditions. For all sky, the results were as shown in Table 1. The dataset was recommended for use in solar resource applications. Further validation of the CAMS satellite-based data in other areas was recommended. Marchand et al. [9] validated the CAMS satellite-based dataset against hourly observed GHI from five stations located in northern and central parts of Morocco to investigate how the bias of the stations located in the same climate varied. They found that the dataset was capable of estimating GHI (Table 1) and CAMS slightly overestimated GHI. Overall, there was a variation from site to site, but the dataset was recommended as a reliable source of estimated GHI data.
Trolliet et al. [31] validated satellite-based datasets (CMSAF SARAH and CAMS) and reanalysis based (ERA5 and MERRA2) against hourly observed GHI data in the tropical Atlantic Ocean collected by five buoys of the Prediction and Research Moored Array in the Tropical Atlantic (PIRATA) network. The PIRATA network was established in the mid-1990s to study ocean-atmosphere interactions in the tropical Atlantic that affect regional weather and climate variability [32]. It was found that satellite-based datasets performed well (Table 1). Reanalysis datasets were found to report clear sky conditions while the actual conditions were cloudy and vice-versa. The reanalysis cloud parameterization scheme was suspected to be weak and contributed to large biases reported. ERA5 and MERRA2 were not recommended to estimate hourly temporal variability of GHI in the tropical Atlantic Ocean, but that they could be used in annual variability studies. The limitations of the study were that some buoys gathered African dust and the pyranometers may have been affected by the motion of the buoys; these factors might have contributed to the biases reported.
Since the satellite and reanalysis gridded datasets considered were generated by applying different algorithms, a variation in performance was expected in different stations, hence the need for quantifying errors in different locations.

Observation or Reference GHI Data
The South African Weather Service (SAWS) manages a radiometric network of 13 stations that are distributed in all six macro-climate regions in South Africa (with location and climate as shown in Figure 1 and given in Table 2). The color shaded climate zones shown in Figure 1 are based on SAWS macro classification and the climate codes, which are indicated in square brackets, are the "micro" zones based on the Council for Scientific and Industrial Research (CSIR) Köppen-Geiger climate classification (KGCC), as given by Conradie in [33].  Table 2. South African Weather Services radiometric stations with Köppen-Geiger climate classification (KGCC), altitude, latitude, climatic zones, average number of clear sky days per year (percentage of clear sky days per year), annual aggregated diffuse fraction, humidity, and the percentage of data outliers removed per station (colors are described in Table 3).

Parameter/Colour Humidity (H) Clear Sky Days (CL) Diffuse Fraction (DF)
Green Information on each of the stations (including site parameters, instrumentation, datalogging, and quality control) is given in [15,34,35]. The area of study included all 13 South African Weather Service radiometric sites.
In Table 2, the cells for clear sky days, diffuse fraction, and humidity are color coded. The coding represents different levels of clear sky days, diffuse fraction, and humidity. The meteorological information is used to explain the reason behind the performance of satellite-based and reanalysis datasets in different stations. The color-coding limits and distributions are summarized in Table 3.
ERA5 hourly cloud data from 2013-2019 were used to calculate the average clear sky days per year for each station, and this information was used to evaluate the performance of datasets based on the frequency of cloud occurrences. Diffuse fraction and relative humidity information for the study site were from Mabasa et al. (2018) [35].

Reanalysis Data
The choice of reanalysis datasets was based on the general free availability. The spatial and temporal characteristics as well as the level of accessibility and applicable regions of the reanalysis datasets are given in Table 4. Reanalysis datasets are generated by assimilating historical observation data from various platforms (ground observation, satellites, ships, and aircrafts) and numerical weather prediction models using a consistent algorithm. Reanalysis datasets have an advantage of global spatial coverage and long-time series [36]. ERA5, as described by Hersbach et al. (2020) [37], is the fifth generation European Centre for Medium-Range Weather Forecasts (ECMWF, Reading, United Kingdom) atmospheric reanalysis of the global climate. ERA5 has a spatial resolution of 0.25 • × 0.25 • and has an hourly time resolution. It uses a new advanced model cycle for Integrated Forecasting System (IFS Cycle 41r2) for data assimilation, which increased the computational efficiency and forecast accuracy. ERA5 uses climatological aerosol information, and the Global Ozone Chemistry Aerosol Radiation and Transport (GEOCART) stratospheric sulfate aerosol from volcanic eruptions is also included [37]. The dataset covers a period from 1979 to the present, hourly data are available in Joule per square meter (J/m 2 ), so to obtain the hourly values in watt per square meter (W/m 2 ), values should be divided by 3600 (number of seconds in one hour).
The Modern-Era Retrospective Analysis for Research and Applications, version 2 (MERRA2), which was introduced by Gelaro et al. (2017) in [38], assimilates space-based observations of aerosols and represents their interactions with other physical processes in the climate system. MERRA2 products are generated using the GEOS 5.12.4 model, which uses real time aerosols as inputs. MERRA2 data are available from 1980 to the present with two months delay, has a spatial resolution of 0.5 • × 0.625 • , and has an hourly resolution for surface irradiance variables [38].

Satellite-Based Datasets
The choice of satellite-based datasets was based on the general free availability and availability for research purposes. Information on the satellite datasets for this study is given in Table 4. The relationship between the satellite images and the actual ground GHI was established with an algorithm that combines cloud information, which is generated from a clear sky model and cloud index. Aerosol, water vapor, linke turbidity, and ozone information were also added as inputs to an algorithm, which will then generate estimated GHI at the surface. The technique is also described in the literature including Ameen et al. (2018) [20].
The Surface Solar Radiation Data Record-Heliosat Edition 2 (SARAH) dataset, as described by Pfeifroth et al. (2019) [39] is provided by the EUMETSAT Satellite Application Facility on Climate Monitoring (CMSAF) and covers the time period from 1983 to present, with the temporal resolution ranging from 30 min instantaneous values. Amongst the other products, the dataset provides GHI. CMSAF SARAH datasets are derived from the geostationary METEOSAT satellite service of the first and second generation, which are geostationary over Europe, Africa, and a small part of South America. In this case, the data were retrieved by using the Heliosat method to estimate the cloud index, clear sky radiative transfer model, and several climatological parameters (precipitable water vapor, monthly AOD climatology, monthly ozone climatology, and ground albedo). CMSAF SARAH data have a higher stability in early years due to the removal of erroneous satellite images during the transition from the first to the second generation METEOSAT satellite. CMSAF, as also given in Table 4, has a spatial resolution of 0.05 • × 0.05 • [39]. CMSAF SARAH was validated using twelve BSRN stations from three continents, eight stations from Europe, one station from South America, and three from Africa (South Africa, Algeria, and Namibia). The validating irradiance datasets were first quality controlled using BSRN methodology and outliers were discarded [40].
The Copernicus Atmosphere Monitoring Service (CAMS) radiation service is part of the Copernicus Program, an Earth observation program coordinated and managed by the European Commission in partnership with the European Space Agency. The CAMS radiation service is available for free via the CAMS [41] and solar radiation data (SoDa) [42]. It makes use of the Heliosat-4 method [29], which models the radiative transfer in the atmosphere to compute the solar radiation parameters. The McClear model [28] is used to estimate clear sky irradiance. The AVHRR Processing scheme Over cLouds, Land and Ocean (APPOLLO) method is used to process the satellite images from the German Aerospace Center database to yield information (cloud coverage, cloud level, and cloud type) for each pixel (3 km at nadir) every 15 min [29]. Ground albedo from Moderate Resolution Imaging Spectroradiometer (MODIS) is used, as given in Qu et al. (2017) [29]. The CAMS radiation service provides a time series of GHI with a spatial coverage of −66 • to 66 • in both latitudes and longitudes (i.e., North to South and East to West directions). The data are interpolated to the point of the user's interest. The time coverage of data was from 1 February 2004 to date with a two day delay and with a temporal resolution that ranged from 1-min average, 15-min averages, hourly averages, daily averages, and monthly averages [29,41,42].
The CAMS satellite dataset is regularly validated using global BSRN and non BSRN stations. The validating in situ irradiation datasets were first quality controlled using the BSRN methodology and outliers were discarded. The recent validation process in 2020 used 32 in situ stations and only one station from Southern Africa, Namibia (Gobabeb), was used in the recent validation process [43].
SOLCAST is a commercial company [17,26,44]. The SOLCAST method estimates solar irradiance from satellites by detecting cloud cover and characterizing cloud cover in terms of its impact on solar radiance, modeling the available solar irradiance under clear skies and then combining the estimate of the amount of solar irradiance reaching the Earth's surface after it passes through the clouds. In Africa, the EUMETSAT Meteosat satellite and REST2v5 clear sky model [45] with MERRA2 reanalysis inputs (for metadata) were used in SOLCAST GHI data estimation. The temporal resolution ranged from basic time series (hourly averages) and alternative time series (5 min, 10 min, 15 min, and 30 min). One-minute data are also available on request. The data are available from January 2007 to date with a seven day delay, through the SOLCAST website [44]. SOLCAST provides global coverage of the data, except for ocean and polar regions, with a spatial resolution of 1-2 km [17,26,44]. According to the website on SOLCAST validation and accuracy [46], SOLCAST has considered 46 of the BSRN sites for validation of GHI and reports a maximum bias deviation of 2.01%. The BSRN sites for steep mountain areas, oceanic, and polar sites were excluded in the SOLCAST validation.

Methodology
The methodology used in this study is summarized in the flow chart in Figure 2. The methodology consists of the following steps: (1) Preprocessing and quality control of in-situ GHI data; (2) Averaging one minute to 15 min; (3) Averaging four slots of 15 min to obtain hourly averages; (4) Gathering and preprocessing of satellite-based and reanalysis datasets; (5) Matching common time steps of in situ GHI and satellite/reanalysis-based datasets; (6) Calculation of hourly zenith angles and removing datasets that fall on points where the zenith angle is greater than 90 degrees; and (7) Calculation of statistical metrics.

Pre-Processing of Observation Data and Validation Process
One-minute average GHI data recorded from each of the 13 stations using a CMP11 pyranometer from Kipp and Zonen were pre-processed, subjected to "physical possible" limit check, which is aimed at detecting extremely large errors in the radiation data and "extremely rare" values checks of the BSRN QC test [23,47,48], and the outliers were removed. The minute data that passed the BSRN QC test were converted to 15 min averages; then hourly averages were calculated from four slots of 15-min averages. This methodology was used in [5,15,30,34,35,48,49]. All the datasets were synchronized to South African standard time (SAST) to avoid the misalignment of time series datasets.
Hourly average GHI observation data were also further processed by removing all the data points recorded on hours when the solar zenith angle was greater than 90 • . Hourly averages were then averaged to daily average values. Daily averages were then subjected to HelioClim model QC, described by Geiger et al. in [50]. Hourly data on days that failed the HelioClim QC test were removed before any further analysis. The percentage of outliers removed from each station is given in the last column in Table 2.
Solar zenith angles were calculated using the solar position algorithm (SPA) on Python PV_LIB [51,52]. Hourly average values were then compared to corresponding hourly average CAMS, CMSAF SARAH, SOLCAST, ERA5, and MERRA2.
SOLCAST, ERA5, MERRA2, and CAMS were sourced as hourly averages while CMSAF SARAH was sourced as instantaneous 30 min averages and then 2-time steps were averaged to obtain hourly averages.

Statistical Metrics
The statistical metrics that were used to quantify the difference between hourly estimated and hourly measured GHI are relative mean bias error (rMBE), relative root mean square error (rRMSE), relative mean absolute error (rMAE), and the coefficient of determination (R 2 ). These statistical metrics are given in the literature [53,54] and have also been described and applied in the authors' previous studies [15,34].

Most Feasible Gridded Dataset
The most feasible option from the satellite-based and reanalysis datasets at each of the 13 stations is determined by first considering the best performing model for each metric. The most feasible option at a station is then the model with the maximum count across all the metrics. A numerical relative rating of the optimal metric is calculated as the maximum count relative to the four metrics (rMBE, rRMSE, rMAE, and R 2 ).

Results
The threshold or the range of the statistical metrics that are used to benchmark the optimal and/or the best applicable hourly gridded products based on their performance when compared to measured hourly data in the study area are summarized in (Table 5). The interval metrics were based on the distribution of the results. Table 5. Range of the statistical metrics used to benchmark the applicability of the gridded datasets. Green represents excellent performance, blue good performance, and orange poor performance).

CMSAF
From Figure 3, CMSAF underestimated GHI in Upington (−2%) and De Aar (−1%) and overestimated GHI at the remaining 11 stations; rMBE ranged from 0-7%. In Figure 4, when considering rRMSE, CMSAF showed a very good performance at Upington (10%), the worst performance at Nelspruit (33%), and the rest of the stations had a rRMSE between 10% and 20%. From Table 6, when considering the hourly correlation R 2 results, CMSAF had R 2 > 0.96 at all stations except in Nelspruit (0.908), which shows a very good correlation between the in situ observed and estimated data. This is also demonstrated by scatterplots in Figures S1-S13

SOLCAST
From Table 7, which gives the most feasible hourly dataset per station by combining all the hourly metrices, SOLCAST was most prominent in 10 out of 13 stations for the hourly aggregates. From Figure 3, the SOLCAST dataset slightly underestimated hourly GHI at De Aar (−0.1%), Mthatha (−2%), George (−3%), and Durban (−2%) and it overestimated hourly GHI at the rest of the stations; rMBE ranged from 1% to 4%. From Table 6, SOLCAST had the correlation R 2 > 0.96 in all 13 stations, showing a very good agreement between the measured and estimated data. This was also demonstrated by the scatterplots in Figures S1-S13, given in the Supplementary Materials. SOLCAST hourly rRMSE was less than 15% at eight stations and more than 15% in five stations George (17%), Cape Point (19%), Durban (17%), Mthatha (17%), and Nelspruit (19%).
From Figures 3-5 and Table 6, the hourly metrics for SOLCAST varied as given below: • −3% ≤ hourly rMBE ≤ 4%; • 8% ≤ hourly rRMSE ≤ 19%; • 5% ≤ hourly rMAE ≤ 12%; • 0.969 ≤ hourly R 2 ≤ 0.996; Table 7. Best performing gridded dataset per hourly metric, most feasible model, and level of performance (rating). The colors were used to show the best performing gridded dataset based on the hourly statical metric per station and the most feasible dataset per station based level of rating out of 4 (the number of statistical metrics used). Green represents SOLCAST, yellow CAMS, blue CMSAF, orange ERA5, and red MERRA2.
From Figures 3-5 and Table 6, the hourly metrics for ERA5 varied as given below:
From Figures 3-5 and Table 6, the hourly metrics for MERRA2 varied as given below: • −1% ≤ hourly rMBE ≤ 23%; • 16% ≤ hourly rRMSE ≤ 50%; • 9% ≤ hourly rMAE ≤ 32%; and • 0.823 ≤ hourly R 2 ≤ 0.985. Figures 6-9, which shows the aggregated hourly averages, demonstrates that all the gridded datasets could capture the temporal variability of GHI in different sites nonetheless, with varying accuracy. The MERRA2 reanalysis dataset overestimated GHI and it did not perform well as its line diverged from the reference measured dataset as well as the other datasets, in almost all the stations. ERA5 reanalysis also overestimated, and it diverged from observation and satellite datasets lines in most stations. SOLCAST, CMSAF, and CAMS satellite-based datasets lines were adjacent to the observation in most stations, which showed good performance.     Table 2.

CAMS
The above statistical metrics suggest that the CAMS dataset has good performance in South Africa. The findings are similar to the studies by Marchand et al. (2018) [9] in Morocco; Ameen et al. (2018) [20] in Iraq; Yang and Bright [17] for 57 BSRN stations; Thomas et al. [30] in Brazil; and Trolliet et al. [31] in the tropical Atlantic Ocean, which found that CAMS could accurately estimate the hourly GHI and can therefore be used with quantitative confidence as a reliable alternative source of estimated GHI data. The relatively high rRMSE observed for Nelspruit, Thohoyandou, Mthatha, Durban, Cape Point, and George may be due to the characteristic high annual humidity (greater than 60% in Tables 2 and 3). The same tendency was also observed by Thomas et al. [30]. George and Nelspruit stations also had a high diffuse fraction (DF > 0.32). This implies that the performance of CAMS is affected by high aerosols, high humidity, and hence, many days with diffuse skies. When considering areas where there is infrequent cloud occurrence, there was no significance difference in bias, meaning that McClear can accurately estimate clear sky conditions in the study area. The previous study by Mabasa et al. [32] showed that the McClear clear sky model had a good performance in South Africa. For CAMS, cloud properties were derived from Meteosat satellites with 15-min temporal resolution.
For CAMS, the main inputs to Heliosat-4 are aerosol properties, total column water vapor, and ozone content, as provided by the CAMS global services every three hours. The lower temporal resolution in aerosol, total column vapor, and ozone content may account for the observed biases between ground truth and CAMS estimated GHI. The overall good performance of CAMS datasets might be attributed to having a high spatial resolution 3-5 km in Southern Africa.

CMSAF
The overall results show that there is relatively good performance by the CMSAF satellite-based dataset, which suggests that CMSAF is a viable tool to estimate GHI for sites such as the 13 stations in this study. CMSAF satellite-based dataset showed a relatively poor performance at Nelspruit Station. Nelspruit is a station with the highest diffuse fraction (Tables 2 and 3). CMSAF satellite-based dataset uses aerosol climatology as input to satellite retrieval algorithms as given by Riihelä et al. [49], however, aerosol climatology might not capture the aerosol climate variability. Mueller et al. [55] showed that aerosol climatologies used in CMSAF satellite retrievals algorithms were underestimated when compared to real aerosol measurements. The poorer metrics for Nelspruit might be due to the use of aerosol climatology information in the CMSAF satellite retrieval algorithm. No significant bias has been observed for the stations with high humidity and more frequent cloud occurrence compared to areas with low humidity and less frequent cloud occurrence. This means that the CMSAF cloud and water vapor parameterization scheme is effective in South Africa.
The CMSAF satellite-based dataset outperformed CAMS, ERA5, and MERRA2 at all 13 stations under this study (Table 7). CMSAF was outperformed by SOLCAST in 11 stations, this might be due to the use of hourly average from only two intervals (half hour and hour) instead of four intervals (15 min, 30 min, 45 min, and hour). The good performance of CMSAF datasets might also be attributed to having a high spatial resolution 5 km.

SOLCAST
These results show an overall very good performance of SOLCAST satellite-based dataset in all 13 stations in this study. The results agreed with the study by Yang and Bright [17] and Bright [26], who found that the SOLCAST satellite-based dataset performed well. Overall SOLCAST was outperformed by CMSAF (from Table 7) at Mahikeng and Cape Point Stations; this was similar to the findings by Yang and Bright [17], where SOLCAST did not outperform some freely available products at all sites. The stations where rRMSE was greater than 15% when referencing from Tables 2 and 3. They all had a low number of clear sky days (less than 5%), high humidity (greater than 60%), and high diffuse fraction (greater than 0.33), except Nelspruit Station with (more than 10%) number of clear sky days. This implies that frequent cloud occurrence, higher humidity, and higher diffuse fraction slightly affected the performance of the SOLCAST satellite-based dataset.
The excellent performance of the SOLCAST dataset might be due to the use of the REST2v5 [45] clear sky model to calculate the clear sky index when converting the satellite image to GHI. Sun X et al. [56] found that the REST2v5 clear sky model had an excellent worldwide performance. The use of very high spatial resolution satellite images 1-2 km enabled almost all features (e.g., terrain difference) in an area of interest or a grid to be properly identified and properly interpolated.

ERA5
Overall, the ERA5 reanalysis dataset showed a poor performance in estimating GHI in South Africa. The results were similar to Yang and Bright [17] and Trolliet et al. [31], who found that ERA5 datasets had poor performance, overestimated GHI for most sites, and were outperformed by the satellite-based dataset; rMBE was 72% (Table 1). ERA5 reanalysis data showed a very poor performance in areas with frequent cloud occurrences, high humidity, and high diffuse fraction. From Supplementary Materials Figures S1-S13, ERA5 estimates tended to estimate cloud conditions while observations showing actual conditions as clear conditions was shown by irradiance values, this might be contributing to the higher biases. Basically, ERA5 cloud models struggle to differentiate non cloud and cloud conditions. ERA5 uses climatological aerosol information [37] instead of measured aerosols which captures changes in atmospheric constituents. This might also be one of the reasons for poor performance. The low spatial resolution (0.25 • × 0.25 • ) of the ERA5 reanalysis data might also be a contributing factor to poor performance of the ERA5 datasets in South Africa.

MERRA2
Overall, MERRA2 reanalysis showed a very poor performance and it overestimated hourly GHI in all 13 sites under study. The performance of MERRA2 in South Africa was similar to the findings by Yang and Bright [17] for 57 BSRN stations and Trolliet et al. [31] in the tropical Atlantic Ocean (i.e., poor performance, overestimating GHI, and being outperformed by satellite-based datasets; rMBE was 76%) ( Table 1). MERRA2 data showed a very poor performance in areas with frequent cloud occurrences, high humidity, and high diffuse fraction. The very low spatial resolution (0.625 • × 0.5 • ) of MERRA2 reanalysis data might also be a contributing factor to poor performance of MERRA2, as it was the overall worst performing dataset in the study.

Conclusions
The study validated hourly global horizontal irradiance (GHI) from three satellitebased GHI datasets (namely SOLCAST, CAMS, and CMSAF SARAH) and two reanalysis based GHI datasets (namely ERA5 and MERRA2) against quality-controlled hourly in situ GHI recorded at 13 radiometric stations in South Africa. The study demonstrated that GHI from the satellite-based datasets had better performance than reanalysis-based datasets in South Africa. The overall statistical metrics used to gauge the performance of the datasets varied, as tabulated below (Table 8). SOLCAST was the best performing overall, while MERRA2 was the overall worst performing dataset. Freely available satellite-based datasets (CAMS and CMSAF) are recommended for use with quantitative confidence in diverse solar energy applications that require GHI data. Reanalysis based GHI datasets (ERA5 and MERRA2) are not good enough to be used in South Africa. Low spatial resolution, weak cloud parameterization schemes, and the use of climatological inputs instead of real in situ measurement in reanalysis GHI deriving algorithms might be some of the reasons behind the poor performance of reanalysis based GHI estimates in the study.