Next Article in Journal
From Exposure to Equity: Understanding Air Quality Impacts on Environment and Human Health
Previous Article in Journal
Characterization of VOCs at Shaoxing in the Winter Campaign: Sources and Chemical Reactivity
Previous Article in Special Issue
Influence of Clouds and Aerosols on Solar Irradiance and Application of Climate Indices in Its Monthly Forecast over China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation of Cloud Fraction Data for Modelling Daily Surface Solar Radiation: Application to the Lake Baikal Region

1
Institute of Monitoring of Climatic and Ecological Systems, Siberian Branch of the Russian Academy of Sciences, 634055 Tomsk, Russia
2
V.B. Sochava Institute of Geography, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
3
Federal Research Centre for Information and Computational Technologies, 634055 Novosibirsk, Russia
4
Matrosov Institute for System Dynamics and Control Theory, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
5
Scientific Education Center “Environmental Dynamic and Global Climate Changes”, Yugra State University, 628012 Khanty-Mansiysk, Russia
6
Limnological Institute, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
*
Author to whom correspondence should be addressed.
Atmosphere 2025, 16(12), 1405; https://doi.org/10.3390/atmos16121405
Submission received: 14 November 2025 / Revised: 12 December 2025 / Accepted: 13 December 2025 / Published: 16 December 2025

Abstract

Accurately modelling surface solar radiation (SSR) is essential for environmental research but remains a significant challenge in topographically complex regions like Lake Baikal, where ground measurements are sparse. This study evaluates the performance of various open-access cloud cover products—from satellite sensors (AVHRR, MODIS) and ground-based observations—for modelling daily SSR totals, using a physical radiation model validated against in-situ measurements from 10 coastal stations. The results demonstrate that the choice of cloud data critically impacts model accuracy. The AVHRR satellite product yields the most reliable estimates (R2 = 0.54, RMSE = 4.538 MJ/m2), significantly outperforming both ground-based cloudiness observations and the ERA5 reanalysis dataset. This finding underscores that spatially continuous satellite data provide a superior representation of cloud attenuation for regional modelling than point-based ground observations or reanalysis. Consequently, a physical model driven by high-quality satellite cloud masks is recommended as an effective methodology for generating reliable SSR fields.

1. Introduction

Solar radiation exerts a direct and governing influence on the planetary energy equilibrium [1]. Diurnal irradiance fundamentally dictates the thermal dynamics of the terrestrial surface [2,3], drives key biogeochemical processes including the hydrological cycle, and serves as the fundamental input to both photovoltaic and thermal solar energy conversion systems [4,5]. Furthermore, through its synergistic interactions with atmospheric variables, solar radiation emerges as a pivotal parameter in climate modelling and formulation of strategic policies for agricultural and hydrological resource management [6]. Consequently, it constitutes a critical, indispensable variable for sustainable land-use planning, particularly within regions exhibiting high susceptibility to climatic changes [7,8]. Changes in surface solar radiation (SSR) directly affect climate data, the hydrologic cycle, sensible heat, latent heat, evaporation, photosynthesis, ecological life, migration, and many other important parameters.
The significant spatiotemporal variability of SSR makes estimation of total radiation challenging [9], especially in regions with complex atmospheric dynamics and sparse meteorological stations [8,10]. The SSR value depends on many parameters such as solar angle, hour angle, topography, air composition, surface morphology, and conditions [11]. All these parameters generate great uncertainties in obtaining SSR, especially for short temporal partitions such as instantaneous, hourly, or daily values.
Despite its paramount importance, establishing comprehensive and dense observational networks for SSR is often not feasible. The high cost of precision radiometric equipment, coupled with significant demands for ongoing maintenance and calibration, means that meteorological stations equipped with radiation sensors are scarce, especially in topographically complex or remote regions [12]. This spatial sparsity of direct measurements creates a critical data gap, necessitating a heavy reliance on modelled estimates to obtain a continuous understanding of solar radiation patterns across the landscape [13].
In response to this need, a wide spectrum of modelling approaches has been developed, ranging from traditional empirical and regression-based methods to sophisticated statistical, time-series, artificial intelligence, and satellite imagery-based models [14,15,16,17,18]. However, extensive reviews of the literature consistently reveal that no single model has emerged as universally superior [19,20,21]. The performance of any given model is highly contingent on local atmospheric conditions, geographic specifics, and quality of the input data [15]. This inherent regional dependency underscores the necessity for localised model development and validation, as an approach that performs well in one climatic or geographic context may prove inadequate in another [22]. Consequently, the initial aim of this paper is to develop and validate a simulation of SSR for a specific, environmentally significant region—the Lake Baikal.
Machine learning methods are widely applied for precise solar radiation forecasting. Evaluation of the performance of different machine learning methods for solar radiation prediction in Konya, Turkey [23] showed that the Long Short-Term Memory model performed best. A combined artificial neural network approach was successfully applied for forecasting solar radiation in Iran [24]. A novel aggregated model of outputs from different machine learning models demonstrates better results than a single model output [25]. A stacking model approach with categorical boosting feature selection for China revealed that the sunshine duration and ozone mass concentration significantly affect solar radiation prediction [26].
Some authors compare global radiation products and suggest correction methods to improve and reduce biases in radiation characteristics [27,28]. Saint-Drenan and Wald [29] compared ERA5, JRA-3Q, and MERRA-2 reanalyses with ground-based measurements from 28 stations located worldwide, and noted that none of the three reanalyses is reliable in space. Xian et al. [30] reaffirm that the Random Forest model is a highly effective tool for correcting ERA5 ECMWF (European Centre for Medium-Range Weather Forecasts) shortwave radiation estimates over Hubei Province, China.
Many studies rely on satellite remote sensing data from platforms such as Terra/Aqua MODIS [13,31,32], and Landsat-8 [6,33], as well as ASTER, AVHRR, ETM+/TM, GOES, MISR, POLDER, and VEGETATION [34]. Furthermore, various meteorological and solar radiation databases are commonly used, including NASA SSE, NASA POWER [35], ERA5 [27,30], SoDa [36], NOAA CLASS [34], the Baseline Surface Radiation Network [29,37], the World Bank’s ESMAP Solar Resource Mapping, and MERRA-2 [27,38].
Multiple studies focused on solar radiation using Geographic Information Systems (GIS) [39,40,41,42]. This research aims to validate its results against existing geoinformation modules [36] and apply them to specific cases. These applications include modelling snowmelt dynamics [43], optimising agricultural ecosystems [31,33,44], identifying optimal sites for solar power plants [45,46,47,48,49,50], and assessing forest fire risk [36,51]. The spatial resolution of the developed models varies from tens of metres to half a metre [51].
Among different tools, which are useful for integrating layers with different spatial resolutions, the r.sun function embedded in GRASS GIS has been selected as the most suitable to capture complex variations resulting from weather and geographical parameters [52]. The r.sun module is integrated within QGIS software version 3.36.2 [36,41,42,53]. The Area Solar Radiation tool within the ArcGIS version 10.4 Spatial Analyst toolbox is also popular [43,44,45,46,47,48]. This tool was integrated into major GIS platforms and its consequent widespread adoption, despite its reliance on 2.5D surface models for calculations.
It is widely recognised that the choice of both the modelling algorithm and, critically, the input data sources leads to substantial variations in the final radiation estimates. This study employs a well-established physical modelling approach, which explicitly and accurately accounts for extraterrestrial solar radiation and its interaction with the Earth’s movement and atmosphere. This deterministic foundation provides a robust framework for calculating potential clear-sky radiation. Within this framework, the attenuation of radiation by the atmosphere, particularly by cloud cover, becomes the most significant and variable factor determining the actual radiation that reaches the Earth’s surface. Therefore, the accuracy of the cloudiness data used as input is the single most critical element for generating valid modelling results.
The central novelty of this research is its systematic evaluation of multiple, diverse cloudiness data products within a physically coherent modelling scheme for the Lake Baikal region. This area, characterised by its immense lake, complex mountainous terrain, and unique microclimates, represents a significant gap in the existing literature, as it has not been the subject of a study that concurrently utilises in-situ observations of incoming radiation and compares the performance of various satellite-derived and reanalysis cloudiness products. Our work directly addresses this gap by rigorously testing which cloudiness data source yields the most reliable daily radiation totals when validated against a dedicated network of ground-based measurements. The aim of this paper is therefore to analyse the quality of various cloudiness data products for modelling daily SSR totals in the Lake Baikal surroundings, using a unique set of in-situ observations from 10 automated weather stations as a benchmark for validation.
Our research is positioned to address a specific literature gap. There have been no previous works that systematically combine remote sensing cloudiness data, physical radiation modelling, and a dedicated network of in-situ observations specifically for the Lake Baikal shoreline. This integrated approach is crucial for this complex environment, where the interplay between the deep lake and the surrounding topography creates highly localised microclimates that are poorly captured by generic models.

2. Materials and Methods

2.1. Materials

The study area (Figure 1) is located within the Central Ecological Zone of the Baikal Natural Territory [54]. Lake Baikal is a UNESCO World Heritage Site [55], renowned as the world’s oldest, deepest, and most voluminous freshwater lake. Its unique geomorphology, characterised by a massive water body surrounded by complex mountainous terrain, fosters highly localised and dynamic microclimates that are crucial for the region’s numerous endemic species. Solar radiation is a primary climate-forming factor. Given the sparse network of actinometric stations in the area and the complete absence of assessments of the surface radiation and heat budget under complex orographic conditions, this research is highly relevant. Data on radiation levels under natural conditions is essential for assessing changes in the coastal biodiversity of Lake Baikal amidst contemporary climate change. The territorial accessibility and the installation of automatic weather stations equipped with radiation sensors make this chosen area an ideal natural laboratory for studying solar radiation modelling.
The specific focus on the lakeshore is deliberate. The land-water interface is a critical and highly sensitive zone where sharp gradients in surface-atmosphere energy exchange occur [56]. Studying sites along this transition allows us to directly evaluate how well different cloudiness products can capture the pronounced spatial variability in solar radiation caused by the lake’s moderating influence. While the sites may be difficult to interpret, this complexity is precisely the point—it provides a rigorous test for the models.
The input data used in this study for modelling daily incoming solar radiation was: a digital elevation model (DEM) and its derivatives (slope and aspect), cloud cover, and aerosol optical thickness (AOT), which was converted into the Linke turbidity coefficient. All datasets were reprojected into the EPSG:32648 coordinate system.
Digital Elevation Model (DEM). The Shuttle Radar Topography Mission (SRTM) data [57] with a spatial resolution of 30 × 30 m was selected as the DEM. This choice was based on the findings of [10], which reported higher reliability of SRTM data due to lower levels of random errors compared to alternative DEMs.
Linke Turbidity Coefficient or AOT. Currently, instead of the dimensionless Linke turbidity coefficient (TL), the aerosol optical thickness (AOT) is widely used in meteorological and climatic calculations. For this study, the TERRA/MODIS satellite system was selected. The data is provided by NASA Earth Observations [58] and is based on the MODIS/Aqua Collection 6.1 products [59]. This data is represented as a raster layer with a spatial resolution of 0.1° by latitude and longitude, containing monthly mean AOT values per pixel in the WGS84 coordinate system. Since the solar radiation model requires TL. AOT values were converted to TL using the Ineichen–Perez model [60,61].
Cloud Fraction. Four cloud cover datasets were used as input, each representing as a raster layer of daily mean cloud fraction cover per pixel:
  • TERRA\MODIS and AQUA\MODIS. Two datasets of the MOD06 product (MODIS Cloud Product) were used, namely the Cloud_Fraction datasubset, based on measurements from the Terra and Aqua satellites equipped with MODIS (Moderate Resolution Imaging Spectroradiometer) scanners. Both data sets have a spatial resolution of 0.1° by latitude and longitude in the WGS84 geographic coordinate system. The temporal resolution of the imagery for each satellite is one pass per day. The data was provided by NASA Earth Observations [47,62]. Cloud cover values are encoded as integers in the range 0 to 255, where 0 typically represents no clouds and 255 represents full cloud cover or no valid data. In the following text, data from the Terra satellite is referred to as “TERRA” and data from the Aqua satellite is referred to as “AQUA”.
  • MetOp\AVHRR. The CLARA-A3 CFC (Cloud Fractional Cover, version 3.0) product, based on observations from the AVHRR (Advanced Very High-Resolution Radiometer) sensor onboard the MetOp satellite series (https://wui.cmsaf.eu/safira/action/viewProduktDetails?fid=40&eid=22257_22484 (accessed on 16 November 2024)), provides cloud cover estimates on a regular latitude–longitude grid with a spatial resolution of 0.25° in the WGS84 geographic coordinate system. Cloud fractional cover values range from 0 to 100, representing the percentage of cloud cover within each grid cell. The MetOp satellite performs, on average, two overpasses per day, yielding a temporal sampling frequency of approximately two observations per day. The data is provided by the EUMETSAT Satellite Application Facility on Climate Monitoring (CM SAF) [63,64]. Hereafter, this dataset is referred to as “AVHRR”.
The MODIS data used is the standard Level-3 daily global product (MOD06/MYD06) ‘Cloud_Fraction’ field. Similarly, the AVHRR data is the CLARA-A3 ‘Cloud Fractional Cover’ (CFC) product. Both are used as provided, without further algorithmic modification, to ensure the evaluation reflects the utility of these operational datasets.
  • Weather station data. Cloud cover observations recorded every 3 h at six meteorological stations (Yelokhin, Uzur, Khujir, B. Goloustnoe, Ushkaniy, and Sarma) (Figure 1), were provided in CSV format by the Russian Federal Service for Hydrometeorology and Environmental Monitoring (Roshydromet) [65]. For consistency in modelling, daily mean cloud cover values were calculated and converted into raster format, assigning a uniform value to all pixels within each modelling site.
The spatial resolution of both the cloud cover and TL datasets was upscaled to match that of the DEM. Each cloud cover dataset was normalised to a scale where 0 corresponds to a completely clear sky and 1 represents maximum (10-point) daily mean cloud cover. Hereafter, this dataset is referred to as “Meteo”.
ERA5 reanalysis. For comparison with the modelling results, surface incoming solar radiation (ISR) data from the ERA5 reanalysis with a spatial resolution of 0.25° was also used. This data was obtained from the Copernicus Climate Data Store (CDS) [66,67].
In-situ observations. Ten automatic monitoring stations have been deployed along the Lake Baikal coast [68,69] (Figure 1).
These stations are designed to record meteorological variables and are equipped with sensors measuring air temperature and humidity, wind speed and direction, atmospheric pressure, liquid precipitation totals, snow depth, soil temperature and moisture, as well as SSR [70]. Observation points are located in open areas (deforested) to reduce the effect of the Earth’s surface on pyranometers. The lakeshore sites were deliberately selected within this heterogeneous transition zone with complex terrain. The expected scale mismatch between point measurements and satellite pixels (10–25 km) presents a rigorous, real-world test for evaluating whether the spatial average cloudiness from a product can effectively represent the cloud attenuation experienced at a point within a complex scene. The measurement range for global solar radiation is 0–1800 W/m2, with an accuracy of ±5%. The spectral range of the solar radiation sensor is 400–1100 nm. The in-situ data used in this study covers the period from 1 January 2023 to 31 December 2023, with hourly temporal resolution. Daily SSR totals were calculated by integrating the hourly SSR intensity values in W/m2 and converting them to MJ/m2.

2.2. Methods

Physically based modelling of SSR was employed to evaluate cloud fraction data. Modelling of incoming solar radiation was accounted for the annual and diurnal variation of the Sun’s position, atmospheric extinction, terrain topography (used only for spatial comparisons), and cloud cover. The primary calculations were performed for a horizontal surface, as the pyranometers at actinometric stations are also levelled horizontally.
All computations were carried out using Python 3.11. The Rasterio version 1.4.4, PyProj version 3.7.2 and NumPy version 2.2.3 libraries were employed for processing raster geospatial data.

2.2.1. Modelling the Annual Cycle of SSR

In this study, a mathematical model described in detail in [71] was employed to calculate the SSR, accounting for astronomical, atmospheric, and topographic factors. Key astronomical parameters were computed to determine the Sun’s position on the celestial sphere:
Solar declination (δ): This is the angle between the Earth’s equatorial plane and the line connecting the centers of the Earth and the Sun. It was calculated using the empirical formula provided in [72]:
δ   =   23.45 ° · s i n 360 ° 365 · n 81 ,
where:
  • δ—solar declination (in degrees),
  • n—day of the year (ranging from 1 to 365).
Hour angle (ω) [72], which reflects the Sun’s position relative to local solar noon:
ω = 15 ° · t 12 ,
where:
  • t—local solar time (in hours).
Using these, the solar zenith angle (θz) [73] was calculated, defining sun elevation above the horizon:
c o s θ z   =   s i n φ · s i n δ   +   c o s φ · c o s δ · c o s ω ,
where:
  • θz—solar zenith angle (in degrees),
  • φ—latitude of the calculation point.
Daylight duration (ωs) is determined by the noon-centered hour angle of sunrise and sunset:
ω s   =   a r c c o s t a n ϕ · t a n δ ,
Local solar times of sunrise and sunset are calculated using the formulas:
s u n r i s e   =   12     ω s 15 ° ,   s u n s e t   =   12   +   ω s 15 ° ,
where ωs—the noon-centered hour angle of sunrise/sunset.
Temporal values were converted from degrees to hours using the Earth’s rotation rate (15°/h), yielding results in the range of 0 to 23 h. All calculations were performed on an hourly basis and subsequently summed up to obtain the daily total solar radiation. The standard solar constant used in the model is I0 = 1367 W/m2 [71], while ref. [74] report on I0 = 1361.1 W/m2.
To account for atmospheric attenuation of solar radiation, the concept of Air Mass (AM) is used—a dimensionless quantity representing the relative path length of sunlight through the atmosphere, particularly significant at low solar elevation angles [75]:
A M   =   1 c o s θ z   +   0.50572 · 96.07995 θ z 1.6364 ,
The transmittance coefficient (τ), which represents the fraction of radiation that passes through the atmosphere, is calculated using the TL, proportional to AOT, along with a set of regional empirical coefficients a, c, d, f calibrated for specific meteorological stations [76]:
τ   =   e x p a · T L · A M · c · A M d     f ,
where AM—air mass.
The parameters of the equation a, b, c, d, f were determined by successively searching through the values of the coefficients in such a way that the curve of the annual distribution of the values of the modeled radiation under cloudless skies coincided with the values of the positive extremes of the curve of the annual distribution of natural observations (Table 1).
The influence of elevation above sea level (h in km) is accounted for through the reduced atmospheric density, which decreases radiation absorption; this effect is modelled using the atmospheric scale height H = 8.4 km [77,78]. This correction is applied to the transmittance coefficient (τh).

2.2.2. Cloud Cover

The effect of cloud cover on incoming solar radiation was accounted for using three models [79].
Linear model:
F c l o u d   =   1     0.9 · C + 0.1 · C 2 ,
Exponential model:
F c l o u d   =   e x p 1 · C ,
Clear-sky model:
F c l o u d   =   1.0 ,
where C is the cloud cover fraction, ranging from 0 to 1.
The cloud fraction C is a daily mean value for each pixel. This single daily value is used in F c l o u d for all hourly radiation calculations within that day at the corresponding location.
The cloud attenuation factor F c l o u d is applied to the clear-sky irradiance (for the relevant surface orientation) to estimate the actual global irradiance under cloudy skies. This approach implicitly incorporates the cloud-modulated diffuse component within the empirical factor F c l o u d .

2.2.3. Terrain

Shadow masking. In the current implementation, explicit shading was not modelled; however, the following rules were applied: If cosθ < 0, the radiation flux on the surface was set to zero (the ray strikes the back side of the surface); if θz > 90°, the radiation was set to zero.
In its complete form, the model appears as follows:
R t o t a l φ , λ   =   t = t s u n r i s e t s u n s e t I 0 · τ h t · c o s θ z t · F c l o u d C · Δ t ,
where φ, λ are the latitude and longitude of the modelled area, respectively; t is local solar time (in hours); Δt is time interval.
To refine the calculations, a correction factor b is introduced to adjust SSR: Itotal(t) = Itotal(t)b.
To account for topography, instead of cosθz, the cosine of the angle between the solar beam direction and the normal to the surface (cosθ) was computed, allowing the radiation intensity to be adjusted based on the slope of the surface [80].

2.2.4. Model Accuracy Assessment

The following metrics were used to evaluate the accuracy of reproducing the spatiotemporal characteristics of the modelled radiation:
The coefficient of determination (R-squared, R2) indicates the proportion of the variance in the target variable explained by the model:
R 2   =   1     y i     y ^ i 2 y i     y ¯ 2 ,
where yi are observed (true) values, ŷ are predicted values, y ¯ is mean of the observed values.
In-situ SSR observation data was used as true values.
The Root Mean Squared Error (RMSE)—a measure of the root mean square deviation of predictions from true values:
R M S E   =   1 n i = 1 n y i     y ^ i 2 ,
where n is number of values in time series.
The Mean Absolute Relative Error (MARE)—the mean of absolute relative errors with respect to true values:
M A R E   =   1 n i = 1 n y ^ i     y i y i ,
The Mean Relative Error (MRE)—the mean relative error with respect to the true value:
M R E   =   1 n i = 1 n y ^ i     y i y i ,
Equation (16) is used to quantitatively assess the convergence (consistency) between the directions of change in daily cloud cover and surface solar radiation (SSR) over the entire observation period. It is defined by a binary vector V i for each time step i (in our case, for each day). Specifically, V i = 1 indicates that the directions of change in cloud cover and SSR are opposite, which aligns with the expected physical relationship, whereas V i = 0 means that the directions of change are the same—contradicting physical expectations (16). Here, “convergence” refers to the degree of agreement between physical processes captured by different data sources. The higher the mean value of vector V (convergence) (Table 2), the greater the convergence.
This approach helps us preliminarily evaluate which cloud cover datasets most reliably represent real-world conditions:
V i   =   1 , i f s i g n C i     C i 1     s i g n R i     R i 1 0 , i f s i g n C i     C i 1   =   s i g n R i     R i 1 ,
where C i is the cloud cover fraction at the i-th time step, and is R i is SSR at the same time step.
A preliminary conclusion can be drawn: values in the range 0.5–0.7 indicate relatively low convergence between day-to-day variations in cloud cover and in-situ SSR, particularly for cloud cover characteristics derived from meteorological station data.

3. Results and Discussion

3.1. Observations

Annual totals of SSR in 2023 across the observation sites varied widely (Table 3)—from 2641.92 MJ/m2 (Uh) to 4263.58 MJ/m2 (L), corresponding to a relative difference of more than 1.6 times. The mean value across all sites was 3708.47 MJ/m2, with a standard deviation of 642.31 MJ/m2 and a coefficient of variation of 17.3%, indicating significant spatial heterogeneity in radiation conditions.
The highest value was recorded at site L, exceeding the nearest stations (S—4103.04 MJ/m2, U—4098.28 MJ/m2) by 3.9%. This site is characterised by the highest winter insolation (406.01 MJ/m2), indicating favourable insolation conditions even during the period of minimum solar illumination in this region.
Sites X (4051.08 MJ/m2), BG (3959.54 MJ/m2), and BK (3920.92 MJ/m2) form a group with moderate to relatively high SSR values, demonstrating moderate variability and high reproducibility at the seasonal scale. In contrast, stations E (2597.24 MJ/m2), U2 (2698.47 MJ/m2), Uh (2641.92 MJ/m2), and Hak (2906.13 MJ/m2) show lower SSR values, which may be attributed both to regional climatic features and to local factors [81].
Analysis of seasonal SSR variations revealed the following: the summer period (June–August) contributes the largest share to the annual total—on average 40.2% (ranging from 38.1% at Uh to 42.5% at L). Spring (March–May) accounts for 35.6%, autumn (September–November) for 18.7%, and winter (December–February) for 5.5%. The absolute winter maximum SSR was recorded at site L (406.01 MJ/m2), which is nearly 2.3 times higher than at E (179.12 MJ/m2). Sites BG (367.47 MJ/m2) and BK (373.42 MJ/m2) also demonstrate relatively high SSR values despite modest summer totals. In spring, site U leads (1448.71 MJ/m2). In summer, maximal SSR was observed at site L (1700.08 MJ/m2), followed by U (1655.59 MJ/m2) and X (1646.75 MJ/m2). Site X recorded the absolute monthly maximum in the dataset—609.62 MJ/m2 in July, indicating high efficiency during the peak period. In autumn, site L leads again (720.09 MJ/m2), whereas stations with reduced insolation (e.g., Uh, E) exhibited sharp declines in daily SSR totals as early as October.

3.2. Modelling

Eight SSR calculations were performed for all days of 2023 using two cloud parameterisation models and four cloud data sources. Calculations were carried out for daily total SSR values separately for each station and then compared with in-situ observations. The average performance across all stations for all eight calculations is presented in Table 3. It demonstrates significant differences in their ability to predict actual SSR values. The highest accuracy was achieved by the exponential AVHRR_Exp model with R2 = 0.54 (Table 4). This model also showed the lowest absolute error (MARE = 0.33). Moreover, it has the lowest root mean square error (RMSE)—only 4.538 MJ/m2. This makes the AVHRR_Exp model the best approach for describing SSR totals. The second most accurate model is TERRA_Exp, with R2 = 0.48 and RMSE = 4.872 MJ/m2. Third place is occupied by the AQUA_Exp model (R2 = 0.47, RMSE = 4.932 MJ/m2). These three models demonstrated superior performance, outperforming the remaining linear models and the SSR data from the ERA5 reanalysis.
It should be noted that models based on a linear representation of cloud cover yielded the worst results, with R2 ranging from 0.10 to 0.32, the highest errors (5.553–6.352 MJ/m2), and a significant underestimation of mean SSR values (MRE varying from −0.36 to −0.31) (Table 4). Therefore, subsequent analysis will focus on the exponential cloud cover model.
Particular interest is drawn to the ERA5 reanalysis data. Comparison of ERA5 with the exponential cloud cover model shows that, when evaluated across all observation sites, ERA5 achieves an R2 of 0.37, which is 0.17 lower than that of AVHRR_Exp. Plots of the annual SSR cycle (Figure 2) reveal a systematic overestimation by ERA5 relative to observations, confirmed by an MRE of 0.44 calculated across all observation sites (Table 3). Such errors in ERA5 have been reported previously. Notably, ERA5 systematically underestimates SSR under cloudy conditions and during the cold season due to its inability to accurately parameterise cloud optical properties [27,28]. Additionally, ERA5 poorly captures the actual spatial variability of SSR, as surface conditions vary considerably from station to station [27,29]. With a spatial resolution of approximately 25 km, ERA5 is physically incapable of adequately reproducing local effects such as the influence of large water bodies (Lake Baikal), shading from mountain ranges, and other influences.
This is especially evident for the Khakusy (Hak), Yelokhin (E), and Ushkany (Uh) sites, with R2 values of 0.00, 0.08, and 0.13; MARE values of 0.67, 0.82, and 0.55; and MRE values of 0.67, 0.81, and 0.54, respectively (Table A1 in Appendix A). However, at the remaining sites, ERA5 demonstrated some of the best results in terms of explained variance and modelling error, although its MRE values range from 0.30 to 0.43—higher than those of the exponential models (0.00 to −0.29).
Reanalysis products, such as MERRA-2, demonstrate good reproduction of the seasonal cycle of solar radiation but fail to capture local features [27,28,29,40]. This underscores the necessity of mandatory local calibration of any global products (ERA5, MERRA-2, SARAH-E) prior to their use in regional studies. This was successfully implemented in [82] for MERRA data over the Fiji Islands, where the authors performed detailed validation and correction based on ground-based measurements.
For models with exponential cloud cover parameterisation, using TERRA, AQUA, and AVHRR cloud data, the performance metrics were fairly similar, with slightly better results obtained with the AVHRR sensor (Table 4 and Table A1). The difference in model accuracy between sites using AVHRR data reached up to 0.21, 1.42, 0.09, and 0.16 for R2, RMSE, MARE, and MRE, respectively, indicating spatial heterogeneity.
Considering SSR modelling results based on cloud observations (Meteo), the quality metrics were generally quite low, although at some sites (GB and U) they approach to the ERA5 results and even outperform the models based on other cloud data sources, showing better MRE values: −0.16 and −0.13, respectively.
Summarising the interim findings, it can be noted that the AVHRR daily mean cloud cover data performed better than MODIS and meteorological station data. This somewhat contradicts common perceptions of these products. A partial explanation is provided in studies [19,20]. The authors emphasise that comparisons of different models and data sources conducted in various regions worldwide over recent decades often yield contradictory and even mutually exclusive results. No existing model or sensor is universally “best”—only optimal for a specific region. Several studies note that although AVHRR offers higher observation frequency and longer data records, its cloud detection capability is limited by a smaller number of spectral channels, ultimately resulting in lower contrast between clouds and the surface compared to MODIS [13,83].
However, the observation frequency plays a critical role. AVHRR provides multiple daily overpasses, whereas Terra and Aqua satellites each provide only one. This enables CLARA-A3 to better capture the diurnal dynamics of cloud cover, which is particularly important for calculating integrated daily SSR totals. At the same time, as noted in [13,83], more sophisticated MODIS algorithms may be more sensitive to noise, aerosols, snow cover, and other local features, which under the complex topography of the Lake Baikal coastline can lead to systematic errors. In conclusion, for tasks requiring estimation of integrated daily values, the reliability, stability, and consistency of the data processing algorithm may be more important than spatial detail.
It should be noted that the high spatiotemporal variability of cloud cover in the Lake Baikal region—driven by the interaction of lake–land breeze circulation, orographic lifting, and synoptic-scale processes—is also crucial when calculating daily totals of solar radiation. Unlike monthly averages, daily integrals are highly sensitive to short-lived and localised changes in cloud cover, which can introduce considerable uncertainty when comparing different data sources. Nevertheless, the high observation frequency of AVHRR enables a more accurate capture of intraday cloud dynamics and, consequently, a more precise estimation of daily totals of SSR. Although cloud structures can be extremely heterogeneous on individual days, the use of data with high temporal representativeness reduces the risk of systematic bias typical of satellites with only a single daytime overpass. This underscores the importance of temporal resolution, even in the presence of spatial heterogeneity.
Analysis of the spatial variability of SSR is critically important for a wide range of applications, including environmental state modelling. Although the primary modelling in this study was performed for horizontal surfaces, the provided context includes an important demonstration of how cloud data quality can affect solar radiation estimates on tilted surfaces. Figure 3 shows a map of modelled monthly totals of global horizontal irradiance (GHI) for the Khujir area in July 2023, simulated for a tilted surface using three different cloud data sources: AVHRR, TERRA, and Meteo.
As can be seen from the figure, there is a noticeable discrepancy between the maps generated from different data sources. While the maps based on AVHRR and TERRA data show relatively minor differences across most of the study area, except for its northern part (Figure 3E), the map derived from Meteo data demonstrates substantial differences compared to the aforementioned cloud datasets (Figure 3D,F). This indicates significant uncertainty in the cloud data used for modelling.
Particular attention should be paid to the influence of topography when interpreting daily totals of solar radiation in mountainous regions. In areas with highly dissected terrain—such as the coastline of Lake Baikal—local shading can substantially reduce the duration of direct sunlight during the day, especially in the morning and evening hours. On steep slopes or within narrow valleys, even minor landforms, such as isolated rock outcrops or dense tree cover, can cast persistent shadows that remain unresolved at spatial resolutions of tens to hundreds of meters. Under these conditions, a resolution of 30 × 30 m, despite its apparent detail, may still prove insufficient for adequately capturing microscale variations in insolation. This is especially critical for applied tasks such as siting solar panels or conducting point-based actinometric measurements. Nevertheless, when estimating radiation on a horizontal surface for regional-scale studies, the primary focus should be not so much on achieving maximum spatial detail as on properly accounting for topographic shading in the model, as this factor fundamentally determines the physical representativeness of daily radiation totals in complex terrain.
This conclusion fully aligns with the warning issued by [19] and confirmed by [15] that most existing solar radiation models consist of equations whose coefficients are not universal physical constants but depend on region, climate, calibration period, and even on the specific set of input data. Applying such models without prior calibration inevitably leads to large errors [19].
Although this study employs a physical model, many recent works, e.g., [22,26,84], rely on machine learning (ML) and artificial neural network (ANN) methods. Comparison of our results (R2 = 0.54) with theirs shows that ML models generally achieve significantly higher accuracy (often R2 > 0.8). However, this comparison is not entirely fair, as ML models typically use numerous predictors and sometimes identify spurious correlations. Physical models like ours are irreplaceable in situations where access to comprehensive meteorological data is limited: in historical reconstructions or in mountainous and remote regions. They are transparent, interpretable, physically based, and do not require large training datasets. As noted in [15], it is precisely these qualities that make them “practical” for solving real-world problems, despite their somewhat lower accuracy. Moreover, ref. [82] demonstrated that even when numerous parameters are available, the best-performing models are often hybrid: they combine physical parameters with empirical coefficients calibrated using statistical or ML methods. An example of such an approach is provided in [30], who successfully applied Random Forest to correct ERA5 biases in Hubei Province, China, or [84], who used ML to enhance classical empirical models.

4. Conclusions

This study provided a critical evaluation of cloud data sources for modelling surface solar radiation in the topographically complex and ecologically unique region of Lake Baikal. The findings yielded several key insights with direct implications for future research and applications.
Firstly, the relative performance of the ERA5 reanalysis dataset was found to be relatively low for precise local-scale radiation modelling in this environment. SSR from ERA5 achieved R2 = 0.34, RMSE = 4.724 MJ/m2. While ERA5 remains a valuable resource for global and regional climate studies, its resolution and physical parameterisations appear insufficient to capture the fine-scale spatial variability induced by the lake-land-atmosphere interactions around Lake Baikal. Researchers requiring high-fidelity radiation data for local ecological or hydrological models in similar complex terrains should therefore use ERA5-derived values with caution.
Secondly, the research underscored the profound variability in modelling outcomes that stems directly from the choice of cloud cover input. The superior performance of the AVHRR satellite-based dataset over both reanalysis and, notably, ground-based synoptic cloud observations, highlighted a significant finding. It suggested that for modelling purposes, spatially continuous satellite data can provide a more consistent and reliable representation of cloud attenuation than ground-based observations, which may be affected by observer subjectivity and spatial representativeness errors. This confirms the critical importance of selecting appropriate, high-quality cloud masks as a primary input to physical radiation models.
Finally, despite the challenges posed in data accuracy, the synthesis of a robust physical model with high-quality satellite cloud observations—specifically from AVHRR—proved to be a highly effective methodology. This approach successfully generated detailed, reliable, and spatially continuous fields of incoming solar radiation.
A logical extension of this work would be a two-stage analysis: first, evaluating the cloud products’ accuracy at representing instantaneous cloud conditions at satellite overpass times against sky cameras or ceilometers; second, using those findings to interpret their performance in daily integrated models. Furthermore, comparison with established surface radiation products (e.g., CERES FluxByCldTyp, SARAH) would provide additional context for the model’s performance.
To enhance model accuracy, future work should focus on developing a dynamic fusion approach that intelligently combines the strengths of various cloud products, including satellite data like AVHRR and modern geostationary data, with calibrated ground-based observations where available. The integration of higher temporal resolution (e.g., hourly) cloud data could significantly improve the modelling of diurnal radiation patterns, which is critical for applications in solar energy and evapotranspiration modelling. Future direct applications of the generated high-resolution radiation maps to drive specific process-based models will support a range of applied sciences. They can be powerfully leveraged for high-resolution climate reconstructions, detailed landscape differentiation analyses, and modelling of essential biogeophysical processes, such as evapotranspiration, snowmelt, and primary productivity, across regional, sub-regional, and local scales. This work thus establishes a validated framework for solar radiation mapping that can be applied to other areas with scarce data, and complex terrain, contributing to improved environmental monitoring and resource management.

Author Contributions

Conceptualization: N.V. and E.D.; Data collection: D.G., N.V. and I.A.; formal analysis: D.G.; writing—original draft preparation: D.G.; writing—review and editing: E.D.; funding acquisition: N.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Higher Education of the Russian Federation, grant number 075-15-2024-533 for implementation of Major scientific projects on priority areas of scientific and technological development (the project «Fundamental research of the Baikal natural territory based on a system of interconnected basic methods, models, neural networks and a digital platform for environmental monitoring of the environment»).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data on measured and modeled daily SSR data with various initial cloud cover data for 2023 for the Lake Baikal region in tabular form for observation points can be found at the provided link: https://zenodo.org/records/17947896 (accessed on 1 December 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The Appendix A displays the accuracy of models based on the selected metrics.
Table A1. Model accuracy assessment by observation site.
Table A1. Model accuracy assessment by observation site.
Meteo
_Exp
Meteo
_Line
Era_5TERRA
_Exp
AQUA
_Exp
AVHRR
_Exp
TERRA
_Line
AQUA
_Line
AVHRR
_Line
Site
R2
0.610.360.730.570.580.560.250.260.28BG
0.100.240.080.280.160.300.220.230.11E
0.340.000.810.630.570.780.330.230.64S
0.700.510.780.630.640.630.370.370.39U
0.430.230.450.310.310.380.020.010.09U2
0.320.110.130.460.440.500.100.080.19Uh
0.300.050.780.630.650.680.350.390.49X
--0.780.600.560.710.320.270.57BK
--0.530.400.350.500.070.010.28L
--0.000.300.400.370.200.270.38Hak
RMSE, MJ/m2
4.3285.4963.6004.5324.4444.5885.9485.9055.828BG
4.6715.9727.3924.1834.5054.1185.4355.8605.175E
6.1597.5633.3264.5854.9583.5376.1896.6484.545S
4.1805.3273.5214.6114.5334.6056.0406.0295.950U
5.8466.8265.7636.4646.4746.1407.7057.7187.419U2
4.9915.7405.6764.4424.5234.2735.7735.8245.454Uh
6.1497.5363.4774.4884.3864.1825.9275.7335.272X
--3.2604.4124.6543.7635.7605.9984.607BK
--5.3786.1076.3315.5427.5767.8296.677L
--5.8464.8934.5084.6375.2345.0044.603Hak
MARE
0.300.360.320.340.350.340.440.460.43BG
0.370.500.820.350.360.330.470.510.44E
0.410.500.340.320.350.260.430.490.34S
0.290.340.410.320.320.330.420.430.43U
0.350.420.480.390.400.350.500.500.45U2
0.370.410.550.310.300.300.410.390.38Uh
0.420.500.360.310.300.310.420.400.40X
0.320.340.350.290.420.440.34BK
0.490.480.480.430.550.550.48L
0.670.350.360.310.420.450.35Hak
Note: The best metric values for each site are highlighted in bold.

References

  1. Campbell, G.S.; Norman, J.M. Radiation Basics. In An Introduction to Environmental Biophysics; Springer: New York, NY, USA, 1998; pp. 147–165. [Google Scholar] [CrossRef]
  2. Muneer, T. Solar Radiation and Daylight Models, 2nd ed.; Elsevier Butterworth Heinemann: Amsterdam, The Netherlands, 2004; pp. 13–17. [Google Scholar]
  3. Stocker, T.F.; Qin, D.; Plattner, G.-K.; Tignor, M.; Allen, S.K.; Boschung, J.; Nauels, A.; Xia, Y.; Bex, V.; Midgley, P.M. (Eds.) IPCC, 2013: Climate Change 2013: The Physical Science Basis. Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2013; p. 1535. [Google Scholar]
  4. Neves, D.; Brito, M.C.; Silva, C.A. Impact of solar and wind forecast uncertainties on demand response of isolated microgrids. Renew. Energy 2016, 87, 1003–1015. [Google Scholar] [CrossRef]
  5. López-Velázquez, J.E.; Velázquez-Limón, N.; Islas-Pereda, S.; Flores-Jiménez, D.E.; Santillan-Soto, N.; Ríos-Arriola, J. An Assessment of the Weather Research and Forecasting Model for Solar Irradiance Forecasting under the Influence of Cold Fronts in a Desert in Northwestern Mexico. Atmosphere 2024, 15, 1300. [Google Scholar] [CrossRef]
  6. Cierniewski, J.; Ceglarek, J. Annual Dynamics of Shortwave Radiation as Consequence of Smoothing Previously Plowed Bare Arable Land Surface in Europe. Remote Sens. 2024, 16, 2476. [Google Scholar] [CrossRef]
  7. WMO. Essential Climate Variables. Global Climate Observing System (World Meteorological Organization). Available online: https://gcos.wmo.int/site/global-climate-observing-system-gcos/essential-climate-variables (accessed on 10 September 2025).
  8. Roca-Fernández, C.; Pons, X.; Ninyerola, M. A Comparison of Different Solar Radiation Models in the Iberian Peninsula. Atmosphere 2025, 16, 590. [Google Scholar] [CrossRef]
  9. Asilevi, P.J.; Quansah, E.; Amekudzi, L.K.; Annor, T.; Klutse, N.A.B. Modeling the spatial distribution of Global Solar Radiation (GSR) over Ghana using the Ångström-Prescott sunshine duration model. Sci. Afr. 2019, 4, e00094. [Google Scholar] [CrossRef]
  10. Yichuan, M.; Tao, H.; Shunlin, L.; Xiongxin, X. Quantifying the impacts of DEM uncertainty on clear-sky surface shortwave radiation estimation in typical mountainous areas. Agric. For. Meteorol. 2022, 327, 109222. [Google Scholar] [CrossRef]
  11. Beguería, S.; Vicente-Serrano, S.M.; Gutiérrez-Llorente, J.M.; Brands, S.; Gil-Guallar, M.; Royo-Aranda, A.; del Mar Rondón-Velasco, M.; Torralba-Gallego, A.; Luna, Y.; Morata, A. A hierarchical Bayesian spatio-temporal model for estimating solar radiation from sunshine duration records. Renew. Energy 2026, 256, 123943. [Google Scholar] [CrossRef]
  12. Mohammadi, B.; Moazenzadeh, R.; Bao Pham, Q.; Al-Ansari, N.; Ur Rahman, K.; Tran Anh, D.; Duan, Z. Application of ERA-Interim, empirical models, and an artificial intelligence-based model for estimating daily solar radiation. Ain Shams Eng. J. 2022, 13, 101498. [Google Scholar] [CrossRef]
  13. Han, J.; Jiang, B.; Zhao, Y.; Peng, J.; Li, S.; Liang, H.; Yin, X.; Chen, Y. A General Model for Converting All-Wave Net Radiation at Instantaneous to Daily Scales Under Clear Sky. Remote Sens. 2025, 17, 2364. [Google Scholar] [CrossRef]
  14. Nwokolo, S.C.; Ogbulezie, J.C. A quantitative review and classification of empirical models for predicting global solar radiation in West Africa. Beni-Suef Univ. J. Basic Appl. Sci. 2018, 7, 367–396. [Google Scholar] [CrossRef]
  15. Babatunde, O.M.; Munda, J.L.; Hamam, Y.; Monyei, C.G. A critical overview of the (Im)practicability of solar radiation forecasting models. E-Prime Adv. Electr. Eng. Electron. Energy 2023, 5, 100213. [Google Scholar] [CrossRef]
  16. Gürel, A.E.; Ağbulut, Ü.; Bakır, H.; Ergün, A.; Yıldız, G. A state of art review on estimation of solar radiation with various models. Heliyon 2023, 9, e13167. [Google Scholar] [CrossRef]
  17. Khatib, T.; Mohamed, A.; Sopian, K. A review of solar energy modeling techniques. Renew. Sustain. Energy Rev. 2012, 16, 2864–2869. [Google Scholar] [CrossRef]
  18. Etuk, S.E.; Nwokolo, S.C.; Okechukwu, E.A. Modelling and estimating photosynthetically active radiation from measured global solar radiation at Calabar, Nigeria. Phys. Sci. Int. J. 2016, 12, 1–12. [Google Scholar] [CrossRef]
  19. Prieto, J.I.; García, D. Global solar radiation models: A critical review from the point of view of homogeneity and case study. Renew. Sustain. Energy Rev. 2022, 155, 111856. [Google Scholar] [CrossRef]
  20. Wane, O.; Ramírez Ceballos, J.A.; Ferrera-Cobos, F.; Navarro, A.A.; Valenzuela, R.X.; Zarzalejo, L.F. Comparative Analysis of Photosynthetically Active Radiation Models Based on Radiometric Attributes in Mainland Spain. Land 2022, 11, 1868. [Google Scholar] [CrossRef]
  21. Martim, C.C.; Paulista, R.S.D.; Castagna, D.; Borella, D.R.; de Almeida, F.T.; Damian, J.G.R.; de Souza, A.P. Daily Estimates of Global Radiation in the Brazilian Amazon from Simplified Models. Atmosphere 2024, 15, 1397. [Google Scholar] [CrossRef]
  22. Kambezidis, H.D.; Patelis, E.; Kavadias, K.A. An in-depth analysis of the Ångström—Prescott-type solar models: Application for Athens, Greece. Acad. Environ. Sci. Sustain. 2025, 2, 1–31. [Google Scholar] [CrossRef]
  23. Demir, V. Evaluation of Solar Radiation Prediction Models Using AI: A Performance Comparison in the High-Potential Region of Konya, Türkiye. Atmosphere 2025, 16, 398. [Google Scholar] [CrossRef]
  24. Mohammadi, M.; Jamshidi, S.; Rezvanian, A.; Gheisari, M.; Kumar, A. Advanced fusion of MTM-LSTM and MLP models for time series forecasting: An application for forecasting the solar radiation. Meas. Sens. 2024, 33, 101179. [Google Scholar] [CrossRef]
  25. Mendyl, A.; Demir, V.; Omar, N.; Orhan, O.; Weidinger, T. Enhancing Solar Radiation Forecasting in Diverse Moroccan Climate Zones: A Comparative Study of Machine Learning Models with Sugeno Integral Aggregation. Atmosphere 2024, 15, 103. [Google Scholar] [CrossRef]
  26. Ding, Y.; Wang, Y.; Li, Z.; Zhao, L.; Shi, Y.; Xing, X.; Chen, S. Improving Solar Radiation Prediction in China: A Stacking Model Approach with Categorical Boosting Feature Selection. Atmosphere 2024, 15, 1436. [Google Scholar] [CrossRef]
  27. Wang, H.; Wang, Y. Evaluation of the Accuracy and Trend Consistency of Hourly Surface Solar Radiation Datasets of ERA5, MERRA-2, SARAH-E, CERES, and Solcast over China. Remote Sens. 2025, 17, 1317. [Google Scholar] [CrossRef]
  28. Jadhav, A.V.; Belange, K.; Gajbhiv, N.; Kumar, V.; Rahul, P.R.C.; Sudeepkumar, B.L.; Bhawar, R.L. Evaluation of the Reanalysis and Satellite Surface Solar Radiation Datasets Using Ground-Based Observations over India. Atmosphere 2025, 16, 957. [Google Scholar] [CrossRef]
  29. Saint-Drenan, Y.M.; Wald, L. On the Assessment of Hourly Means of Solar Irradiance at Ground Level in Clear-Sky Conditions by the ERA5, JRA-3Q, and MERRA-2 Reanalyses. Atmosphere 2025, 16, 949. [Google Scholar] [CrossRef]
  30. Xian, C.; Jin, M.; Wang, M. Evaluation of Correction Methods for ERA5 Shortwave Radiation Biases in China’s Second-Step Topographic Region: A Case Study of Hubei Province. Atmosphere 2025, 16, 1008. [Google Scholar] [CrossRef]
  31. Baranovskiy, N.V.; Yankovich, E.P. Geoinformation system for prediction of forest fire danger caused by solar radiation using remote sensing. In Proceedings of the SPIE Remote Sensing, Toulouse, France, 21 September 2015; SPIE: Bellingham, WA, USA, 2015; Volume 9640, p. 96402B. [Google Scholar] [CrossRef]
  32. Xin, Q.; Gong, P.; Suyker, A.E.; Si, Y. Effects of the partitioning of diffuse and direct solar radiation on satellite-based modeling of crop gross primary production. Int. J. Appl. Earth Obs. Geoinf. 2016, 50, 51–63. [Google Scholar] [CrossRef]
  33. Teixeira, A.H.; Leivas, J.; Andrade, R.; Hernandez, F.; Franco, R. Modelling radiation and energy balances with Landsat 8 images under different thermohydrological conditions in the Brazilian semi-arid region. In Proceedings of the SPIE Remote Sensing for Agriculture, Ecosystems, and Hydrology XVII, Toulouse, France, 21 September 2015; SPIE: Bellingham, WA, USA, 2015; Volume 9637, p. 96370Z. [Google Scholar] [CrossRef]
  34. Dobrohotov, A.V.; Maksenkova, I.L.; Kozyreva, L.V. Avtomatizirovannyj raschet prostranstvennogo raspredeleniya sostavlyayushchih energeticheskogo balansa s ispol’zovaniem dannyh DZZ i nazemnyh meteorologicheskih izmerenij. In Primenenie Sredstv Distancionnogo Zondirovaniya Zemli v Sel’skom Hozyajstve; Agrophysical Research Institute: Saint Petersburg, Russia, 2018; pp. 305–309. (In Russian) [Google Scholar]
  35. Gorbarenko, E.V. A possibility of determination of earth surface radiation budget from calculated and satellite data. Russ. Meteorol. Hydrol. 2017, 42, 745–752. [Google Scholar] [CrossRef]
  36. Nefedova, L.V.; Rafikova, Y.Y. Assessment of the Stability of Solar Energy Resources by Statistical and Geoinformation Methods. Appl. Sol. Energy 2022, 58, 438–443. [Google Scholar] [CrossRef]
  37. Piskunova, D.; Chubarova, N.; Poliukhov, A.; Zhdanova, E. Radiative Regime According to the New RAD-MSU(BSRN) Complex in Moscow: The Roles of Aerosol, Surface Albedo, and Sunshine Duration. Atmosphere 2024, 15, 144. [Google Scholar] [CrossRef]
  38. Stamatis, M.; Hatzianastassiou, N.; Korras-Carraca, M.B.; Matsoukas, C.; Wild, M.; Vardavas, I. An Assessment of Global Dimming and Brightening during 1984–2018 Using the FORTH Radiative Transfer Model and ISCCP Satellite and MERRA-2 Reanalysis Data. Atmosphere 2023, 14, 1258. [Google Scholar] [CrossRef]
  39. Hetrick, W.A.; Rich, P.M.; Barnes, F.J. GIS-based Solar Radiation Flux Models. In GIS, Photogrammetry, and Modeling; American Society for Photogrammetry and Remote Sensing: New York, NY, USA, 1993; Volume 3, pp. 132–143. [Google Scholar]
  40. Pons, X.; Ninyerola, M. Mapping a topographic global solar radiation model implemented in a GIS and refined with ground data. Int. J. Climatol. 2008, 28, 1821–1834. [Google Scholar] [CrossRef]
  41. Anselmo, S.; Safaeianpour, A.; Moghadam, S.T.; Ferrara, M. GIS-based solar radiation modelling for photovoltaic potential in cities: A sensitivity analysis for the evaluation of output variability range. Energy Rep. 2024, 12, 4656–4669. [Google Scholar] [CrossRef]
  42. Li, Q.; Bessafi, M.; Li, P. Mapping Prediction of Surface Solar Radiation with Linear Regression Models: Case Study over Reunion Island. Atmosphere 2023, 14, 1331. [Google Scholar] [CrossRef]
  43. Shajdulina, A.A. Raschety postupleniya solnechnoj radiacii na sklony v period snegotayaniya. Vestn. Voronezh. Gos. Univ. 2022, 1, 50–58. (In Russian) [Google Scholar]
  44. Ďuračiová, R.; Pružinec, F. Effects of Terrain Parameters and Spatial Resolution of a Digital Elevation Model on the Calculation of Potential Solar Radiation in the Mountain Environment: A Case Study of the Tatra Mountains. ISPRS Int. J. Geo-Inf. 2022, 11, 389. [Google Scholar] [CrossRef]
  45. Thomas, C.; Wandji Nyamsi, W.; Arola, A.; Pfeifroth, U.; Trentmann, J.; Dorling, S.; Laguarda, A.; Fischer, M.; Aculinin, A. Smart Approaches for Evaluating Photosynthetically Active Radiation at Various Stations Based on MSG Prime Satellite Imagery. Atmosphere 2023, 14, 1259. [Google Scholar] [CrossRef]
  46. Agugiaro, G.; Remondino, F.; Stevanato, G.; De Filippi, R.; Furlanello, C. Estimation of Solar Radiation on Building Roofs in Mountainous Areas. In Proceedings of the ISPRS Conference PIA 2011: Photogrammetric Image Analysis, Munich, Germany, 5–7 October 2011; International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences; ISPRS: Munich, Germany, 2011; Volume XXXVIII-3/W22, pp. 155–160. [Google Scholar]
  47. Platnick, S.; Meyer, K.G.; King, M.D.; Wind, G.; Amarasinghe, N.; Marchant, B.; Arnold, G.T.; Zhang, Z.; Hubanks, P.A. The MODIS Cloud Optical Property Retrieval: Over Land and Ocean. IEEE Trans. Geosci. Remote Sens. 2017, 55, 51–72. [Google Scholar] [CrossRef]
  48. Akitsu, T.K.; Nasahara, K.N.; Ijima, O.; Hirose, Y.; Ide, R.; Takagi, K.; Kume, A. The variability and seasonality in the ratio of photosynthetically active radiation to solar radiation: A simple empirical model of the ratio. Int. J. Appl. Earth Obs. Geoinf. 2022, 108, 102724. [Google Scholar] [CrossRef]
  49. Kambezidis, H.D.; Psiloglou, B.E. Estimation of the Optimum Energy Received by Solar Energy Flat-Plate Convertors in Greece Using Typical Meteorological Years. Part I: South-Oriented Tilt Angles. Appl. Sci. 2021, 11, 1547. [Google Scholar] [CrossRef]
  50. Farahat, A.; Kambezidis, H.D.; Almazroui, M.; Al Otaibi, M. Solar Potential in Saudi Arabia for Inclined Flat-Plate Surfaces of Constant Tilt Tracking the Sun. Appl. Sci. 2021, 11, 7105. [Google Scholar] [CrossRef]
  51. Liu, W.; Guan, H.; Gutierrez-Jurado, H.A.; Banks, E.W.; He, X.; Zhang, X. Modelling quasi-three-dimensional distribution of solar irradiance on complex terrain. Environ. Model. Softw. 2022, 149, 105293. [Google Scholar] [CrossRef]
  52. Hofierka, J.; Suri, M.; Huld, T.R. sun—GRASS GIS Manual. Available online: https://grass.osgeo.org/grass83/manuals/r.sun.html (accessed on 11 January 2023).
  53. Olpenda, A.S.; Stereńczak, K.; Będkowski, K. Modeling Solar Radiation in the Forest Using Remote Sensing Data: A Review of Approaches and Opportunities. Remote Sens. 2018, 10, 694. [Google Scholar] [CrossRef]
  54. Bychkov, I.V.; Gagarinova, O.V.; Orlova, I.I.; Bogdanov, V.N. Water protection zoning as an instrument of preservation for Lake Baikal. Water 2018, 10, 1474. [Google Scholar] [CrossRef]
  55. UNESCO/ICOMOS Documentation Centre. Properties Included in the World Heritage List. Available online: https://unesdoc.unesco.org/ark:/48223/pf0000105529?posInSet=31&queryId=341e4a81-ca3c-4864-ab02-4a782f8f70d9 (accessed on 27 October 2025).
  56. Barlage, M.; Chen, F. Human Impacts on Land Surface—Atmosphere Interactions. In Fast Processes in Large-Scale Atmospheric Models: Progress, Challenges, and Opportunities; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2023; pp. 213–228. [Google Scholar] [CrossRef]
  57. Farr, T.G.; Rosen, P.A.; Caro, E.; Crippen, R.; Duren, R.; Hensley, S.; Kobrick, M.; Paller, M.; Rodriguez, E.; Roth, L.; et al. The Shuttle Radar Topography Mission. Rev. Geophys. 2007, 45, RG2004. [Google Scholar] [CrossRef]
  58. NASA Earth Observations (NEO). MODIS/Aqua Aerosol Optical Depth (AOD) 550 nm—Monthly Product. Available online: https://neo.gsfc.nasa.gov/view.php?datasetId=MODAL2_M_AER_OD (accessed on 15 June 2025).
  59. Levy, R.C.; Levetin, S.W.; Kleidman, R.; Mattoo, S.; Ichoku, C.; Kahn, R.; Remer, L.A. The Collection 5 MODIS aerosol products over land and ocean. Atmos. Meas. Tech. 2013, 6, 2989–3034. [Google Scholar] [CrossRef]
  60. European Commission; Joint Research Centre (JRC). PVGIS User Manual. Available online: https://joint-research-centre.ec.europa.eu/photovoltaic-geographical-information-system-pvgis/getting-started-pvgis/pvgis-user-manual_en (accessed on 5 July 2025).
  61. Huld, T.; Friesen, G.; Skoczek, A.; Kenny, R.L.; Cebollero, E.; Crítz, D. A power-duration approach to modelling long-term energy yield in heavily clouded locations. Sol. Energy 2016, 137, 42–52. [Google Scholar] [CrossRef]
  62. NASA Earth Observations (NEO). MODIS/Aqua Cloud Fraction Daily L3 Global 1 × 1 deg (MODAL2_D_CLD_FR). Available online: https://neo.gsfc.nasa.gov/view.php?datasetId=MODAL2_D_CLD_FR (accessed on 15 June 2025).
  63. European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT). CM SAF CLARA-SAL Daily and Monthly Cloud, Albedo and Radiation Dataset from AVHRR Data; Product ID: CLA_SAL_ARC_2.0. Available online: https://wui.cmsaf.eu/safira/action/viewProduktDetails?fid=40&eid=22257_22484 (accessed on 6 July 2024).
  64. Karlsson, K.G.; Anttila, K.; Trentmann, J.; Stengel, M.; Fokke Meirink, J.; Devasthale, A.; Hanschmann, T.; Kothe, S.; Jääskeläinen, E.; Sedlar, J.; et al. CLARA-A2: The second edition of the CM SAF cloud and radiation data record from 34 years of global AVHRR data. Atmos. Chem. Phys. 2017, 17, 5809–5828. [Google Scholar] [CrossRef]
  65. Dementeva, T.V. Average Monthly Amount of Total Cloudiness and Low-Level Cloudiness. Database Description. All-Russian Research Institute of Hydrometeorological Information—World Data Center. 2020. Available online: http://meteo.ru/data/monthly-clouds/ (accessed on 6 July 2025).
  66. ECMWF. ERA5: Data Documentation. Available online: https://confluence.ecmwf.int/display/CKB/ERA5%3A+data+documentation (accessed on 18 July 2025).
  67. Hersbach, H.; Bell, B.; Berrisford, P.; Biavati, G.; Horányi, A.; Muñoz Sabater, J.; Nicolas, J.; Peubey, C.; Radu, R.; Rozum, I.; et al. The ERA5 Global Reanalysis. Q. J. R. Meteorol. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
  68. Kiselev, M.V.; Voropay, N.N.; Dyukarev, E.A.; Kurakov, S.A.; Kurakova, P.S.; Makeev, E.A. Automatic meteorological measuring systems for microclimate monitoring. IOP Conf. Ser. Earth Environ. Sci. 2018, 190, 012031. [Google Scholar] [CrossRef]
  69. Makarov, M.; Aslamov, I.; Gnatovsky, R. Environmental monitoring of the littoral zone of Lake Baikal using a network of automatic hydro-meteorological stations: Development and trial run. Sensors 2021, 21, 7659. [Google Scholar] [CrossRef]
  70. Dyukarev, E.; Voropay, N.; Vasilenko, O.; Rasputina, E. Validation of remotely sensed land surface temperature at Lake Baikal’s surroundings using in situ observations. Land 2024, 13, 555. [Google Scholar] [CrossRef]
  71. Duffie, J.A.; Beckman, W.A. Solar Engineering of Thermal Processes, 5th ed.; Wiley: Hoboken, NJ, USA, 2020. [Google Scholar]
  72. Gueymard, C.A. A reevaluation of the solar constant based on a 42-year total solar irradiance time series and a reconciliation of spaceborne observations. Sol. Energy 2018, 168, 2–9. [Google Scholar] [CrossRef]
  73. Spencer, J.W. Fourier Series Representation of the Position of the Sun. Search 1971, 2, 172. [Google Scholar]
  74. Iqbal, M. An Introduction to Solar Radiation; Academic Press: New York, NY, USA, 1983. [Google Scholar]
  75. Kasten, F.; Young, A.T. Revised Optical Air Mass Tables and Approximation Formula. Appl. Opt. 1989, 28, 4735–4738. [Google Scholar] [CrossRef] [PubMed]
  76. Kasten, F. A Simple Parameterization of the Pyrheliometric Equation. Meteorol. Z. 1990, 2, 51–54. [Google Scholar]
  77. Wallace, J.M.; Hobbs, P.V. Atmospheric Science: An Introductory Survey, 2nd ed.; Academic Press: Amsterdam, The Netherlands, 2006. [Google Scholar]
  78. Bohren, C.F.; Albrecht, B.A. Atmospheric Thermodynamics; Oxford University Press: New York, NY, USA, 1998. [Google Scholar]
  79. Reindl, D.T.; Beckman, W.A.; Duffie, J.A. Diffuse Fraction Correlations. Sol. Energy 1990, 45, 1–7. [Google Scholar] [CrossRef]
  80. Liu, B.Y.H.; Jordan, R.C. The Interrelationship and Characteristic Distribution of Direct, Diffuse and Total Solar Radiation. Sol. Energy 1960, 4, 1–19. [Google Scholar] [CrossRef]
  81. Tyumentseva, E.M.; Orel, G.F. Atmospheric Processes in the South Baikal Basin and Their Role in Relief Formation. Atmosphere 2018, 9, 176. [Google Scholar] [CrossRef]
  82. Oyewola, O.M.; Ojo, O.S.; Ajayi, V.O.; Akinsanola, A.A.; Abatan, A.A. Global Solar Radiation Predictions in Fiji Islands Based on Empirical Models. Alex. Eng. J. 2022, 61, 8559–8571. [Google Scholar] [CrossRef]
  83. Zhang, Z.; Ao, Z.; Wu, W.; Wang, Y.; Xin, Q. Developing a Multi-Scale Convolutional Neural Network for Spatiotemporal Fusion to Generate MODIS-like Data Using AVHRR and Landsat Images. Remote Sens. 2024, 16, 1086. [Google Scholar] [CrossRef]
  84. Demircan, C.; Bayrakçı, H.C.; Keçebaş, A. Machine Learning-Based Improvement of Empiric Models for an Accurate Estimating Process of Global Solar Radiation. Sustain. Energy Technol. Assess. 2020, 37, 100574. [Google Scholar] [CrossRef]
Figure 1. Localisation of in-situ observation points. Symbols ✸ indicate the location of observation points; symbols ○ indicate cities.
Figure 1. Localisation of in-situ observation points. Symbols ✸ indicate the location of observation points; symbols ○ indicate cities.
Atmosphere 16 01405 g001
Figure 2. Annual course of daily SSR amounts of the observation sites within 2023.
Figure 2. Annual course of daily SSR amounts of the observation sites within 2023.
Atmosphere 16 01405 g002
Figure 3. Demonstration for the Khujir area: 1. Monthly global horizontal irradiance (GHI) for July, modelled using the following cloud data: (A) AVHRR, (B) Meteo, (C) TERRA; 2. Differences between: (D) AVHRR—Meteo, (E) TERRA—AVHRR, (F) TERRA—Meteo; 3. Terrain morphometric characteristics: (G) DEM, (H) Slope.
Figure 3. Demonstration for the Khujir area: 1. Monthly global horizontal irradiance (GHI) for July, modelled using the following cloud data: (A) AVHRR, (B) Meteo, (C) TERRA; 2. Differences between: (D) AVHRR—Meteo, (E) TERRA—AVHRR, (F) TERRA—Meteo; 3. Terrain morphometric characteristics: (G) DEM, (H) Slope.
Atmosphere 16 01405 g003
Table 1. Coefficients for transmittance calculation.
Table 1. Coefficients for transmittance calculation.
abcdfSite
−0.60626404.5352 × 10−3−2.4770BG
−0.60626404.5352 × 10−3−2.4770BK
−0.60626400.524535−2.4770E
8.00 × 1046005 × 10−7−2.4770L
8.00 × 1046205 × 10−7−2.4770S
8.00 × 1046205 × 10−7−2.4770U
−2.062 × 10−37505 × 10−7−0.0770U2
−2.062 × 10−312005 × 10−7−0.0770Uh
−2.062 × 10−36505 × 10−7−0.0770X
−2.062 × 10−36005 × 10−7−0.077−0.2Hak
Table 2. Mean convergence value for 2023 of day-to-day variations in cloud cover and solar radiation from different cloud data sources, expressed as fractions of unity.
Table 2. Mean convergence value for 2023 of day-to-day variations in cloud cover and solar radiation from different cloud data sources, expressed as fractions of unity.
Cloud Cover Data SourceBGSUXBKUhLU2HakEAverage
AQUA0.640.570.60.580.610.540.530.520.570.50.57
TERRA0.610.640.640.620.570.520.520.490.60.520.57
AVHRR0.630.630.610.680.580.540.50.510.580.50.58
MS0.630.510.650.50.50.530.50.470.40.520.52
Note: The best metric values for each site are highlighted in bold. The best metric is the highest convergence value for each station.
Table 3. Seasonal and annual sums of SSR in MJ/m2 at the actinometric stations in 2023.
Table 3. Seasonal and annual sums of SSR in MJ/m2 at the actinometric stations in 2023.
BGSUXBKUhLU2HakESeason
367.47323.99315.06330.54373.42240.46409.64370.65205.90237.65Winter
1365.241418.171448.711395.431413.561373.611594.161721.121097.28980.62Spring
1524.611647.521655.591646.751474.461554.191700.081636.281270.821137.95Summer
702.22713.36678.92678.36659.47555.35720.09674.31489.59556.43Autumn
3959.544103.044098.284051.083920.923723.614423.964402.373063.582912.65Total
Note: Winter season was composed from January 2023, February 2023 and December 2023.
Table 4. Model’s performance characteristics.
Table 4. Model’s performance characteristics.
R2RMSE, MJ/m2MAREMREModel
0.544.5380.33−0.18AVHRR_Exp
0.484.8720.35−0.15TERRA_Exp
0.474.9320.36−0.18AQUA_Exp
0.405.1890.36−0.17Meteo_Exp
0.374.7240.480.44ERA5
0.325.5530.40−0.34AVHRR_Line
0.186.1590.45−0.33TERRA_Line
0.156.2550.46−0.36AQUA_Line
0.106.3520.43−0.31Meteo_Line
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Golubets, D.; Voropay, N.; Dyukarev, E.; Aslamov, I. Evaluation of Cloud Fraction Data for Modelling Daily Surface Solar Radiation: Application to the Lake Baikal Region. Atmosphere 2025, 16, 1405. https://doi.org/10.3390/atmos16121405

AMA Style

Golubets D, Voropay N, Dyukarev E, Aslamov I. Evaluation of Cloud Fraction Data for Modelling Daily Surface Solar Radiation: Application to the Lake Baikal Region. Atmosphere. 2025; 16(12):1405. https://doi.org/10.3390/atmos16121405

Chicago/Turabian Style

Golubets, Dmitry, Nadezhda Voropay, Egor Dyukarev, and Ilya Aslamov. 2025. "Evaluation of Cloud Fraction Data for Modelling Daily Surface Solar Radiation: Application to the Lake Baikal Region" Atmosphere 16, no. 12: 1405. https://doi.org/10.3390/atmos16121405

APA Style

Golubets, D., Voropay, N., Dyukarev, E., & Aslamov, I. (2025). Evaluation of Cloud Fraction Data for Modelling Daily Surface Solar Radiation: Application to the Lake Baikal Region. Atmosphere, 16(12), 1405. https://doi.org/10.3390/atmos16121405

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop