Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation

Iftime, Adrian; Omer, Secil; Burcea, Victor-Andrei; Călinescu, Octavian; Babeș, Ramona-Madalina

doi:10.3390/ijgi14080283

Open AccessArticle

Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation

by

Adrian Iftime

^1,*,†

,

Secil Omer

^2,†

,

Victor-Andrei Burcea

^1,3

,

Octavian Călinescu

¹

and

Ramona-Madalina Babeș

¹

Biophysics Department, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania

²

MedLife Life Memorial Hospital, Calea Griviței 365, 010719 Bucharest, Romania

³

Ștefan S. Nicolau Institute of Virology, 030304 Bucharest, Romania

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

ISPRS Int. J. Geo-Inf. 2025, 14(8), 283; https://doi.org/10.3390/ijgi14080283

Submission received: 22 May 2025 / Revised: 10 July 2025 / Accepted: 18 July 2025 / Published: 22 July 2025

(This article belongs to the Special Issue HealthScape: Intersections of Health, Environment, and GIS&T (2nd Edition))

Download

Browse Figures

Versions Notes

Abstract

Previous studies reported the links between the COVID-19 incidence and weather factors, but few investigated their impact and timing on mortality, at a continental scale. We systematically investigated the temporal relationship of COVID-19 mortality in the European countries in the 1st year of pandemic (March–December 2020) with (i) solar insolation (W/

m^{2}

) at the ground level and (ii) objective sky cloudiness (as decimal cloud fraction), both derived from satellite measurements. We checked the correlations of these factors within a sliding window of two months for the whole period. Linear-mixed effect modeling revealed that overall, for the European countries (adjusted for latitude), COVID-19 mortality was substantially negatively correlated with solar insolation in the previous month (std. beta −0.69). Separately, mortality was significantly correlated with the cloudiness in both the previous month (std. beta +0.14) and the respective month (std. beta +0.32). This time gap of ∼1 month between the COVID-19 mortality and correlated weather factors was previously unreported. The long-term monitoring of these factors might be important for epidemiological policy decisions especially in the initial period of potential future pandemics when effective medical treatment might not yet be available.

Keywords:

COVID-19; mortality; solar insolation; cloudiness; latitude; temporal correlation; spatial correlation; satellite data; Europe

Graphical Abstract

1. Introduction

The COVID-19 (coronavirus disease 2019) pandemic, produced by SARS-CoV-2 virus (severe acute respiratory syndrome) has had a significant impact on global health, with millions of people affected by its quick spread. In the initial year of the pandemic (2019–2020) effective countermeasures were not yet available (vaccination, antiviral therapy) and the environmental factors that influenced viral transmission and mortality were intensely debated. If unimpeded by countermeasures, the coronaviruses (and other similar respiratory viruses) spread in the population is influenced by a combination of medical, biological, socio-economic and environmental factors. These environmental factors that might influence the spread and impact of COVID-19 have been then extensively researched (temperature, humidity, season, wind, atmospheric composition, precipitation, dew point and other climate factors) sometimes with mixed results [1,2,3,4,5,6,7,8]. Additionally, it has also been observed that latitude is a significant factor that modulates the COVID-19 spread dynamics in population [2,9]. This might be due to either UV intensity variation with latitude [10] or the temperature variation with latitude [11,12]. At the same time, other studies argue against the effect of temperature, and focus instead on the effect of vitamin D synthesis variation with latitude [13].

It is known from previous laboratory measurements that SARS-CoV-2 virions are sensitive to light (proportional to the illuminance intensity of the light [14,15,16,17,18,19]); it seems that SARS-CoV-2 virions aerosolized from infected persons could remain infectious in outdoor settings for prolonged times during low illuminance periods (winter, autumn), posing a risk for re-aerosolization and reinfection [15].

The solar radiation energy reaching earth (UV-B and UV-A at noon time during summers in temperate regions) is highly effective in inactivating SARS-CoV-2 virions [20,21]. The UV component of sunlight could be an additional factor that influences seasonal variations of COVID-19 epidemic (leading to a lower incidence during summertime) [22] among many other investigated meteorological factors (temperature, humidity, wind, precipitation, pollutants) [5,23]. The relationship between meteorological factors and various COVID-19 epidemiological characteristics was extensively investigated [3,5,24,25], but there are still many unknowns about the environmental drivers of transmission and infection [26,27].

These findings indicate a complex relationship between the SARS-CoV-2 virion inactivation by natural sunlight and the factors that modulate the sunlight intensity: latitude, season, and factors that influence optical properties of the atmosphere. Among these, extensive studies were performed, with mixed results, for instance for gases [24,28], natural and artificial pollutants [29], fog, haze and particulate matter [30,31].

There seems to be a mismatch between the laboratory results of sunlight inactivation and theoretical estimates based on photo-chemical reactions (i.e., virions are inactivated several times faster than predicted by theory) [32]. Proposed hypotheses for this finding were: (a) the virions’ sensitivity to broader light spectrum (UV-A and natural blue light might additionally impair the virions) [15,32,33], (b) the presence of naturally occurring photo-sensitizer molecules in the medium (water droplets interacting with humic acids from soil or waste dust) [32,34].

As presented, the meteorological influence on COVID-19 prevalence or incidence (i.e., new cases over a period in a region, or the rate of new cases) was extensively researched; there are comparatively fewer studies and reviews on the meteorological influence on mortality [10,35,36].

From a temporal perspective, it is known that in the natural course of the COVID-19 disease, there is a median lag of 2 weeks between the infection time and development of severe symptoms/hospitalization; and in the fatal cases, there is an additional median time of 2–3 weeks between hospitalization and death [37,38,39,40]. Therefore, in the majority of the fatal cases, the infection occurred in the month prior to the date of death, see Figure 1.

Therefore we put forth the hypothesis that because of this temporal lag, there should be a correlation between COVID-19 mortality in a given month and the meteorological factors in the previous month in the same geographical area; more specifically:

(a) given the light sensitivity of the Sars-CoV-2 virions, we hypothesize that if the solar insolation is higher in a month, this should correlate with a decreased mortality in the following month;

(b) however, given the fact the light–virion interaction is modulated by water droplets and air contaminants [32,34], we hypothesize that there should be a linked dynamics between the mortality and atmospheric cloudiness, as a proxy for both humidity [41] and atmospheric contaminants [42,43]. To the best of our knowledge, this temporal link of cloudiness and mortality was not investigated up to now.

A complication arises because of cross-interactions: solar irradiance at ground level is heavily influenced by the atmospheric composition, primarily by clouds [44] and only secondarily by other factors such as dust, pollutants, and humidity [45]. Clouds exert a complex influence: by reflection of the sunlight (back into space) they reduce the amount of total energy that reaches the earth but by scattering they can counter-intuitively direct some of the energy back to land, especially in the UV portion of the spectrum, thus modulating the biologically effective radiation dose received by living things [46]. Because of this interaction, we took care to model them separately.

To test these hypotheses, we made a retrospective observational longitudinal study at the continental scale, where the units of observation were the European countries (landmass), the time points were the months of the year 2020, the outcome variable was the COVID-19 mortality in each month and the investigated (possible explanatory) variables were: (a) the satellite–measured solar insolation at the ground level, (b) satellite–measured atmospheric cloudiness (cloud covering), (c) latitude and (d) longitude. We investigated these factors in dynamic (in any given month and with a lag of 1 month relative to the investigated month).

2. Materials and Methods

2.1. Geographical Data

Geographical data and the administrative boundaries of the countries were retrieved from the public domain repository Natural Earth [47] and processed within R version 4.5 [48] with R software packages: rnaturalearth [49], sp [50,51], sf [52], stars [53] and raster [54]. For this study we set the units of the analysis to be the countries as a whole (i.e., not smaller administrative units). From this dataset, for each country we extracted: (a) the country border (see the magenta lines in Figure 2) and (b) the surface area location and size in a rectangular tiled grid with a fairly detailed resolution of 0.25° latitude × 0.25° longitude. We ensured that the other geographical datasets that we used in this study matched the same spatial resolution. The marginal tiles (i.e., at the border of the country, peninsulas or small islands) were included only if more than 50% of the area of a tile was inside the country border. This integration algorithm ensures that there are no overlapped nor missed tiles (i.e. each tile can belong to a single country) and results in polygonal surfaces with 0.25° latitude × 0.25° longitude resolution, that optimally cover the countries at this resolution (Figure 2).

Since latitude is a fixed factor that modulated the COVID-19 pandemic dynamics (see Introduction) we carefully included it in our modeling and adjusted the other factors for it.

In order to analyze the latitude and the longitude at the country level (the “latitude of a country” and the “longitude of a country” variables), we chose to use in this study the country centroid. A centroid (also known as the center of gravity or center of mass) is the arithmetic mean of the positions of all the points in a geometrical object; for irregular objects, it is closest to the center of the biggest part of the object (i.e., it is less influenced by very thin or heavily scattered boundaries). We chose this measure because several countries have highly irregular geometrical boundaries or long thin peninsulas or numerous islands (i.e., Greece, Norway, etc.); the advantage of the country centroid is that it is located closer to the widest area of the mainland. We used a standard database of countries’ centroids published by Google Maps developers [55].

As a verification step, centroids and borders of the countries were laid also on a different digital mapping provider, OpenStreetMaps [56] and the overlapping was checked visually; no issues were found. The projection used in this study was the Spherical Pseudo-Mercator Projection (also known as Web Mercator; European Petroleum Survey Group (EPSG) identifier EPSG:3857), which is commonly used by Google Maps Developers [55] and by OpenStreetMaps [56].

2.2. Atmospheric Cloudiness Data

Cloudiness (also known as cloud fraction, cloud cover, cloud amount or sky cover) refers to the fraction of the sky obscured by clouds (in a particular location). It can be reported in various units; in this study, we used the decimal cloud fraction (as tenths of the entire sky), where 0.0 indicates a clear sky and 1.0 (or 10/10) indicates a completely covered sky.

We used a publicly available dataset of global cloudiness as measured from space by NASA’s Terra and Aqua satellites using the MODIS instrument (Moderate Resolution Imaging Spectroradiometer) [57]. This dataset is collected continuously and presented as values averaged daily, weekly and monthly for the entire globe; for this study we chose the monthly averaged values. In this dataset the entire Earth surface is divided into a rectangular grid; each rectangle of the grid contains the average cloud fraction of the sky covering that grid area. The datasets are available at different resolutions, and we used for this study the grid with 0.25° latitude × 0.25° longitude resolution (as the rest of the datasets used).

For each country and each month of the year 2020 we have extracted the cloudiness values for all tiles within the borders of a given country and averaged them; thus we calculated an “averaged cloudiness for a country” in a given month. For example, see Figure 2 top panel (I) as a visualization of the process for two randomly chosen countries, Germany and Romania. In the month of June 2020 the obtained average cloudiness for Germany was 0.713 (Figure 2a) and for Romania, it was 0.728 (Figure 2b). As a side note, the temporal resolution seems to be detailed enough to observe consistent variations in cloudiness patterns over the surface of the countries—for instance in Figure 2b there is a higher cloudiness that matches the Carpathian mountains arching in the middle of the country.

As a convenience for the reader we included a quick visual overview of these aggregated results in Appendix A.1, for all European countries, for each month of the year, Figure A1.

2.3. Solar Insolation Data

The average solar insolation (also known as solar irradiance, solar exposure, incoming sunlight) in W/m² at the Earth’s surface was used in this study. Solar irradiance is one of the main factors that determine the temperature at ground level (with an almost linear dependence [58]), and the climate in general [59]. We used a publicly available dataset inferred from measurements taken by Clouds and Earth’s Radiant Energy System (CERES) instrument flying aboard NASA’s Terra and Aqua satellites [60]. We used the same temporal and spatial sampling as presented above (Section 2.2), i.e., monthly averaged values over a 0.25° latitude × 0.25° longitude grid.

We repeated the same algorithm (as used for cloudiness data): for each country and each month of the year 2020 we extracted the solar insolation values for all tiles within the borders of a given country and averaged them; thus we calculated an “averaged insolation for a country” in a given month. For a visual example of the process see Figure 2, bottom panel (II): for Germany, the average insolation calculated for the month of June 2020 was 255.1 W/m² (Figure 2c) and for Romania it was 275 W/m² (Figure 2d). The overview of these aggregated results for all countries is presented in Appendix A.2, Figure A2.

2.4. Epidemiological Data (COVID-19 Data Sources)

Different epidemiological variables about COVID-19 epidemics were collected and reported around the world and summarized in various ways (academic institutions, national health bodies, media sites, public data repositories, etc.) in an effort to promote effective research and public understanding of the phenomenon. However, despite the best intentions of the authors, errors and inconsistencies appeared in COVID-19 epidemiological data due to the difficulty of aggregating [61], collecting [62,63,64], compiling [65] and interpreting conflicting estimates [66,67].

A further complication was the fact that the publicly reported data was sometimes revised retrospectively by the health authorities, as updated epidemiological procedures methods were devised [68] and initial studies (in years 2020–2022) might have inadvertently used out-of-date epidemiological datasets. In order to mitigate these concerns, in this study we used a public data source, COVID-19 Data Hub [69] that provides immutable snapshots of the data, taken daily, which are provided to ensure reproducible research. Thus, this database provides the daily time-series of COVID-19 cases, deaths, recovered people, tests, vaccinations, and hospitalizations, for more than 230 countries and their lower-level administrative divisions [69].

From this database we selected the daily confirmed death counts at the country level, within the limits of the entire year 2020 (start date: 1 January 2020, end date: 31 December 2020). The access time was February 13, 2025 (this is the snapshot date); being a late snapshot, it includes the retrospective corrections of the epidemiological data [68]. From the reported daily deaths counts we calculated the total monthly deaths for each country, via two different methods (to independently cross–check for possible errors): (a) method 1: total monthly deaths as a sum of the death cases reported in each day of a given month; (b) method 2: total monthly deaths as the difference between the cumulative deaths reported in the last day of each month and the last day of the previous month. The two value sets agreed; (we performed this intermediary check because in a preliminary analysis that we performed with Worldometer [70] public data set we have spotted inconsistencies between the results of the two methods).

We then calculated monthly mortality per million due to COVID-19 for each country in a month as (total monthly deaths/average country population)

\times 10^{6}

. The average country population number was the average for the entire year 2020.

We chose to use the mortality parameter because it seems to be less variable than the incidence. Mortality is usually recorded using a fixed legal procedure and reported by the same personnel (coroners, hospital doctors, etc.). The incidence reports could include self–reports from self–testing, field reports; or the procedures for testing could not always be reliably repeated (especially in the beginning of the pandemic when some countries experienced shortages in supply with testing kits or dealt with sudden changes of testing policies).

There are two issues with this approach that might impact our study. First, there are legal and practical differences among countries regarding death recording and reporting [71]. The COVID-19 database we used records the deaths as reported by local authorities; we could not quantitatively assess the differences between the countries, which could impact comparison of different countries. Second, the time lag between the actual death and the reported time could be different for each case; we think that the monthly intervals we chose for analysis probably averaged most of the differences.

2.5. Inclusion and Exclusion Criteria

The inclusion criteria were as follows: (a) all countries on the European continent; (b) availability of epidemiological and geographical (cloudiness, insolation) data.

The exclusion criteria were as follows: (a) country population < 0.5 million and (b) a geographical bounding box of the countries (or islands or peninsulas) smaller than 0.25° latitude × 0.25° longitude. We used these criteria because the European micro-states (Monaco, Vatican, San Marino, etc.) were too small to be properly sampled from the available resolution of geospatial data (insolation, cloudiness). Russia was also excluded from analysis because the COVID-19 epidemiological data from the country was publicly available only in an aggregate form (i.e., no data were available detailing the epidemiology in European and Asian parts of Russia; in this study we focused on the Europe landmass).

In this way we obtained a list of 37 European countries (listed alphabetically): Albania, Austria, Belarus, Belgium, Bosnia and Herzegovina (abbreviated as Bosnia_and_H in the graphics), Bulgaria, Croatia, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Moldova, Montenegro, Netherlands, North Macedonia, Norway, Poland, Portugal, Romania, Serbia, Slovakia, Slovenia, Spain, Sweden, Switzerland, United Kingdom (abbreviated as UK in the graphics) and Ukraine.

2.6. Statistics

2.6.1. Variables

For each one of the 37 countries we included in the analysis the following variables:

(i) calculated COVID-19 monthly mortality (deaths per 1 million population) in each month;

(ii) time, as monthly interval (each one of the 10 months in the studied interval (March–December 2020);

(iii) calculated monthly insolation value (averaged solar radiation for each month across the country);

(iv) calculated monthly cloud fraction (averaged decimal cloud fraction);

(v) the latitude and longitude of the countries (centroid data).

For these variables we thus aggregated a total of 1850 data-points from the datasets; from these, 1110 were independent (latitude and longitude are dependent on the country). In order to ensure reproducible analysis, we have included these data points and the intermediary steps as an open-access dataset (see Data Availability Statement at the end).

2.6.2. Data Verification and Transformation

For each continuous variable included in the analysis, we calculated the descriptive statistics and visually checked the data distribution with histogram (density) plots and quantile–quantile (QQ) plots and formally with Lilliefors–corrected Kolmogorov-Smirnov test for larger samples (

n > 30)

and for the sub-sets with Shapiro-Wilk normality test for smaller samples (where

n \leq 30)

[72,73].

In the cases of variables where data distribution was found to be non-normal we attempted to linearize the data set, with the formal goal of approximating a normal distribution through reducing the skewness of non-normal data [74,75]. For this dataset we explored several procedures for data transformation: log–transform, log1p–transform, cube–root transform and Box–Cox transform [76,77]). As the mortality data contained a very broad range of values, including very small and zero, we used the standard

l o g 1 p

[78] function included in the R language to avoid computation singularities or rounding errors; the function

l o g 1 p (x)

computes

log (1 + x)

accurately for

∣ x ∣ ≪ 1

. To back transform we used its inverse function,

e x p 1 m (x)

that computes

exp (x) - 1

accurately also for

∣ x ∣ ≪ 1

.

Data analysis was performed with R [48] version 4.5, with the additional packages: lme4 (version 1.1-37) [79], ggeffects [80], ggplot2 [81], emmeans [82], effectsize [83], lmtest [84], report [85], rstatix [86], stargazer [87], sjPlot [88], tidyverse [89] and car [90].

2.7. Modeling

The main starting hypothesis of this study is that there might be a temporal correlation between the variation in COVID-19 mortality and the sky cloudiness or insolation (adjusted for latitude), in the first year of the pandemic (before vaccination was available).

For this retrospective longitudinal study we chose two types of models: (1) linear mixed effects (to analyze all longitudinal data series) and (2) classic linear regression (on averaged data set). The former can capture discrete effects but are harder to interpret; the latter are less precise (due to averaging) but were chosen as they are more easily interpreted [91,92]. We also used the two different modeling techniques as a way to independently check the validity of the results.

In both modeling techniques we extensively checked for temporal autocorrelation structures (as there are previous reports of temporal features of other meteorological factors influencing COVID-19 epidemics [31]). As another validity check, in order to avoid possible multicollinearity interactions between insolation and cloudiness, we did not include both variables (solar insolation and cloudiness) in the same model at the same time (i.e., we modeled them separately and compared the results).

2.7.1. Linear Mixed-Effects

We used linear mixed-effects modeling (LME) [93] to explore the association between the monthly COVID-19 mortality (outcome variable) and the rest of variables included in the analysis, and in all models we included Country as the random effect variable.

We took a conservative approach in LME modeling [91] and as such: we kept the models as simple as possible (as few parameters and transformations as possible to avoid over-fitting); we compared diverse models using AIC (Akaike information criterion) and BIC (Bayesian information criterion) measures; we report the full model parameters and tests done (in Appendix A, in order not to clutter the main text). To aid understanding, we depicted the models graphically and reported

R^{2}

. For mixed-effects models,

R^{2}

can be categorized loosely into two types: marginal

R^{2}

and conditional

R^{2}

. The marginal

R^{2}

is concerned with variance explained by the fixed factors of the model, and conditional

R^{2}

is concerned with variance explained by both fixed and random factors (i.e., by the whole model) [94]. For clarity, in this paper we will note the marginal as

R_{m}^{2}

and the conditional as

R_{c}^{2}

. LME models were calculated with lme4 [79], using the same default settings proposed by its authors—the fitting was done with restricted maximum likelihood (REML) option, using “nloptwrap” (nonlinear optimization) settings. In order to facilitate comparisons of the models and to judge the relative influence of the factors within the same model, we calculated and reported standardized parameters (standardized beta) [91,95]; these were obtained by fitting each model on a standardized version of the dataset. The 95% Confidence Intervals (CIs) and p-values were computed using a Wald t-distribution approximation. The effect sizes were qualitatively judged (as “small”, “moderate”, “substantial”) according to recommendations summarized by [83,96].

2.7.2. Linear Regression

Despite its simplicity, the key advantage of the simple linear regression is its easy intuitive understanding of the results.

In order to perform linear regression, we have averaged the data in time (time–averaged models) and separately, in space (space–averaged models). The fitting of the data was performed with the standard lm (linear model) included in R using default ordinary least squares (OLS) method.

2.7.3. Model Selection

Selection of the most likely predictor variable was done with bidirectional (forward and backward) step-wise multivariable regression [97]. Briefly, for each situation, we started with the full model (incorporating all included variables as the possible predictors) and then performed an automated stepwise regression eliminating redundant or low-impact variables from the models, thus selecting the most likely predictor variables. We used AIC and BIC as a quality indicator for each step in model selection; we selected the models with the lowest AIC and BIC scores.

We report here only the significant models that passed the following stringent validity tests: (a) normality of the residuals, (checked with QQ plots, Kolomogorov-Smirnov/Shapiro-Walk tests), (b) presence of outliers, (c) homogeneity of the variance of residuals (we assumed homoscedasticity if the result of the studentized Breusch-Pagan test was bigger than 0.05) and (d) autocorrelation of the residuals with a Durbin-Watson test (we considered that there is no autocorrelation in the residuals of a regression if at this test the p-values > 0.05 and the d-statistic values were in range 1.5 to 2.5). Additionally, we checked the interrelationships between the predictor variables (statistical collinearity) using the Variance Inflation Factor (VIF) statistic [98]. The temporal correlation between the predictor series was checked with Granger causality test [84,99].

The significance level of all statistics tests used in this paper was set at the typical 5%.

3. Results

3.1. Aggregated Data

For the described variables (country, time, mortality, insolation, cloudiness, latitude and longitude) we extracted 1850 aggregated data-points from the datasets. From these, 1110 were independent (latitude and longitude are dependent on the country); their statistical summary is shown in Table 1, and a quick visual overview is presented in the Appendix A, see Figure A1, Figure A2 and Figure A3.

The monthly deaths count and mortality per million (the outcome variable) show a severely right-skewed distribution (skewness = 4.47 and 2.01 respectively). Linearization was attempted on the data set, with the goal approaching a normal distribution through reducing the skewness [74,75]; the smallest skewness was achieved with

l o g 1 p

-transformation of the data (transformed data skewness = −0.032), see Appendix A.7, Figure A6.

The monthly insolation data follows the expected typical annual cycle pattern [100] with a broad maximum during the months of May–July [101], see an overview in Figure A2.

The monthly cloudiness data for the year 2020 is more randomly distributed than the insolation data, with the highest variability in months of July–August, see Figure A1.

3.2. LME Modeling of Monthly Mortality and Insolation

Within the paradigm: “Atmospheric factors might influence the COVID-19 mortality” we tested the hypothesis “There might be a temporal correlation between the insolation and COVID-19 mortality”.

We investigated the response (

l o g 1 p

-transformed mortality in a month) to possible explanatory variables: insolation in current month, insolation in previous month and geographical coordinates, using 6 linear mixed models that progressively included the explanatory variables and included Country as a random effect. We selected the final model (Figure 3) based on lowest AIC and BIC score.

The R formula used for this model:

l o g 1 p (d e a t h s / m i l l i o n) \sim

Previous_Insolation_Average + Latitude + (1|Country). The model included the Country as random effect (formula: ∼ 1|Country). The Latitude is the centroid latitude of the country. The model’s total explanatory power is substantial (

R_{c}^{2}

= 0.57) and the part related to the fixed effects alone is

R_{m}^{2}

= 0.45. The model’s intercept, corresponding to Previous_Insolation_Average = 0 and Latitude = 0, is at 11.69 (95% CI [9.89, 13.49], t(337) = 12.74, p < 0.001). Within this model:

- The effect of Previous_Insolation_Average (i.e., insolation in previous month, averaged for the whole country) is statistically significant and negative (

β

= −0.01, 95% CI [−0.01, −0.01], t(337) = −17.73, p < 0.001; Std. beta = −0.69, 95% CI [−0.76, −0.61])

- The effect of Latitude is statistically significant and negative (

β

= −0.12, 95% CI [−0.15, −0.08], t(337) = −6.68, p < 0.001; Std. beta = −0.47, 95% CI [−0.60, −0.33]). The full statistical details of the model are reported in Appendix A.4.

Summary: This statistically significant model (p < 0.001) relates the COVID-19 mortality in any given month with the average country insolation in the previous month (adjusted for latitude), showing that the higher the solar radiation in the previous month, the lower the mortality in the following month (Figure 3). The model’s total explanatory power is substantial (

R_{c}^{2}

= 0.57) and the part related to the fixed effects alone is

R_{m}^{2}

= 0.45. Both previous insolation and latitude are significant predictors of the variability of the mortality; the previous month’s insolation has the greatest relative influence (standardized beta = −0.69, see Figure 5a).

3.3. LME Modeling of Monthly Mortality and Cloudiness

Within the paradigm: “Atmospheric factors might influence the COVID-19 mortality” we tested the hypothesis “There might be a temporal correlation between the atmospheric cloudiness and COVID-19 mortality”. We investigated the response variable (

l o g 1 p

-transformed mortality in a month) relationship to possible explanatory variables: sky cloudiness in current month, sky cloudiness in previous month and geographical coordinates, using 6 linear mixed models that progressively included the explanatory variables and included Country as a random effect. We selected the final model (Figure 4) based on the lowest AIC and BIC score.

The R formula used for this model:

l o g 1 p (d e a t h s / m i l l i o n) \sim

Previous_Cloud_Fraction +Cloud_Fraction+Latitude + (1|Country). The model included the Country as random effect (formula: ∼1|Country). The Latitude is the centroid latitude of the country.

The model’s total explanatory power is moderate (

R_{c}^{2}

= 0.24) and the part related to the fixed effects alone is

R_{m}^{2}

= 0.16. The model’s intercept, corresponding to Previous_Cloud_Fraction = 0, Cloud_Fraction = 0 and Latitude = 0, is at 6.21 (95% CI [4.57, 7.85], t(336) = 7.45, p < 0.001). Within this model:

- The effect of Previous Cloud Fraction is statistically significant and positive (beta = 1.34, 95% CI [0.20, 2.49], t(336) = 2.31, p = 0.022; Std. beta = 0.14, 95% CI [0.02, 0.26])

- The effect of Cloud Fraction is statistically significant and positive (beta = 2.94, 95% CI [1.89, 3.99], t(336) = 5.52, p < 0.001; Std. beta = 0.32, 95% CI [0.21, 0.44])

- The effect of Latitude is statistically significant and negative (beta = −0.11, 95% CI [−0.15, −0.08], t(336) = −5.93, p < 0.001; Std. beta = −0.45, 95% CI [−0.60, −0.30]). The full statistical details of the model are in Appendix A.5.

Summary: The model found a relationship of the variability of the COVID-19 mortality in a given month with the average atmospheric cloudiness in that month, and with the cloudiness in the previous month (adjusted for latitude). The statistically significant model (p < 0.001) has a moderate total explanatory power (conditional

R^{2}

= 0.24) and the part related to the fixed effects alone (marginal

R^{2}

) is 0.16. A greater cloudiness in the previous month and in the current month correlates with a greater COVID-19 mortality (when adjusted for latitude); the relative influence of the cloudiness in previous month is about half of the influence for current month (standardized beta = 0.14 for previous month and 0.32 for the current month), see Figure 5b.

3.4. Time–Averaged Modeling

For each one of the 37 countries, for all year 2020, we averaged the values:

- of the insolation in all months, thus obtaining an average insolation in 2020 for each country;

- of the cloud fraction in all months, thus obtaining an average cloud fraction in 2020 for each country;

- of the mortality/million values, thus obtaining an average mortality for the year 2020 for each country; for consistency purposes with the rest of models, we log1p transformed this averaged mortality.

We therefore transformed the longitudinal data in a reduced data (retaining spatial information—i.e., countries, latitude, and longitude) but averaging temporal variations in the data, as a prerequisite of linear modeling (to avoid including repeated measurement data in a linear model). We then ran a step–wise regression to find the most significant predictors (if any) of averaged mortality (as described in Section 2.7). The most significant model (listed below) found in time–averaged data correlates yearly averaged mortality in a country with the yearly cloudiness values (adjusted for latitude), see Figure 6. The other factors were not significant in time-averaged data.

The linear model had the R formula: Avg. log1p(deaths/million) ∼ Avg. cloud fraction + Latitude. The model explains a statistically significant and substantial proportion of variance (

R^{2}

= 0.38, F(2, 34) = 10.30, p < 0.001, adj.

R^{2}

= 0.34). The model’s intercept, corresponding to Avg. cloud fraction = 0 and Latitude = 0, is at 6.97 (95% CI [5.50, 8.43], t(34) = 9.63, p < 0.001). Within this model:

- The effect of Avg. cloud fraction is statistically significant and positive (beta = 5.62, 95% CI [0.98, 10.25], t(34) = 2.46, p = 0.019; Std. beta = 0.74, 95% CI [0.13, 1.36])

- The effect of Latitude is statistically significant and negative (beta = −0.13, 95% CI [−0.20, −0.06], t(34) = −3.91, p < 0.001; Std. beta = −1.18, 95% CI [−1.79, −0.57]). The full statistical details of the model and its validation are presented in Appendix A.6.

Summary: even with a time–averaged data, our data suggest that there is a consistent correlation between the COVID-19 mortality and the cloudiness; about 34% of the variance in COVID-19 mortality appears to be influenced by the cloudiness fraction of the sky, adjusted for latitude (Figure 6 and Figure 7 for a model visualization across the range of collected variables).

3.5. Space–Averaged Modeling

An alternative approach to reduce the longitudinal data dimensions is to average it over the spatial dimensions, retaining chronological information (i.e., losing spatial details, like distribution over latitude and longitude). For each month (March–December) of 2020, we have averaged the values:

- of the insolation values in all countries, obtaining an European average insolation for each month;

- of the cloud fraction in all countries, obtaining an European average cloud fraction for each month;

- of the mortality/million values, in all countries, obtaining an European average mortality for each month; for consistency purposes with the rest of models, we again log1p transformed this average value before modeling.

We ran the same algorithm as in the previous section (a step–wise regression) with these variables; the major difference is that we could include in the model the insolation and cloudiness in the previous months (as the space–averaging preserves time information). In the space–averaged data, the most significant predictor that correlates with the European average mortality in a month was the European average insolation in the previous month (see Figure 8). The other factors were not significant in space-averaged data.

The linear model had the R formula: Avg. log1p(deaths/million) ∼ Average_Previous_Insolation. The model explains a statistically significant and substantial proportion of variance (

R^{2}

= 0.82, F(1, 8) = 35.90, p < 0.001, adj.

R^{2}

= 0.79). The model’s intercept, corresponding to Average_Previous_Insolation = 0, is at 6.05 (95% CI [5.16, 6.95], t(8) = 15.62, p < 0.001). Within this model:

- The effect of Average_Previous_Insolation is statistically significant and negative (beta = −0.01, 95% CI [−0.02,

- 6.71 \times 10^{- 3}

], t(8) = −5.99, p < 0.001; Std. beta = −0.90, 95% CI [−1.25, −0.56]). The full statistical details of the model and its validation are presented in Appendix A.7.

Summary: This linear model implies that in spatially–averaged data over the entire European landmass, in 2020, there is a strong temporal influence of the insolation in the previous month. About 79% of the COVID-19 mortality in a month appears to be correlated with the geographical insolation in the previous month.

To check for this we performed a Granger causality test [84,99]. On this averaged dataset, between the monthly averaged mortality (outcome) and monthly insolation (predictor), with a lag of 1 (i.e., 1 month lag) there is a strong Granger causality correlation (F = 92.73, p < 0.001). If the dependence is true, the reverse Granger causality test should fail (i.e testing for spurious correlation by reversing outcome and the predictor). Our results suggest that is the case, the reverse test is not significant (F = 1.406, p = 0.28) so these tests indicate that indeed there is a consistent temporal correlation between of the values of insolation in one month and the values of the COVID-19 mortality in the following month.

4. Discussion

The results of our data sampling suggest that there is a statistically significant temporal correlation between COVID-19 mortality and meteorological factors investigated (insolation, cloudiness) with a lag of one month. The significant factors and their impact are summarized in Figure 5, that compares the relative factors in both models, for the European continent, during the first year of COVID-19 epidemic, when no targeted treatments were available; in both models, the latitude appears to have roughly the same modulating effect, with notable differences for insolation and for cloudiness. We discuss these in detail in the following subsections.

4.1. Insolation

We have found that the insolation in the previous month negatively correlates with COVID-19 mortality in the following month (the higher the solar energy at the ground level in a month; Figure 3, Figure 5a). To the best of our knowledge, this time lag was not reported up to now in previous studies. We propose three possible explanations for the observed correlation:

First, in previous studies of the interactions between sunlight—COVID-19 epidemics, increased solar exposure was linked to increased vitamin D synthesis in the skin and its beneficial immunological roles [10,11,12,13], so this might be a possible explanation, population-wide.

Second, it is known that coronaviruses in general survive better in environments with low temperature and low radiation/sunlight [102]. As the Sars-CoV-2 virions’ ability to survive in environment is impeded by light [14,15,16,17,18,19], presumably in a month with a higher insolation, the average number of viable virions in the environment decreases, thus the incidence decreases [5,22] and therefore the fatalities in the next month decrease (as it takes ~1 month from infection to death in fatal cases). Consequently, a consistent effect of decreased mortality should be associated with increased sunlight, regardless of geographical position. We checked for this by averaging data over all surfaces (i.e., all countries), in Section 3.5, Figure 8. Within this explanation, we think we can regard our results as a different confirmation, applied to a large geographical area, of the laboratory studies of [14,15,16].

Finally, it was observed that longer exposure to natural sunlight appears to be beneficial to patients, increasing recovery rates [103] and thus reducing patient mortality purportedly via different mechanisms than vitamin D (other physiological parameters [104] or immuno–modulation [18]).

4.2. Cloudiness

We have found that the cloudiness in the previous month and cloudiness in the current month both correlate with a higher COVID-19 mortality in the current month, albeit with a smaller relative impact than insolation (Figure 4, Figure 5b). This has been, to the best of our knowledge, unreported up to now.

Based on our results of insolation effects presented above, we expected that: (i) the relative amount of influence of cloudiness to be in a comparable range with the effect of insolation—but it is smaller; (ii) only the cloudiness in the previous month to be correlated to mortality (as cloudiness heavily influences the amount of insolation at ground level [46])—but we found that both months are significant. We discuss these novel points below.

(i) The cloud fraction does not linearly reduce solar insolation [105,106]; in particular, the shading created by the clouds changes the light spectral distribution at the ground level. The clouds exert a complex influence on UV-B radiation and hence impact vitamin D formation. A limitation in our study is that we could not account for different cloud types and thus their particular influence. In past studies, the overcast cumulus-type clouds were found to attenuate almost all (99%) UV-B radiation, but surprisingly a partly cloudy sky with the same clouds can actually increase UV-B radiation at ground level up to 27% [107]. Also, in some particular conditions, multiple scattering due to cirriform clouds can actually increase the effect of UV radiation on horizontal surfaces [108]. As a limitation, in our analysis we could not distinguish between cloud types; this would be an interesting point for future research.

(ii) It is surprising that higher cloudiness in both months (current and previous) correlates with higher mortality. This is different from insolation data; we think therefore that cloudiness correlation might have an additional underlying mechanism. We suggest that the cloudiness data may actually be a proxy for persistent pollution data.

Even moderate pollution in the atmosphere helps clouds form [109], and it was also previously found that airborne pollution was linked to worse outcomes for the COVID-19 patients [26,29,30,110]. The airborne pollution particles have complex interactions with Sars-CoV-2 virions [27]; by reducing the UV radiation, air pollutants might promote viral persistence in air [29], thus prolonging the exposure time. Higher pollution levels, especially with smaller particulate matter, impair human health (via increased oxidative stress and altered immune response [111,112]). Therefore, a persistent particulate pollution, for a long period, could theoretically be both a favoring factor for persistent cloudiness and at the same time increases mortality via direct immunological effects of the pollutants.

Another possible explanation could be that heavy cloudiness is linked with colder outdoor surfaces. A higher cloud cover quickly cools down the outdoor surfaces, especially at lower latitudes [113]. Colder surfaces facilitate the survival of SARS-CoV-2 for longer periods [114]; thus cloud fraction over a geographical area could also be regarded as a proxy variable for the temperature of the surfaces (not only of the air temperature) and be therefore a confounding factor for an increased incidence.

4.3. Latitude (And Geographical Distribution)

Our results confirm previous studies that, for the European continent, latitude appears to be an important modulator of the epidemiological waves of COVID-19. Many previous studies linked the COVID-19 incidence to latitude (see for instance [9,11,12,13,115] and somewhat fewer with mortality [10,35,36]. A meta-analysis by Li et al. [2] concluded that the influence of the factors, while important, is uneven and difficult to generalize.

Latitude is a main driver of the average local climate and therefore is a confounding factor for variations in temperature, average sunlight, cloudiness (and other proposed climatological factors that influence COVID-19 epidemic). It is also (somewhat loosely) linked to socioeconomic status [116,117,118] and this in turn correlates with the quality of the healthcare. For these reasons, we think it is perhaps useful to consider the influence of any of these climatological factors not as independent or absolute but always adjusted for latitude.

In this regard, our numerical results on latitude influence on mortality are an independent confirmation of the results of spatio-temporal analysis done by Martínez-Portillo et al. [9]. However, as a difference, in our study we did not find a correlation of COVID-19 mortality with longitude, and this may be due to a methodological difference (they examined the surge dates in space and time). Another possible explanation could be that we examined a time–frame lag of 1 month, which might be too narrow to catch the West–East longitudinal gradient of the spreading pandemic on European countries, but wide enough to capture the latitudinal gradient.

4.4. Results of Granger Causality Tests

Separately from LME modeling, we checked the temporal lag of one month in our data sample with a Granger causality test in the averaged data (Section 3.5). We again have found a significant temporal correlation of mortality in a month with the average insolation in the previous month (a temporal lag of one month). We are fully aware that the Granger causality is not necessarily true causality, and that the test itself might fail to reject the alternative hypothesis if both tested processes are driven by a common unknown third process with a different time lag. As a limitation of this paper, we did not attempt to test for other time lags or for varying time lags, as this would require more granular data.

4.5. Limitations

We stress that our study is a retrospective longitudinal one and we developed the models to answer the starting hypothesis; the aim of the study was not to build predictive models. Our study only modeled a specific year and a specific population and while the sample was large it is unknown if a predictive extrapolation can be done to other populations/geographical regions—this is for future research.

We are aware that this study has limitations: we did not investigate the precipitation rate, relative humidity, temperature, wind velocity, air pressure, air pollution and density. The solar radiation measure was the net amount of energy and did not analyze spectral distribution: we were unable to differentiate between visible, infrared and ultraviolet radiation components from the measured data. Due to orbital dynamics, MODIS data is unavailable for some regions on certain days, and different effective imputation methods were developed [119,120,121,122]; we used a monthly aggregated MODIS data set that has no missing data (due to averaging method), but we cannot exclude variations in its temporal quality.

Of a particular concern is the inter-dependence between cloudiness and the rest of the factors. We carefully modeled to isolate it; but in one verification model (Section 3.4) a standardized beta for latitude was greater than 1—this is conceivable in multiple regression models, but raises the possibility of colinearity between predictor variables [123]. We checked this, and VIF statistics found in this case was 4.97. VIF values over the range 5 … 10 indicate a strong collinearity. For an extra caution in interpreting, we consider that values greater than 2.5 could be a point of concern [124]. Although the model formally passed the the quality criteria we set (Appendix A.6), we cannot therefore exclude a multiple collinearity issue here: (i) latitude influences cloudiness; (ii) cloudiness influence mortality, but at the same time, (iii) latitude influences mortality via a different, unknown dynamic. Also, this issue does not appear in the corresponding more precise LME model (Section 3.3), so it might be a result of averaging.

As cloudiness is both influenced by and influences the temperature and humidity which are also influencing mortality [1], we could not differentiate the confounding factors. These influences seem to be different at different scales (national scale vs. urban scale, see [2] for an in-depth meta-analysis). We acknowledge that our analysis is limited to the national scale only. We acknowledge that using centroid data and the averaged insolation/cloud fraction at the country level cannot capture details of spatial variation in population density [125].

From the epidemiological perspective, the authors acknowledge that this study did not include in the modeling done the major clinical confounding factors, like age, comorbidities, sex, lifestyle patterns, etc. We also note that especially during the initial phase of pandemic, mortality data was likely influenced by other factors as well: technical (limited testing capabilities [126]), socio-economic (delayed diagnosis [127]) or governmental (testing at scale [128], introduction and extent of social distancing measures). These were different for each country; in this regard, the sampling time chosen could have biases that we were unable to address.

In spatial analysis in geographical epidemiology, latency and mobility are two factors known to alter statistical analysis of the temporal data [129]. Latency is the time lag between the exposure to a hazardous factor and the emergence of an outcome (disease, death). In this study we chose an interval of one month as this seems to cover the reported pathological findings between the time of viral exposure and death (see Introduction, Figure 1). One month is also broad enough to average out the weekend-weekdays systematic variations of mortality observed during the initial year of the pandemic [130,131], and the variations in mobility caused by different national policies on social distancing.

The level of population mobility across the borders of the spatial unit changes with the area of the unit and this implies that reliability of inference will be lower for areas of high mobility (such as highly urbanized counties) vs. less urbanized ones. In our study, we chose the highest administrative area (country) [129] to minimize this effect, but we acknowledge that we could not account for it.

Therefore, for a future study it might be interesting to investigate the relationship between COVID-19 mortality, cloudiness and solar radiation at a finer temporal and spatial scale, perhaps also taking in account the mobility patterns.

5. Conclusions

The data from the European continent in the spring–winter of 2020, when targeted countermeasures (vaccination, effective treatments) were not available yet for COVID-19, suggest that there might be a longer term (1 month) correlation between weather and mortality. Due to the generally long period between infection and death (in fatal cases), our results suggest that COVID-19 mortality in any given month was negatively correlated with insolation and positively correlated with prolonged cloudiness. This knowledge might help advise public health policies related to COVID-19 mitigation and control; we propose increased vigilance and increased frequency of sanitation of outdoor high-risk surfaces during prolonged overcast weather.

Author Contributions

Conceptualization, Adrian Iftime and Secil Omer; methodology, Adrian Iftime, Secil Omer; formal analysis and data visualization, Adrian Iftime; data curation and storage, Victor-Andrei Burcea; writing—original draft preparation, Adrian Iftime, Secil Omer; writing—review and editing, Adrian Iftime, Secil Omer, Victor-Andrei Burcea, Octavian Călinescu, Ramona-Madalina Babeș; validation, Octavian Călinescu, Ramona-Madalina Babeș; clinical virology expertise: Victor-Andrei Burcea; biology expertise, Ramona-Madalina Babeș; clinical care expertise, Secil Omer. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable; this study used only previously published, anonymous, aggregated human mortality data.

Informed Consent Statement

Not applicable.

Data Availability Statement

We made available the full data-set that we aggregated, curated, computed and analyzed in this study at the Zenodo open-access repository: [132]. The raw data that support the findings of this study are openly available at: [57,60,69].

Acknowledgments

Publication of this paper was supported by the University of Medicine and Pharmacy Carol Davila, through the institutional program Publish not Perish.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

avg.	average
Bosnia_and_H	Bosnia and Herzegovina
CI	confidence interval
COVID-19	Coronavirus disease 2019
LME	linear mixed effect
mil.	million
NASA	National Aeronautics and Space Administration
OLS	ordinary least squares
QQ	quantile–quantile plot
REML	restricted maximum likelihood
SARS-CoV-2	Severe acute respiratory syndrome coronavirus 2
Std. beta	standardized beta coefficient
UK	The United Kingdom of Great Britain and Northern Ireland
UV	Ultraviolet radiation
UV-A	Ultraviolet radiation, type A
UV-B	Ultraviolet radiation, type B
VIF	Variable inflation factor

Appendix A. Statistical Details

Appendix A.1. Monthly Cloudiness Data Points Used in This Study

Figure A1. The average monthly cloudiness calculated for each European country. Each blue dot on the foreground blue line represents the average of the cloud fraction values over the area of the respective country in the respective month. A cloud fraction of 0.0 means a completely sunny sky; 1.0 means a completely overcast sky. To visually compare the values in each country with the rest of the countries, the cloudiness values for all the countries are drawn as background gray lines in each panel. The procedure used for the calculation of these data points is presented in Section 2.2.

Appendix A.2. Monthly Insolation Data Points Used in This Study

Figure A2. The average insolation in each European country. Each red dot on the foreground red line represents the averaged measured energy flux from the sun over the area of the country in each month, expressed in

W / m^{2}

. To visually compare the values in each country with the rest of the countries, the insolation values for all the countries are drawn as background gray lines in each panel. The procedure used for the calculation of these data points is presented in Section 2.3.

Figure A2. The average insolation in each European country. Each red dot on the foreground red line represents the averaged measured energy flux from the sun over the area of the country in each month, expressed in

W / m^{2}

. To visually compare the values in each country with the rest of the countries, the insolation values for all the countries are drawn as background gray lines in each panel. The procedure used for the calculation of these data points is presented in Section 2.3.

Appendix A.3. Monthly Mortality Data Points Used in This Study

Figure A3. The average monthly mortality in each European country in the year 2020. Each black dot on the foreground black line represents the

l o g 1 p

transformed total deaths in a month in the country. The cloudiness values of all countries are drawn as a background gray lines in each panel to aid the visual comparison. The procedure used for the calculation of these data points is presented in Section 2.4.

Figure A3. The average monthly mortality in each European country in the year 2020. Each black dot on the foreground black line represents the

l o g 1 p

transformed total deaths in a month in the country. The cloudiness values of all countries are drawn as a background gray lines in each panel to aid the visual comparison. The procedure used for the calculation of these data points is presented in Section 2.4.

Appendix A.4. LME Modeling of Monthly Mortality and Insolation

Table A1. Characteristics of the Linear Mixed-Effects (LME) modeling of mortality vs. insolation (Section 3.2, Figure 3 and Figure 5a).

	Dependent Variable:
	Monthly Mortality as $\log 1 p (Deaths / Million)$
Previous Insolation Average	−0.01 ***
	(0.001)
Latitude	−0.12 ***
	(0.02)
Constant	11.69 ***
	(0.92)
Observations	342
Conditional $R^{2}$	0.57
Marginal $R^{2}$	0.45
Log Likelihood	−546.46
AIC	1102.93
BIC	1122.10

Notes: *** p < 0.001; the numbers in parentheses are the standard errors.

R formula:

l o g 1 p (d e a t h s / m i l l i o n) \sim

Previous_Insolation_Average+Latitude + (1|Country). The model included the Country as random effect (formula: ∼ 1|Country). The Latitude is the centroid latitude of the country. The model’s total explanatory power is substantial (

R_{c}^{2}

= 0.57) and the part related to the fixed effects alone is

R_{m}^{2}

= 0.45. The model’s intercept, corresponding to Previous_Insolation_Average = 0 and Latitude = 0, is at 11.69 (95% CI [9.89, 13.49], t(337) = 12.74, p < 0.001). Within this model:

- The effect of Previous_Insolation_Average (i.e., insolation in previous month) is statistically significant and negative (

β

= −0.01, 95% CI [−0.01, −0.01], t(337) = −17.73, p < 0.001; Std. beta = −0.69, 95% CI [−0.76, −0.61])

- The effect of Latitude is statistically significant and negative (

β

= −0.12, 95% CI [−0.15, −0.08], t(337) = −6.68, p < 0.001; Std. beta = −0.47, 95% CI [−0.60, −0.33])

Appendix A.5. LME Modeling of Monthly Mortality and Cloudiness

Table A2. Characteristics of the Linear Mixed-Effects (LME) modeling of monthly COVID-19 mortality vs. cloudiness (Section 3.3, Figure 4 and Figure 5b).

	Dependent Variable:
	Monthly Mortality as $\log 1 p (Deaths / Million)$
Previous_Cloud_Fraction (i.e., in previous month)	1.34 *
	(0.58)
Cloud_Fraction (i.e., in current month)	2.94 ***
	(0.53)
Latitude	−0.11 ***
	(0.02)
Constant	6.21 ***
	(0.83)
Observations	342
Conditional $R^{2}$	0.24
Marginal $R^{2}$	0.16
Log Likelihood	−625.49
AIC
BIC	1286.00

Notes: * p < 0.05; *** p < 0.001; the numbers in parentheses are the standard errors.

R formula used:

l o g 1 p (d e a t h s / m i l l i o n) \sim

Previous_Cloud_Fraction +Cloud_Fraction+Latitude + (1|Country). The model included the Country as random effect (formula: ∼1|Country). The Latitude is the centroid latitude of the country.

The model’s total explanatory power is moderate (

R_{c}^{2}

= 0.24) and the part related to the fixed effects alone is

R_{m}^{2}

= 0.16. The model’s intercept, corresponding to Previous_Cloud_Fraction = 0, Cloud_Fraction = 0 and Latitude = 0, is at 6.21 (95% CI [4.57, 7.85], t(336) = 7.45, p < 0.001). Within this model:

- The effect of Previous Cloud Fraction is statistically significant and positive (beta = 1.34, 95% CI [0.20, 2.49], t(336) = 2.31, p = 0.022; Std. beta = 0.14, 95% CI [0.02, 0.26])

- The effect of Cloud Fraction is statistically significant and positive (beta = 2.94, 95% CI [1.89, 3.99], t(336) = 5.52, p < 0.001; Std. beta = 0.32, 95% CI [0.21, 0.44])

- The effect of Latitude is statistically significant and negative (beta = −0.11, 95% CI [−0.15, −0.08], t(336) = −5.93, p < 0.001; Std. beta = −0.45, 95% CI [−0.60, −0.30])

Appendix A.6. Time–Averaged Model Details

The model formula:

A v g . l o g 1 p (d e a t h s / m i l l i o n) = α + β_{1} (A v g . C l o u d F r a c t i o n) + β_{2} (L a t i t u d e) + ϵ

(on time–averaged dataset)

Table A3. Characteristics of the time—averaged model.

	Dependent Variable:
	Avg. log1p (Deaths/Million)
Avg. Cloud Fraction	5.62 *
	(2.28)
Latitude	−0.13 ***
	(0.03)
Constant	6.97 ***
	(0.72)
Observations:	37
R²	0.38
Adjusted R²	0.34
Residual Std. Error	0.58 (df = 34)
F Statistic	10.30 *** (df = 2; 34)
AIC	69.74
BIC	76.19

Notes: * p < 0.05; *** p < 0.001; the numbers in parentheses are the standard errors.

The residuals of this linear model appear to fall on the expected diagonal line on a QQplot (see Figure A4) and to be normally distributed (formal Lilliefors corrected Kolmogorov-Smirnov test D = 0.10, p = 0.45). The residuals appear to be independent (no autocorrelation between them, Durbin-Watson test DW = 2.39, p = 0.88). There seems to be a little heteroscedasticity of the residuals (Breusch-Pagan test, BP = 6.39, df = 2, p = 0.04)—at further investigation we think this to be an artifact induced by the log1p transform at the extremely low values of the interval: in the non-transformed model (i.e., just mortality/million), the homoscedasticy is present (Breusch-Pagan test BP = 2.33, df = 2, p-value = 0.31).

Figure A4. Quantile–Quantile plot of the residuals of the time–averaged model.

Appendix A.7. Space–Averaged Model Details

The model formula:

A v g . l o g 1 p (d e a t h s / m i l l i o n) = α + β (A v g . P r e v i o u s I n s o l a t i o n) + ϵ

(on space–averaged dataset)

Table A4. Characteristics of the space—averaged model.

	Dependent Variable:
	Avg. log1p (Deaths/Million)
Avg. previous insolation	−0.01 ***
	(0.002)
Constant	6.05 ***
	(0.39)
Observations	10
R²	0.82
Adjusted R²	0.79
Residual Std. Error	0.46 (df = 8)
F Statistic	35.90 *** (df = 1; 8)
AIC	16.74
BIC	17.65

Notes: *** p < 0.001; the numbers in parentheses are the standard errors.

The residuals of this linear model appear to be normally distributed (Shapiro-Wilk test: W = 0.85, p = 0.055 and Figure A5). The residuals appear to be independent (no autocorrelation of the residuals, Durbin-Watson test: DW = 1.51, p = 0.12) and without heteroscedasticity (studentized Breusch-Pagan test: BP = 2.27, df = 1, p-value = 0.13).

Figure A5. Quantile–Quantile plot of the residuals of the space–averaged model.

Appendix A.8. Diagnostic Plots for Data Transformation

Figure A6. Top panels: the raw values of COVID-19 calculated mortality data (as monthly deaths/1 million): (a) histogram (highly skewed) and (b) rank-value plot; Bottom panels: the same data,

l o g 1 p

transformed: (c) histogram (reduced skewness) and (d) rank-value plot (more linear).

Figure A6. Top panels: the raw values of COVID-19 calculated mortality data (as monthly deaths/1 million): (a) histogram (highly skewed) and (b) rank-value plot; Bottom panels: the same data,

l o g 1 p

transformed: (c) histogram (reduced skewness) and (d) rank-value plot (more linear).

References

Islam, M.M.; Noor, F.M. Correlation between COVID-19 and weather variables: A meta-analysis. Heliyon 2022, 8, e10333. [Google Scholar] [CrossRef] [PubMed]
Li, H.L.; Yang, B.Y.; Wang, L.J.; Liao, K.; Sun, N.; Liu, Y.C.; Ma, R.F.; Yang, X.D. A meta-analysis result: Uneven influences of season, geo-spatial scale and latitude on relationship between meteorological factors and the COVID-19 transmission. Environ. Res. 2022, 212, 113297. [Google Scholar] [CrossRef] [PubMed]
Chen, S.; Huang, L.; Cai, D.; Li, B.; Yang, J. Association between meteorological factors and COVID-19: A systematic review. Int. J. Environ. Health Res. 2022, 33, 1254–1268. [Google Scholar] [CrossRef] [PubMed]
Ford, J.D.; Zavaleta-Cortijo, C.; Ainembabazi, T.; Anza-Ramirez, C.; Arotoma-Rojas, I.; Bezerra, J.; Chicmana-Zapata, V.; Galappaththi, E.K.; Hangula, M.; Kazaana, C.; et al. Interactions between climate and COVID-19. Lancet Planet. Health 2022, 6, e825–e833. [Google Scholar] [CrossRef] [PubMed]
Guo, C.; Bo, Y.; Lin, C.; Li, H.B.; Zeng, Y.; Zhang, Y.; Hossain, M.S.; Chan, J.W.; Yeung, D.W.; Kwok, K.o.; et al. Meteorological factors and COVID-19 incidence in 190 countries: An observational study. Sci. Total Environ. 2021, 757, 143783. [Google Scholar] [CrossRef] [PubMed]
Casanova, L.M.; Jeon, S.; Rutala, W.A.; Weber, D.J.; Sobsey, M.D. Effects of Air Temperature and Relative Humidity on Coronavirus Survival on Surfaces. Appl. Environ. Microbiol. 2010, 76, 2712–2717. [Google Scholar] [CrossRef] [PubMed]
Ma, Y.; Zhao, Y.; Liu, J.; He, X.; Wang, B.; Fu, S.; Yan, J.; Niu, J.; Zhou, J.; Luo, B. Effects of Temperature Variation and Humidity on the Death of COVID-19 in Wuhan, China. Sci. Total Environ. 2020, 724, 138–226. [Google Scholar] [CrossRef] [PubMed]
Yang, X.D.; Su, X.Y.; Li, H.L.; Ma, R.F.; Qi, F.J.; Cao, Y.E. Impacts of socio-economic determinants, spatial distance and climate factors on the confirmed cases and deaths of COVID-19 in China. PLoS ONE 2021, 16, e0255229. [Google Scholar] [CrossRef] [PubMed]
Martínez-Portillo, A.; Garcia-Garcia, D.; Leon, I.; Ramis-Prieto, R.; Gómez-Barroso, D. Latitude and longitude as drivers of COVID-19 waves’ behavior in Europe: A time-space perspective of the pandemic. PLoS ONE 2023, 18, e0291618. [Google Scholar] [CrossRef] [PubMed]
Rhodes, J.; Dunstan, F.; Laird, E.; Subramanian, S.; Kenny, R.A. COVID-19 Mortality Increases with Northerly Latitude after Adjustment for Age Suggesting a Link with Ultraviolet and Vitamin D. BMJ Nutr. Prev. Health 2020, 3, 118–120. [Google Scholar] [CrossRef] [PubMed]
Burra, P.; Soto-Díaz, K.; Chalen, I.; Gonzalez-Ricon, R.J.; Istanto, D.; Caetano-Anollés, G. Temperature and Latitude Correlate with SARS-CoV-2 Epidemiological Variables but not with Genomic Change Worldwide. Evol. Bioinform. 2021, 17, 1176934321989695. [Google Scholar] [CrossRef] [PubMed]
Sajadi, M.M.; Habibzadeh, P.; Vintzileos, A.; Shokouhi, S.; Miralles-Wilhelm, F.; Amoroso, A. Temperature and Latitude Analysis to Predict Potential Spread and Seasonality for COVID-19. SSRN Electron. J. 2020. [Google Scholar] [CrossRef] [PubMed]
Walrand, S. Autumn COVID-19 surge dates in Europe correlated to latitudes, not to temperature-humidity, pointing to vitamin D as contributing factor. Sci. Rep. 2021, 11, 1981. [Google Scholar] [CrossRef] [PubMed]
Ratnesar-Shumate, S.; Williams, G.; Green, B.; Krause, M.; Holland, B.; Wood, S.; Bohannon, J.; Boydston, J.; Freeburger, D.; Hooper, I.; et al. Simulated Sunlight Rapidly Inactivates SARS-CoV-2 on Surfaces. J. Infect. Dis. 2020, 222, 214–222. [Google Scholar] [CrossRef] [PubMed]
Sagripanti, J.; Lytle, C.D. Estimated Inactivation of Coronaviruses by Solar Radiation With Special Reference to COVID-19. Photochem. Photobiol. 2020, 96, 731–737. [Google Scholar] [CrossRef] [PubMed]
Raiteux, J.; Eschlimann, M.; Marangon, A.; Rogée, S.; Dadvisard, M.; Taysse, L.; Larigauderie, G. Inactivation of SARS-CoV-2 by Simulated Sunlight on Contaminated Surfaces. Microbiol. Spectr. 2021, 9, 10-1128. [Google Scholar] [CrossRef] [PubMed]
Hessling, M.; Lau, B.; Vatter, P. Review of Virus Inactivation by Visible Light. Photonics 2022, 9, 113. [Google Scholar] [CrossRef]
Ailioaie, L.M.; Ailioaie, C.; Litscher, G. Light as a Cure in COVID-19: A Challenge for Medicine. Photonics 2022, 9, 686. [Google Scholar] [CrossRef]
Sadraeian, M.; Zhang, L.; Aavani, F.; Biazar, E.; Jin, D. Viral inactivation by light. eLight 2022, 2, 18. [Google Scholar] [CrossRef] [PubMed]
Nicastro, F.; Sironi, G.; Antonello, E.; Bianco, A.; Biasin, M.; Brucato, J.R.; Ermolli, I.; Pareschi, G.; Salvati, M.; Tozzi, P.; et al. Solar UV-B/A radiation is highly effective in inactivating SARS-CoV-2. Sci. Rep. 2021, 11, 14805. [Google Scholar] [CrossRef] [PubMed]
Herman, J.; Biegel, B.; Huang, L. Inactivation times from 290 to 315 nm UVB in sunlight for SARS coronaviruses CoV and CoV-2 using OMI satellite data for the sunlit Earth. Air Qual. Atmos. Health 2020, 14, 217–233. [Google Scholar] [CrossRef] [PubMed]
Karapiperis, C.; Kouklis, P.; Papastratos, S.; Chasapi, A.; Danchin, A.; Angelis, L.; Ouzounis, C.A. A Strong Seasonality Pattern for Covid-19 Incidence Rates Modulated by UV Radiation Levels. Viruses 2021, 13, 574. [Google Scholar] [CrossRef] [PubMed]
Yang, X.D.; Li, H.L.; Cao, Y.E. Influence of Meteorological Factors on the COVID-19 Transmission with Season and Geographic Location. Int. J. Environ. Res. Public Health 2021, 18, 484. [Google Scholar] [CrossRef] [PubMed]
Srivastava, A. COVID-19 and air pollution and meteorology-an intricate relationship: A review. Chemosphere 2021, 263, 128297. [Google Scholar] [CrossRef] [PubMed]
Kerr, G.H.; Badr, H.S.; Gardner, L.M.; Perez-Saez, J.; Zaitchik, B.F. Associations between meteorology and COVID-19 in early studies: Inconsistencies, uncertainties, and recommendations. ONE Health 2021, 12, 100225. [Google Scholar] [CrossRef] [PubMed]
Shao, L.; Ge, S.; Jones, T.; Santosh, M.; Silva, L.F.; Cao, Y.; Oliveira, M.L.; Zhang, M.; BéruBé, K. The role of airborne particles and environmental considerations in the transmission of SARS-CoV-2. Geosci. Front. 2021, 12, 101189. [Google Scholar] [CrossRef] [PubMed]
Gu, Z.; Han, J.; Zhang, L.; Wang, H.; Luo, X.; Meng, X.; Zhang, Y.; Niu, X.; Lan, Y.; Wu, S.; et al. Unanswered questions on the airborne transmission of COVID-19. Environ. Chem. Lett. 2023, 21, 725–739. [Google Scholar] [CrossRef] [PubMed]
Ogen, Y. Assessing nitrogen dioxide (NO2) levels as a contributing factor to coronavirus (COVID-19) fatality. Sci. Total Environ. 2020, 726, 138605. [Google Scholar] [CrossRef] [PubMed]
Bourdrel, T.; Annesi-Maesano, I.; Alahmad, B.; Maesano, C.N.; Bind, M.A. The impact of outdoor air pollution on COVID-19: A review of evidence fromin vitro, animal, and human studies. Eur. Respir. Rev. 2021, 30, 200242. [Google Scholar] [CrossRef] [PubMed]
Solimini, A.; Filipponi, F.; Fegatelli, D.A.; Caputo, B.; De Marco, C.M.; Spagnoli, A.; Vestri, A.R. A global association between COVID-19 cases and airborne particulate matter at regional level. Sci. Rep. 2021, 11, 6256. [Google Scholar] [CrossRef] [PubMed]
Penuelas, J.; Fernández-Martínez, M.; Cobo, S.; Badiella, L.; Sardans, J. Does urban particulate matter hinder COVID-19 transmission rate? Air Qual. Atmos. Health 2024, 17, 2307–2319. [Google Scholar] [CrossRef]
Luzzatto-Fegiz, P.; Temprano-Coleto, F.; Peaudecerf, F.J.; Landel, J.R.; Zhu, Y.; McMurry, J.A. UVB Radiation Alone May Not Explain Sunlight Inactivation of SARS-CoV-2. J. Infect. Dis. 2021, 223, 1500–1502. [Google Scholar] [CrossRef] [PubMed]
Nelson, K.L.; Boehm, A.B.; Davies-Colley, R.J.; Dodd, M.C.; Kohn, T.; Linden, K.G.; Liu, Y.; Maraccini, P.A.; McNeill, K.; Mitch, W.A.; et al. Sunlight-mediated inactivation of health-relevant microorganisms in water: A review of mechanisms and modeling approaches. Environ. Sci. Process. Impacts 2018, 20, 1089–1122. [Google Scholar] [CrossRef] [PubMed]
Kohn, T.; Nelson, K.L. Sunlight-Mediated Inactivation of MS2 Coliphage via Exogenous Singlet Oxygen Produced by Sensitizers in Natural Waters. Environ. Sci. Technol. 2006, 41, 192–197. [Google Scholar] [CrossRef] [PubMed]
Tyrovolas, S.; Tsiampalis, T.; Morena, M.; Leung, A.Y.M.; Faka, A.; Chalkias, C.; Tsiodras, S.; Panagiotakos, D. COVID-19 Mortality in Europe, by Latitude and Obesity Status: A Geo-Spatial Analysis in 40 Countries. Nutrients 2022, 14, 471. [Google Scholar] [CrossRef] [PubMed]
Whittemore, P.B. COVID-19 Fatalities, Latitude, Sunlight, and Vitamin D. Am. J. Infect. Control 2020, 48, 1042–1044. [Google Scholar] [CrossRef] [PubMed]
Faes, C.; Abrams, S.; Van Beckhoven, D.; Meyfroidt, G.; Vlieghe, E.; Hens, N. Time between Symptom Onset, Hospitalisation and Recovery or Death: Statistical Analysis of Belgian COVID-19 Patients. Int. J. Environ. Res. Public Health 2020, 17, 7560. [Google Scholar] [CrossRef] [PubMed]
Atamenta, T.; Cherie, A.; Alemu, W. Time to death and its predictors among adult patients with COVID-19: A retrospective cohort study in Ethiopia. Front. Epidemiol. 2023, 2, 1065184. [Google Scholar] [CrossRef] [PubMed]
de Roquetaillade, C.; Bredin, S.; Lascarrou, J.B.; Soumagne, T.; Cojocaru, M.; Chousterman, B.G.; Leclerc, M.; Gouhier, A.; Piton, G.; Pène, F.; et al. Timing and causes of death in severe COVID-19 patients. Crit. Care 2021, 25, 224. [Google Scholar] [CrossRef] [PubMed]
Wiliński, A.; Kupracz, L.; Senejko, A.; Chrzastek, G. COVID-19: Average time from infection to death in Poland, USA, India and Germany. Qual. Quant. 2022, 56, 4729–4746. [Google Scholar] [CrossRef] [PubMed]
van Heerwaarden, C.C.; Guerau de Arellano, J.V. Relative Humidity as an Indicator for Cloud Formation over Heterogeneous Land Surfaces. J. Atmos. Sci. 2008, 65, 3263–3277. [Google Scholar] [CrossRef]
Li, Z.; Niu, F.; Fan, J.; Liu, Y.; Rosenfeld, D.; Ding, Y. Long-term impacts of aerosols on the vertical development of clouds and precipitation. Nat. Geosci. 2011, 4, 888–894. [Google Scholar] [CrossRef]
Twohy, C.H.; Coakley, J.A.; Tahnk, W.R. Effect of changes in relative humidity on aerosol scattering near clouds. J. Geophys. Res. Atmos. 2009, 114. [Google Scholar] [CrossRef]
Frederick, J.E.; Steele, H.D. The Transmission of Sunlight through Cloudy Skies: An Analysis Based on Standard Meteorological Information. J. Appl. Meteorol. 1995, 34, 2755–2761. [Google Scholar] [CrossRef]
Jacovides, C.P.; Steven, M.D.; Asimakopoulos, D.N. Spectral Solar Irradiance and Some Optical Properties for Various Polluted Atmospheres. Sol. Energy 2000, 69, 215–227. [Google Scholar] [CrossRef]
Frederick, J.E.; Erlick, C. The Attenuation of Sunlight by High-Latitude Clouds: Spectral Dependence and Its Physical Mechanisms. J. Atmos. Sci. 1997, 54, 2813–2819. [Google Scholar] [CrossRef]
Patterson, T.; Kelso, N.V.; Furno, D.; Buckingham, T.; Buckingham, B.; Springer, N.; Cross, L.; Zillmer, S.; Haggit, C.; Bennet, S.; et al. Made with Natural Earth. Free Vector and Raster Map Data @ naturalearthdata.com. 2022. Available online: https://www.naturalearthdata.com/ (accessed on 1 February 2022).
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2025. [Google Scholar]
Massicotte, P.; South, A. Rnaturalearth: World Map Data from Natural Earth; R Package Version 1.0.1; 2023. Available online: https://CRAN.R-project.org/package=rnaturalearth (accessed on 8 July 2025).
Pebesma, E.J.; Bivand, R. Classes and methods for spatial data in R. R News 2005, 5, 9–13. [Google Scholar]
Bivand, R.S.; Pebesma, E.; Gomez-Rubio, V. Applied Spatial Data Analysis with R, 2nd ed.; Springer: New York, NY, USA, 2013. [Google Scholar]
Pebesma, E. Simple Features for R: Standardized Support for Spatial Vector Data. R J. 2018, 10, 439–446. [Google Scholar] [CrossRef]
Pebesma, E.; Bivand, R. Spatial Data Science: With Applications in R; Chapman and Hall/CRC: London, UK, 2023; p. 352. [Google Scholar] [CrossRef]
Hijmans, R.J. Raster: Geographic Data Analysis and Modeling; Comprehensive R Archive Network; R Package Version 3.6-32; 2025. Available online: https://CRAN.R-project.org/package=raster (accessed on 8 July 2025).
Google Developers. Dataset Publishing Language: Country Centroids. 2012. Available online: https://developers.google.com/public-data/docs/canonical/countries_csv (accessed on 2 June 2021).
OpenStreetMap Contributors. Planet Dump Retrieved from Planet OSM. 2017. Available online: https://www.openstreetmap.org (accessed on 2 February 2021).
NASA, MODIS Atmosphere Science Team. Cloud Fraction. 2022. Available online: https://neo.gsfc.nasa.gov/view.php?datasetId=MODAL2_M_CLD_FR (accessed on 1 July 2022).
Douglass, D.H.; Clader, B.D. Climate Sensitivity of the Earth to Solar Irradiance. Geophys. Res. Lett. 2002, 29, 33-1–33-4. [Google Scholar] [CrossRef]
Lean, J.; Rind, D. Climate Forcing by Changing Solar Radiation. J. Clim. 1998, 11, 3069–3094. [Google Scholar] [CrossRef]
NASA, CERES/FLASHFlux Science Team. Solar Insolation. 2022. Available online: https://neo.gsfc.nasa.gov/view.php?datasetId=CERES_INSOL_M (accessed on 1 July 2022).
Miller, A.R.; Charepoo, S.; Yan, E.; Frost, R.W.; Sturgeon, Z.J.; Gibbon, G.; Balius, P.N.; Thomas, C.S.; Schmitt, M.A.; Sass, D.A.; et al. Reliability of COVID-19 data: An evaluation and reflection. PLoS ONE 2022, 17, e0251470. [Google Scholar] [CrossRef] [PubMed]
Antoniou, V.; Vassilakis, E.; Hatzaki, M. Is Crowdsourcing a Reliable Method for Mass Data Acquisition? The Case of COVID-19 Spread in Greece During Spring 2020. ISPRS Int. J. Geo-Inf. 2020, 9, 605. [Google Scholar] [CrossRef]
Baris, O.F.; Pelizzo, R. Research Note: Governance Indicators Explain Discrepancies in COVID-19 Data. World Aff. 2020, 183, 216–234. [Google Scholar] [CrossRef]
Choucair, J.; Waked, R.; Saliba, G.; Haddad, F.; Haddad, E.; Makhoul, J. Discrepancy in reports of COVID-19 onset of symptoms: Are faulty data being collected? Clin. Microbiol. Infect. 2020, 26, 1433–1434. [Google Scholar] [CrossRef] [PubMed]
Badker, R.; Miller, K.; Pardee, C.; Oppenheim, B.; Stephenson, N.; Ash, B.; Philippsen, T.; Ngoon, C.; Savage, P.; Lam, C.; et al. Challenges in reported COVID-19 data: Best practices and recommendations for future epidemics. BMJ Glob. Health 2021, 6, e005542. [Google Scholar] [CrossRef] [PubMed]
Walkowiak, M.P.; Walkowiak, D. Underestimation in Reporting Excess COVID-19 Death Data in Poland during the First Three Pandemic Waves. Int. J. Environ. Res. Public Health 2022, 19, 3692. [Google Scholar] [CrossRef] [PubMed]
Jung, S.m.; Akhmetzhanov, A.R.; Hayashi, K.; Linton, N.M.; Yang, Y.; Yuan, B.; Kobayashi, T.; Kinoshita, R.; Nishiura, H. Real-Time Estimation of the Risk of Death from Novel Coronavirus (COVID-19) Infection: Inference Using Exported Cases. J. Clin. Med. 2020, 9, 523. [Google Scholar] [CrossRef] [PubMed]
Guidotti, E. A worldwide epidemiological database for COVID-19 at fine-grained spatial resolution. Sci. Data 2022, 9, 112. [Google Scholar] [CrossRef] [PubMed]
Guidotti, E.; Ardia, D. COVID-19 Data Hub. J. Open Source Softw. 2020, 5, 2376. [Google Scholar] [CrossRef]
Worldometers.info. COVID-19 Coronavirus Pandemic. 2020. Available online: http://web.archive.org/web/20210607000129/https://www.worldometers.info/coronavirus/ (accessed on 1 June 2021).
Das, C. Death Certificates in Germany, England, The Netherlands, Belgium and the USA. Eur. J. Health Law 2005, 12, 193–211. [Google Scholar] [CrossRef] [PubMed]
Arnastauskaitė, J.; Ruzgas, T.; Bražėnas, M. An Exhaustive Power Comparison of Normality Tests. Mathematics 2021, 9, 788. [Google Scholar] [CrossRef]
Gosselin, R.D. Testing for normality: A user’s (cautionary) guide. Lab. Anim. 2024, 58, 433–437. [Google Scholar] [CrossRef] [PubMed]
Islam, T.U. Ranking of Normality Tests: An Appraisal through Skewed Alternative Space. Symmetry 2019, 11, 872. [Google Scholar] [CrossRef]
Lee, D.K. Data transformation: A focus on the interpretation. Korean J. Anesthesiol. 2020, 73, 503–508. [Google Scholar] [CrossRef] [PubMed]
Osborne, J. Notes on the use of data transformations. Pract. Assess. Res. Eval. 2002, 8. [Google Scholar] [CrossRef]
Griffith, D.A. Reciprocal Data Transformations and Their Back-Transforms. Stats 2022, 5, 714–737. [Google Scholar] [CrossRef]
R Core Team. Log: Logarithms and Exponentials; R Foundation for Statistical Computing: Vienna, Austria, 2025. [Google Scholar]
Bates, D.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Softw. 2015, 67, 1–48. [Google Scholar] [CrossRef]
Lüdecke, D. ggeffects: Tidy Data Frames of Marginal Effects from Regression Models. J. Open Source Softw. 2018, 3, 772. [Google Scholar] [CrossRef]
Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016. [Google Scholar]
Lenth, R.V. emmeans: Estimated Marginal Means, aka Least-Squares Means; R package Version 1.10.1; 2024. Available online: https://CRAN.R-project.org/package=emmeans (accessed on 8 July 2025).
Ben-Shachar, M.S.; Lüdecke, D.; Makowski, D. effectsize: Estimation of Effect Size Indices and Standardized Parameters. J. Open Source Softw. 2020, 5, 2815. [Google Scholar] [CrossRef]
Zeileis, A.; Hothorn, T. Diagnostic Checking in Regression Relationships. R News 2002, 2, 7–10. [Google Scholar]
Makowski, D.; Lüdecke, D.; Patil, I.; Thériault, R.; Ben-Shachar, M.S.; Wiernik, B.M. Automated Results Reporting as a Practical Tool to Improve Reproducibility and Methodological Best Practices Adoption. CRAN. 2023. Available online: https://easystats.github.io/report/ (accessed on 8 July 2025).
Kassambara, A. Rstatix: Pipe-Friendly Framework for Basic Statistical Tests; R Package Version 0.7.2; 2023. Available online: https://CRAN.R-project.org/package=rstatix (accessed on 8 July 2025).
Hlavac, M. stargazer: Well-Formatted Regression and Summary Statistics Tables; R Package Version 5.2.3; Social Policy Institute: Bratislava, Slovakia, 2022. [Google Scholar]
Lüdecke, D. sjPlot: Data Visualization for Statistics in Social Science; R Package Version 2.8.17; 2024. Available online: https://CRAN.R-project.org/package=sjPlot (accessed on 8 July 2025).
Wickham, H.; Averick, M.; Bryan, J.; Chang, W.; McGowan, L.D.; François, R.; Grolemund, G.; Hayes, A.; Henry, L.; Hester, J.; et al. Welcome to the Tidyverse. J. Open Source Softw. 2019, 4, 1686. [Google Scholar] [CrossRef]
Fox, J.; Weisberg, S. An R Companion to Applied Regression, 3rd ed.; Sage: Thousand Oaks, CA, USA, 2019. [Google Scholar]
Meteyard, L.; Davies, R.A. Best practice guidance for linear mixed-effects models in psychological science. J. Mem. Lang. 2020, 112, 104092. [Google Scholar] [CrossRef]
West, B.T.; Welch, K.B.; Galecki, A.T. Linear Mixed Models: A Practical Guide Using Statistical Software, 1st ed.; Chapman and Hall/CRC Press: Boca Raton, FL, USA, 2006. [Google Scholar]
Brown, V.A. An Introduction to Linear Mixed-Effects Modeling in R. Adv. Methods Pract. Psychol. Sci. 2021, 4, 251524592096035. [Google Scholar] [CrossRef]
Nakagawa, S.; Schielzeth, H. A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol. Evol. 2012, 4, 133–142. [Google Scholar] [CrossRef]
Nieminen, P. Application of Standardized Regression Coefficient in Meta-Analysis. BioMedInformatics 2022, 2, 434–458. [Google Scholar] [CrossRef]
Ellis, P.D. (Ed.) The Essential Guide to Effect Sizes, 6th ed.; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
Venables, V.N.; Ripley, B.D. Modern Applied Statistics with S, 4th ed.; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
Allison, P.D. Multiple Regression: A Primer, 1st ed.; Pine Forge Press: Newbury Park, CA, USA, 1999. [Google Scholar]
Lopez, L.; Weber, S. Testing for Granger Causality in Panel Data. Stata J. Promot. Commun. Stat. Stata 2017, 17, 972–984. [Google Scholar] [CrossRef]
Erbs, D.; Klein, S.; Duffie, J. Estimation of the diffuse radiation fraction for hourly, daily and monthly-average global radiation. Sol. Energy 1982, 28, 293–302. [Google Scholar] [CrossRef]
Kambezidis, H.D. The Solar Radiation Climate of Greece. Climate 2021, 9, 183. [Google Scholar] [CrossRef]
Nichols, G.L.; Gillingham, E.L.; Macintyre, H.L.; Vardoulakis, S.; Hajat, S.; Sarran, C.E.; Amankwaah, D.; Phalkey, R. Coronavirus seasonality, respiratory infections and weather. BMC Infect. Dis. 2021, 21, 1101. [Google Scholar] [CrossRef] [PubMed]
Asyary, A.; Veruswati, M. Sunlight Exposure Increased COVID-19 Recovery Rates: A Study in the Central Pandemic Area of Indonesia. Sci. Total Environ. 2020, 729, 139016. [Google Scholar] [CrossRef] [PubMed]
Korman, M.; Tkachev, V.; Reis, C.; Komada, Y.; Kitamura, S.; Gubin, D.; Kumar, V.; Roenneberg, T. Outdoor daylight exposure and longer sleep promote wellbeing under COVID-19 mandated restrictions. J. Sleep Res. 2021, 31, e13471. [Google Scholar] [CrossRef] [PubMed]
Fountoulakis, I.; Kosmopoulos, P.; Papachristopoulou, K.; Raptis, I.P.; Mamouri, R.E.; Nisantzi, A.; Gkikas, A.; Witthuhn, J.; Bley, S.; Moustaka, A.; et al. Effects of Aerosols and Clouds on the Levels of Surface Solar Radiation and Solar Energy in Cyprus. Remote Sens. 2021, 13, 2319. [Google Scholar] [CrossRef]
Zhao, Q.; Yao, W.; Zhang, C.; Wang, X.; Wang, Y. Study on the influence of fog and haze on solar radiation based on scattering-weakening effect. Renew. Energy 2019, 134, 178–185. [Google Scholar] [CrossRef]
Estupiñán, J.G.; Raman, S.; Crescenti, G.H.; Streicher, J.J.; Barnard, W.F. Effects of clouds and haze on UV-B radiation. J. Geophys. Res. Atmos. 1996, 101, 16807–16816. [Google Scholar] [CrossRef]
Sabburg, J.; Calbó, J. Five years of cloud enhanced surface UV radiation measurements at two sites (in the Northern and Southern Hemispheres). Atmos. Res. 2009, 93, 902–912. [Google Scholar] [CrossRef]
Jiang, J.H.; Su, H.; Huang, L.; Wang, Y.; Massie, S.; Zhao, B.; Omar, A.; Wang, Z. Contrasting Effects on Deep Convective Clouds by Different Types of Aerosols. Nat. Commun. 2018, 9, 3874. [Google Scholar] [CrossRef] [PubMed]
Coker, E.S.; Cavalli, L.; Fabrizi, E.; Guastella, G.; Lippo, E.; Parisi, M.L.; Pontarollo, N.; Rizzati, M.; Varacca, A.; Vergalli, S. The Effects of Air Pollution on COVID-19 Related Mortality in Northern Italy. Environ. Resour. Econ. 2020, 76, 611–634. [Google Scholar] [CrossRef] [PubMed]
de la Fuente, J.; Armas, O.; Barroso-Arévalo, S.; Gortázar, C.; García-Seco, T.; Buendía-Andrés, A.; Villanueva, F.; Soriano, J.A.; Mazuecos, L.; Vaz-Rodrigues, R.; et al. Good and bad get together: Inactivation of SARS-CoV-2 in particulate matter pollution from different fuels. Sci. Total Environ. 2022, 844, 157241. [Google Scholar] [CrossRef] [PubMed]
Pryor, J.T.; Cowley, L.O.; Simonds, S.E. The Physiological Effects of Air Pollution: Particulate Matter, Physiology and Disease. Front. Public Health 2022, 10, 882569. [Google Scholar] [CrossRef] [PubMed]
Sun, B.; Groisman, P.Y.; Bradley, R.S.; Keimig, F.T. Temporal Changes in the Observed Relationship between Cloud Cover and Surface Air Temperature. J. Clim. 2000, 13, 4341–4357. [Google Scholar] [CrossRef]
Riddell, S.; Goldie, S.; Hill, A.; Eagles, D.; Drew, T.W. The Effect of Temperature on Persistence of SARS-CoV-2 on Common Surfaces. Virol. J. 2020, 17, 145. [Google Scholar] [CrossRef] [PubMed]
Nandin de Carvalho, H. Latitude impact on pandemic Sars-CoV-2 2020 outbreaks and possible utility of UV indexes in predictions of regional daily infections and deaths. J. Photochem. Photobiol. 2022, 10, 100108. [Google Scholar] [CrossRef] [PubMed]
Nordhaus, W.D. Geography and macroeconomics: New data and new findings. Proc. Natl. Acad. Sci. USA 2006, 103, 3510–3517. [Google Scholar] [CrossRef] [PubMed]
Pawliczek, A.; Kurowska-Pysz, J.; Smilnak, R. Relation between Globe Latitude and the Quality of Life: Insights for Public Policy Management. Sustainability 2022, 14, 1461. [Google Scholar] [CrossRef]
Gong, Z.; Song, T.; Hu, M.; Che, Q.; Guo, J.; Zhang, H.; Li, H.; Wang, Y.; Liu, B.; Shi, N. Natural and socio-environmental factors in the transmission of COVID-19: A comprehensive analysis of epidemiology and mechanisms. BMC Public Health 2024, 24, 2196. [Google Scholar] [CrossRef] [PubMed]
Wang, Y.; Zhou, X.; Ao, Z.; Xiao, K.; Yan, C.; Xin, Q. Gap-Filling and Missing Information Recovery for Time Series of MODIS Data Using Deep Learning-Based Methods. Remote Sens. 2022, 14, 4692. [Google Scholar] [CrossRef]
Chi, Y.; Wu, Z.; Liao, K.; Ren, Y. Handling Missing Data in Large-Scale MODIS AOD Products Using a Two-Step Model. Remote Sens. 2020, 12, 3786. [Google Scholar] [CrossRef]
Liu, H.; Lu, N.; Jiang, H.; Qin, J.; Yao, L. Filling Gaps of Monthly Terra/MODIS Daytime Land Surface Temperature Using Discrete Cosine Transform Method. Remote Sens. 2020, 12, 361. [Google Scholar] [CrossRef]
Chen, Z.Y.; Jin, J.Q.; Zhang, R.; Zhang, T.H.; Chen, J.J.; Yang, J.; Ou, C.Q.; Guo, Y. Comparison of Different Missing-Imputation Methods for MAIAC (Multiangle Implementation of Atmospheric Correction) AOD in Estimating Daily PM2.5 Levels. Remote Sens. 2020, 12, 3008. [Google Scholar] [CrossRef]
Deegan, J. On the Occurrence of Standardized Regression Coefficients Greater Than One. Educ. Psychol. Meas. 1978, 38, 873–888. [Google Scholar] [CrossRef]
Johnston, R.; Jones, K.; Manley, D. Confounding and collinearity in regression analysis: A cautionary tale and an alternative procedure, illustrated by studies of British voting behaviour. Qual. Quant. 2017, 52, 1957–1976. [Google Scholar] [CrossRef] [PubMed]
Gatrell, A.C.; Bailey, T.C.; Diggle, P.J.; Rowlingson, B.S. Spatial Point Pattern Analysis and Its Application in Geographical Epidemiology. Trans. Inst. Br. Geogr. 1996, 21, 256. [Google Scholar] [CrossRef]
Vandenberg, O.; Martiny, D.; Rochas, O.; van Belkum, A.; Kozlakidis, Z. Considerations for diagnostic COVID-19 tests. Nat. Rev. Microbiol. 2020, 19, 171–183. [Google Scholar] [CrossRef] [PubMed]
Kim, E.J.; Marrast, L.; Conigliaro, J. COVID-19: Magnifying the Effect of Health Disparities. J. Gen. Intern. Med. 2020, 35, 2441–2442. [Google Scholar] [CrossRef] [PubMed]
Mercer, T.R.; Salit, M. Testing at scale during the COVID-19 pandemic. Nat. Rev. Genet. 2021, 22, 415–426. [Google Scholar] [CrossRef] [PubMed]
King, P.E. Problems of spatial analysis in geographical epidemiology. Soc. Sci. Med. Part D Med. Geogr. 1979, 13, 249–252. [Google Scholar] [CrossRef] [PubMed]
Manzoor, F.; Redelmeier, D.A. COVID-19 deaths on weekends. BMC Public Health 2023, 23, 1596. [Google Scholar] [CrossRef] [PubMed]
Bergman, A.; Sella, Y.; Agre, P.; Casadevall, A. Oscillations in U.S. COVID-19 Incidence and Mortality Data Reflect Diagnostic and Reporting Factors. mSystems 2020, 5, 10-1128. [Google Scholar] [CrossRef] [PubMed]
Iftime, A.; Omer, S.; Burcea, V.A.; Calinescu, O.; Babes, R.M. Monthly Values of Solar Irradiance, Cloud Fraction, COVID-19 Mortality, for European Countries in 2020. Zendo Open Data Repository. 2025. Available online: https://zenodo.org/records/15481351 (accessed on 21 May 2025).

Figure 1. Timeline of the study. By “Current month” we denote the month of death due to COVID-19 in fatal cases; by “Previous month” we denote the month before the month of death.

Figure 2. Two examples of the spatial integration algorithm used at the country level, in the same month. (I) Upper panels: cloudiness maps, with cloudiness values (in decimal cloud fraction) for each 0.25° latitude × 0.25° longitude grid. (II) Bottom panels, insolation maps of the same countries as above, with insolation values (W/m²) for the same grid; magenta lines: border of the respective countries, Germany (a,c) and Romania (b,d). The average value of all tiles within a border represent the average value for the country in that month (in this example, June, 2020).

Figure 3. LME model of monthly mortality and insolation in previous month (in W/

m^{2}

), adjusted for latitude. The dots represent the individual log1p transformed monthly mortality values from the European countries in the period March–December 2020. The regression model is adjusted for latitude of the countries (for visualization purposes, the figure shows the model at three equally partitioned latitudes). The shaded bands are the 95% CI.

Figure 3. LME model of monthly mortality and insolation in previous month (in W/

m^{2}

), adjusted for latitude. The dots represent the individual log1p transformed monthly mortality values from the European countries in the period March–December 2020. The regression model is adjusted for latitude of the countries (for visualization purposes, the figure shows the model at three equally partitioned latitudes). The shaded bands are the 95% CI.

Figure 4. LME model of monthly mortality and monthly cloudiness, adjusted for latitude. The dots represent the individual

l o g 1 p

–transformed monthly mortality values from the European countries in the period March–December 2020. The regression model is adjusted for latitude of the countries (for visualization purposes, the figure shows the model at three equally partitioned latitudes). The shaded bands are the 95% CI. (a) Modeled influence of the cloudiness in the previous month (b) Modeled influence of the cloudiness in the current month.

Figure 4. LME model of monthly mortality and monthly cloudiness, adjusted for latitude. The dots represent the individual

l o g 1 p

–transformed monthly mortality values from the European countries in the period March–December 2020. The regression model is adjusted for latitude of the countries (for visualization purposes, the figure shows the model at three equally partitioned latitudes). The shaded bands are the 95% CI. (a) Modeled influence of the cloudiness in the previous month (b) Modeled influence of the cloudiness in the current month.

Figure 5. Comparative view of the standardized beta estimates of the LME models presented above in Section 3.2 and Section 3.3. Relative impact on COVID-19 mortality in a given month (a) of the insolation in previous month and (b) of the cloudiness in the same month and in the previous month. Notes: * p < 0.05; *** p < 0.001.

Figure 6. Yearly–averaged cloud fraction relationship with yearly–averaged mortality, adjusted for latitude. Dots: mortality, averaged for the entire year 2020, for each country (i.e., each country is a dot); the shaded areas: the CI limits of the fitted model. The regression model is adjusted for latitude of the countries (for visualization purposes, the figure shows the model at three equally partitioned latitudes). Note: *** p < 0.001.

Figure 7. Visualization of estimated European COVID-19 mortality from the time–averaged model, for the entire year 2020, in the form of a heat-map. To avoid extrapolation, the range of the plotted variables is clipped to the range of collected values.

Figure 8. European averaged COVID-19 mortality in a month (in the period March–December 2020) relationship with averaged insolation over all Europe in the previous month. The shaded areas: the CI limits of the fitted model. Note: *** p < 0.001.

Table 1. The statistic summary of the main variables.

Statistic	N	Median	Mean	St.Dev	Min	Max	Skewness
Monthly insolation	370	213.34	190.77	97.46	1.31	404.12	–0.27
(as W/ $m^{2}$ )
Monthly cloud fraction	370	0.64	0.62	0.17	0.07	0.96	–0.55
(as decimal fraction)
Monthly deaths	342	169	1515.23	3846.44	0	33,854	4.47
Monthly mortality	342	23.16	81.51	120.31	0	600.32	2.01
(as deaths/million)
$l o g 1 p$ (deaths/million)	342	3.18	3.29	1.64	0	6.40	–0.03

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Iftime, A.; Omer, S.; Burcea, V.-A.; Călinescu, O.; Babeș, R.-M. Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation. ISPRS Int. J. Geo-Inf. 2025, 14, 283. https://doi.org/10.3390/ijgi14080283

AMA Style

Iftime A, Omer S, Burcea V-A, Călinescu O, Babeș R-M. Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation. ISPRS International Journal of Geo-Information. 2025; 14(8):283. https://doi.org/10.3390/ijgi14080283

Chicago/Turabian Style

Iftime, Adrian, Secil Omer, Victor-Andrei Burcea, Octavian Călinescu, and Ramona-Madalina Babeș. 2025. "Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation" ISPRS International Journal of Geo-Information 14, no. 8: 283. https://doi.org/10.3390/ijgi14080283

APA Style

Iftime, A., Omer, S., Burcea, V.-A., Călinescu, O., & Babeș, R.-M. (2025). Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation. ISPRS International Journal of Geo-Information, 14(8), 283. https://doi.org/10.3390/ijgi14080283

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Spatial and Temporal Correlations of COVID-19 Mortality in Europe with Atmospheric Cloudiness and Solar Radiation

Abstract

1. Introduction

2. Materials and Methods

2.1. Geographical Data

2.2. Atmospheric Cloudiness Data

2.3. Solar Insolation Data

2.4. Epidemiological Data (COVID-19 Data Sources)

2.5. Inclusion and Exclusion Criteria

2.6. Statistics

2.6.1. Variables

2.6.2. Data Verification and Transformation

2.7. Modeling

2.7.1. Linear Mixed-Effects

2.7.2. Linear Regression

2.7.3. Model Selection

3. Results

3.1. Aggregated Data

3.2. LME Modeling of Monthly Mortality and Insolation

3.3. LME Modeling of Monthly Mortality and Cloudiness

3.4. Time–Averaged Modeling

3.5. Space–Averaged Modeling

4. Discussion

4.1. Insolation

4.2. Cloudiness

4.3. Latitude (And Geographical Distribution)

4.4. Results of Granger Causality Tests

4.5. Limitations

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Statistical Details

Appendix A.1. Monthly Cloudiness Data Points Used in This Study

Appendix A.2. Monthly Insolation Data Points Used in This Study

Appendix A.3. Monthly Mortality Data Points Used in This Study

Appendix A.4. LME Modeling of Monthly Mortality and Insolation

Appendix A.5. LME Modeling of Monthly Mortality and Cloudiness

Appendix A.6. Time–Averaged Model Details

Appendix A.7. Space–Averaged Model Details

Appendix A.8. Diagnostic Plots for Data Transformation

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI