3D-VAR Data Assimilation of SEVIRI Radiances for the Prediction of Solar Irradiance in Italy Using WRF Solar Mesoscale Model—Preliminary Results

: Solar power generation is highly ﬂuctuating due to its dependence on atmospheric conditions. The integration of this variable resource into the energy supply system requires reliable predictions of the expected power production as a basis for management and operation strategies. This is one of the goals of the Solar Cloud project, funded by the Italian Ministry of Economic Development (MISE)—to provide detailed forecasts of solar irradiance variables to operators and organizations operating in the solar energy industry. The Institute of Methodologies for Environmental Analysis of the National Research Council (IMAA-CNR), participating to the project, implemented an operational chain that provides forecasts of all the solar irradiance variables at high temporal and horizontal resolution using the numerical weather prediction Advanced Research Weather Research and Forecasting (WRF-ARW) Solar version 3.8.1 released by the National Center for Atmospheric Research (NCAR) in August 2016. With the aim of improving the forecast of solar irradiance, the three-dimensional (3D-Var) data assimilation was tested to assimilate radiances from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) aboard the Meteosat Second Generation (MSG) geostationary satellite into WRF Solar. To quantify the impact, the model output is compared against observational data. Hourly Global Horizontal Irradiance (GHI) is compared with ground-based observations from Regional Agency for the Protection of the Environment (ARPA) and with MSG Shortwave Solar Irradiance estimations, while WRF Solar cloud coverage is compared with Cloud Mask by MSG. A preliminary test has been performed in clear sky conditions to assess the capability of the model to reproduce the diurnal cycle of the solar irradiance. The statistical scores for clear sky conditions show a positive performance of the model with values comparable to the instrument uncertainty and a correlation of 0.995. For cloudy sky, the solar irradiance and the cloud cover are better simulated when the SEVIRI radiances are assimilated, especially in the short range of the simulation. For the cloud cover, the Mean Bias Error one hour after the assimilation time is reduced from 41.62 to 20.29 W / m 2 when the assimilation is activated. Although only two case studies are considered here, the results indicate that the assimilation of SEVIRI radiance improves the performance of WRF Solar especially in the ﬁrst 3 hour forecast. Several studies assess the performances of mesoscale models. The Weather Research and Forecasting (WRF) model has been tested in the forecasts of GHI in Spain [13] and in several European and North American sites [14], reporting relative Root Mean Square Error (RMSE) of about 40% for 24 h forecast horizon. Similar results have been found in Europe [15]. In Southern Italy, a comparison of two atmospheric regional models (the Regional Atmospheric Modeling System (RAMS) and the WRF model) has been reported for clear and cloudy sky conditions ﬁnding RMSE values for both models in overcast conditions about 80 W / m 2 larger than in clear sky conditions [16]. A seasonal characterization using the RAMS model over the western Mediterranean Coast has been performed to evaluate the solar radiation using ground-based weather stations measurements for the winter 2010–2011 and the summer 2011. The statistically daily evaluations show absolute bias values varying from − 50 to 160 W / m 2 and a RMSE from 60 to 240 W / m 2 [17]. A similar evaluation has been performed for the WRF / RAMS models over a region in Eastern Spain under distinct atmospheric conditions using in-situ observations and remote sensing data derived from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) aboard the Meteosat Second Generation (MSG) and the uncoupled Land Surface Model (LSM) Global Land Data Assimilation System (GLDAS) during a 7-day period in summer 2011. Both the models show di ﬃ culties to forecast clouds with values of Mean Bias Error (MBE) and RMSE in overcast conditions ranging, respectively, from − 70 to − 80 W / m 2 and from 140 to 190 W / m 2 for WRF and from − 80 to − 110 W / m 2 and from 200 to 400 W / m 2 for RAMS [18].


Introduction
The rise of renewable energy in global energy production requires the development of procedure to better manage these highly variable sources. In recent years, the contribution of solar power to electricity has incremented rapidly; in Italy solar power has grown from 3.6 GW in 2010 to more than 20 GW in 2018, which represents about 7% of the gross national electricity and the 20% of the total energy from renewable sources [1]. Solar energy usage is expected to continue growing for the foreseeable future. Thus, the prediction of solar radiation is becoming crucial to estimate in advance how much energy will be available and what share can be covered with renewable sources. Contrary to the conventional energy sources (fossil and nuclear), solar energy is considered a variable source because the energy production is dependent on the intensity of solar irradiance that is mainly attenuated by atmospheric aerosols and clouds passing between the sun and the solar-powered plants. Given this variability in the generation of solar power, nowadays it becomes imperative to focus on a realistic modelling and accurate prediction of this variable, which is essential for management, strategies operations and regulation of power supplies.
Solar power prediction is pursued through different techniques for various time scales. Indeed, the forecast temporal interval helpful for the operators range from few minutes to days, so different methods are necessary. The strategies used for the prediction of the Global Horizontal Irradiance (GHI) can be divided mainly into two groups. The first group includes numerical weather prediction (NWP) models, which solves the equations of atmospheric dynamics and thermodynamics to infer cloud and radiative information up to several days ahead [2,3]. The second group relies on the analysis of satellite or ground based remote sensing to infer cloud motion and short-term forecasting from minutes to few hours [4][5][6][7][8]. Most of the works [9][10][11] shows that the second group outperforms the first one for forecasts within 4-5 h, with the time window depending on the localization. For a longer forecast horizon, numerical models are more appropriate [12]. Some recent studies suggest that NWP forecasts are becoming competitive with respect to satellite-based methods even on short-time scale [10,12].
Several studies assess the performances of mesoscale models. The Weather Research and Forecasting (WRF) model has been tested in the forecasts of GHI in Spain [13] and in several European and North American sites [14], reporting relative Root Mean Square Error (RMSE) of about 40% for 24 h forecast horizon. Similar results have been found in Europe [15]. In Southern Italy, a comparison of two atmospheric regional models (the Regional Atmospheric Modeling System (RAMS) and the WRF model) has been reported for clear and cloudy sky conditions finding RMSE values for both models in overcast conditions about 80 W/m 2 larger than in clear sky conditions [16]. A seasonal characterization using the RAMS model over the western Mediterranean Coast has been performed to evaluate the solar radiation using ground-based weather stations measurements for the winter 2010-2011 and the summer 2011. The statistically daily evaluations show absolute bias values varying from −50 to 160 W/m 2 and a RMSE from 60 to 240 W/m 2 [17]. A similar evaluation has been performed for the WRF/RAMS models over a region in Eastern Spain under distinct atmospheric conditions using in-situ observations and remote sensing data derived from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) aboard the Meteosat Second Generation (MSG) and the uncoupled Land Surface Model (LSM) Global Land Data Assimilation System (GLDAS) during a 7-day period in summer 2011. Both the models show difficulties to forecast clouds with values of Mean Bias Error (MBE) and RMSE in overcast conditions ranging, respectively, from −70 to −80 W/m 2 and from 140 to 190 W/m 2 for WRF and from −80 to −110 W/m 2 and from 200 to 400 W/m 2 for RAMS [18].
An important milestone in solar radiation applications has been the development of WRF-Solar within the project Sun4Cast [19]. This study asserts that the use of WRF-Solar significantly improves the solar irradiance forecast and the combination of the NWP with satellite-based method produces accurate forecast also in the short range 1-6 h time scale [20]. This paper focuses on the following forecasting challenge: improve the short-term forecast of solar irradiance using satellite data assimilation. The forecast skill is indeed strongly influenced by the accuracy of the cloud and thermodynamics analyses in the initial representations, which can improve significantly using satellite data assimilation [21].
Remote Sens. 2020, 12, 920 3 of 27 Satellite are the primary source of cloud information and therefore can play an important role also for solar power forecasting [21][22][23]. One of the methods used to improve the forecast of the clouds and consequently of the solar irradiance is the assimilation of the satellite radiances from geostationary meteorological satellites. The main advantage of the radiance assimilation is that the observed radiances can be directly ingested into the NWP without the need of retrieval methods. Retrievals provide an estimate of the simulated variable from the observed quantity based on inversion of a physical or empirical model; these techniques usually produce large uncertainties [21].
The main focus in recent years has been on radiance assimilation from polar orbiting satellites to improve the NWP forecast. Two studies analyzed the impact of the assimilation of radiance data from the Advanced Microwave Sounding Unit-A (AMSU-A) [24] and from the Advanced Microwave Scanning Radiometer 2 (AMSR2) [25] for the hurricane Sandy forecasts and the results proved that the radiance assimilation improved the short and medium range forecast [24] and the hurricane structure and cloud distributions [25]. Similar results can be found in References [26,27], where radiances from AMSU-A and AMSU-B [26] and from the Microwave Humidity Sounder (MHS) [27] have been assimilated in the WRF model to simulate an event of heavy rainfall in Beijing [26] and to forecast a binary system typhoons [27].
Besides polar-orbiting satellites, geostationary instruments were also found to be useful source of information and have been introduced to data assimilation more recently. Despite recent progress in data assimilation, the potentiality of geostationary sensors is far from being fully exploited [28]. One advantage of geostationary instruments is the nearly continuous picture of the weather event evolution due to their high temporal resolution. The impact of geostationary radiances from three infrared channels of SEVIRI using a four-dimensional data assimilation has been investigated using the High-Resolution Limited Area Model (HIRLAM) under clear sky and low-level cloud conditions [29][30][31].
The tests have been performed in a domain with a horizontal resolution of 22 km converting the measured radiances by SEVIRI to brightness temperatures and cloud products. Results show a positive impact for almost all upper-air variables; the main improvements are obtained for geopotential height and humidity [29][30][31]. The impact of assimilating Geostationary Operational Environmental Satellite (GOES) imager radiances on the analysis and forecast of a convective process over Mexico was assessed for the first time using a rapid refresh assimilation system with a convection permitting model setting [32]. Improved humidity and temperature analysis and significant standard deviation reductions were produced when the assimilation is activated [32].
In this study we present the preliminary results of the first attempt to assimilate the SEVIRI radiance in the WRF-Solar model in convection-permitting configuration to improve the solar irradiance forecast especially in the short-medium range. The paper is organized as follows: Section 2 describes the WRF-Solar with its configuration, the data assimilation system, the observation data sets, the two simulated periods and the statistical methodology. Section 3 provides the results and the statistical analysis both in clear sky and cloudy conditions and Section 4 draws conclusions and future developments.

Materials and Methods
In this study, the tool used to forecast the solar irradiance is the NWP model WRF, Solar version. The model has been tested both in the control configuration (CNTRL) without any type of assimilation and in assimilated mode (ASSIM) using the three-dimensional (3D-Var) data assimilation technique. The radiances measured by the imager radiometer SEVIRI onboard MSG geostationary satellite have been ingested in the NWP. The model performances are evaluated using two types of data: the ground-based observations from the weather stations of Regional Agency for the Protection of the Environment (ARPA) Lombardy and two European Organization for the Exploitation of Meteorological Satellites (EUMETSAT) products derived by SEVIRI.

Numerical Weather Prediction Model and the WRF-IMAA-Solar Setup
The WRF model is a next-generation mesoscale NWP system designed for flexible purposes from atmospheric investigations to operational forecasting requests. The Advanced Research WRF (ARW) dynamical core system has been developed by the collaborations of several research institutes directed by the National Center for Atmospheric Research (NCAR, http://www.wrf-model.org). The WRF-ARW core is based on an Eulerian solver for the fully compressible nonhydrostatic equations, using a mass vertical coordinate varying with height and it used for time-integration scheme a third-order Runge-Kutta. WRF-ARW supports horizontal nesting (one-or two-way) to allow a spatial resolution enhancement through the introduction of additional domains. The full physics schemes (microphysics, cumulus convection, atmospheric and surface radiation, planetary boundary layer and land-surface) provided by WRF vary from simple and efficient to the most sophisticated and computationally expensive [33].
The Solar version of WRF has been specifically conceived to improve the solar irradiance prediction. WRF-Solar is the first NWP model expressly planned to join the rising request for dedicated forecast variables for solar energy purposes [34]. The direct normal irradiance (DNI) and diffuse (DIF) components of solar irradiance in addition to the GHI component are calculated in the radiative budgets within the NWP system [34]. This calculation is performed by the radiative parameterization Rapid Radiative Transfer Model for Global (RRTMG) that explicitly solves the radiative transport equation for all the components. In addition, WRF-Solar introduces the direct effect in aerosol-radiation feedback to obtain a better representation of the components of diffuse and direct radiation and the indirect effect in the aerosol-cloud feedback that considers the interaction between the hydrometeors and the aerosols [35]. This link between the microphysical and the radiative scheme, leading to a physically more consistent representation of the distribution of the hydrometeors, is a novelty since usually in NWP models the distribution functions are imposed with an effective radius of the constant cloud particle.
The WRF-IMAA-Solar version 3.8.1 is operative at the Institute of Methodologies for Environmental Analysis of the National Research Council (IMAA-CNR) since 2016 and it has been developed during the SolarCloud project financed by the Italian Ministry of Economic Development (MISE), with the aim of providing high temporal and spatial resolution forecasts of solar irradiance variables to solar energy industry purposes.
Two-way nested domains have been selected for the implemented configuration: the larger domain is centered over the Mediterranean basin with 9 km grid spacing while the inner domain includes Italy with 3 km horizontal resolution ( Figure 1). The domains are represented with a Lambertian projection using staggered Arakawa-C horizontal grid and the land-use and the Digital Elevation Model (DEM) are obtained by the Moderate Resolution Imaging Spectroradiometer (MODIS) database at 30 s of arc (about 900 m) updated in 2008. The vertical grid is common to all the domains with 36 vertical levels, ranging from the surface to the model top set to 50 hPa. The simulations are initialized using the weather forecast of the Global Forecast System (GFS) produced by the National Centers for Environmental Prediction (NCEP) at 0.25 degree and 35 vertical levels, upgraded every 6 hours. A preliminary sensitivity analysis was carried out before making the model operational within the SolarCloud project to derive the optimum configurations. The Planetary Boundary Layer (PBL) has been simulated using the non-local-K Yonsei parameterization [36], the microphysics has been computed through the Thompson parameterization [35] and for the shortwave and the longwave the RRTMG radiation scheme [37]. The convective scheme has been activated only for the outer domain using the Kain-Fritsch [38]; the inner domain explicitly resolves the convective motion. The cumulus and the PBL scheme are called every time step, whereas the irradiances (GHI, DNI and DIF) are computed every 3 minutes, each time the radiation scheme is called and the instantaneous values are stored in output every hour.

D-VAR Data Assimilation of SEVIRI Radiance
The idea to assimilate satellite radiance data arises from the need to have the most accurate initial state of the atmosphere to better predict its future state and hence that of the solar irradiance reaching the surface [21,26,39]. Outgoing radiance is related to the geophysical atmospheric state, providing very useful thermodynamic information both over land and over sea. It can be simply measured by a satellite and directly ingested into the NWP using a variational method and a Radiative Transfer Model (RTM).
The radiance measured by SEVIRI is considered for data assimilation in this article. SEVIRI is an imager radiometer onboard MSG geostationary platform providing accurate weather monitoring data through twelve spectral channels. Eight channels are in the thermal infrared, providing data about the temperature of clouds, atmosphere, land and sea surfaces. The remaining four channels are in the visible and near-infrared range. The imaging sampling distance at nadir is 3 km for standard channels and approximately 1 km for the High Resolution Visible (HRV) channel [40]. The radiance data are available through National Centers for Environmental Prediction (NCEP) public interface in The simulations are initialized using the weather forecast of the Global Forecast System (GFS) produced by the National Centers for Environmental Prediction (NCEP) at 0.25 degree and 35 vertical levels, upgraded every 6 h. A preliminary sensitivity analysis was carried out before making the model operational within the SolarCloud project to derive the optimum configurations. The Planetary Boundary Layer (PBL) has been simulated using the non-local-K Yonsei parameterization [36], the microphysics has been computed through the Thompson parameterization [35] and for the shortwave and the longwave the RRTMG radiation scheme [37]. The convective scheme has been activated only for the outer domain using the Kain-Fritsch [38]; the inner domain explicitly resolves the convective motion. The cumulus and the PBL scheme are called every time step, whereas the irradiances (GHI, DNI and DIF) are computed every 3 min, each time the radiation scheme is called and the instantaneous values are stored in output every hour.

D-VAR Data Assimilation of SEVIRI Radiance
The idea to assimilate satellite radiance data arises from the need to have the most accurate initial state of the atmosphere to better predict its future state and hence that of the solar irradiance reaching the surface [21,26,39]. Outgoing radiance is related to the geophysical atmospheric state, providing very useful thermodynamic information both over land and over sea. It can be simply measured by a satellite and directly ingested into the NWP using a variational method and a Radiative Transfer Model (RTM).
The radiance measured by SEVIRI is considered for data assimilation in this article. SEVIRI is an imager radiometer onboard MSG geostationary platform providing accurate weather monitoring data through twelve spectral channels. Eight channels are in the thermal infrared, providing data about the temperature of clouds, atmosphere, land and sea surfaces. The remaining four channels are in the visible and near-infrared range. The imaging sampling distance at nadir is 3 km for standard channels and approximately 1 km for the High Resolution Visible (HRV) channel [40]. The radiance data are available through National Centers for Environmental Prediction (NCEP) public interface in near real time (with 6 h delay) in Binary Universal Form for the Representation of meteorological data (BUFR) format (ftp://nomads.ncdc.noaa.gov/GDAS/).
The data assimilation used in this work is the 3D-Var version 3.8 developed by NCAR and included into the WRF Data Assimilation (WRFDA) system. The data assimilation is a technique by which observations are combined with the NWP product (the first guess or background) and their respective error statistics to provide an improved estimate (the analysis) of the atmospheric state [41,42]. Variational data assimilation achieves this through the iterative minimization of a prescribed cost function J(x): where x is the state vector defined by the atmospheric and surface parameters, x b is the background vector, y is the observation, B is the background error covariance matrix, H is the operator that converts the model state to the observation space and R is the observation error covariance matrix. The aim is to minimize the cost function (1) that measures the distance of a field x from the observations y and from the background x b. A correct evaluation of the error covariance matrices, both B and R, is crucial to a good-quality final analysis. For the direct radiance assimilation, the observation operator H(x) include the RTM that calculates the radiances from the model variables of a state vector x. The RTM evaluates the physical processes modulating the Earth's radiant energy balance. The modulations depend on the cloud microphysical details (particle size, shape, orientation, presence of aerosols), the cloud macrophysical properties (cloud top temperature, morphology, optical thickness) and environment context (water vapor, surface albedo, vertical profile of temperature). Actually, the RTM codes supported by the WRFDA are the Community Radiative Transfer Model (CRTM) and the Radiative Transfer for TOVS (RTTOV). The first has been developed by the Joint Center for Satellite Data Assimilation (JCSDA) and the second one by the European Centre for Medium-range Weather Forecasts (ECMWF) and the United Kingdom Meteorology Office (MetOffice). Both RTM codes can calculate radiances for several current and past instruments in orbit. WRFDA allows to select the desired RTM via the namelist parameter. In this study, the CRTM is chosen because this package is distributed with WRFDA without licensing restrictions and is compiled automatically [41,42]. The CRTM provides fast, accurate satellite radiance simulations under all weather and surface conditions, including both a forward model, which simulates the upwelling radiances for a given sensor and its Jacobian, which calculates the radiance derivatives with respect to the input atmospheric state variables. The CRTM is capable of accounting for the absorption of atmospheric gases as well as the multiple scattering of water clouds, of ice clouds and of a variety of aerosols [43]. The instruments to be assimilated are controlled by four integer namelist parameters: RTMINIT_NSENSOR (the total number of sensors to be assimilated), RTMINIT_PLATFORM (the platforms IDs array to be assimilated), RTMINIT_SATID (satellite IDs array) and RTMINIT_SENSOR (sensor IDs array) [41,42]. For the assimilation of the SEVIRI radiance these parameters are set to 1, 12, 3 and 21, respectively.
As highlighted above, the background error covariance matrix plays a crucial role to provide proper weight to the background term in defining the analysis cost function J. Two different types of B are available for use in the WRF 3D-Var technique: one is the generic background error statistics that can be used for any case and the second one is a domain specific B matrix that considers the climatological aspects of the area. It is strongly recommended to use a domain-specific background error to consider the scale of the motions especially in the area with a complex orography [44]. In this study, a specific background B matrix is computed with a 1-month dataset using the National Meteorological Centre (NMC) technique [45]. This method evaluates the initial state error using differences of pairs of forecasts valid at the same time but with one of them having a delayed start time.
Before assimilating the satellite radiances in the system, a quality control is performed to discard faulty observations and the bias correction is carried out using the Variational Bias Correction (VarBC) scheme to remove possible biases from the measurements. A further procedure applied over the raw radiance data is the 120 km thinning mesh. Thinning methods are necessary to downsize the large Remote Sens. 2020, 12, 920 7 of 27 satellite data sets and to reduce spatial error correlations between the observations in order to obtain the fundamental information of the data for a proper use in data assimilation technique. It should also be mentioned that WRFDA reads directly the BUFR radiance files distributed by EUMETSAT with no need for separate pre-processing. The whole processing, such as quality control, thinning, bias correction and so forth, is carried out within WRFDA. This is different from conventional observation assimilation, which requires a pre-processing package (OBSPROC) to generate WRFDA readable ASCII files. The WRFDA system requires three input files to run: the WRF first guess file (initial condition file for the WRF model), the observations (BUFR file) and the background error statistics file. The outputs of the WRFDA system becomes the inputs for the ASSIM simulations.
Currently, the WRF 3D-Var is able to assimilate the eight thermal infrared channels of SEVIRI (4-11).

Observational Data Sets
GHI data (W/m 2 ) are obtained from the ground-based pyranometers of the Regional Network of Meteorological Survey of the ARPA of Lombardy region ( Figure 2). These data are freely available from the ARPA website (http://www.arpalombardia.it/) in Comma Separated Values (CSV) format. The 23 weather stations are reported in Table 1 and have a sample frequency of 10 minutes. Table 1 reports also the above sea level (asl) elevations of the stations and the corresponding ones extracted from the DEM of the WRF. ARPA agency is responsible for the observation acquirement and the quality control. The pyranometers (mainly CM6B, HE20/K and DPA) used by the ARPA network are heterogeneous and produced by various companies, all belonging to the International Organization for Standardization (ISO) "First Class" classification and the World Meteorological Organization (WMO) "Good quality" classification; the uncertainties on the daily data declared by the manufacturers are ± 5%. [46][47][48][49][50].
Remote Sens. 2020, 12, 920 7 of 26 large satellite data sets and to reduce spatial error correlations between the observations in order to obtain the fundamental information of the data for a proper use in data assimilation technique. It should also be mentioned that WRFDA reads directly the BUFR radiance files distributed by EUMETSAT with no need for separate pre-processing. The whole processing, such as quality control, thinning, bias correction and so forth, is carried out within WRFDA. This is different from conventional observation assimilation, which requires a pre-processing package (OBSPROC) to generate WRFDA readable ASCII files. The WRFDA system requires three input files to run: the WRF first guess file (initial condition file for the WRF model), the observations (BUFR file) and the background error statistics file. The outputs of the WRFDA system becomes the inputs for the ASSIM simulations.
Currently, the WRF 3D-Var is able to assimilate the eight thermal infrared channels of SEVIRI (4-11).

Observational Data Sets
GHI data (W/m2) are obtained from the ground-based pyranometers of the Regional Network of Meteorological Survey of the ARPA of Lombardy region ( Figure 2). These data are freely available from the ARPA website (http://www.arpalombardia.it/) in Comma Separated Values (CSV) format. The 23 weather stations are reported in Table 1 and have a sample frequency of 10 minutes. Table 1 reports also the above sea level (asl) elevations of the stations and the corresponding ones extracted from the DEM of the WRF. ARPA agency is responsible for the observation acquirement and the quality control. The pyranometers (mainly CM6B, HE20/K and DPA) used by the ARPA network are heterogeneous and produced by various companies, all belonging to the International Organization for Standardization (ISO) "First Class" classification and the World Meteorological Organization (WMO) "Good quality" classification; the uncertainties on the daily data declared by the manufacturers are ± 5%. [46][47][48][49][50].   The products from the MSG mission have been used to further evaluate the performance of the WRF-IMAA-Solar. Satellite observations were directly downloaded from the Data Centre of the EUMETSAT Earth Observation Portal where it is possible to order online long-term archived data and generated products from EUMETSAT.
The EUMETSAT products used here are:

The Two Simulated Periods
To evaluate WRF-IMAA-Solar two periods have been selected, one characterized by sunny days used to test CNTRL configuration and one by overcast conditions for ASSIM. For cloudy conditions, 01-02 April 2017 is selected, as during these days the southward regression of the high pressure centered over Italy allows the descent of a deep trough in the northcentral Europe (Figure 4a). This synoptic structure encourages the rising from south-west of warm and wet air (Figure 4a) fostering the development of scattered clouds in Northern Italy (Figure 4b).

The Statistical Methodology
The validation of this preliminary test in the two selected periods against the ground-based and satellite products is performed through the evaluation of statistical skills. Four commonly-used statistical indexes are considered for the analysis of the GHI observed by the ARPA stations and for the MSG SSI [56]: the Mean Absolute Error (MAE), the Mean Bias Error (MBE), the Root Mean Square Error (RMSE) and the correlation (CORR), defined as follows: For cloudy conditions, 01-02 April 2017 is selected, as during these days the southward regression of the high pressure centered over Italy allows the descent of a deep trough in the northcentral Europe (Figure 4a). This synoptic structure encourages the rising from south-west of warm and wet air (Figure 4a) fostering the development of scattered clouds in Northern Italy (Figure 4b).

The Statistical Methodology
The validation of this preliminary test in the two selected periods against the ground-based and satellite products is performed through the evaluation of statistical skills. Four commonly-used statistical indexes are considered for the analysis of the GHI observed by the ARPA stations and for the MSG SSI [56]: the Mean Absolute Error (MAE), the Mean Bias Error (MBE), the Root Mean Square Error (RMSE) and the correlation (CORR), defined as follows:

The Statistical Methodology
The validation of this preliminary test in the two selected periods against the ground-based and satellite products is performed through the evaluation of statistical skills.
Four commonly-used statistical indexes are considered for the analysis of the GHI observed by the ARPA stations and for the MSG SSI [56]: the Mean Absolute Error (MAE), the Mean Bias Error (MBE), the Root Mean Square Error (RMSE) and the correlation (CORR), defined as follows: where m and o represent model data and observations, respectively, cov is the covariance, N indicates the length of data, σ m and σ o are respectively the standard deviation of simulated and observed data. For completeness the normalized scores have also been evaluated, so to reduce the dependence of the statistical indexes on the values of the GHI: where o is the average of the observations and MBE, RMSE and MAE are calculated respectively using the Equations (2)-(4). For the evaluation of the cloud mask additional statistical indexes are introduced. These scores are based on the contingency table (Table 2), which summarizes the relationship between two categorical variables. These statistical scores are defined using the frequency distributions in Table 2: where A indicates event both observed and simulated, B simulated but not observed, C observed but not simulated and D not observed and not simulated.

Results and Discussion
This section presents the comparison of the WRF-IMAA-Solar in the two configurations CNTRL and ASSIM against the observations described in Section 2.3. The statistical analysis is carried out through the indexes reported in Section 2.5. An initial sensitivity test has been performed with the ground-based observations in clear and overcast conditions. The clear sky evaluation has been presented only for the model in CNTRL configuration since in absence of cloud the impact of radiance assimilation is of little relevance. For the clear sky sensitivity test the 20-21 January 2017 period has been used (Figure 3a,b). The comparison is done extracting from the numerical model the GHI over the four nearest grid points to the station coordinates and then performing a distance weighted mean. Figure 5a reports the diurnal cycles of the solar irradiance for the 20-21 January 2017 simulated by WRF-IMAA-Solar (red line) and observed by ARPA radiometers (black line). For clear sky conditions, the model correctly reproduces the observed evolution even though a slight overestimation (~5 W/m 2 ) can be observed in the maximum of the mean solar irradiance (Figure 5a). The mean scatter plot of the GHI, reported in Figure 5b, shows a distribution with grouped points, due to the hourly samples; in clear sky conditions the GHIs increase before noon and decrease after the maximum with a similar trend among the stations. The standard deviation reported in Figure 5a,b for both observations and simulations is calculated between the stations to take into account the variability between them. The widths of the standard deviations for the observations (about 20-30 W/m 2 ) are approximately comparable for all the GHI values (Figure 5b). These gaps are easily visible from Figure 5a at midday when the derivative of the curve approaches zero and they seem negligible in the rapid phase of growth and decrease of the GHI. In addition, note from Figure 5b that the standard deviations calculated among the stations are larger for the observations than for the simulated ones, meaning that the observed GHI of the 23 stations show larger variability than the simulations. That indicates that the model may miss some of the differences between sites that are fairly close. To highlight the differences among the stations, statistical scores have been evaluated separately for each station and reported in Table 3. From Table 1 we note that the model tends to underestimate the elevation of the stations and to smooth the height of the stations, especially the ones with higher elevations, leading to underestimation and less variable GHI among the observation sites. This is evident at stations like Bione, Mezzoldo Passo S. Marco and Tignale Oldesio where the GHI is underestimated, especially in the maximum of the solar irradiance of approximately 20-30 W/m 2 , as demonstrated by the negative values of the MBE (Table 3, clear sky). For some stations, particularly the ones located in urban areas (e.g. Milano, Milano Lambrate and Cinisello Balsamo), the model shows positive MBE > 5 W/m 2 , indicating an overestimation of the GHI probably due to an underestimation of GHI extinction caused by pollution aerosols or to excessively high albedo associated to improper land use category (Table 3, clear sky). Furthermore, larger RMSE, ranging from 20 to 30 W/m 2 , are associated to stations showing the greatest differences between real and model elevation, that is, Bione, Mezzoldo Passo S.Marco and Tignale Oldesio (Table 3, clear sky). In clear sky conditions, significant differences among real and model elevations could explain the model deviations from observations, indeed the vertical profile of global irradiance is determined by the change of the optical path length. associated to improper land use category ( Table 2, clear sky). Furthermore, larger RMSE, ranging from 20 to 30 W/m 2 , are associated to stations showing the greatest differences between real and model elevation, that is, Bione, Mezzoldo Passo S.Marco and Tignale Oldesio (Table 2, clear sky). In clear sky conditions, significant differences among real and model elevations could explain the model deviations from observations, indeed the vertical profile of global irradiance is determined by the change of the optical path length.    To conclude the comparison with the ARPAL stations, the model performances in overcast conditions have been evaluated. For this purpose, the period 01-02 April 2017 is selected (Figure 4a,b). For this case study the evaluation is performed for both the CNTRL and ASSIM configurations. For the ASSIM configuration, two different settings have been tested to choose the best performing one: • ASSIM1: WRFDA uses the default B matrix. • ASSIM2: WRFDA uses a domain specific B matrix that considers the climatological aspects of the area.
For both ASSIM1 and 2, the model assimilates radiances from the eight thermal infrared channels of SEVIRI (4-11) at the start time (00 UTC). The comparison is done in the same manner of clear sky conditions, with the ARPA observations and using the distance weighted mean. The first 24 h of simulation of the two runs have been selected for the comparison. The statistical scores obtained for the cloudy sky and averaged all over the stations are reported in Table 4. The performances of the model in overcast sky get worse than the ones in clear sky, confirming the difficulty for the model to correctly reproduce clouds in both space and in time. The introduction of the assimilation tends to improve the irradiance forecast. In fact, the RMSE, MAE and CORR are better for the ASSIM configurations than the CNTRL one. Only the MBE gets worse for ASSIM tests than the CNTRL one, which is probably due to the underestimation of the cloud cover. The assimilation test with the domain specific B matrix has better scores than the one with the default one.
Even though the ASSIM2 configuration shows slightly higher performances than CNTRL ones, except for the MBE, another test has been performed to further investigate the impact of the assimilation of the SEVIRI radiance at 12 UTC. To better evaluate, indeed, the impact of the assimilation on the forecast of the solar irradiance, it is preferable to choose diurnal hours for the assimilation time, when GHI is relatively high. In addition, at 00 UTC the cloudiness over Lombardy region is quite limited, hinting a little impact of the assimilation on the performance of the model.
For the ASSIM at 12 UTC, the model is set using the configuration with best scores in the preliminary tests (ASSIM2); hence from now on the acronym ASSIM will refer to the configuration mentioned above as ASSIM2. Figure 6a shows the time series of simulated and observed solar irradiance with the respective standard deviations of the stations. Close to the assimilation time (12 UTC), ASSIM shows values closer to observations than CNTRL. Figure 6b confirms the tendency for both the configurations to overestimate the GHI, in fact the points lay above the 1:1 line. The underestimation of the cloud cover and the associated overestimation of radiation is a common problem in NWP modeling due to the incomplete representation of the complex and often nonlinear cloud processes into the parameterizations [12,[57][58][59][60]. Even for this test, the standard deviation reported in Figure 6a,b for observations and simulations is calculated between the stations to take into account the variability between them. As seen for the clear sky in Figure 5b, the standard deviations of the stations are larger for the observed GHI than the simulated ones. To the possible reasons listed above, we can also add the difficulty for the model to simulate the right location and timing of clouds.
Even though the ASSIM2 configuration shows slightly higher performances than CNTRL ones, except for the MBE, another test has been performed to further investigate the impact of the assimilation of the SEVIRI radiance at 12 UTC. To better evaluate, indeed, the impact of the assimilation on the forecast of the solar irradiance, it is preferable to choose diurnal hours for the assimilation time, when GHI is relatively high. In addition, at 00 UTC the cloudiness over Lombardy region is quite limited, hinting a little impact of the assimilation on the performance of the model.
For the ASSIM at 12 UTC, the model is set using the configuration with best scores in the preliminary tests (ASSIM2); hence from now on the acronym ASSIM will refer to the configuration mentioned above as ASSIM2.   Figure 6b confirms the tendency for both the configurations to overestimate the GHI, in fact the points lay above the 1:1 line. The underestimation of the cloud cover and the associated overestimation of radiation is a common problem in NWP modeling due to the incomplete representation of the complex and often nonlinear cloud processes into the parameterizations [12,[57][58][59][60]. Even for this test, the standard deviation reported in Figure 6a,b for observations and simulations is calculated between the stations to take into account the variability between them. As seen for the clear sky in Figure 5b, the standard deviations of the stations are larger for the observed GHI than the simulated ones. To the possible reasons listed above, we can also add the difficulty for the model to simulate the right location and timing of clouds. The statistical scores reported in Table 4 reveal better values for the two simulations initialized at 12 UTC than for the three initialized at 00 UTC. These improvements are due both to the assimilation during diurnal time, when GHI is relatively high and to the interval time considered for the statistical scores closer to the start and assimilation times.
For this test, the statistical indexes have also been evaluated separately for each station (Table 3, cloudy sky). For the MBE and MAE indexes, the number of stations where ASSIM or CNTRL prevail is very similar (respectively 8 for the MBE and 6 for the MAE). For the RMSE and CORR scores, instead, the number of stations where the ASSIM values are better than the CNTRL ones, prevails. The stations that perform best for ASSIM are located on the east and west side of the Lombardy region, leaving a central area (the Milan and Bergamo area) where the CNTRL configuration prevails. This can be explained by looking at Figure 5a where we can distinguish two bands of clouds over northern Italy which leave an area of clear sky in the central part of Lombardy. Therefore, in the areas where the cloud cover is observed, the impact of assimilation of radiance from SEVIRI is positive and improves the model's performance.
The statistical indexes averaged over all the stations for CNTRL and ASSIM are similar, though ASSIM shows better RMSE, decreasing from 104.34 W/m 2 to 100.05 W/m 2 and CORR, increasing from 0.915 to 0.920 (Table 4).
To increase the size of validation points, we extended the comparison to the satellite observations described above. The first evaluation is against the CLM from MSG. The cloud cover computed by WRF-IMAA-Solar-ASSIM is compared with the cloud cover observed by MSG satellite at 13 UTC (1 hour later the assimilation). Figure 7 clearly shows the impact of the assimilation-the ASSIM produce more cloudiness (Figure 7b) with respect to CNTRL (Figure 7c) which is confirmed by MSG observations (Figure 7a). The CNTRL simulation misses most of clouds over Italy, reproducing only the convective clouds related to the incoming front in North-West Italy. The analysis is further divided by cloud height (low, medium and high) as shown in Figure 8. It is clear that the SEVIRI assimilation allows the model to recover both the cirrus (Figure 8c) and the convective activity producing medium and low clouds (Figure 8a,b) towards the western boundary.
ASSIM produce more cloudiness (Figure 7b) with respect to CNTRL (Figure 7c) which is confirmed by MSG observations (Figure 7a). The CNTRL simulation misses most of clouds over Italy, reproducing only the convective clouds related to the incoming front in North-West Italy. The analysis is further divided by cloud height (low, medium and high) as shown in Figure 8. It is clear that the SEVIRI assimilation allows the model to recover both the cirrus (Figure 8c) and the convective activity producing medium and low clouds (Figure 8a,b) towards the western boundary.
at 13 UTC (1 hour later the assimilation). Figure 7 clearly shows the impact of the assimilation-the ASSIM produce more cloudiness (Figure 7b) with respect to CNTRL (Figure 7c) which is confirmed by MSG observations (Figure 7a). The CNTRL simulation misses most of clouds over Italy, reproducing only the convective clouds related to the incoming front in North-West Italy. The analysis is further divided by cloud height (low, medium and high) as shown in Figure 8. It is clear that the SEVIRI assimilation allows the model to recover both the cirrus (Figure 8c) and the convective activity producing medium and low clouds (Figure 8a,b) towards the western boundary.  The comparison between the CLM from MSG and the simulations (CNTRL and ASSIM) is done interpolating the observations on the grid of the model and producing the CLM from the WRF cloud cover. For this comparison the simple persistence method has been introduced as benchmark to evaluate the convenience of using the assimilation technique. The persistence is a very simple method, for which the current status is projected as is to the future assuming that conditions remain unchanged in subsequent time intervals [5].
The impact of the assimilating process is most relevant at 13 UTC of 1 April, that is, one hour after the assimilation. Indeed, the first line of Figure 9 shows that WRF ASSIM gives a cloud mask very similar to the observed one, even though an overestimation of the cloud can be observed. This is confirmed by Figure 10, where PC, FAR, MR and BIAS are reported for the first 6 h after the assimilation. It is evident that the CNTRL configuration misses the 59% of cloud grid points against only the 11% for ASSIM; also the PC confirms that ASSIM well simulates the 81% of pixels cloud/no cloud versus the 63% for CNTRL and the BIAS for ASSIM is very close to the best value (1.09 for ASSIM and 0.43 for CNTRL). On the other hand, FAR increases for ASSIM (18%) with respect to CNTRL (5%). One hour later, at 14 UTC, the CNTRL simulation starts to recover the gap and indeed more clouds appear in central-southern Italy (Figure 9, second row). The statistical scores ( Figure 10) confirm this recovery, especially for PC (82% for ASSIM and 80% for CNTRL), though remaining below the ASSIM performances (MR: 9% for ASSIM and 26% for CNTRL; BIAS: 1.10 for ASSIM and 0.80 for CNTRL). Three hours later the start time (15 UTC, third row Figure 9) the two WRF simulations are very similar even though the CNTRL maintains the tendency to underestimate the presence of clouds, for example around Sardinia, as shown by the MR reported in Figure 10 (MR: 7% for ASSIM and 16% for CNTRL) and the ASSIM to overestimate the cloud cover, as indicated by the FAR reported in Figure 10 (FAR: 16% for ASSIM and 9% for CNTRL). In the following hours ( Figure 9: 16 UTC fourth row and 17 UTC fifth row) the CNTRL and ASSIM cloud masks are close to each other and no particular feature can be detected as confirmed by the PC ranging from 84% of CNTRL to 86% of ASSIM ( Figure 10) and by the BIAS oscillating between 1.03 and 1.06 for both the configurations. For the statistical scores FAR and MR the differences decrease reaching respectively 13% and 8%. The comparison highlights that the persistence outperforms ASSIM only in the first half hour of simulation for the PC and FAR indexes; later in time the enhancement for ASSIM with respect to the persistence is remarkable (Figure 10). The improvement of ASSIM relative to the persistence model is negligible at the first hour of simulation for the MR and BIAS scores and increases with time ( Figure 10). In addition, persistence tends to worsen with time according to all statistical indices, as opposed to the ASSIM and CNTRL configurations for which the performances improve over time ( Figure 10). Therefore, WRF-IMAA-Solar outperforms the persistence model, especially when the assimilation is used. Indeed, all the mean statistical scores are always worse for the persistence than ASSIM ( Table 5). The CNTRL configuration is outperformed by the persistence only for the MR index (Table 5). Summarizing the impact of the radiance assimilation on the cloud mask, it is evident that the benefits are very clear especially within 3 h after the assimilation time; after this interval the performances of the CNTR and ASSIM are very similar. The statistical scores reported in Table 5 assert best results for ASSIM than CNTRL for the PC, MR and BIAS; only the FAR is worst for ASSIM with respect to CNTRL.
The second evaluation with MSG products focuses on the Hourly Shortwave Solar Irradiance. As for the cloud mask, the observations have been interpolated over the WRF model grid.
The GHI at 13 UTC of 1 April 2017 (first row of Figure 11) clearly reproduces what already seen above for the cloud mask ( Figure 9): the overestimation produced by the CNTRL simulation is due to the lack of high clouds over central Italy and also the cloudy system north-west of Corsica is underestimated compared to the observed one. The ASSIM simulation better reproduces the GHI pattern over Italy but overestimates the GHI in the west side of the domain due to a lower extinction by the simulated cloud system, which appears more scattered than the observed one. We note a different behavior of the GHI by MSG over land and water; indeed, clear gaps appear over Sicily and central Italy where the lower values of GHI suddenly interrupt passing from sea to land. These observational gaps can be due to a different retrieval technique over land and over water and could have effects on the evaluation.  The comparison between the CLM from MSG and the simulations (CNTRL and ASSIM) is done interpolating the observations on the grid of the model and producing the CLM from the WRF cloud cover. For this comparison the simple persistence method has been introduced as benchmark to evaluate the convenience of using the assimilation technique. The persistence is a very simple method, for which the current status is projected as is to the future assuming that conditions remain unchanged in subsequent time intervals [5]. The impact of the assimilating process is most relevant at 13 UTC of 1 April, that is, one hour after the assimilation. Indeed, the first line of Figure 9 shows that WRF ASSIM gives a cloud mask very similar to the observed one, even though an overestimation of the cloud can be observed. This is confirmed by Figure 10, where PC, FAR, MR and BIAS are reported for the first 6 hours after the assimilation. It is evident that the CNTRL configuration misses the 59% of cloud grid points against only the 11% for ASSIM; also the PC confirms that ASSIM well simulates the 81% of pixels cloud/no cloud versus the 63% for CNTRL and the BIAS for ASSIM is very close to the best value (1.09 for ASSIM and 0.43 for CNTRL). On the other hand, FAR increases for ASSIM (18%) with respect to CNTRL (5%). One hour later, at 14 UTC, the CNTRL simulation starts to recover the gap and indeed more clouds appear in central-southern Italy (Figure 9, second row). The statistical scores ( Figure 10) confirm this recovery, especially for PC (82% for ASSIM and 80% for CNTRL), though remaining For this reason, the statistical indexes have also been calculated by selecting only the grid points over the sea, which seem more reasonable and more consistent with respect to the observed clouds.
Both the MBE and the NMBE indexes confirm the GHI overestimation more pronounced for the CNTRL configuration than the ASSIM one ( Figure 12). In fact, the values of the MBE range from a maximum 68 W/m 2 to a minimum 54 W/m 2 for CNTRL and from a maximum 62 W/m 2 to a minimum 49 W/m 2 for ASSIM. The NMBE shows an increase of the overestimation with time ( Figure 12, first row), with at least 2% above for ASSIM with respect to CNTRL. The mean values of the NMBE are 19% for ASSIM and 21% for CNTRL as reported in the Table 6. Let us highlight the improvement of both the MBE and the NMBE when only the water grid points are selected. This is more pronounced for ASSIM; in fact, the mean value of the MBE passes from 54.43 W/m 2 to 38.85 W/m 2 for ASSIM and from 60.57 W/m 2 to 58.40 W/m 2 for CNTRL (Table 6).    Later in time ( Figure 11) the GHI decreases because of sun setting and the overestimation in the left side of the domain remains more evident in the CNTRL than ASSIM simulation. The RMSE and MAE apparently seem to improve with time but this is due to the decreasing of the values of the GHI and not to an improvement of the forecast performances (second and third rows of Figure 12). For this reason, it is more appropriate to analyze the normalized indices, which indicate a slight superiority for ASSIM with respect to CNTRL and clearly show the worsening of the forecast skills with the time of simulation. The values of NMAE move from 17% of the first hour of simulation to 45% of the fourth one for ASSIM and from 18% to 47% for CNTRL. The same trend can be found for the NRMSE showing values ranging from 28% to 68% for ASSIM and from 30% to 69% for CNTRL. Slight better performances are given over water, with a maximum improvement for the NRMSE of 5% for ASSIM and 4% for CNTRL; the maximum improvement for NMAE is 3% for both ASSIM and CNTRL.
The correlation shows very similar results for both the configurations with a maximum of 59% for ASSIM WATER one hour after the assimilation (Figure 12, fourth row); values worsen with time, reaching approximately 12-14% after four hours. The persistence method has been used as benchmark also for the evaluation of the GHI. The mean values of the scores confirm that WRF-IMAA-Solar, both CNTRL and ASSIM, outperforms the persistence method. This result is expected, as this simple method can be used in clear sky conditions and is not suitable when the atmospheric dynamics involves rapid cloud cover variations.
Summarizing these preliminary results, it can be asserted that the WRF model overestimated the GHI and the assimilation process tends to reduce the error up to 4 h after the start time, when the SEVIRI radiance has been assimilated. The average scores confirm better performances for ASSIM, especially considering grid points only over water (Table 6).

Conclusions
The solar irradiance is particularly variable both in space and in time because of the complexity of the dynamic evolution of clouds. Geostationary satellite data can be taken into account to produce a more realistic representation of the atmospheric state using a data assimilation tool in a convective permitting model. Therefore, with the aim to improve the prediction of solar irradiance, preliminary tests have been performed to assess the impact of the 3D-VAR data assimilation of SEVIRI radiance in Italy using WRF Solar mesoscale model. The performances of the model have been analyzed both in clear conditions and cloudy sky. In clear sky the model has been tested without assimilation (CNTRL configuration) using the ARPAL radiometric stations. The statistical scores for the clear sky conditions confirm the positive performance of the WRF-IMAA-Solar with a MBE of −0.21 W/m 2 , a RMSE of 15.47 W/m 2 , a MAE of 6.78 W/m 2 comparable to the instrument uncertainty and a CORR of 0.995.
For cloudy sky, simulations with data assimilation (ASSIM configuration) have been performed. A preliminary test allows us to select the best settings for the use of the 3D-VAR data assimilation: the eight thermal infrared channels of SEVIRI have been assimilated at 12 UTC using a domain specific B matrix to consider the climatological aspects of the area, the VarBC scheme correction to remove the biases from the measurements and the 120 km thinning mesh to avoid correlations between adjacent observations. The solar irradiance is better simulated by ASSIM than CNTRL configuration, especially in the 2-3 h nearest to the assimilation time, even though both the configurations overestimate the GHI. The statistical indexes calculated using the ARPAL stations improve for the ASSIM configuration, especially for the RMSE. The improvements introduced by the 3D-VAR data assimilation of SEVIRI radiances are confirmed also by the comparison with the EUMETSAT products: the Cloud Mask and the Hourly Shortwave Solar Irradiance. The use of the radiance assimilation on the simulation of cloud mask clearly produces better results especially in the 3 h later the assimilation time; after this interval the performances of the CNTRL and ASSIM are very similar. In particular one hour after the assimilation time the CNTRL configuration misses the 59% of cloud grid points against only the 11% for ASSIM; the PC statistical score passes from the 63% of CNTRL to the 81% of ASSIM and the BIAS for ASSIM is very close to the best value (1.09 for ASSIM and 0.43 for CNTRL). After the first hour of simulation, the better performance for the ASSIM configuration persists even though the difference is less significant.
The improvement related to the SEVIRI radiance assimilation is confirmed also by the comparison of the WRF-IMAA-Solar with the MSG products of the Hourly Shortwave Solar Irradiance. The statistical indexes confirm the GHI overestimation of the model is more pronounced for CNTRL than ASSIM. In the first hour of simulation the RMSE is reduced from 185 to 173 W/m 2 and the MBE from 54 to 49 W/m 2 when the assimilation is activated. The normalized statistical indexes are better bỹ 2% using the ASSIM configuration. A discontinuity is noted for MSG GHI over water and over land, probably due to a different retrieval technique. Calculating the statistical scores for only the over-water grid points, which seem to have a more consistent pattern with the observed clouds, the values improve further. Indeed, the value of the MBE one hour after the assimilation time passes from 49 W/m 2 to 20 W/m 2 for ASSIM and from 54 W/m 2 to 42 W/m 2 for CNTRL, using only the water grid points and the behavior is similar for the remaining indexes. The statistical scores obtained are comparable or slightly better than the values obtained from similar works, where a NWP is used to estimate the GHI [13][14][15][16][17][18]. The results from these preliminary tests show that the assimilation of geostationary satellite data increase the performances of the WRF-Solar model especially in the short-term range of the forecasts. This confirms the findings of other works [61][62][63], that is the effect of satellite data assimilation has high impact in the first 3-4 h of simulation and the improvement due to the assimilation decreases going forward with the simulation.
Note that this technique has been tested on two case studies only (one in clear sky and one in cloudy conditions) as preliminary tests for checking whether it is worth the effort to implement this change on our operational chain. Encouraged by the positive results, we are planning to increase the dataset and also to focus on the diffuse and direct components of the GHI.
Further improvements could be made considering other SEVIRI channels, for example, in the visible range. In the future, we plan to test assimilation with RTTOV model, since its most recent version allows for the assimilation of visible channels in addition to the infrared channels ones. The visible channels and specially the HRV channel with 1 km spatial resolution, carry more information and detail on the cloud pattern and properties, possibly foreshadowing an improved solar irradiance forecast. In addition, other advanced assimilation methods such as 4D-Var could be tested to take advantage of the frequent interval of scanning (15 min for SEVIRI).