The set of governing equations is based on the fully compressible non-hydrostatic system for the representation of a two-component system (dry air and water in all the three phases). The time integration is performed using an explicit two-time-level predictor–corrector scheme, while the terms describing vertical sound-wave propagation are treated in an implicit way.
2.1. Simulation Set-Up
The domain considered in the present work is shown in
Figure 1a and includes the whole Italian peninsula, extending from 34.199° N to 48.612° N and from 3.8056° E to 23.800° E. The positioning of boundaries was chosen in order to avoid crossing of mountain chains, improving the numerical stability. The computational grid is an R2B10 and is made up of 451,384 triangular cells, with a spatial resolution of about 2.5 km. The geometrical center of the grid is positioned in Gaeta (longitude 13.802° E, latitude 41.560° N).
Moreover, in order to perform a model sensitivity to the domain size, two additional domains were considered (D1 and D2, not shown), with the same geometrical center and the same resolution R2B10; however, with extensions larger, respectively, of 100 and 200 km with respect to the original one. D1 is made up of 498,712 cells, D2 of 562,240 cells.
Analysis was conducted over the Italian area, considering the second level of NUTS2 (Nomenclature des Unités Territoriales Statistiques) repartition but, for simplicity, in this work the sensitivity results are presented in the three macro-areas shown in
Figure 1b.
Two test cases were considered, namely 1–14 January 2019 and 16–31 August 2020. The selected periods were chosen in order to investigate the capabilities of ICON-LAM under different weather conditions and will be identified in the following as winter 2019 and summer 2020. The first period was characterized by several rainy events and strong winds, whereas the second one was characterized by an extreme precipitation event that occurred during 29 and 30 August 2020.
Specifically, hindcasts were initialized once and ran continuously throughout each of the 2-week periods. The initial and boundary conditions are the “analysis” provided by the ECMWF IFS model [
9], characterized by a spatial resolution of 0.075° (about 8.5 km), with the boundary conditions updated every 6 h. This approach was chosen in order to have accurate boundary conditions and minimize the errors introduced by the forcing data, thus, avoiding a long-range error growth.
In any case, it is worth noting that initializing ICON with IFS data could be problematic for soil moisture, as the Land Surface Models used in both models have substantially different soil water climatologies. Model evaluation was executed considering the 2-m temperature (T2m) as the most basic weather variable and precipitation that, at convection permitting scales, is a fundamental parameter.
The simulations were performed on the Supercomputer ZEUS of the Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC). This is a Lenovo HPC cluster based on 348 Intel Xeon dual processor nodes (for a total of 12,528 cores) interconnected by means of an Infiniband EDR network. Due to the high performance of the supercomputer, each simulation over the domain considered employs less than 20 min for each day, using 576 cores distributed on 16 nodes.
2.2. Observational Datasets
Model evaluation was conducted against SCIA, a gridded observational dataset provided by the Italian Environmental Protection Agency (ISPRA) (
http://www.scia.isprambiente.it/, accessed on 10 March 2022), available at a horizontal resolution of about 5 km for T2m and 10 km for precipitation over the whole Italian area. This dataset comes from the interpolation of data stations at daily resolution, covering the 20th century up to the present date (with different reliability due to the different number of interpolated values available each year) [
10]. SCIA provides daily maximum and minimum values of T2m and daily precipitation values.
Additionally, the E-OBS [
11] (version 23.1) was considered, a widely used gridded daily dataset of precipitation and temperature at 0.11° spatial resolution (about 11 km), available for the period 1950–2020, covering the European area (mean daily values). It is generally used for monitoring the climate in Europe (especially with regard to the assessment of daily extremes) and is an important dataset for model validations, even if, for the present work, some shortcomings are expected, related to the large difference of resolution between the model and the dataset.
Moreover, in order to strengthen the analysis, evaluation was conducted also against selected data stations provided by regional environmental protection agencies (ARPA) scattered over the Italian area, since non-negligible errors could affect gridded data, especially in mountainous regions, since the majority of stations are located in valleys, and the assumptions made for the height regression may not be appropriate on individual days.
2.3. Methodology
The sensitivity analysis was conducted starting from a reference configuration developed at DWD and modified according with the finding of Khain P. (2021, personal communication) obtained over Israel. The external datasets used are GLC2000 for land use, GLOBE for surface altitude and FAO Digital Soil Map of the World for soil types. The convection scheme is the classical Tiedtke/Bechtold, whereas the land surface scheme is the multi-layer model TERRA [
12], which simulates the energy and water balance at the land surface and in the ground, providing the lower boundary-condition for the atmospheric model and is responsible for the exchange of fluxes of heat, moisture and momentum between land surface and atmosphere.
Other main features are:
- -
The time step dt was set equal to 24 s. This value was chosen after a series of preliminary tests. In particular, it was shown that the value 36 s does not guarantee stability, whereas the value 12 s provides almost the same solution as with 24 s.
- -
The number of vertical levels was set equal to 65 as a tradeoff between accuracy and computational costs after preliminary tests were performed also with 50 and 90 vertical levels. The highest level was at 22 km.
ICON allows choices to be made among different parameterizations of physical phenomena and numerical schemes as well as the selection of suitable values for parameters related to them. It is therefore possible to optimize the configuration of the model according to the climatology of the area and the spatial resolution employed.
In the present work, starting from the reference configuration described above, different configurations were defined by varying one scheme (parameter) at a time. A list of the sensitivity runs with changes in the parameters with respect to the control run (ref1) is reported in
Table 1. According with previous experiences on sensitivity analysis [
13], sensitivity tests mainly regard the physics schemes and are combined into five groups.
Sensitivity to convection: Convection is a fundamental process in the atmosphere, since it contributes to the formation of large-scale circulation and local heavy precipitation. In ICON, it is parameterized through a bulk mass flux scheme based on Tiedtke/Bechtold [
14] (the parameter inwp_convection is set to the value 1). In the reference configuration (named ref1 in
Table 1), the shallow convection parameterization is active whereas the parts treating deep and mid-level convection are switched off (the parameter lshallowconv_only is set to TRUE). This option is recommended for convection permitting grids (resolution between 1 and 3 km). The second option (adopted in ref2 of
Table 1) adopts a full convection scheme (both kinds of convection are parameterized). In such a case, the parameter lshallowconv_only is set to FALSE.
Sensitivity to the radiation scheme: Radiation is an important component of a weather model, since local and global temperatures are determined by heating (absorption) and cooling (emission). The shortwave radiation from the sun and the longwave radiation from the earth interact with clouds. The reference configuration (ref1 in
Table 1) uses the radiation scheme ecRad [
15,
16] and the parameter inwp_radiation is set to the value 4.
This allows choices for each component, namely the optical property parameterizations for the atmospheric components and the radiation solver calculating how radiation travels through the optical medium. A second option (adopted in the case ref3 of
Table 1) uses a rapid and accurate radiative transfer model RRTM [
17] and the parameter inwp_radiation is set to the value 1. Moreover, the options with Ritter-Geleyn radiation [
18] and PSRAD [
19] are also tested in cases ref4 and ref5 of
Table 1, respectively, and the parameter inwp_radiation is set to the values 2 and 3, respectively.
Sensitivity to the cloud microphysics schemes: they are made up of a closed set of equations able to calculate the formation and evolution of condensed water in the atmosphere. These provide the latent heating rates for the dynamics. The reference configuration (ref1 in
Table 1) uses a single moment scheme, which predicts the cloud water, rain water, cloud ice, snow and graupel [
12], and the parameter inwp_gscp is set to the value 2.
A second option (adopted in the case ref6 of
Table 1) is based on a double moment scheme, which predicts the mass and number of cloud water, rain water, cloud ice, snow, graupel and hail [
20] and the parameter inwp_gscp is set to the value 4. This scheme is mainly recommended at convection permitting scales, i.e., mesh size smaller than 3 km. Further options (adopted in the cases ref7 and ref8 of
Table 1) are based on the Koehler scheme (improved ice nucleation) and on the Kessler scheme [
21] and the parameter inwp_gscp is set to the value 3 and 9, respectively.
Sensitivity to cloud cover: an estimate of cloud cover is required to prepare optical properties of clouds for the radiative transfer. The reference configuration (ref1 in
Table 1) uses a diagnostic cloud cover scheme (Kohler), which combines information from turbulence, convection and microphysics schemes, and the parameter inwp_cldcover is set to the value 1. A second option (adopted in the case ref9 of
Table 1) is based on the COSMO SGS cloud scheme (cloud cover and cloud water/ice calculation are performed as in the COSMO model), and the parameter inwp_cldcover is set to the value 3.
A third option (adopted in the case ref10 of
Table 1) is clouds as in turbulence approach, in which the area fraction of a grid box covered by stratiform (non-convective) clouds is calculated; an additional saturation adjustment is done assuming a Gaussian distribution for the saturation deficit. In such a case, the parameter inwp_cldcover is set to the value 4. The last option (adopted in the case ref11 of
Table 1) is a simple grid scale cloud cover, in which each computational cell is marked with 1 or 0, according to the presence (or not) of clouds; this is determined by comparing the specific cloud water/ice content of each cell with an assigned threshold value (the parameter inwp_cldcover is set to the value 5).
Sensitivity to the turbulence transfer: this parameterization links the resolvable scales and the non-resolvable fluctuating scales of motion. Turbulent fluxes run an exchange of momentum, heat and humidity between the free atmosphere and the earth’s surface, representing a crucial issue for a proper simulation of atmospheric flows. The reference configuration (ref1 in
Table 1) uses a turbulence scheme [
22] taken from COSMO, based on a prognostic equation for Turbulent Kinetic Energy also considering that some modifications for ICON are still ongoing in order to improve the numerical stability under extreme conditions. In this case, the parameter inwp_turb is set to the value 1.
Another option (adopted in the case ref12 of
Table 1) is based on the Globales Modell (GME) turbulence scheme [
23], and the parameter inwp_turb is set to the value 2. The last option (adopted in the case ref13 of
Table 1) is based on the 3D sub-grid model of Smagorinsky [
24] implemented for Large Eddy Simulations applications, and the parameter inwp_turb is set to the value 5.
Statistical measures for model’s performance were calculated as the Root Mean Square Error (
RMSE) and Mean Absolute Error (
MAE), defined in the following, where
S and
O refer to daily values of model and reference product, respectively:
Here, N represents the number of grid points included in the macro area considered. Moreover, Taylor diagrams and Quantile-quantile plots (QQplot) are proposed. Taylor diagrams provide a graphical framework that allows a comparison among models and reference data in terms of their correlation, their root-mean-square difference and the ratio of their variances. QQ-plots use a graphical method and allow comparison of the probability distributions by plotting their quantiles against each other. If the compared distributions are close to the observations, the points in the QQplot approximately lie close to the 45° inclined line.