Impact of Soil Moisture Initialization in the Simulation of Indian Summer Monsoon Using RegCM4

: Soil moisture is one of the key components of land surface processes and a potential source of atmospheric predictability that has received little attention in regional scale studies. In this study, an attempt was made to investigate the impact of soil moisture on Indian summer monsoon simulation using a regional model. We conducted seasonal simulations using a regional climate model (RegCM4) for two different years, viz., 2002 (deﬁcit) and 2011 (normal). The model was forced to initialize with the high-resolution satellite-derived soil moisture data obtained from the Climate Change Initiative (CCI) of the European Space Agency (ESA) by replacing the default static soil moisture. Simulated results were validated against high-resolution surface temperature and rainfall analysis datasets from the India Meteorology Department (IMD). Careful examination revealed signiﬁcant advancement in the RegCM4 simulation when initialized with soil moisture data from ESA-CCI despite having regional biases. In general, the model exhibited slightly higher soil moisture than observation, RegCM4 with ESA setup showed lower soil moisture than the default one. Model ability was relatively better in capturing surface temperature distribution when initialized with high-resolution soil moisture data. Rainfall biases over India and homogeneous regions were signiﬁcantly improved with the use of ESA-CCI soil moisture data. Several statistical measures such as temporal correlation, standard deviation, equitable threat score (ETS), etc. were also employed for the assessment. ETS values were found to be better in 2011 and higher in the simulation with the ESA setup. However, RegCM4 was still unable to enhance its ability in simulating temporal variation of rainfall adequately. Although initializing with the soil moisture data from the satellite performed relatively better in a normal monsoon year (2011) but had limitations in simulating different epochs of monsoon in an extreme year (2002). Thus, the study concluded that the simulation of the Indian summer monsoon was improved by using RegCM4 initialized with high-resolution satellite soil moisture data although having limitations in predicting temporal variability. The study suggests that soil moisture initialization has a critical impact on the accurate prediction of atmospheric circulation processes and convective rainfall activity.


Introduction
The strong impact of land surface processes is well recognized in modulating the weather and climate system in subseasonal to seasonal and even longer time scale. Land surface acts as an interface between the biosphere and the overlying atmosphere. It interacts with the atmosphere through the exchange of mass, momentum and energy and hence is 2 of 27 considered as the lower boundary of the atmosphere at approximately 30% of the Earth's surface [1]. It is also well understood that the Earth's surface is the reservoir of our main energy resources from solar radiation. Both short and long wave forms of solar radiation are absorbed by the land surface and reemitted. When releasing the energy through the planetary boundary layer, the Earth's surface works like a separator. It redistributes the net incoming radiative energy into various fluxes such as sensible, latent and other ground fluxes. Hence, the energy required to develop and sustain any weather system over landmass, is supplied from the underlying land surface [2][3][4][5]. Therefore, the landatmosphere interaction plays a vital role in modulating the weather and climate systems on a regional and global scale [6][7][8][9][10]. Due to its immense impact, the functions of the land surface have been explored extensively in observation as well as modeling studies across the globe [9][10][11][12][13][14].
Land surface-atmosphere interaction may be either a positive, negative or both feedback mechanism between the atmosphere and different land surface characteristics such as soil moisture, soil temperature, soil types, vegetation cover, snow cover, etc. Each of them is not of similar importance for a weather system over a region. In particular, soil moisture is an important component of the global water budget and hydrology cycle [1,8,15]. The function of soil moisture may be described in two ways. Primarily, rate of evaporation from the land surface is determined by the soil moisture quantity which controls the moisture supply to the atmosphere. Secondly, as mentioned earlier, it mainly partitions the net absorbed solar radiation into fluxes. It is mentioned by Dutta et al. [16] that soil moisture and snow cover are the two leading land surface variables that have a potential impact on the variation of weather systems if the effect of sea surface temperature is excluded.
Climate downscaling using a regional climate model (RCM) is well accepted and widely used for the simulation of various weather and climate systems for the past several decades. It is demonstrated in numerous earlier works of literature [17][18][19][20][21][22] that the RCMs show better competence in simulating climatic features due to better representation of the subgrid scale physical process and topography than the global circulation model. Land surface processes mostly occur at a subgrid-scale but play an important role in controlling weather systems [6,7,23]. Through evaporation, the exchange of heat and moisture fluxes from the land surface to the atmosphere helps form convection and precipitation. Proper representation of soil moisture is therefore extremely crucial for the numerical weather forecasting as well as climate simulations on seasonal, annual and decadal scale using fully coupled RCMs. In each state-of-the-art RCM, physical parameterization of the land surface is taken care of through different land surface models (LSM henceforth). The soil moisture initialization technique is different in different LSMs. However, providing an accurate state of the soil parameters has a serious impact on evaluating the weather and climate modes which are associated with the retrospective research based on the terrestrial hydrology cycle. Therefore, better simulation of atmospheric processes can be achieved through initializing climate models with realistic observational/reanalysis soil moisture datasets.
Several kinds of research have already been carried out to emphasize the impact of land surface model initialization with realistic soil moisture datasets [6,7,[24][25][26][27][28][29][30][31][32][33]. Fennessy and Shukla [24] studied the importance of initial soil wetness in seasonal prediction with dynamical models. They concluded that the effect of initial soil wetness is local and greatest in the near-surface fields, viz. evaporation, surface temperature and precipitation. Douville and Chauvin [25] used a land surface scheme that was forced with meteorological observation and analysis using a relaxation technique and inferred that the relaxation positively impacts both model climatology and variability at an interannual scale. Kanamitsu et al. [27] showed that the predictive ability of the initial soil moisture is higher in arid/semi-arid regions and has a sound impact on surface temperature simulation. Douville [28] investigated the effect of soil moisture on climate variability and potential predictability and highlighted its strong contribution to climate variability.
Moufouma-Okia and Rowell [2] investigated the sensitivity of soil moisture initialization on the West African monsoon by using an RCM and revealed that specification of initial soil moisture is slightly sensitive to the West African monsoon rainfall. Douville [3] highlighted the significant impact of soil moisture on regional climate and suggested further comprehensive and systematic investigation. Bisselink et al. [4] performed a similar study by initializing an RCM with satellite derived soil moisture data and showed greater impact during dry years. Suarez et al. [7] performed a numerical experiment for three synoptic events using two different mesoscale models with varying soil moisture. They illustrated that the rainfall increases (decreases) with enhanced (reduced) soil moisture, respectively. These studies indicate that the soil moisture significantly affects the weather and climate simulation, but varies from region to region. However, no studies have yet been discussed in this context over the Indian region.
Among the various RCMs available, the regional climate modeling system which is commonly abbreviated as RegCM of the International Center for Theoretical Physics (ICTP, Italy) has become remarkably popular due to its successful application towards numerous scientific studies [15,[17][18][19][20][21][22]30,[34][35][36][37]] and many studies have tested its performance over Indian regions [17][18][19][20][21][22]37]. In the context of soil moisture, RegCM is also used over various regions [30,31,38]. Hu et al. [38] argued that the treatment of soil moisture should be paid more attention while performing an experiment on soil moisture data assimilation using RegCM over China. Patarcic and Brankovic [30] investigated the ability of surface temperature seasonal forecasting over Europe using RegCM by initializing it with three different types of soil moisture conditions during summer and winter times. Their study showed that the systematic error was reduced and deterministic ability was improved during summer using realistic soil moisture data. Liu et al. [31] evaluated the impact of soil moisture using RegCM simulation. They showed that initialization with wet (dry) soil moisture anomalies increased (reduced) the subsequent precipitation amount and reduced (increased) surface temperature. Due to sparse observation networks, the availability of accurate soil moisture data (either observation, reanalysis or both) in the past was very rare. Nowadays, different organizations offer accessibility to satellite-derived as well as reanalysis soil moisture datasets. The Climate Change Initiative (CCI) of the European Space Agency (ESA) is one such piece of data publicly released in 2015. This dataset has been successfully applied in some observational [39] and modeling studies [40] over other regions across the globe. However, it is not extensively explored over Indian regions. Although few observational studies over India are available in the literature [8,10,11], it is not comprehensively used in modeling studies. This study mainly deals with the soil moisture initialization over India to understand its role on the seasonal simulation (May-September) of the Indian summer monsoon (ISM) using RegCM. To our knowledge, our attempt to investigate the impact of soil moisture on ISM using a regional model is the first over India. The rest of the paper is structured as follows: brief model information, experimental design, descriptions of the various datasets and validation strategy are discussed in Section 2. Results are described in Section 3 followed by discussion, conclusion, limitation and future scope in Sections 4-6, respectively.

Model Description
In the present study, RegCM v.4.4.5 (RegCM4 henceforth) is employed. It is a compressible, hydrostatic, terrain-following, finite difference, limited area model with a similar dynamical core to that of its previous version (RegCM3 [41]). The model offers a variety of parameterization schemes to represent different physical processes. Cumulus convection is represented using five major schemes such as Kuo [42], Grell [43], MIT [44], Tiedke [45] and Kain-Fritsch [46]. Due to variation in performance, RegCM4 shows flexibility of using different schemes separately over land and ocean, referred to as "mixed" schemes. Land surface processes are represented using two LSMs, namely, the BATS scheme [47] as well as CLM (v.3.5 [48]; v.4.5 [49]). Radiative transfer package from the global model CCM3 [50], planetary boundary layer from Holtslag [51] as well as University of Washington [52] are also available in RegCM4. A detailed description of other available physics schemes, viz., ocean fluxes parameterization schemes, interactive aerosol schemes and interactive lake models are described in Giorgi et al. [15].
In this study, we focused on the soil moisture initialization in the seasonal simulation of ISM using RegCM4. Two LSMs differ with their formulation in various aspects. One of the major disparities is in the description of the soil moisture column. BATS is composed of three soil moisture layers with varying depth from 10 cm to 3 m [35]. On the other hand, the CLM soil column consists of 10 unevenly distributed soil layers at 1.8 cm, 2.8 cm, 4.6 cm, 7.5 cm, 12.4 cm, 20.4 cm, 33.6 cm, 55.4 cm, 91.3 cm and 113.7 cm depth, for a total depth of 3.4 m [53]. In the earlier version of RegCM, soil moisture was initialized using static soil water content relative to saturation as a function of land cover type [54]. Patarcic and Brankovic [30] suggested that this technique is a crude way of defining the initial soil moisture which includes neither seasonal nor interannual variation. Due to that, the model took a higher spin-up time to become stable, particularly for deeper soil layers. Considering this, RegCM4 offers the option to be initialized using climatological soil moisture data both in CLM and BATS [29,55] along with the default static soil moisture. After being initialized from the soil moisture climatology, RegCM4 evolves independently with its own internal water balance equation [30], which would reduce sudden shock to the model at the initial time step and consequently decreases the spin-up time at the deeper soil layer of the model [29,53]. According to the India Meteorological Department (IMD), India did not face any excess monsoon year during the last few decades subsequent to 1988. However, the country witnessed a severe drought in 2002 with 81% ISM rainfall of its long period average [56]. On the other hand, 2011 was a normal year with 102% ISM rainfall of its long period average. Model configuration setup is provided in Table 1 [57]. Simulation during May is considered as spin-up and excluded from the subsequent analysis. Zeng [60] By default, RegCM4 is initialized with the static soil moisture data through the BATS lookup table. In this study, the model is forced to begin with high-resolution satellitederived soil moisture data [61,62] from the Climate Change Initiative (CCI) of the European Space Agency (ESA) (referred as ESA-CCI; http://www.esa-soilmoisture-cci.org; accessed on 15 July 2019). Detailed information of this dataset is given in the following subsection.  By default, RegCM4 is initialized with the static soil moisture data through the BATS lookup table. In this study, the model is forced to begin with high-resolution satellite-derived soil moisture data [61,62] from the Climate Change Initiative (CCI) of the European Space Agency (ESA) (referred as ESA-CCI; http://www.esa-soilmoisture-cci.org; accessed on 15 July 2019). Detailed information of this dataset is given in the following subsection.

Data
The model was forced with six-hourly ERA-Interim reanalysis (EIN75 [63] hereafter) data at 0.75 • × 0.75 • resolution. Topography and land use were obtained from the United States Geological Survey and Global Land Cover Characterization [64] global data at 10 min resolution.
The sea surface temperature from optimum interpolation weekly mean sea surface temperature [65] was fed into the model at 1 • × 1 • resolution from National Oceanic and Atmospheric Administration. Additional datasets including land cover, soil texture, soil color, leaf area index, plant functional type, emission factors, snow data, etc. required for CLM4.5 [49] were obtained from the RegCM data portal (http://clima-dods.ictp.it/Data/ RegCM_Data/CLM45/; accessed on 2 June 2019). Simulated surface temperature and rainfall were validated against high-resolution surface temperature and rainfall analysis data from IMD. The temperature data was constructed by IMD based on 395 station observatories data at 1 • × 1 • spatial resolution [66] covering the land region of India (6.5 • N-38.5 • N, 66.5 • E-100 • E). Similarly, the rainfall data was prepared by IMD by considering the daily rainfall measurements from 6955 rain gauge stations at 0.25 • × 0.25 • spatial grid [67]. These datasets are the finest observation data from IMD so far which RegCM4 was initialized here with ESA-CCI soil moisture datasets (v.02.2). This dataset is a multi-decadal satellite-derived soil moisture product with high spatial resolution at 0.25 • × 0.25 • . The primary data is accumulated through various spaceborne microwave scatterometers, such as ERS-1/2 (SCAT) and METOP-A (ASCAT), as well as microwave radiometers, viz., SMMR, SSM/I, TMI, AMSR-E, WindSat and AMSR2. The detailed information about the different satellite sensors and their specification is mentioned in Dorigo et al. [61,62]. ESA provides three types of soil moisture products, viz., active only, passive only and combined datasets based on these gathered data. Active only data is made by merging all the data from the scatterometers while the passive only product is generated by merging all the data from radiometers. Afterwards these two products are further rescaled to the common platform of Global Land Data Assimilation System version-1 and merged to prepare the combined soil moisture data [68]. Complete procedural technique and further details about this data preparation may be obtained from the literature cited above and the references therein. The datasets are available in the volumetric unit (m 3 m −3 ) at daily scale during 1979-2014. The soil depth of the data varies in the range of 0.5-2 cm. For this study, we used the ESA-CCI combined soil moisture data only. The ESA-CCI dataset are the calibrated data prepared with in situ observation from International Soil Moisture Network (ISMN; [62]). At present, ISMN data consists of 6100 soil moisture datasets from 1400 measurement stations operated by 40 different networks [62]. ISMN holds data globally having 10 station data in India. Validation is carried out for 28 data networks all over the globe. Detailed validation strategy including precise information about the measurement stations is mentioned in Dorigo et al. [62].

Validation Strategy
Model performance was assessed in terms of the spatiotemporal distribution of surface temperature, soil moisture and rainfall considering all India (AI henceforth) as well as its five homogeneous regions, viz., north west India, west central India, central and north-east India, south peninsular India and north east India (NWI, WCI, CNEI, SPI, NEI henceforth) [69]. The simulation forced with a default soil moisture lookup table (ESA-CCI) will be referred to as default (ESA) hereafter. The validation includes some of the basic inferential statistics such as mean, standard deviation (SD) and correlation. In order to further estimate the model ability in predicting the rainfall, equitable threat score (ETS) was computed. ETS is an ability measure generally used for dichotomous (yes/no) forecasting events [70,71]. Mathematically, ETS is defined as follows: where H, M and F stands for number of hits, misses and false alarms while T and H rand refer to the total events and hits due to random chances, respectively. These values are calculated based on a 2 × 2 contingency table. ETS measures the fraction of perfectly forecast points, corrected using hits due to random chance. It varies in the range of -1/3 to 1 with ETS ≤ 0 indicating no ability and ETS = 1 indicating perfect ability.

Surface Temperature
The analysis was started with the discussion about the model simulated surface temperature and its validation with the IMD observation. Results from 2002 (2011) are given in Figure 2 (Figure 3). The first two columns represent bias with default and ESA configuration with their difference in the last column. The first four rows (a-c, d-f, g-i, j-l) correspond to June, July, August and September while the last row (m-o) stands for seasonal (June-July-August-September; JJAS hereafter) mean. It indicates that RegCM4 showed consistent cold bias over peninsular India, irrespective of the years and attained maxima in July 2002 ( Figure 2) and June 2011 ( Figure 3). In contrast to the JJAS mean, it was noticed that the model exhibited cold (warm) bias during 2002 (2011) over north India. In the monthly distribution, RegCM4 experienced warm bias in August and September during 2002 while in July, August and September during 2011. The simulated surface temperature was noticed to be closer to IMD data when initialized with ESA soil moisture data. It indicates that the model was responsive to the soil moisture initialization process and outperformed after being initialized with real-time soil moisture data by reducing existing cold bias in the default soil moisture combination. The present model bias (cold/warm) might be associated with the simulation of rainfall by the model. We have discussed it in later subsections, particularly the model inefficiency in predicting various epochs of rainfall (during initial months of ISM) which might have possible consequences of obtaining different surface temperature biases.
A discussion was further extended by analyzing the daily variation of surface temperature. Time series of surface temperature during 2002 and 2011 over AI and its five homogeneous regions are described in Figures 4 and 5, respectively. Irrespective of the regions, temporal variation of surface temperature was estimated slightly better by the model when initialized with the ESA soil moisture data except for some evident exceptions. As observed earlier, the model showed consistent cold bias throughout the season in both the years over SPI, NEI as well as AI level. Underestimation was higher over SPI and AI compared to NEI. However, the temporal variation was not well simulated by the model. The model's ability was further investigated through different temporal statistics. Temporal correlation and standard deviation over AI and five homogeneous regions during 2002 and 2011 are illustrated in Table 2. Except for SPI, the model showed a significant correlation with the configuration (default and ESA) in both years. Even though minor variations at the regional scale existed, RegCM4 exhibited slightly better ability at the AI level with ESA configuration. Interestingly, the correlation was consistently highest over NWI in both the configuration and the years, which indicated the daily variation of surface temperature was simulated relatively better there. The table also noticed that the spread of surface temperature in ESA simulation was slightly higher in both the years. Overall, it concluded that soil moisture initialization in RegCM4 has a significant impact in simulating surface temperature and subsequently spatiotemporal distribution of surface temperature in individual month and season are better predicted by the model when initialized with realistic soil moisture data from ESA-CCI, albeit having few lacunae.   A discussion was further extended by analyzing the daily variation of surface temperature. Time series of surface temperature during 2002 and 2011 over AI and its five homogeneous regions are described in Figures 4 and 5, respectively. Irrespective of the regions, temporal variation of surface temperature was estimated slightly better by the highest over NWI in both the configuration and the years, which indicated the daily variation of surface temperature was simulated relatively better there. The table also noticed that the spread of surface temperature in ESA simulation was slightly higher in both the years. Overall, it concluded that soil moisture initialization in RegCM4 has a significant impact in simulating surface temperature and subsequently spatiotemporal distribution of surface temperature in individual month and season are better predicted by the model when initialized with realistic soil moisture data from ESA-CCI, albeit having few lacunae.

Soil Moisture
The model simulated seasonally averaged soil moisture from the two combinations (default and ESA) compared for 2002 and 2011 and validated with that from ESA-CCI (Figures 6 and 7). It is worth mentioning that RegCM4 provides soil moisture output in two layers, viz., upper/surface layer (with depth 10 cm) and root zone layer (with depth 100 cm). In this study, only upper layer soil moisture from the simulation and observation were considered for the model validation although they differ marginally in depth. While analyzing JJAS mean, it was observed that RegCM4 simulated the soil moisture reasonably well using both the setups. In both simulation and observation, soil moisture was found to vary in the range of 0.1-0.4 m 3 m −3 over major parts of the Indian landmass.

Soil Moisture
The model simulated seasonally averaged soil moisture from the two combinations (default and ESA) compared for 2002 and 2011 and validated with that from ESA-CCI (Figures 6 and 7). It is worth mentioning that RegCM4 provides soil moisture output in two layers, viz., upper/surface layer (with depth 10 cm) and root zone layer (with depth 100 cm). In this study, only upper layer soil moisture from the simulation and observation were considered for the model validation although they differ marginally in depth. While analyzing JJAS mean, it was observed that RegCM4 simulated the soil moisture reasonably well using both the setups. In both simulation and observation, soil moisture was found to vary in the range of 0.1-0.4 m 3 m −3 over major parts of the Indian landmass. However, soil moisture was seen to be higher than that of ESA-CCI data in both combinations. Atmosphere 2021, 12, x FOR PEER REVIEW 13 of 28  The analysis was further extended by analyzing monthly soil moisture (June, July, August and September). As observed seasonally, simulated soil moisture was distinctly higher than that of ESA-CCI data on a monthly scale as well. It was noticed that ESA-CCI soil moisture was lower in June and it gradually improved in the months of July, August and September. The highest amount of soil moisture was noticed during August in both the years. Comparing both simulations, RegCM4 with default setup overestimated the soil moisture in each of the months. Consequently, soil moisture from ESA configuration was more realistic and hence closer to ESA-CCI. These differences were strongly visible from their differences indicated in the last column of Figures 6 and 7 which was noted to be highest in June and lowest in August. Hence, based on the above analysis, it is concluded that RegCM4 was extremely sensitive to the soil moisture initialization. Therefore,  . As described earlier, the top soil layer in the model was deeper than that of ESA-CCI. Therefore, higher soil moisture in the model simulation may be attributed to this disparity in soil depth. Interestingly, soil moisture from the ESA setup was found to be more realistic in terms of spatial distribution. Soil moisture with default setup was considerably higher than using ESA data in both years and it was very prominent over central India and adjoining regions. Hence, it can be concluded that RegCM4 was appropriate for the soil moisture initialization technique. Moreover, while initialized with ESA-CCI data, the model improved the soil moisture distribution by reducing the non-realistic bias from the default setup.
The analysis was further extended by analyzing monthly soil moisture (June, July, August and September). As observed seasonally, simulated soil moisture was distinctly higher than that of ESA-CCI data on a monthly scale as well. It was noticed that ESA-CCI soil moisture was lower in June and it gradually improved in the months of July, August and September. The highest amount of soil moisture was noticed during August in both the years. Comparing both simulations, RegCM4 with default setup overestimated the soil moisture in each of the months. Consequently, soil moisture from ESA configuration was more realistic and hence closer to ESA-CCI. These differences were strongly visible from their differences indicated in the last column of Figures 6 and 7 which was noted to be highest in June and lowest in August. Hence, based on the above analysis, it is concluded that RegCM4 was extremely sensitive to the soil moisture initialization. Therefore, RegCM4 using the ESA setup showed reasonable enhancement in soil moisture simulation.

Rainfall
In order to investigate the impact of soil moisture initialization on rainfall, model simulated rainfall was analyzed in different spatiotemporal scale. Daily, monthly and seasonal rainfalls from the model simulation were compared with IMD over AI and five homogeneous regions (mentioned earlier). The monthly and seasonal rainfall (mmday −1 ) distribution from the two model combinations during 2002 and 2011 is illustrated in Figures 8 and 9, respectively, in terms of bias and individual difference.  Day-to-day variation of rainfall is an important aspect of ISM which controls the overall performance of the model throughout the season. Hence, daily rainfall variations The model simulated monthly rainfall (June, July, August and September) during 2002; 2011 was also analyzed (Figures 8 and 9) as a part of the validation. During June, July and September of 2002, the model showed wet bias over the major part of the Indian landmass except for NEI and Gangetic West Bengal where dry bias was noticed. Both the biases were found highest in July. Similarly, dry bias regions remained similarly visible in 2011 while the coverage of wet bias regions were reduced with lower magnitude, indicating an improvement in model ability. Nevertheless, it is important to mention that RegCM4 with ESA setup reduced the rainfall bias.
Day-to-day variation of rainfall is an important aspect of ISM which controls the overall performance of the model throughout the season. Hence, daily rainfall variations from the model simulation in both the years were examined over the whole of India and the other five regions against IMD observation (Figures 10 and 11). The rainfall was significantly overestimated (underestimated) over the majority of Indian land throughout the season in 2002 (2011) by the model, except for a few extreme epochs. Moreover, the variation within the season was also not reasonably well simulated by the model. During 2002, it was observed that rainfalls were not initiated on the same dates over all the homogeneous domains and rather maintained intervals of a few days.
Atmosphere 2021, 12, x FOR PEER REVIEW 18 of 28 from the model simulation in both the years were examined over the whole of India and the other five regions against IMD observation (Figures 10 and 11). The rainfall was significantly overestimated (underestimated) over the majority of Indian land throughout the season in 2002 (2011) by the model, except for a few extreme epochs. Moreover, the variation within the season was also not reasonably well simulated by the model. During 2002, it was observed that rainfalls were not initiated on the same dates over all the homogeneous domains and rather maintained intervals of a few days.  The first rainfall peak was noticed over SPI followed by WCI, NWI, CNEI and NEI. While similarly compared with IMD over the corresponding regions, it was noticed to be earlier by a few days in the model simulation. This indicated that the model showed early onset over each of the regions compared to the IMD data. Similarly, RegCM4 exhibited delayed withdrawals from each of the regions during the end of the season. In contrast with 2002, moderately better performance was perceived during 2011. Even though the model showed a large amplitude of over and underestimation during the peak rainfall months of July and August, it followed the daily rainfall pattern of IMD.
Interestingly, the onset and withdrawal of 2011 were also reasonably well simulated by the model. It implies that the model exhibited better ability during a normal monsoon year (2011) as compared to an extreme year (2002). Temporal statistics (correlation and standard deviation) for the two years are provided in Table 3. It shows a 95% significant correlation during 2011 in both the simulation over major parts of India, and therefore, simulation using ESA soil moisture was slightly better in comparison to others. Contrarily, correlations were insignificant and negative over the whole of India in 2002, which indicated deviation in model ability. However, the standard deviation was significantly less than IMD, which inferred limited model performance regarding accurate prediction of magnitude. The first rainfall peak was noticed over SPI followed by WCI, NWI, CNEI and NEI. While similarly compared with IMD over the corresponding regions, it was noticed to be earlier by a few days in the model simulation. This indicated that the model showed early onset over each of the regions compared to the IMD data. Similarly, RegCM4 exhibited delayed withdrawals from each of the regions during the end of the season. In contrast with 2002, moderately better performance was perceived during 2011. Even though the model showed a large amplitude of over and underestimation during the peak rainfall months of July and August, it followed the daily rainfall pattern of IMD.
Interestingly, the onset and withdrawal of 2011 were also reasonably well simulated by the model. It implies that the model exhibited better ability during a normal monsoon year (2011) as compared to an extreme year (2002). Temporal statistics (correlation and standard deviation) for the two years are provided in Table 3. It shows a 95% significant correlation during 2011 in both the simulation over major parts of India, and therefore, simulation using ESA soil moisture was slightly better in comparison to others. Contrarily, correlations were insignificant and negative over the whole of India in 2002, which indicated deviation in model ability. However, the standard deviation was significantly less than IMD, which inferred limited model performance regarding accurate prediction of magnitude. To investigate the model ability during extreme monsoon years, differences of seasonal average  of the three parameters (rainfall, surface temperature and soil moisture) were analyzed ( Figure 12). It was noticed that soil moisture and rainfall were relatively higher during 2011 while the surface temperature was lower in 2011 (last row of Figure 12). Simulated results are depicted in the 1st and 2nd rows using the default and ESA setups. It was noticed that the spatial patterns in both the model combinations were not prominent while compared to observations. Moreover, simulated surface temperature over north India was slightly higher in 2011, contradicting observations. Even though RegCM4 with ESA setup improved the simulation by reducing bias, further enhancement is needed. There was hardly any difference between the two years with regard to soil moisture using both combinations. Surprisingly, the model showed mixed performance in simulating rainfall during the two years and therefore the results were not convincing. Hence, based on the above analysis, it is concluded that the contrasting monsoon features were not well captured by the model when compared to observations.

Quantitative Evaluation: Equitable Threat Score
In our study, ETS is computed over AI and five homogeneous regions (described earlier) for different rainfall categories (0-5, 5-10, 10-20 and 20-50 mmday -1 during 2002 and 2011 and are illustrated in Figures 13 and 14, respectively. Higher ETS in 2011 ( Figure 14) indicates that the precipitation events at an all-India level were better estimated by the model in 2011 compared to 2002 ( Figure 13). Magnitude of ETS was relatively higher in ESA simulation for all rainfall categories in 2011, indicating improvement in rainfall simulation using ESA soil moisture.
Although RegCM4, with default setup, showed similar ability in higher rainfall category (10-20 and 20-50 mmday -1 ), its efficiency deviated in low (0-5 mmday -1 ) or moderate (5-10 mmday -1 ) rainfall cases. At regional scale, highest ETS was noticed over NWI followed by CNEI, WCI, SPI and NEI. In 2011, the model with default configuration showed higher ETS in the low category rainfall over CNEI, WCI and SPI. Moderate rainfall was better estimated using the ESA setup. As observed earlier, ETS values were similar for both the setup in high rainfall cases indicating superior efficiency of the model in predicting high rainfall compared to other categories. During 2002, higher ETS was noticed in higher rainfall cases over NWI followed by AI but failed to estimate other categories. The model was unable to show any ability for other regions. It is noteworthy to mention that RegCM4 consistently showed better ability over NWI in estimating moderate/high rainfall events irrespective of the years. Performance of RegCM4 in 2011 (normal year) was better compared to 2002 (deficit year) and consequently exhibited superior ability in predicting all categories of rainfall while initialized with ESA soil moisture. Atmosphere 2021, 12, x FOR PEER REVIEW 21 of 28

Quantitative Evaluation: Equitable Threat Score
In our study, ETS is computed over AI and five homogeneous regions (described earlier) for different rainfall categories (0-5, 5-10, 10-20 and 20-50 mmday -1 during 2002 and 2011 and are illustrated in Figures 13 and 14, respectively. Higher ETS in 2011 (Figure 14) indicates that the precipitation events at an all-India level were better estimated by the model in 2011 compared to 2002 ( Figure 13). Magnitude of ETS was relatively higher in ESA simulation for all rainfall categories in 2011, indicating improvement in rainfall simulation using ESA soil moisture.  Although RegCM4, with default setup, showed similar ability in higher rainfall category (10-20 and 20-50 mmday -1 ), its efficiency deviated in low (0-5 mmday -1 ) or moderate (5-10 mmday -1 ) rainfall cases. At regional scale, highest ETS was noticed over NWI followed by CNEI, WCI, SPI and NEI. In 2011, the model with default configuration showed higher ETS in the low category rainfall over CNEI, WCI and SPI. Moderate rainfall was better estimated using the ESA setup. As observed earlier, ETS values were

Discussion
In this study, the impact of the soil moisture initialization technique in the model RegCM4 was investigated by incorporating high-resolution satellite-derived soil moisture data from ESA-CCI. In order to evaluate this aspect, seasonal simulations were conducted during two specific years, viz., 2002 (deficit monsoon year) and 2011 (normal monsoon year) with the default as well as modified soil moisture. A comprehensive evaluation was carried out based on the three essential parameters, viz., surface temperature, soil moisture and rainfall. These parameters were investigated regarding their distribution and accuracy at different temporal and spatial scales.
The surface temperature distribution clearly noticed that model ability was relatively better when initialized with soil moisture data from the ESA. The magnitude and distribution of the temperature were better predicted by the model although having warm and cold biases over various regions of the country. In comparison to the default configuration, RegCM4 reduced the surface temperature biases significantly in the ESA setup. Statistical values, such as correlation and standard deviation, are consistently better using ESA soil moisture data. Simulated soil moisture was higher in RegCM4 than ESA-CCI, but when initialized using ESA soil moisture, it lowered the magnitude of soil moisture and portrayed better performance. Rainfall validation demonstrated that the model showed superior ability when initialized with ESA soil moisture on a seasonal and monthly scale. However, the model could not accurately predict the temporal variation of daily rainfall. Studies on soil moisture initialization with RegCM over other regions across the globe also highlighted similar abilities. Over the European region, Patarcic and Brancovic [30] investigated the ability of RegCM3 and found reduction (enhancement) in systematic errors (deterministic ability) of RegCM3 when initialized with high-resolution soil moisture. Over Asia, Liu et al. [31] mentioned that RegCM4 with higher initial soil moisture reduced the surface temperature and consequently increased the rainfall, although the impact was greater in mid-latitude areas compared to the tropics. This study also highlighted that temperature (rainfall) response was stronger (weaker) over India. Hu et al. [38] indicated that description of soil moisture with RegCM2 affected the model bias over China. Similar studies with other models (e.g., Weather Research and Forecasting Model) also showed that the ability scores and frequency bias of rainfall and root mean square of temperature were improved when using soil moisture data from global forecast systems [72].
Although, RegCM4 with the ESA setup appeared to ameliorate the performance, improvement is still necessary. Careful examination proclaimed that the model performance deteriorated, particularly during the extreme monsoon year (2002), although it showed acceptable accuracy during the normal monsoon year (2011). Major association of the poor ability during 2002 was the inefficiency to pick up various epochs of ISM precisely and thereby showed early onset and delayed withdrawal. However, it was also recognized that simulated rainfall was surprisingly low during the peak monsoon months, viz., July and August during 2011 (normal). In addition, rainfall was extremely high in June and July during 2002 (deficit). This indicated that RegCM4 would not be able to capture the contrasting features of ISM accurately. In brief, soil moisture initialization can significantly improve the model's ability in simulating weather/climate features and hence should be paid more attention. Our overall analysis infers significant improvement in the model's ability in simulating surface temperature and rainfall distribution when using high-resolution ESA soil moisture data, albeit that temporal variation in lacunae noticed. ETS of rainfall was higher with the ESA setup.

Conclusions
This study provided a primary assessment of realistic soil moisture initialization through seasonal simulation of ISM using the regional model. In summary, we found RegCM4 was sensitive to soil moisture initialization and consequently imparts potential improvement in simulating surface temperature and rainfall when initialized with highresolution, satellite-derived soil moisture data. Although, the model showed reasonable ability in a normal year, it still had difficulty in simulating different epochs of monsoon, particularly in extreme years. Further investigation is therefore required to enhance the model's ability.

Limitation and Future Studies
The investigations presented here are preliminary ideas for similar modeling studies in the future. Thus, systematic investigation with the added number of extreme years may reproduce more robust results. In addition, it is also important to test the model's ability using soil moisture data from different sources.

Data Availability Statement:
The ERA-Interim reanalysis, sea-surface temperature, soil characteristics, soil moisture and other geophysical data used in this study are obtained from the http: //clima-dods.ictp.it/regcm4/ accessed on 20 September 2020). Rainfall and temperature analysis data used for validation are freely available from the India Meteorological Department.