Spatial Distribution of Soil Moisture in Mongolia Using SMAP and MODIS Satellite Data: A Time Series Model (2010–2025)

: Soil moisture is one of the essential variables of the water cycle, and plays a vital role in agriculture, water management, and land (drought) and vegetation cover change as well as climate change studies. The spatial distribution of soil moisture with high-resolution images in Mongolia has long been one of the essential issues in the remote sensing and agricultural community. In this research, we focused on the distribution of soil moisture and compared the monthly precipi-tation/temperature and crop yield from 2010 to 2020. In the present study, Soil Moisture Active Passive (SMAP) and Moderate Resolution Imaging Spectroradiometer (MODIS) data were used, including the MOD13A2 Normalized Difference Vegetation Index (NDVI), MOD11A2 Land Surface Temperature (LST), and precipitation/temperature monthly data from the Climate Research Unit (CRU) from 2010 to 2020 over Mongolia. Multiple linear regression methods have previously been used for soil moisture estimation, and in this study, the Autoregressive Integrated Moving Arima (ARIMA) model was used for soil moisture forecasting. The results show that the correlation was statistically signiﬁcant between SM-MOD and soil moisture content (SMC) from the meteorological stations at different depths ( p < 0.0001 at 0–20 cm and p < 0.005 at 0–50 cm). The correlation between SM-MOD and temperature, as represented by the correlation coefﬁcient ( r ), was 0.80 and considered statistically signiﬁcant ( p < 0.0001). However, when SM-MOD was compared with the crop yield for each year (2010–2019), the correlation coefﬁcient ( r ) was 0.84. The ARIMA (12, 1, 12) model was selected for the soil moisture time series analysis when predicting soil moisture from 2020 to 2025. The forecasting results are shown for the 95 percent conﬁdence interval. The soil moisture estimation approach and model in our study can serve as a valuable tool for conﬁdent and convenient observations of agricultural drought for decision-makers and farmers in Mongolia.


Introduction
Soil moisture (SM) plays an important role in the terrestrial water cycle and has been assessed in many field studies, e.g., in water management, agricultural irrigation management, crop production, vegetation cover, drought, and global climate change [1][2][3][4]. In addition, soil moisture indicates groundwater conditions and links the exchange of water and energy between the atmosphere and land surface. There are many ways to estimate soil moisture, including direct and indirect methods. The most accurate method is direct measurement in the field (gravimetric method) to estimate soil moisture by point measurement [5], but this is costly [6]. Therefore, remote sensing techniques have become popular for estimating soil moisture at a regional scale due to the sensing ability of the regional SM with low-resolution images. Microwave remote sensing methods have been used at the global and regional scale to establish models [7,8]. To date, some highly advanced SM products have been developed, e.g., soil moisture active passive (SMAP) from the National Aeronautics and Space Administration (NASA), soil moisture ocean salinity (SMOS), and climate change initiative (CCI) from European Space Agency (ESA). In optical and thermal remote sensing, many researchers have established methods based on the relationships between SM and soil reflection/soil temperature and vegetation cover [9][10][11]. Using a combined microwave and optical remote sensing data can give more precise information on soil moisture rather than estimation based only on data from one type of remote sensing.
Remote sensing technology is a powerful method for soil moisture monitoring at the regional level. Many studies have established that SMAP products generate accurate in situ measurements and can be used in various fields of study, such as agriculture, environmental monitoring, and hydrology [12]. They have also been intensively validated by several studies over the past few years [13][14][15]. For instance, Zeng et al. [16] approved a SMAP product for the preliminary evaluation of soil moisture compared to in situ measurements from the three networks that cover different climatic and land surface conditions; moreover, the results show that the SMAP product is in good agreement with in situ measurements. However, it has limited application to agricultural studies because SMAP products only provide 3, 9, or 36 km spatial resolution data at the global or regional scale. In this paper, we used SMAP products at a spatial resolution of 9 km for the development of a soil moisture model by combining SMAP and optical/thermal satellite images. The combination of optical/thermal and microwave remote sensing potentially expands the application possibilities [7].
In previous studies, various researchers have investigated and developed methods based on LST/NDVI, in view of vegetation types and topography and climate parameters, among other factors [9,[17][18][19][20]. These approaches have mainly used reflectance to estimate SM in visible/thermal infrared sensors. Chelsea et al. [21] used LST/NDVI data to obtain an enhanced spatial resolution of soil moisture from SMAP at 9 km down to 1 km. Natsagdorj et al. [5] developed a model for soil moisture using multiple regression analysis, and the model showed that the type of soil moisture index from the satellite measurements depends on the LST, NDVI, elevation, slope, and aspect. The results indicate a good correlation between the developed model and ground truth measurements in the subprovinces of Mongolia. The lack of field measurements for SM makes it challenging to validate remote sensing SM estimates in Mongolia because the territory is so widespread (1565 million km 2 ).
Due to the characteristics of the Mongolian climate, agricultural production is strongly limited by a short growing season (generally 80 to 100 days, but varies from 70 to 130 days depending on the altitude and location), low precipitation, and high evaporation [22]. Mongolian steppe ecosystems are crucial for relieving regional and even global climate variation through their interaction with the atmosphere [23]. Many studies have shown that in Mongolia, due to the harsh continental climate and the distance from the sea, the processes of soil drying, desertification, and degradation are intensifying due to the loss of vegetation and changes in soil moisture due to global warming. Therefore, to study the impacts of climate change, there is an urgent need to consider soil moisture as one of its indicators. Few studies of soil moisture have been conducted with point-scale measurements [24,25]. In Mongolia, SM distribution data with a higher resolution are needed for practical applications such as agricultural management, water management, and flood and drought monitoring. Therefore, time series analysis of long-term soil moisture was conducted using the autoregressive integrated moving average (ARIMA) model [26].
Few previous studies have examined soil moisture and river flow forecasting [27]; on the other hand, various reviews have addressed drought monitoring [28][29][30][31]. The SM forecast data support farmers in organizing their resources for crop production. The ARIMA model is commonly used in time series models. There are many methods and criteria for ranking and selecting the autoregressive (AR), moving average (MA), or ARIMA models for a given purpose. These models are suitable for limited data values and shortterm forecasting [27]. However, the main advantages of ARIMA model forecasting is that it only requires time series data [28]. In this study, we use the ARIMA model to investigate the time series analysis of soil moisture dynamics between 2010 and 2020 based on SMAP and MODIS satellite images. Remarkably, this research focused on that to have a higher spatial resolution (1 km) soil moisture map than SMAP (9 km) provide us then the SMAP data periods 2015-2020 was used in order to build a model. From the model, spatially distributed monthly soil moisture data will contribute return back into time (2010-2020), and it is towards the future by ARIMA model.
The main objectives of this research are to estimate a monthly soil moisture distribution map and to build appropriate models to forecast future trends. Because of the stochastic nature of monthly soil moisture, we used time series analysis for monthly soil moisture forecasting. A process is considered stationary if its statistical properties, such as the average and variance, do not change over time. The monthly soil moisture map was estimated from remote sensing data in Mongolia. The modeling and prediction of soil moisture were done through statistical methods based on ARIMA. In this paper, soil moisture modeling and forecasting was performed by means of the conventional method, the Box-Jenkins time series model. The monthly soil moisture distribution map has not yet been considered in previous studies in Mongolia and is expected to be useful for agriculture, hydrology, and climate science.

Study Area
Mongolia is a landlocked country situated in Central Asia, bordered by Russia and China and located between the latitudes of 41 • 35 N and 52 • 09 N and the longitudes of 87 • 44 E and 119 • 56 E, with a total area of 1565 million km 2 ( Figure 1). Mongolia is covered by 73% agricultural land, 0.5% villages, and other settlements, with 0.35% land representing roads and networks, 9.2% forest and forest resources, 0.4% water and water resources, and 16.1% land for special needs (protected, historical and monumental natural beauty) [32]. The Mongolian climate is highly continental, with arid and semi-arid conditions [33], and has four distinct seasons, high temperature fluctuations, and little precipitation. About 85% of the total precipitation falls from April to September, of which about 50-60% falls during July and August [34]. The mean annual temperature is −8 • C in the northern areas and 6 • C in southern regions [34]. The total soil moisture decreases, going from the north to the south of Mongolia [35] due to the different regions of Mongolian vegetation [36]. The annual precipitation is 300-400 mm in the taiga, high mountain, and forest steppe zones: 150-250 mm in the steppe, 100-150 mm in the desert steppe, and 50-100 mm in the desert (Gobi) region. Most of the cropland is located in the north of Mongolia, which encompasses the forest steppe and steppe zones.

SMAP
On 31 January 2015, NASA launched SMAP, which has an initial L-band with both radar and radiometer to assess soil moisture [37]. The daily coverage started on 31 March 2015 at a spatial resolution of 3-36 km. The average monthly SMAP data were obtained using daily SMAP with 9 km resolution. We downloaded daily SMAP L3 Radiometer Global Daily 9 km EASE-Grid Soil moisture data (SPL3SMP_E.003) through the Application for Extracting and Exploring Analysis Ready Samples (AppEEARS) between 2015 and 2020 [38]. AppEEARS is a useful tool for time series analysis in specific regions and at certain scales. It provides data by enabling users to download only the information (geospatial datasets using spatial, temporal, and band/layer parameters) needed from several federal archives (https://lpdaacsvc.cr.usgs.gov/appeears/). The downloaded images were preprocessed with the ENVI 5.3 and ArcGIS 10.3 software to obtain the monthly soil moisture data.

MODIS
We used MODIS products over a 10-year period (2010-2020) to observe the dynamic range of the NDVI and LST. Zhang et al. [39] used a similar approach to detect anomalies using MODIS land products via time series analysis. Accordingly, monthly composites of 1 km spatial resolution MOD13A3 [40] and MOD11A2 [41] data from MODIS and the National Aeronautics and Space Administration (NASA) Earth Observing system (https://lpdaac.usgs.gov/product_search/) data were used. The MODIS vegetation indices (MOD13A3) version 6 data are provided monthly at 1 km spatial resolution in the sinusoidal projection [40]. The MOD11A2 version 6 product provides an average eightday-per-pixel Land Surface Temperature and Emissivity (LST&E) with a 1 km spatial resolution [41]. We calculated eight-day LST as monthly averages using product version 6 (MOD11A1). The Application for Extracting and Exploring Analysis Ready Samples (AppEEARS) tool offers vegetation and LST products of MODIS for long-term data [42].

CRU Data and Meteorological Data
CRU TS (Climatic Research Unit gridded Time Series) data are broadly used in climate studies and are available at 0.5 × 0.5 degrees over the whole surface of the Earth. It provides a monthly land-based gridded high-resolution dataset from 1901. The CRU dataset is derived by the interpolation of monthly weather station observations of extensive networks. The database is updated annually [43]. The CRU TS global data freely downloadable and accessible online (https://crudata.uea.ac.uk/cru/data/). We applied CRU TS v4 to this research for available temperature and precipitation data between 2010 and 2019.
The meteorological station data were provided by the Information and Research Institute of Meteorology, Hydrology and Environment (IRIMHE) of Mongolia website (http://tsag-agaar.gov.mn/). There were limited stations for soil moisture measurement in croplands over Mongolia. Soil moisture content was acquired at depths of 0-20 cm and 0-50 cm at monthly intervals (the 7th, 17th, and 27th days of each month) from April to September due to seasonal conditions [44]. Soil moisture content was averaged over the monthly intervals from May to August (2015-2020). The selected meteorological stations are shown in Figure 1, and the locations of the stations have been categorized into two vegetation zones (Table 1).

Crop Yield Statistical Data
Soil moisture is one of the most important factors in the agricultural sectors of Mongolia. The National Statistical Organization (NSO) website (http://nso.mn/) of Mongolia provides information on crop yields every year in the subprovinces. The NSO has been accumulating data on croplands and harvests collected from agricultural enterprises and local farmers through the statistical departments and offices of each province. Crop yield information was applied for the validation of soil moisture distribution in Mongolia from 2010 to 2019. The total harvest includes the amount of potatoes, fodder crops, cereals, fruits, vegetables, etc. (in thousands of tons), and the crop yield information is averaged every year. Crop yield is shown as the amount of agricultural production per unit area (from 1 hectare). The crop yield per hectare is estimated as the ratio of total harvest to total sown area [45].

Methodology
In this study, the structure of the spatial distribution of the soil moisture and time series analysis based on SMAP and MODIS products is given in the flowchart in Figure 2. The first involved downloading the data and processing it, e.g., by the SMAP satellite, NDVI, and LST from the MODIS satellite. After data processing, a multiple linear regression model was developed. Prepared monthly soil moisture content from the station, temperature/precipitation of CRU data, and yearly crop yield information from the NSO were applied for validation of the estimated soil moisture over Mongolia. Finally, time series analysis and forecasting model were used for the prediction of estimated monthly soil moisture.

Multiple Linear Regression-SM-MOD
Multiple linear regression models have often been used in natural resource studies, and these involve calculating the dependent variable Y by applying a linear combination of independent variables X i . The linear regression form is shown in the following equation [46]: where β 0 , β m (m > 0) are constant terms corresponding to the connection, the regression coefficients, and the model error, X m,i , respectively. The input variables (X i ) describe the output variable (Y i ) according to the results of the multiple regression model. Therefore, each input variable has different information that means all input variables should not be collinear. We used p-value statistics to estimate the significance of each variable for input variable selection [46]. The variance inflation factor (VIF) is applied to detect collinearity (also called multicollinearity) among predictors in regression models [47,48]. The VIF values are between 1 and 10, which means that there is no multicollinearity for the regression model. After these analyses, a combination of p-value and VIF measures was used. Additionally, the independent variables are normally distributed for the assumptions of the linear regression model. Table 2 shows the statistical variables computed from satellite images.

ARIMA Model
The Box-Jenkins time series models are named after the statisticians George Box and Gwilym Jenkins [49]. These models generate forecast values based on the statistical parameters of observed time series data and are used in many fields. The Box-Jenkins Autoregressive Integrated Moving Average (ARIMA) model is a combination of the Autoregressive (AR), Integrated (I), and Moving Average (MA) terms. The Box-Jenkins model describes a wide class of models forecasting univariate time series that can be made stationary by applying transformations such as differencing of nonstationary series one or more times to achieve stationarity [50,51].
A seasonal ARIMA model is denoted by ARIMA (p, d, q), where p is the number of autoregressive terms, q is the number of moving average terms, and d represents the number of differences applied to the series [52].
The AR (p) model is defined as where ϕ 1 , ϕ 2 , . . . , ϕ n are the autoregressive coefficients, c is a constant, and u t is white noise. In the autoregressive model of order p, the value of the time series at t, X t depends upon its previous p-values and a random disturbance (the stochastic part). The MA (p) model is defined as where θ 1 , θ 2 , . . . , θ q are the moving average coefficients and {z t } is a white noise process with mean 0 and variance σ 2 . Combing autoregressive and moving average models, we get Equation (4): where x t denotes the dth difference of X t . This defines the autoregressive moving average (ARIMA) process of p and q order and difference d, or ARIMA (p, q) [26].
There are many methods and criteria for selecting the order of an AR, MA, or ARIMA model. One of them is based on the so-called information criteria and computes the values of Akaike's information criterion (AIC) and the Bayesian information criterion (BIC) or Schwarz criterion, with smaller values of AIC and BIC preferred [53,54]. The most commonly used approach for checking the model's adequacy is to examine the residuals by using the autocorrelation function (ACF) and partial autocorrelation function (PACF) graphs. If the selected model is appropriate, the residual graphs of both correlation functions should be white noise, indicating no remaining correlation.

Model Validation
The model was implemented to evaluate the next step. Pearson's correlation (r) [55] values were applied for the comparison for estimated soil moisture and observed soil moisture and crop yield values. The coefficients of the Pearson's correlation (r) are given in Equation (5): where X i and Y i are the individual derivations and measurements of variables X and Y, respectively. X and Y are the means of X and Y, respectively. The correlation coefficient (r) ranges between −1 and 1. If r is equal to zero, this means that there is no linear association between the variables. If r is equal to 1, there is a perfect positive linear relationship between the variables, and all individuals sampled would be exactly on the same straight line with a positive slope. If 0 < r < 1, this means a positive linear trend, but sampled individuals would be scattered around this common trend line; the smaller the absolute r value, the less well the data can be characterized by a single linear relationship. If r is positive and r values are close to 1, this describes a valuable relationship between variables [56]. Linear Pearson's correlation (r) was determined at a monthly timescale [44,55] for the satellite-derived and meteorological station/NSO data.

SM-MOD-Multiple Linear Regression Model
The linear regression model in Equation (1) was applied to estimate soil moisture in Mongolia, with 1 km resolution. The multicollinearity tests for all variables (NDVI and LST) were determined as in Table 3. The VIF values were lower than five, which shows that there was no multicollinearity for the regression model. Also, histogram normality test has been checked by the Jarque-Bera test for the linear regression model. In this test, if the probability of Jarque-Bera greater than 0.05% or 5%, then we accepted the null hypothesis, which means that the residuals were normally distributed. Table 3 summarizes the multiple linear regression model coefficients, p-values, standard error, t-statistics, VIF statistics. The monthly NDVI and LST explained 78% of the variation in soil moisture. The F-statistic was less than 0.05, which means that this model can be used for soil moisture analysis. In this research, we assume that SM is derived from the satellite and depends on the independent variables NDVI and LST, while SMAP is the dependent variable. From the assumption, a multiple regression model has been developed. Finally, the MLR model resulted in Equation (6): where SM MOD is the modeled soil moisture; constant coefficients were estimated from Table 3. Figure 3 shows graphs of the actual, fitted, and residual values of the linear regression. The figure suggests that in most of the studied months, the correlation between the real-life situation and the model was high. Soil moisture (SM-MOD) was calculated using Equation (6), with values in m 3 /m 3 . The lowest value of 0 indicates dry areas and the highest value of 0.35 m 3 /m 3 indicates wet areas. Figure 4 represents the spatial distribution of monthly soil moisture over Mongolia. Monthly SM maps were averaged by month between 2010 and 2020. During the years 2010-2020, the winter had the lowest soil moisture (November, December, and January), and spring also had low soil moisture (February, March, and April). The summer showed high soil moisture in May, June, and July. However, the autumn showed the highest soil moisture in August, September, and October ( Figure 4). Apparently, increased soil moisture is observed in the northern part, which is taiga, forest steppe, and steppe zones, while low soil moisture is restricted to the southern part of Mongolia, which is mostly desert steppe and desert vegetation.

Comparison between MLR Model and SMC from the Meteorological Station
In general, the estimation of soil moisture from the model was reasonably accurate, as confirmed by applying satellite images. Figure 5a-d describe the correlation between SMAP and SM-MOD with the SMC from the meteorological station at 0-20 and 0-50 cm depths. Table 4 shows the correlations of the SMAP and SM-MOD with the SMC from the meteorological stations at different depths. The correlation coefficients (r) between SMAP and SMC from meteorological stations were 0.279 (Figure 5a) and 0.181 (Figure 5b) at 0-20 and 0-50 cm depths, respectively. This was statistically significant, with root mean square error (RMSE) values of 0.094 m 3 /m 3 and 0.098 m 3 /m 3 , as shown in Table 4a. Table 4b indicates that the values of correlation coefficients (r) were 0.191 between SM-MOD and SMC at 0-20 cm depth from the meteorological station, which was statistically significant, with an RMSE of 0.090 m 3 /m 3 (p < 0.0001; Figure 5c), and 0.126 between SM-MOD and SMC at 0-50 cm depth from the meteorological station, which was also statistically significant, with an RMSE of 0.091 m 3 /m 3 (p < 0.005; Figure 5d). The confidence intervals of 95% were between SM-MOD and SMC at 0-20 cm depth from the meteorological station from 0.104 to 0.276, and with SMC at 0-50 cm from the meteorological station from 0.038 to 0.213.

Comparison between SM-MOD and CRU Data
We also examined the trends of monthly precipitation and temperature from the CRU data. Figure 6 displays the time series of monthly precipitation, temperature, and SM-MOD from 2010 to 2020 in Mongolia. The highest precipitation was observed in July 2018, and the highest SM-MOD value was in August 2018. From the comparison, when the precipitation was high, then the following month's soil moisture was high, which means that in Mongolia, the soil moisture directly depends on precipitation.   Table 5 shows the correlations of the SM-MOD with the temperature and precipitation from the CRU data over Mongolia. It indicates that the values of correlation coefficients (r) were 0.802 between SM-MOD and temperature, which was statistically significant (p < 0.0001; Figure 7b), and 0.826 between SM-MOD and precipitation, which was also statistically significant (p < 0.0001; Figure 7c). The confidence intervals of 95% were between SM-MOD from the model and monthly temperature from 0.728 to 0.858, and with monthly precipitation from 0.759 to 0.876 over Mongolia.

Comparison between SM-MOD and Crop Yield
We considered the crop yield information for every year to correlate with SM-MOD from the model. The National Statistical Organization (NSO) provides every province's crop yield information since 2010. Figure 8   To apply the model to the use, we examined the relationship between SM-MOD and crop yield over Mongolia. The results show that there is a significant trend in the SM-MOD. It was statistically significantly (p < 0.003) correlated with the crop yield, with r = 0.835 (Table 6).

ARIMA Model of Soil Moisture
We selected the most appropriate model for the time series from the possible models. We selected our model using these criteria: first, the most significant coefficients; second, the lowest volatility; third, the highest adjusted R-squared; and last, the lowest Akaike's information criterion (AIC)/Schwarz information criterion (SIC). Since the theory behind ARMA estimation is based on a stationary time series, we first used the transformation based on a logarithm and considered the first difference of the soil moisture time series. According to the unit root test, the differenced series is a stationary series, so we applied the plots of ACF and PACF to identify the structure of the model. The plots and statistical tests showed that the ARIMA (12, 1, 12) model was suitable for the log time series of soil moisture. The estimation results and the actual, fitted residual graphs of the model are shown in Table 7 and Figure 9, respectively. The residual diagnostics tests suggest that the estimation residuals are white noise. The final results indicate the good performance of the model, and we can say that about 82% of the variability in the soil moisture was predicted by the selected model (Table 7).  The selected model is written as follows: d(X t ) = 0.0006 + 0.9993X t−12 + 0.0006X t−1 − 0.95z t−12 (7) or, equivalently, as where X t is the log values of soil moisture at time t and z t is the error term at time t.
The above model shows that the soil moisture at time t depends on the value of soil moisture of previous months and also on the error terms of 12 months ago.
Partial autocorrelation (PAC) measures the correlation between observations that are p periods apart after controlling for correlations at intermediate lags (i.e., lags less than p). The correlogram of the residuals is flat, which indicates that all information has been captured ( Figure 9). A flat correlogram of the residuals is ideal. Therefore, the forecast will be based on this model.
One of the main purposes of the ARMA and ARIMA models is to provide short-term forecasts. Hence, we have predicted the values of soil moisture from 2020 to 2025 using the selected model. Figure 10 shows the forecasting results for soil moisture from 2020 to 2025 with a 95% confidence interval, which is the range within which the actual dependent value should fall a given percentage of the time (the level of confidence). For forecasting, the root mean squared error was 0.002, and the bias proportion was 0.044. Figure 11 compares the actual soil moisture and soil moisture forecasting (in m 3 /m 3 ) from January to May 2020. The prediction from February and April is almost the same as the actual soil moisture and there were slight deviations when predicting March and May. Overall, the model demonstrated accurate forecasting of soil moisture.  The essence of an appropriate ARIMA model is to forecast the future trends of series. Hence, we used the past information of the soil moisture series itself. The forecast was based on the final selected model, ARIMA (12,1,12).
The forecasting values of soil moisture are given in Figure 12, with the kernel density of the values on the y axis.

Discussion
In the present study, we used a linear regression model to estimte the spatial distribution of soil moisture over Mongolia by considering satellite images (SMAP and MODIS). We estimated the monthly (January-December) soil moisture during the period 2010-2020 in Mongolia. The SM model performance was validated by comparison with SMC from the agricultural meteorological stations, with data on precipitation, temperature, crop yield, etc. The correlation has shown that the model (SM-MOD) gives accurate information on the soil moisture for each month. Moreover, the present model has the advantage of recognizing soil moisture spatial distribution with a high spatial resolution (1 km); this is the first time such information has been gathered for Mongolia. Therefore, we established the ARIMA model for soil moisture forecasting based on estimated soil moisture between 2010 and 2020. The results provide the monthly spatial distribution of soil moisture, which is valuable data for use in numerous contexts, including agricultural management, drought monitoring, assessment of climate change, flooding, determining pasture and land degradation. Land degradation in central Mongolia is mostly caused by overgrazing; however, changes in summertime precipitation have also occurred [57]. Mongolian grassland is decreasing, and drought is increasing [58]. Our research on time series analysis for monthly SM-MOD forecasting is vital for the monitoring of land degradation and drought.
The results of correlation coefficients are low because of limited data availability at the agricultural meteorological station. However, the correlation was statistically significant at p < 0.0001 (0-20 cm) and p < 0.005 (0-50 cm), respectively, between SM-MOD and SMC from the meteorological stations at different depths. From Figure 6, we see that the previous month's precipitation directly impacted the soil moisture during the growing season (June-September). The correlation between SM-MOD and temperature had correlation coefficients (r) of 0.80 (statistically significant at p < 0.0001) and 0.83 (statistically significant at p < 0.0001). However, SM-MOD compared with the crop yield for each year (2010-2019) had a correlation coefficient (r) of 0.84. Therefore, the time series analysis for monthly soil moisture forecasting was developed over Mongolia based on the established ARIMA model. From the study, we selected the ARIMA (12, 1, 12) model, which was most suitable for the SM-MOD time series, for prediction, in which the values of soil moisture (SM-MOD) were predicted from 2020 to 2025 using the selected model. Forecasting results are shown with a 95% confidence interval. Time series SM-MOD data will provide valuable information for decision-makers and researchers. SM-MOD time series and forecasting data are good sources of data for longterm agricultural management, planning, and climate change and drought monitoring.
In terms of applications, this multiple linear regression model is a practical tool for the reliable and timely monitoring of droughts; thus, the advantage of this research lies in providing valuable information for decision-makers and farmers. In further studies, we will investigate seasonal soil moisture in different vegetation zones using this method along with field measurements.

Conclusions
Soil moisture is an important factor for the agricultural land in Mongolia. The model used in this paper is suitable for use in agricultural areas and has useful applications for agricultural management (irrigation, pasture, and hayfield yield) and drought monitoring in Mongolia. Time series analysis is one of the main tools for analyzing and predicting future trends of soil moisture. Most previous studies have examined soil moisture by comparing it with climate factors that were analyzed based on correlation analysis and multilinear regression [7,59,60]. The LST/NDVI combination method may prove to be a robust method for estimating SM; this combination is easy to operate and has a strong physical basis [7].
In general, the model's performance in determining soil moisture was practically assessed using satellite images. This study took Mongolia as the study area, divided into six vegetation zones. The linear regression method was applied in soil moisture estimation using SMAP and MODIS satellite images. From the model, the spatial distribution of soil moisture was developed monthly from 2010 to 2020. The soil moisture was high in the north, while low soil moisture was observed in southern Mongolia, especially during the warm season. Then output maps were compared with the soil moisture content from the agricultural meteorological stations and precipitation/temperature from the CRU data. The results show that the estimated soil moisture was statistically significantly correlated with the actual soil moisture content reported by the station. Moreover, the estimated soil moisture (SM-MOD), when compared with the crop yield, showed a high correlation, though there is a need for more accurate, detailed ground-measured data. Finally, we performed a time series analysis of soil moisture from 2010 to 2020 and predicted soil moisture until 2025 in this study area. Overall, the developed SM model and time series method can both be used to investigate the changes in soil moisture in Mongolia, so it is reasonable to use them in agriculture, hydrology, and climate science. However, this linear regression model should be elaborated to suit each vegetation zone or eco-climate regions in the applied study area.