Landsat Satellite Image-Derived Area Evolution and the Driving Factors Affecting Hulun Lake from 1986 to 2020

: The area ﬂuctuation of lakes directly affects the stability of the surrounding ecological environment. Research on the area evolution of lakes and the driving factors affecting it plays an important role in sustainable water resource management. In this study, Hulun Lake, located in the Hulunbuir grassland, was taken as the research object. Based on remote sensing images of the Hulun Lake area from 1986 to 2020, MNDWI interpretation was used to obtain the change law of lake surface area over a long time frame. Combined with natural factors and anthropogenic factors, Pearson correlation analysis and principal component analysis were used to analyze the driving force. The results showed that (1) in the past 35 years, the water surface area of Hulun Lake has decreased signiﬁcantly. The dynamic change in water area could be divided into four stages. The areas with dramatic changes in water area are distributed mainly in the northeast and south of Hulun Lake. (2) In terms of natural factors, the meteorological factors based on evaporation and relative humidity, the runoff of rivers entering the lake, and the vegetation with medium-high coverage and medium-low coverage had signiﬁcant effects. In terms of anthropogenic factors, the population had the most signiﬁcant impact. The artiﬁcial water diversion project had different degrees of inﬂuence on the response of the Hulun Lake area change to natural factors. (3) Anthropogenic factors were the main driving force causing the rapid change in the Hulun Lake area from 2000 to 2016, explaining 48% of the change in the Hulun Lake area. These research results can provide a scientiﬁc basis for the development and utilization of water resources and sustainable development in the Hulun Lake area.


Introduction
Lakes are sensitive indicators of climate and environmental change.They are the connecting point of the interaction between the layers of the Earth's surface system and an important part of the terrestrial hydrosphere [1].Lakes play an important role in regulating water storage, degrading pollutants, and protecting biodiversity.However, in recent years, due to global climate change and the intensification of human activities, the balance of lake ecosystems has been broken, and changes in their state have seriously affected local ecological balance and security.Water surface area is an important characteristic parameter of lakes.A reduction in water surface area leads to the shrinkage of wetlands around the lake, destroys fish and bird habitats, and threatens the ecological security of lakes [2,3].Studies have shown that lakes in arid and semiarid regions of China are facing serious shrinkage problems under the dual interference of climate change and human activities [4,5].According to statistics, the total area of lakes in Inner Mongolia decreased from 4160 km 2 in 1987 to 2900 km 2 in 2010, a decrease of 30.3%.The number of lakes decreased from 427 to 282, and 145 lakes disappeared completely [6].The loss of lake water resources has annual precipitation was 263 mm.The precipitation is mainly concentrated from June to September.The rivers connected to Hulun Lake include the Kerulen River, the Wuerxun River, and the Hailar River.The first two rivers are the main natural recharge sources.
rectangular.According to the CLCD land use type data in 2021, 87.9% of the area is grassland.The climate is a semiarid continental monsoon climate.It has the characteristics of severe cold and long winters, dry winds in spring, sudden temperature drops, and early frost in autumn.The average annual temperature from 1986 to 2019 was 0.85 °C, and the average annual precipitation was 263 mm.The precipitation is mainly concentrated from June to September.The rivers connected to Hulun Lake include the Kerulen River, the Wuerxun River, and the Hailar River.The first two rivers are the main natural recharge sources.

Data Source and Processing
The remote sensing image data are Landsat series satellite images (TM, ETM+, OLI), and the spatial resolution of the images is 30 m.The images came from the USGS (United States Geological Survey, https://earthexplorer.usgs.gov,accessed on 8 December 2022) and were from 1986 to 2020.It is worth noting that in our study, the remote sensing images used to extract the lake surface were acquired from September to October each year.Meanwhile, the images used to calculate the vegetation coverage were acquired from May to September each year.This is important to ensure that the data used for the lake surface and vegetation coverage calculations are representative of the seasonal changes in the area.The quality of the remote sensing images selected in this study was good, and the cloud cover was less than 10%.To eliminate the influence of sensor factors on the spectral information of ground objects, ENVI 5.3 software was used to preprocess the Landsat series satellite remote sensing data.The preprocessing included cloud removal, orthorectification, radiometric calibration, atmospheric correction, and other steps.Landsat 7 ETM+ data were processed using strip restoration.
The water body data came from the JRC (Joint Research Centre of European Commission) global surface water data provided by Google Earth Engine.JRC is a global surface water product with 30 m resolution produced using all corrected Landsat 5, 7, and 8 archived data and other auxiliary data since 1984.The overall omission error is less than 5%, and the commission error is less than 1%, which has high accuracy on a global scale

Data Source and Processing
The remote sensing image data are Landsat series satellite images (TM, ETM+, OLI), and the spatial resolution of the images is 30 m.The images came from the USGS (United States Geological Survey, https://earthexplorer.usgs.gov,accessed on 8 December 2022) and were from 1986 to 2020.It is worth noting that in our study, the remote sensing images used to extract the lake surface were acquired from September to October each year.Meanwhile, the images used to calculate the vegetation coverage were acquired from May to September each year.This is important to ensure that the data used for the lake surface and vegetation coverage calculations are representative of the seasonal changes in the area.The quality of the remote sensing images selected in this study was good, and the cloud cover was less than 10%.To eliminate the influence of sensor factors on the spectral information of ground objects, ENVI 5.3 software was used to preprocess the Landsat series satellite remote sensing data.The preprocessing included cloud removal, orthorectification, radiometric calibration, atmospheric correction, and other steps.Landsat 7 ETM+ data were processed using strip restoration.
The water body data came from the JRC (Joint Research Centre of European Commission) global surface water data provided by Google Earth Engine.JRC is a global surface water product with 30 m resolution produced using all corrected Landsat 5, 7, and 8 archived data and other auxiliary data since 1984.The overall omission error is less than 5%, and the commission error is less than 1%, which has high accuracy on a global scale [19].In this study, JRC data every 10 years from 1990 to 2020 were selected as the validation dataset for the comparative verification of the lake area extraction results.
The meteorological data mainly came from the daily average temperature, precipitation, average relative humidity, and evaporation of three meteorological stations around Hulun Lake (Manzhouli, New Barag Right Banner, and New Barag Left Banner) from 1986 to 2019 and were provided by the China Meteorological Administration.The vapor pressure data came from Climatic Research Unit (CRU), which provides a long-timescale gridded dataset on a global scale [20].The runoff data came from the Kunduleng and Alatan Frontal Moller hydrological stations from 2000 to 2020.The socioeconomic data came from the population, GDP, industrial output value, and agricultural output value of Manzhouli city, New Barag Right Banner, and New Barag Left Banner from 2000 to 2016.

Water Area Extraction
The methods for extracting lake water information mainly include the single-band threshold method, spectral relationship method, exponential model method, supervised classification method, and decision tree method.Among them, the water index method combined with threshold segmentation to extract water bodies is the fastest and most reliable [21].Based on the idea of the normalized difference vegetation index (NDVI), McFeeters [22] constructed the normalized difference water index (NDWI) by using the green band and near-infrared band of the TM image.Xu Hanqiu [23] proposed a modified normalized difference water index (MNDWI) based on the analysis of the NDWI, and it replaced the original near-infrared band with the mid-infrared band and improved the water extraction effect.The studies by Dong et al. [24] and Xu et al. [25] also verified the reliability of the MNDWI method.Moreover, studies [26] have shown that for eutrophic lakes, the MNDWI can better bypass the effects of suspended solids and algae and is better for land-water separation.Therefore, we used the modified normalized difference water index (MNDWI) to extract the water area of Hulun Lake.The calculation formula is as follows: where ρG is the wavelength of the green band, and ρMIR is the mid-infrared wavelength.
Numerous studies [27][28][29] have shown that lakes in arid and semi-arid regions have seasonal differences and are affected by winter snow cover.Therefore, the months obtained by remote sensing images in this study were mainly concentrated in September-October, during which the lake area reached the stable maximum value of the year.When extracting the lake surface, the average value of available images from September to October was calculated as the lake area of the year.The extraction range included Hulun Lake and its upper right affiliated small lake (Xin Dalai Lake).

Precision Validation
To verify the accuracy of water extraction, the image taken by Maxar Technologies on Google Earth on 2 October 2012 was used as the reference data.The lake water boundary was obtained by visual interpretation, and 400 sample points were selected using random sampling method, including 200 water samples and 200 non-water samples.According to the water extraction results, the confusion matrix was generated using producer accuracy (PA), user accuracy (UA), overall accuracy (OA), and kappa coefficient (Kc) as the bases for the accurate evaluation of the Hulun Lake water body.PA represents the consistency between the reference data and the classified pixels.UA is the evaluation index of the degree of coincidence between the classified pixels and the reference data.OA is the percentage of correctly classified pixels in the total classified pixels, and this index measures the overall performance of the water body recognition algorithm [30].The calculation formulas of the confusion matrix method are as follows: where S is the sum of correctly classified pixels, N is the sum of verified pixels, r is the number of rows, S ij is the observed value of row j in column i, S i is the marginal total number of row i, and S j is the marginal total number of column j.

Pixel Dichotomy Model and Vegetation Coverage Classification
As the most widely used linear mixed pixel decomposition model, the pixel dichotomy model [31] has the advantages of a simple and reliable calculation model, universal data parameters, and high inversion accuracy.In this study, the pixel dichotomy model was used to invert the vegetation coverage in the study area.The calculation method is as follows: where FVC is the vegetation coverage; NDV I is the normalized difference vegetation index; NDV I soil is the NDVI value of completely bare soil or no vegetation coverage area; and NDV I veg is the NDVI value of a pixel completely covered by vegetation, that is, the NDVI value of a pure vegetation pixel.According to the research of many scholars, we used 5% and 95% confidence to intercept the upper and lower thresholds of the NDVI on the cumulative frequency histogram to determine the NDVI soil and NDVI veg , respectively.We selected May to September as the vegetation growth season in the Hulun Lake area and selected a total of 35 remote sensing images from 1986 to 2020 to calculate and analyze the vegetation coverage.We referred to the estimation method proposed by Li [31] and Gutman et al. [32] to classify the vegetation coverage, as shown in Table 1.In this study, the nonparametric rank Mann-Kendall (M-K) test was used to analyze the trend of data change.The nonparametric test does not make any assumptions about the distribution of data, which helps to detect monotonic trends, and is widely used in trend analysis.The M-K test further analyzes the trend changes of the sequence in a certain period by analyzing the statistical sequence UF and BF and determines whether the sequence has a mutation, as well as the time and region of the mutation.The calculation formula is as follows [33]: where x i is a variable that is an independent variable and a random variable with the same distribution, and n is the sample size.The expected value E(S i ) and variance Var(S i ) can be estimated as follows: where UF i is a standard normal distribution.At the significance level α (generally α = 0.05, Uα = ± 1.96), |UF i | > |Uα| indicates that the sequence has a significant increasing or decreasing trend.A positive UF i value indicates an increasing trend, and a negative UF i value indicates a decreasing trend.All UF i form UF curves, and the reliability test can be used to determine whether there is a significant change trend.The above method is applied to the inverse sequence, and the above calculation process is repeated.The calculated value is multiplied by −1 to obtain UB i .UB i is represented as UB in the figure.If the intersection of UF and UB is between the reliability line, this point may be the beginning of the mutation point.

Pearson Correlation Coefficient and Principal Component Analysis
In this study, the Pearson correlation coefficient method was used to analyze the correlation between the water area of Hulun Lake and natural factors (precipitation, temperature, relative humidity, evaporation, vapor pressure, wet day frequency, surface runoff, vegetation coverage) and anthropogenic factors (population, GDP, industrial output value, agricultural output value).On this basis, principal component analysis (PCA) was used to extract the principal components of the influencing factors, and the shrinking process of Hulun Lake and its driving forces were analyzed. (

1) Pearson Correlation Coefficient
Pearson correlation analysis is used to study the relationship between quantitative data, including whether there is a relationship and the degree of the relationship.The correlation coefficient is calculated as follows: In the formula, X(x 1 , x 2 , . . ., x n ) and Y(y 1 , y 2 , . . ., y n ) are two continuous correlation variable sequences; r is the correlation coefficient, which is between −1.0 and 1.0.The greater the absolute value of r, the higher the correlation is between the dependent variable and the independent variable: 0-0.09 is no correlation, 0.1-0.3 is a weak correlation, 0.3-0.5 is a moderate correlation, and 0.5-1.0 is a strong correlation.
(2) Principal Component Analysis (PCA) Principal component analysis adopts a mathematical dimension reduction method to find several comprehensive variables to replace the original numerous variables so that these comprehensive variables represent the original variable information as much as possible and are not related to each other.For sample data, the data matrix of the sample is [34]: where p is the number of observed variables; n is the total number of samples; and x y is the jth observed value of the ith sample.
Based on the idea of dimension reduction, the original variables x 1 , x 2 , . . ., x p with certain correlations were recombined into a new set of mutually independent comprehensive variables Z 1 , Z 2 , . . ., Z m (m p) through linear combination and screening, then: where a y denotes the coefficients of the original variables x 1 , x 2 , . . ., x p on the principal components Z 1 , Z 2 , . .., Z m (m p), which is the unit eigenvector corresponding to the larger m eigenvalues in the correlation coefficient matrix of x 1 , x 2 , . . ., x p .According to the calculated principal component, its contribution rate is as follows: where r represents the contribution rate of each principal component; p i denotes the variance of the ith principal component.
The number of principal components is mainly determined by the cumulative contribution rate of the principal components.The greater the contribution rate is, the stronger the information of the original variables contained in the principal component.The cumulative contribution rate is generally required to be greater than 70% to ensure that the comprehensive variable can include the vast majority of the original variable information.

Hulun Lake Area Extraction Accuracy Verification
In this study, the accuracy of the Hulun Lake water area extracted based on the MNDWI was high.The overall accuracy of the 400 sample points reached 95.25%, and the kappa coefficient was 0.905 (Table 2a).Compared with similar studies, based on Landsat 8 images, Wang et al. [26] used the MNDWI method to obtain a 95.17% overall accuracy and a 0.903 kappa coefficient in the extraction of Taihu Lake with a high cyanobacteria content.Based on the Resourcesat-2 AWiFS satellite image and the MNDWI, Singh et al. [35] obtained an overall accuracy of 96.9% and a kappa coefficient of 0.89 in water extraction in India.The results showed that the extraction of surface water based on the MNDWI was reliable and accurate.
The extraction results were compared with the JRC global water body data.As shown in Figure 2, the change trend of the lake area extracted by the two was the same as the time trend.The water area of Hulun Lake extracted in this study was generally smaller than that of the JRC global water body data.The average relative error of the area extracted by the two was 3.46%.The confusion matrix of the water body extracted by JRC (Table 2b) showed that the overall accuracy was 79.75%, and the kappa coefficient was 0.595.The extraction accuracy of this product in Hulun Lake was lower than that of this study.The extraction results were compared with the JRC global water body data.As s in Figure 2, the change trend of the lake area extracted by the two was the same time trend.The water area of Hulun Lake extracted in this study was generally s than that of the JRC global water body data.The average relative error of the area ext by the two was 3.46%.The confusion matrix of the water body extracted by JRC (Tab showed that the overall accuracy was 79.75%, and the kappa coefficient was 0.59 extraction accuracy of this product in Hulun Lake was lower than that of this study

Evolution Characteristics of the Water Area of Hulun Lake
According to the results of this study, the water area of Hulun Lake varied 1748.81 km 2 to 2314.96 km 2 from 1986 to 2020, with the smallest value recorded in and the largest value recorded in 1991 (Figure 3a).The results of the M-K tren showed that the water surface area of Hulun Lake has decreased significantly in th 35 years (Z = − 3.238, p < 0.01).From 1986 to 2020, the dynamic change in the water a Hulun Lake could be divided into four stages: 1986-1991, the stage with an area inc and the area increase rate of Hulun Lake was 38.6 km 2 /a.From 1991 to 2000, the a Hulun Lake remained at approximately 2300 km 2 , and the maximum difference wa km 2 .From 2000 to 2012, the area of Hulun Lake decreased by 560.82 km 2 , and th reduction rate was 46.7 km 2 /a.From 2012 to 2020, which was the recovery phase, th of Hulun Lake increased significantly and stabilized at 2060 km 2 , reaching the le 2004.
Based on the results of lake area extraction, this study used the water level water volume curve established by Li et al. [36] in Hulun Lake to estimate the wate and water volume of Hulun Lake.The results (Figure 3b) showed that the trend water level and the water volume in Hulun Lake was basically consistent with the area.This is consistent with the results of Guo et al. [16] and Sun et al. [17].In the p years, the water level of Hulun Lake changed between 545.2 m and 541.1 m, and the volume changed between 13.66 billion m 3 and 5.10 billion m 3 .

Evolution Characteristics of the Water Area of Hulun Lake
According to the results of this study, the water area of Hulun Lake varied from 1748.81 km 2 to 2314.96 km 2 from 1986 to 2020, with the smallest value recorded in 2012 and the largest value recorded in 1991 (Figure 3a).The results of the M-K trend test showed that the water surface area of Hulun Lake has decreased significantly in the past 35 years (Z = − 3.238, p < 0.01).From 1986 to 2020, the dynamic change in the water area of Hulun Lake could be divided into four stages: 1986-1991, the stage with an area increase, and the area increase rate of Hulun Lake was 38.6 km 2 /a.From 1991 to 2000, the area of Hulun Lake remained at approximately 2300 km 2 , and the maximum difference was 43.4 km 2 .From 2000 to 2012, the area of Hulun Lake decreased by 560.82 km 2 , and the area reduction rate was 46.7 km 2 /a.From 2012 to 2020, which was the recovery phase, the area of Hulun Lake increased significantly and stabilized at 2060 km 2 , reaching the level of 2004.
Based on the results of lake area extraction, this study used the water level-area-water volume curve established by Li et al. [36] in Hulun Lake to estimate the water level and water volume of Hulun Lake.The results (Figure 3b) showed that the trend of the water level and the water volume in Hulun Lake was basically consistent with the water area.This is consistent with the results of Guo et al. [16] and Sun et al. [17].In the past 35 years, the water level of Hulun Lake changed between 545.2 m and 541.1 m, and the water volume changed between 13.66 billion m 3 and 5.10 billion m 3 .The distribution map of the water area of Hulun Lake in the representative years of 1986-2020 was as follows (Figure 4).The areas with more dramatic changes were distributed mainly in the northeastern and southern waters of Hulun Lake.The small lake in the northeast has not recovered since it disappeared in 2005.The southern waters of Hulun Lake showed a decreasing trend before 2012, with the most serious shrinkage in 2012, and then the lake began to recover gradually.The distribution map of the water area of Hulun Lake in the representative years of 1986-2020 was as follows (Figure 4).The areas with more dramatic changes were distributed mainly in the northeastern and southern waters of Hulun Lake.The small lake in the northeast has not recovered since it disappeared in 2005.The southern waters of Hulun Lake showed a decreasing trend before 2012, with the most serious shrinkage in 2012, and then the lake began to recover gradually.

Meteorological Factors
The annual temperature, precipitation, evaporation, and relative humidity in the Hulun Lake area were represented by the average values of meteorological elements from the three meteorological stations.The annual precipitation, average temperature, relative humidity, evaporation, vapor pressure, and wet day frequency in the Hulun Lake area from 1986 to 2019 are shown in Figure 5.The evaporation in the non-freezing period (May-September) was converted according to the evaporation conversion coefficient [37].The frequency of wet days was calculated according to the number of days with precipitation ≥ 0.1 mm.The vapor pressure data were extracted according to the mask of the study area.The changes in the main meteorological factors showed that the annual precipitation and vapor pressure in the region showed a nonsignificant downward trend, the annual average temperature, annual evaporation, and wet day frequency showed a nonsignificant upward trend, and the average relative humidity showed a significant downward trend (Z = −3.469,p < 0.01).The meteorological data in the last 34 years showed that the Hulun Lake area tended to be warm and dry.To reveal the possible impact of the artificial water diversion project on the response of the Hulun Lake water surface area to climate change, the correlations between the Hulun Lake water surface area and meteorological factors in 1986-2019, 1986-2008 before water transfer, and 2009-2019 after water transfer were analyzed.
Pearson correlation analysis showed that the water surface area of Hulun Lake was To reveal the possible impact of the artificial water diversion project on the response of the Hulun Lake water surface area to climate change, the correlations between the Hulun Lake water surface area and meteorological factors in 1986-2019, 1986-2008 before water transfer, and 2009-2019 after water transfer were analyzed.
Pearson correlation analysis showed that the water surface area of Hulun Lake was significantly positively correlated with the average relative humidity (p < 0.01) and vapor pressure (p < 0.01) from 1986 to 2019 (Figure 6a), with correlation coefficients of 0.45 and 0.46, respectively.The average relative humidity increased from 57.09% to 63.97% from 1986 to 1991.From 1991 to 2000, it maintained a relatively high level, ranging from 59.28% to 63.76%.From 2001 to 2012, it was relatively low, with the highest value recorded in 2002 (62.01%) and the lowest value recorded in 2006 (55.35%).In 2013, because the precipitation was significantly higher than the average precipitation, the relative humidity reached 63.98% and remained at approximately 56% after 2014.The variation trend of vapor pressure is similar to that of relative humidity, which can also be divided into four stages: rising from 1986 to 1991, relatively stable from 1991 to 2000, declining from 2001 to 2012, peaking in 2013, and maintaining at about 6 hPa after 2014.The average relative humidity and vapor pressure were similar to the overall change law of the water area of Hulun Lake.It was speculated that the change in the water surface area of Hulun Lake may be affected by relative humidity and vapor pressure.If considering only the data before the water transfer project (Figure 6b), compared with 1986-2019, the Pearson correlation coefficient between the water surface area of Hulun Lake and relative humidity increased from 0.45 to 0.69, the correlation coefficient with precipitation increased slightly from 0.16 to 0.19, the correlation coefficient with evaporation increased from −0.06 (no correlation) to −0.43 (moderate correlation), the correlation coefficient with temperature changed from a positive correlation to a negative correlation, and the correlation coefficient with wet day frequency changed from a negative correlation to a positive correlation.In addition to the vapor pressure (R = −0.40,p = 0.06), the correlation between the other meteorological elements and the water surface area was enhanced.In addition to relative humidity, the relationship between the water surface area and evaporation of Hulun Lake was significant (p < 0.05).The increase in evaporation also influenced the decrease in the water surface area of Hulun Lake, which was weaker than that of relative humidity.Consistent with the results from 1986 to 2019, relative humidity may be the main factor affecting the change in water area in Hulun Lake.
From 2009 to 2019, after the implementation of artificial water diversion (Figure 6c), Pearson correlation analysis showed that the water surface area of Hulun Lake was significantly positively correlated with the temperature, and the correlation coefficient was 0.79, which was inconsistent with the reduction effect of temperature increases on the water surface area of non-glacial snowmelt recharge lakes.Compared with that before water diversion, the correlation between water area and meteorological factors was seriously reduced, which showed that artificial water diversion seriously interfered with the response of the water area of Hulun Lake to climate change from 2009 to 2019.

Surface Runoff
In addition to atmospheric precipitation and groundwater, the natural supply of Hulun Lake water mainly comes from the Kerulen River and the Wuerxun River. Figure 7 shows the dynamic change map of the annual runoff of the two inflow rivers and the water area of Hulun Lake from 2000 to 2020.It can be found from the figure that the annual runoff of the Wuerxun River and the Kerulen River was at a low level from 2000 to 2012, and then increased.The flood peak of the Kerulen River appeared earlier.To eliminate the influence of the artificial water diversion project on the correlation between water area and surface runoff, the correlation between lake area and runoff before water diversion was analyzed (Table 3).The results showed that the water area of Hulun Lake had a significant positive correlation with the annual runoff of the Kerulen River and an insignificant positive correlation with the annual runoff of the Wuerxun River.If considering only the data before the water transfer project (Figure 6b), compared with 1986-2019, the Pearson correlation coefficient between the water surface area of Hulun Lake and relative humidity increased from 0.45 to 0.69, the correlation coefficient with precipitation increased slightly from 0.16 to 0.19, the correlation coefficient with evaporation increased from −0.06 (no correlation) to −0.43 (moderate correlation), the correlation coefficient with temperature changed from a positive correlation to a negative correlation, and the correlation coefficient with wet day frequency changed from a negative correlation to a positive correlation.In addition to the vapor pressure (R = −0.40,p = 0.06), the correlation between the other meteorological elements and the water surface area was enhanced.In addition to relative humidity, the relationship between the water surface area and evaporation of Hulun Lake was significant (p < 0.05).The increase in evaporation also influenced the decrease in the water surface area of Hulun Lake, which was weaker than that of relative humidity.Consistent with the results from 1986 to 2019, relative humidity may be the main factor affecting the change in water area in Hulun Lake.
From 2009 to 2019, after the implementation of artificial water diversion (Figure 6c), Pearson correlation analysis showed that the water surface area of Hulun Lake was significantly positively correlated with the temperature, and the correlation coefficient was 0.79, which was inconsistent with the reduction effect of temperature increases on the water surface area of non-glacial snowmelt recharge lakes.Compared with that before water diversion, the correlation between water area and meteorological factors was seriously reduced, which showed that artificial water diversion seriously interfered with the response of the water area of Hulun Lake to climate change from 2009 to 2019.

Surface Runoff
In addition to atmospheric precipitation and groundwater, the natural supply of Hulun Lake water mainly comes from the Kerulen River and the Wuerxun River. Figure 7 shows the dynamic change map of the annual runoff of the two inflow rivers and the water area of Hulun Lake from 2000 to 2020.It can be found from the figure that the annual runoff of the Wuerxun River and the Kerulen River was at a low level from 2000 to 2012, and then increased.The flood peak of the Kerulen River appeared earlier.To eliminate the influence of the artificial water diversion project on the correlation between water area and surface runoff, the correlation between lake area and runoff before water diversion was analyzed (Table 3).The results showed that the water area of Hulun Lake had a significant positive correlation with the annual runoff of the Kerulen River and an insignificant positive correlation with the annual runoff of the Wuerxun River.Note: * represents a significant correlation at the 0.05 level.

Fractional Vegetation Cover
From 1986 to 2020, the average vegetation coverage in the Hulun Lake area was 0.56, which generally represents a medium-high coverage level.The regional vegetation coverage showed a downward trend, and the change rate was −0.002/a.Figure 8a shows that the change trend of vegetation coverage in the Hulun Lake area was roughly the same as that of the water surface area of Hulun Lake and was divided into four stages.From 1986 to 1990, the vegetation coverage increased significantly, and the level of vegetation coverage increased from medium to high.From 1991 to 2000, the vegetation coverage was relatively stable and was maintained at the middle and high coverage levels.From 2001 to 2010, the vegetation coverage decreased at a rate of −0.009/a to a medium coverage level.From 2012 to 2020, the vegetation coverage increased and returned to a medium-high level.Due to the influence of other external factors, vegetation coverage may increase or decrease suddenly in a short time.As shown in Figure 8b, the M-K mutation test was used to analyze the vegetation coverage in the Hulun Lake area from 1986 to 2020.The results showed that the UF and UB curves intersected in 2000, that is, the vegetation coverage mutation occurred in 2000.After 2000, the vegetation coverage decreased rapidly, and it was the same year that the water area of Hulun Lake entered a stage of drastic reduction.Table 3. Correlation coefficient between the area of Hulun Lake and the annual runoff of rivers entering the lake.

Lake Area Annual Runoff of Wuerxun River Annual Runoff of Kerulen River
Lake Area Note: * represents a significant correlation at the 0.05 level.

Fractional Vegetation Cover
From 1986 to 2020, the average vegetation coverage in the Hulun Lake area was 0.56, which generally represents a medium-high coverage level.The regional vegetation coverage showed a downward trend, and the change rate was −0.002/a.Figure 8a shows that the change trend of vegetation coverage in the Hulun Lake area was roughly the same as that of the water surface area of Hulun Lake and was divided into four stages.From 1986 to 1990, the vegetation coverage increased significantly, and the level of vegetation coverage increased from medium to high.From 1991 to 2000, the vegetation coverage was relatively stable and was maintained at the middle and high coverage levels.From 2001 to 2010, the vegetation coverage decreased at a rate of −0.009/a to a medium coverage level.From 2012 to 2020, the vegetation coverage increased and returned to a medium-high level.Due to the influence of other external factors, vegetation coverage may increase or decrease suddenly in a short time.As shown in Figure 8b, the M-K mutation test was used to analyze the vegetation coverage in the Hulun Lake area from 1986 to 2020.The results showed that the UF and UB curves intersected in 2000, that is, the vegetation coverage mutation occurred in 2000.After 2000, the vegetation coverage decreased rapidly, and it was the same year that the water area of Hulun Lake entered a stage of drastic reduction.Figure 9 shows a five-year statistical map of vegetation coverage in the Hulun Lake area.The proportion of medium-and high-coverage vegetation in the whole study area was the largest, at approximately 33.13%, followed by medium-coverage vegetation, highcoverage vegetation, and medium-and low-coverage vegetation, accounting for 27.70%, 27.38%, and 10.54% of the study area, respectively.The proportion of low-coverage vegetation was the lowest, at only 1.25%.In terms of spatial distribution, high-coverage vegetation was distributed mainly in the southeast and north of the region, and medium-coverage and low-coverage vegetation was distributed mainly in the southwest of the region.With time, the vegetation coverage in the west of the region changed greatly, while the vegetation coverage in the east of the region remained relatively stable.It is speculated that this result may be caused by the distance from lakes and rivers.Figure 9 shows a five-year statistical map of vegetation coverage in the Hulun Lake area.The proportion of medium-and high-coverage vegetation in the whole study area was the largest, at approximately 33.13%, followed by medium-coverage vegetation, highcoverage vegetation, and medium-and low-coverage vegetation, accounting for 27.70%, 27.38%, and 10.54% of the study area, respectively.The proportion of low-coverage vegetation was the lowest, at only 1.25%.In terms of spatial distribution, high-coverage vegetation was distributed mainly in the southeast and north of the region, and medium-coverage and low-coverage vegetation was distributed mainly in the southwest of the region.With time, the vegetation coverage in the west of the region changed greatly, while the vegetation coverage in the east of the region remained relatively stable.It is speculated that this result may be caused by the distance from lakes and rivers.
The change trend and mutation time of vegetation coverage were similar to the change in the Hulun Lake water surface area.To reveal the possible influence of artificial water diversion on the response relationship between the water surface area and vegetation coverage of Hulun Lake, the correlations between the water surface area and vegetation coverage of Hulun Lake in 1986-2020, 1986-2008 before water transfer, and 2009-2020 after water transfer were analyzed.
Pearson correlation analysis (Figure 10a) from 1986 to 2020 showed that the water area of Hulun Lake was significantly negatively correlated with medium-and low-coverage vegetation (R = −0.43),negatively correlated with low-coverage vegetation (R = −0.27,p = 0.12) and medium-coverage vegetation (R = −0.19,p = 0.28), and positively correlated with medium-and high-coverage vegetation (R = 0.33, p = 0.06) and high-coverage vegetation (R = 0.29, p = 0.10).The results show that the area change in Hulun Lake was related to the change in vegetation with medium and low coverage.
27.38%, and 10.54% of the study area, respectively.The proportion of low-coverage vegetation was the lowest, at only 1.25%.In terms of spatial distribution, high-coverage vegetation was distributed mainly in the southeast and north of the region, and medium-coverage and low-coverage vegetation was distributed mainly in the southwest of the region.With time, the vegetation coverage in the west of the region changed greatly, while the vegetation coverage in the east of the region remained relatively stable.It is speculated that this result may be caused by the distance from lakes and rivers.water diversion on the response relationship between the water surface area and vegetation coverage of Hulun Lake, the correlations between the water surface area and vegetation coverage of Hulun Lake in 1986-2020, 1986-2008 before water transfer, and 2009-2020 after water transfer were analyzed.
Pearson correlation analysis (Figure 10a) from 1986 to 2020 showed that the water area of Hulun Lake was significantly negatively correlated with medium-and low-coverage vegetation (R = −0.43),negatively correlated with low-coverage vegetation (R = −0.27,p = 0.12) and medium-coverage vegetation (R = −0.19,p = 0.28), and positively correlated with medium-and high-coverage vegetation (R = 0.33, p = 0.06) and high-coverage vegetation (R = 0.29, p = 0.10).The results show that the area change in Hulun Lake was related to the change in vegetation with medium and low coverage.
The correlation analysis of the Hulun Lake area and vegetation coverage before water diversion (Figure 10b) showed that the water area of Hulun Lake was significantly negatively correlated with medium-and low-coverage vegetation (R = −0.55)and significantly positively correlated with medium-and high-coverage vegetation (R = 0.42).There was a negative correlation with low-coverage vegetation (R = −0.32,p = 0.13) and medium-coverage vegetation (R = −0.27,p = 0.21) and a positive correlation with high-coverage vegetation (R = 0.36, p = 0.10).Compared with 1986-2020, the correlation coefficients of this period increased, the significance level generally decreased, and the change in mediumand high-coverage vegetation became another influencing factor.
The correlation analysis of 2009-2020 after artificial water diversion (Figure 10c) showed that the water area was significantly negatively correlated with low coverage vegetation (R = −0.61), the correlation coefficients with other levels of coverage vegetation were less than 0.14, and the significance level was higher than 0.68, which indicated no correlation.The correlation between water area and vegetation coverage was seriously reduced compared with that before water transfer.The artificial water diversions seriously disturbed the response of the water surface area of Hulun Lake to the change in vegetation coverage from 2009 to 2020.

Correlation Analysis between Water Area and Anthropogenic Factors of Hulun Lake
As a product of human activities, anthropogenic factors also have different degrees of impact on the ecological environment, especially in arid climate areas, where human activities have a more obvious impact on the environment.In this study, the four indicators of population, regional GDP, agricultural output value, and industrial output value were selected to analyze human activities.
Pearson correlation analysis results (Figure 11) showed that the water area of Hulun Lake was significantly negatively correlated with the population (R = −0.65)and negatively correlated with the GDP (R = −0.44,p < 0.1), the agricultural output value (R = −0.45,p < 0.1), and the industrial output value (R = −0.48,p < 0.1).In addition, there was a significant positive correlation between anthropogenic factors.The correlation analysis of the Hulun Lake area and vegetation coverage before water diversion (Figure 10b) showed that the water area of Hulun Lake was significantly negatively correlated with medium-and low-coverage vegetation (R = −0.55)and significantly positively correlated with medium-and high-coverage vegetation (R = 0.42).There was a negative correlation with low-coverage vegetation (R = −0.32,p = 0.13) and mediumcoverage vegetation (R = −0.27,p = 0.21) and a positive correlation with high-coverage vegetation (R = 0.36, p = 0.10).Compared with 1986-2020, the correlation coefficients of this period increased, the significance level generally decreased, and the change in mediumand high-coverage vegetation became another influencing factor.
The correlation analysis of 2009-2020 after artificial water diversion (Figure 10c) showed that the water area was significantly negatively correlated with low coverage vegetation (R = −0.61), the correlation coefficients with other levels of coverage vegetation were less than 0.14, and the significance level was higher than 0.68, which indicated no correlation.The correlation between water area and vegetation coverage was seriously reduced compared with that before water transfer.The artificial water diversions seriously disturbed the response of the water surface area of Hulun Lake to the change in vegetation coverage from 2009 to 2020.

Correlation Analysis between Water Area and Anthropogenic Factors of Hulun Lake
As a product of human activities, anthropogenic factors also have different degrees of impact on the ecological environment, especially in arid climate areas, where human activities have a more obvious impact on the environment.In this study, the four indicators of population, regional GDP, agricultural output value, and industrial output value were selected to analyze human activities.
Pearson correlation analysis results (Figure 11) showed that the water area of Hulun Lake was significantly negatively correlated with the population (R = −0.65)and negatively correlated with the GDP (R = −0.44,p < 0.1), the agricultural output value (R = −0.45,p < 0.1), and the industrial output value (R = −0.48,p < 0.1).In addition, there was a significant positive correlation between anthropogenic factors.
Based on the results of correlation analysis, the relative humidity, evaporation, surface runoff, medium-and low-coverage vegetation, medium-and high-coverage vegetation, population, GDP, agricultural output value, and industrial output value were analyzed using principal component analysis.Considering that the dimensions of each factor were different, statistical analysis could not be directly performed.Therefore, the data were first standardized, and the range standardization method was used to eliminate the influence of variable dimension and variation range.To test whether the data were suitable for principal component analysis, we conducted the KMO test and Bartlett spherical test on the standardized data.The results are shown in Table 4.The KMO was 0.667, greater than 0.6, and the Sig was less than 0.05, indicating that the data support the use of principal component analysis.According to the principle of eigenvalues greater than 1, two principal components  Based on the results of correlation analysis, the relative humidity, evaporation, surface runoff, medium-and low-coverage vegetation, medium-and high-coverage vegetation, population, GDP, agricultural output value, and industrial output value were analyzed using principal component analysis.Considering that the dimensions of each factor were different, statistical analysis could not be directly performed.Therefore, the data were first standardized, and the range standardization method was used to eliminate the influence of variable dimension and variation range.To test whether the data were suitable for principal component analysis, we conducted the KMO test and Bartlett spherical test on the standardized data.The results are shown in Table 4.The KMO was 0.667, greater than 0.6, and the Sig was less than 0.05, indicating that the data support the use of principal component analysis.
According to the principle of eigenvalues greater than 1, two principal components were extracted, as shown in Table 5.The variance contribution rate of the first principal component was 47.85%.Among them, GDP (R = 0.99), industrial output value (R = 0.98), agricultural output value (R = 0.98), and population (R = 0.96) were highly correlated with the first principal component, and these are all anthropogenic factors.Thus, the dominant factors of the first principal component are anthropogenic factors, that is, anthropogenic factors can explain 48% of the change in the Hulun Lake water area.The variance contribution rate of the second principal component was 26.54%.Among them, evaporation (R = −0.86)was highly correlated with the second principal component, followed by medium-and high-vegetation coverage (R = 0.74), medium-and low-vegetation coverage (R = −0.73),and relative humidity (R= 0.66), all of which are natural factors; thus, natural factors explained 27% of the change in the Hulun Lake water area.In summary, the first principal component with anthropogenic factors as the main factors and the second principal component with natural factors as the main factors can explain 75% of the change in the Hulun Lake area.

Discussion
In this study, the overall accuracy of extracting the water area of Hulun Lake based on the MNDWI was 95.25%, which proved that the index could be effectively used to extract the water body in this area.Additionally, the accuracy of the JRC global water body data in this area was limited, but the extracted lake area had the same trend, and the data processing was fast and convenient.This method has advantages in analyzing the dynamic changes in lake areas on a large scale and over long time scales.Based on the GEE platform, Peng et al. [38] used JRC water products combined with the water index method to study the area change in Bosten Lake in the last 20 years and achieved good results.Many studies have shown that the applicability of JRC global surface water datasets in different regions is different.Jin et al. [39] found that JRC global surface water data could minimize the interference of cloud shadows.This product was suitable for the high cloud coverage plateau area of the Three-River Headwaters region.Wu et al. [40] compared the lake area extracted using different water indices with JRC data in the Hulun Lake area and found that the accuracy of the JRC data was poor.Therefore, the applicability of the dataset needs further study.The variation law of the Hulun Lake area extracted in this study was consistent with the conclusion obtained by Zhao et al. [27] through visual interpretation.It was roughly divided into four stages.The water area fluctuated between 1748.81-2314.96km 2 , and the variation range was the same.
In this study, climate change, surface runoff change, vegetation succession, and economic and social development were fully considered when analyzing the influencing factors of the Hulun Lake water area.Our findings suggest that evaporation and relative humidity are most closely related to the change in the Hulun Lake area.This result is consistent with the findings of Wang et al. [41] and Zhao et al. [27] in their studies.Water surface evaporation is the primary mechanism by which inland lakes consume water [41].Sun et al. [17] estimated that the average annual evaporation of Hulun Lake was approximately 4.8 times the direct precipitation based on the water balance.Relative humidity is a comprehensive indicator that reflects the conditions of precipitation, temperature, evaporation, vapor pressure, and other factors.Although precipitation, vapor pressure, and wet day frequency did not show a significant correlation with the water area of Hulun Lake, they have an indirect correlation with relative humidity.This suggests that relative humidity plays an important role in the water area changes in Hulun Lake, and variations in relative humidity can be used as an indicator of the lake's hydrological conditions.The runoff of the river into the lake was also a very significant factor affecting the dynamic change in the lake area.The amount of water in the river flowing into the lake would directly affect increases and decreases in lake area, and the change in climatic factors would also lead to a change in runoff.Further analysis of the correlation between meteorological factors found that evaporation and precipitation and relative humidity were significantly negatively correlated (p < 0.01).The warming and drying of the regional climate led to an increase in evaporation, an increase in lake water consumption, and a decrease in precipitation, which reduced lake surface precipitation and surface runoff.Guo et al. [16] thought that the decrease in river runoff caused by climate warming and drying was the main reason for the decrease in water level and area of Hulun Lake.Vegetation coverage was the most direct indicator of a change in water surface area.We found that the smaller the area of low-medium coverage vegetation was, the larger the water surface area; additionally, the larger the area of medium-high coverage vegetation was, the larger the water surface area.This shows that compared with mediumlow coverage vegetation, medium-high coverage vegetation has more obvious effects on sand fixation and soil conservation, water conservation, and soil erosion prevention in lakes in semiarid areas.
As a product of human activities, social and economic factors seriously restrict the development of the ecological environment in semiarid areas.Tao et al. [6] analyzed the relationship between the number and area of lakes in Inner Mongolia and climatic and anthropogenic factors and found that the changes in lakes in Inner Mongolia were affected mainly by human activities, especially coal mining.By analyzing the main driving factors of the rapid change in the Hulun Lake area from 2000 to 2016, we found that anthropogenic factors were the main driving force causing the shrinkage of the Hulun Lake area.After 2000, the GDP of the Hulun Lake region increased rapidly, and industry became the main source of economic growth in the region.As the basic resource of industry, coal mining accounted for nearly 10% of the secondary industry [16].Relevant studies [42][43][44] have shown that there are numerous significant deposits in the study area, with Manzhouli being a crucial coal-producing region in Hulunbuir, where coal mining has been a severe issue.On the one hand, the overexploitation of coal destroyed the native vegetation, aggravated grassland degradation, accelerated soil erosion and desertification, and destroyed the local ecological environment [45].On the other hand, coal mining led to land subsidence and the destruction of underground aquifers, affecting the regulation and storage of runoff by the groundwater system, resulting in a decline in groundwater levels and a sharp decrease in surface runoff [46,47].These indirectly led to the shrinking of Hulun Lake.The data on agricultural output value indirectly reflected the increase in cultivated land area and livestock number and the increase in population, which directly led to the increase in water consumption in this area.Furthermore, the region has also experienced significant human activities such as deforestation and urbanization [48].The combination of these factors has resulted in a significant reduction in the water area of Hulun Lake.
In 2013, the area of Hulun Lake rebounded rapidly.To explore the reasons why, we believed that it was the result of the combined action of meteorological factors and artificial water diversion.In 2013, anthropogenic factors, as the main driving factor, did not change much, while meteorological factors changed dramatically (Figure 5).The precipitation, vapor pressure, and relative humidity increased significantly, and the evaporation reached the lowest value in the past 35 years.The response of the lake area to the change in meteorological factors was rapid in a short time.However, to curb the shrinking of Hulun Lake, the state began building the "diversion into the lake" project beginning in 2007, introducing the water of the Hailar River into the lake.It was planned to divert 750 million m 3 of water each year, and the project was completed and diversion started in 2009.In the first two years, due to the small runoff of the Hailar River, the amount of water diversion did not reach the planned value, and the shrinkage of the Hulun Lake area was not curbed but was actually further aggravated.The lake storage capacity decreased by 1.27 billion m 3 in two years.Since 2013, the annual water diversion into the lake has been stable at 600 million m 3 [49], which has played a crucial role in the increase in the Hulun Lake area.
There are several limitations to this study.The method of extracting the water body was relatively singular, and other methods were not comprehensively utilized.In the future, an accuracy comparison of various water body methods can be carried out to explore the best method of water body extraction in this area.The runoff data for inflow rivers were not long enough to cover the whole research period.In the future, a hydrological model or remote sensing inversion method can be used to extend and interpolate runoff data.Human activities were considered as a whole in this study; however, quantifying the effects of different human activities remains an important issue [50].In the future, we will determine the proportion of different human activities in anthropogenic factors based on field surveys or statistical models.Groundwater recharge is also an important factor that affects the area of Hulun Lake.Unfortunately, due to the lack of data, this study did not consider the impact of underground runoff on the lake area.In the future, GRACE satellites can be used to estimate groundwater reserves in the area or collect meteorological and hydrological data to calculate groundwater reserves using the water balance formula.This will help to explore the impact of groundwater recharge on the water area of Hulun Lake.Later, with the improvement of data, we will carry out more work on the physical mechanism to further clarify the driving mechanism of the dynamic changes in Hulun Lake.

Conclusions
Based on Landsat remote sensing image data from 1986 to 2020, this paper extracted the water area of Hulun Lake in a long time scale and analyzed and discussed the shrinking process and driving force of Hulun Lake by combining natural factors and socioeconomic factors.The following conclusions were drawn: (1) The water surface area of Hulun Lake decreased significantly in the past 35 years.The dynamic change in the water area of Hulun Lake could be divided into four stages: 1986-1991, a stage with area increases; 1991-2000, a relatively stable stage; 2000-2012, a stage with a sharp decrease; and 2012-2020, the recovery phase.The areas with more dramatic changes in the water area of Hulun Lake were distributed mainly in the northeastern and southern waters of Hulun Lake.
(2) The dynamic change in the Hulun Lake area was affected by both natural factors and anthropogenic factors.In terms of natural factors, meteorological factors such as evaporation and relative humidity, runoff of inflow rivers, medium-to high-coverage, and medium-to low-coverage vegetation had significant effects.In terms of anthropogenic factors, the population had the most significant impact.The artificial water diversion project had different degrees of influence on the response of the Hulun Lake area change to natural factors.
(3) Anthropogenic factors were the main driving force causing the rapid change in the Hulun Lake area from 2000 to 2016, and they explained 48% of the change in the Hulun Lake area.Natural factors were the secondary driving force and explained 27% of the change in the Hulun Lake area.

Figure 2 .
Figure 2. Comparative verification of water area extraction results (The unit is km 2 ): (a) M and (b) JRC.

Figure 2 .
Figure 2. Comparative verification of water area extraction results (The unit is km 2 ): (a) MNDWI and (b) JRC.

Figure 3 .
Figure 3. Characteristics of temporal variation in Hulun Lake.(a) Lake area and (b) water level and water volume.

Figure 3 .
Figure 3. Characteristics of temporal variation in Hulun Lake.(a) Lake area and (b) water level and water volume.

21 Figure 4 .
Figure 4. Distribution of the water area of Hulun Lake in key years.

3. 3 .
Correlation Analysis between Water Area and Natural Factors of Hulun Lake3.3.1.Meteorological FactorsThe annual temperature, precipitation, evaporation, and relative humidity in the Hulun Lake area were represented by the average values of meteorological elements from

Figure 4 .
Figure 4. Distribution of the water area of Hulun Lake in key years.

21 Figure 5 .
Figure 5. Distribution of meteorological factors in the Hulun Lake region.

Figure 5 .
Figure 5. Distribution of meteorological factors in the Hulun Lake region.

Figure 7 .
Figure 7. Dynamic changes in the water area and surface runoff in Hulun Lake from 2000 to 2020.

Figure 7 .
Figure 7. Dynamic changes in the water area and surface runoff in Hulun Lake from 2000 to 2020.

21 Figure 8 .
Figure 8. 1986-2020 annual variation in FVC in the Hulun Lake region (a) and M-K mutation test results (b).

Figure 8 .
Figure 8. 1986-2020 annual variation in FVC in the Hulun Lake region (a) and M-K mutation test results (b).

Figure 9 .
Figure 9. Grading map of FVC in the Hulun Lake area.The change trend and mutation time of vegetation coverage were similar to the change in the Hulun Lake water surface area.To reveal the possible influence of artificial

Figure 9 .
Figure 9. Grading map of FVC in the Hulun Lake area.

21 Figure 11 .
Figure 11.The correlation coefficient between the area of Hulun Lake and anthropogenic factors.

Figure 11 .
Figure 11.The correlation coefficient between the area of Hulun Lake and anthropogenic factors.

Table 3 .
Correlation coefficient between the area of Hulun Lake and the annual runoff of rivers entering the lake.

Table 4 .
KMO test and Bartlett spherical test.
Note: ** represents an extremely significant correlation at the 0.01 level.

Table 4 .
KMO test and Bartlett spherical test.

Table 5 .
Total variance explained by PCA.(Component 1-8 is a comprehensive variable obtained from data dimensionality reduction, which can fully reflect the data characteristics of the original information.The weight of each load in each comprehensive variable is different, which represents the influencing factors of different biases of each comprehensive variable.)