Application of Gaofen-6 Images in the Downscaling of Land Surface Temperatures

: The coarse resolution of land surface temperatures (LSTs) retrieved from thermal-infrared (TIR) satellite images restricts their usage. One way to improve the resolution of such LSTs is downscaling using high-resolution remote sensing images. Herein, Gaofen-6 (GF-6) and Landsat-8 images were used to obtain original and retrieved LSTs (Landsat-8- and GF-6-retrieved-LSTs) to perform LST downscaling in the Ebinur Lake Watershed. Downscaling model was constructed, and the regression kernel was explored. The results of downscaling LST using the GF-6 normalized difference vegetation index with red-edge band 2, ratio built-up index, normalized difference sand index, and normalized difference water index as multi-remote sensing indices with multiple remote sensing indices with random forest regression method provided optimal downscaling results, with R 2 of 0.836, 0.918, and 0.941, root mean square difference of 1.04 K, 2.06 K, and 1.80 K, and the number of pixels with LST errors between − 1 K and +1 K of 87.2%, 76.4%, and 81.9%, respectively. The expression of spatial distribution of 16 m-LST downscaling results corresponded with that of Landsat-8- and GF-6-retrieved-LST, and provided additional details spatial description of LST variations, which was absent in the Landsat-8- and GF-6-retrieved LSTs. The results of downscaling LST could satisfy the application requirements of LST spatial resolution.


Introduction
Land surface temperature (LST) is a critical parameter in the surface energy balance. It is also an important indicator of land degradation, salinization, desertification, and erosion, and is widely used in studies focusing on evaporation estimates, water cycle, drought monitoring, the "urban heat island" effect [1,2], and the cold island effect in oasis [3]. In early studies, surface temperature was mostly obtained by the ground measurement method, which is associated with an insufficient density of stations and limited space-time range for monitoring spatiotemporal changes of surface thermal environment [4]. With the development of remote sensing technology, satellite images with thermal infrared sensor (TIRS) have become an important approach for obtaining LST because of its wide coverage, relatively low cost, and periodic acquisition [5,6].
Human activities and urban expansion substantially changes the natural surface, causing a series of environmental impacts. The spatial and temporal distribution of thermal environment varies widely with the heterogeneity of underlying surface components and complexity of atmospheric conditions [7][8][9]. However, the LST retrieved from TIRS images typically show a coarse resolution. The Landsat-8 TIRS images provided by the United States Geological Survey website (USGS) (http://earthexplorer.usgs.gov, accessed on 12 March 2022) has been resampled to a resolution of 30 m by the cubic convolution method, although the information expressed remains similar to the physical resolution of China within China's National Plan for Long-and Medium-Term Forestry Development, and since the state-level Lake Ebinur Wetlands nature reserve was established in 2007, conservation plans have enabled the recovery of the area of Lake Ebinur to a certain extent. However, the rate of recovery of the lake is still far slower than its rate of degradation.
The dynamic succession process of wetland ecosystem caused by lake surface area fluctuation has become the barometer of ecological environment improvement and deterioration, and have attracted widespread interest from academia, governmental organizations, the media, and the public. Numerous studies have been conducted on drainage system changes, soil salination, dust storms, desertification, landscape patterns, and the cool island effect, in the context of this region's deteriorating ecological environment [25][26][27][28][29][30][31]. However, studies on drought assessment, irrigation monitoring, and water and heat balance require practical application of LST at the field scale, such as soil moisture monitoring at the field scale [32]. This requires the spatial downscaling of LST to meet the resolution requirements for water resource management of the small-scale, highly heterogeneous underlying surfaces, for which study area are lacking. The acquisition of high resolution LST is helpful for analysis complex interactions between human (socio-economic) and natural (ecological) systems, which is a prerequisite for environmental protection and restoration in the study area.
In order to understand the relationship between surface variability caused by human activities and LST, this study took the Ebinur Lake watershed as the study area; Landsat-8 and GF-6 WFV images were used as data sources to perform LST downscaling using the DisTrad, TsHARP, and MIRF methods, respectively. Remote sensing indices, such as the GF-6 NDVI, normalized difference vegetation index with red-edge band 1 (NDVI RE1 ) , normalized difference vegetation index with red-edge band 2 (NDVI RE2 ), NDSI, ratio builtup index (RBI), and normalized difference water index (NDWI), were selected as regression kernels according to the characteristics of band of GF-6 WFV images and underlying surface features of the study area. This study provides a preliminary data on the viability of using GF-6 images for LST downscaling in the study area, and the effects of the two newly added red-edge bands on the downscaling results with three methods were evaluated, to obtain satisfactory downscaling results for subsequent applications.

Overview of the Study Area
The study area is the Ebinur Lake watershed, which is a classic example of an arid oasis. The area lies within the north temperate zone and has a desert-continental climate. It is located in the Bortala Mongol Autonomous Prefecture in the Xinjiang Uyghur Autonomous Region (81 • 46 -83 • 51 E, 44 • 02 -45 • 10 N). The Gurbantünggüt Desert, Borohoro Mountains (western branch of the Tian Shan Mountain system), Dzungarian Alatau (northern-most branch of the Tian Shan Mountain system), and Mount Mayili (western mountains of Dzungaria) lie to the east, south, west, and north of the study area, respectively. Alashankou valley (with a width of approximately 10 km) is located between the Dzungarian Alatau and Mount Mayili, as shown in Figure 1.
The study area is situated in the wetland ecotone of Ebinur Lake. This region has an extremely fragile ecological environment that has been strongly affected by human activities and environmental factors. Owing to its unique geographical location, this study area is extremely important for conducting research on climate regulation, reducing the occurrence of salt-dust storms, and conserving the endemic biodiversity [33].
Agricultural and industrial sectors have developed rapidly together with the population in this region in recent years, and the amount of water consumption for agricultural, municipal, and industrial needs in regions upstream of Ebinur Lake has also increased. Due to this stress, Kuitun River, which is one of the rivers feeding Lake Ebinur, has been completely cut off, leaving only the Bortala River and Jinghe River to feed Ebinur Lake. Combined with a large amount of evaporation and dust weather, the lake area is shrinking rapidly, lakeshore region has become severely desertified, and the salt desert formed on Remote Sens. 2022, 14, 2307 4 of 20 the dried lakebed has become a major source of severe aeolian sandstorms. Therefore, the regional ecological issues caused by changes in the lake-area of Ebinur Lake have become a direct threat to the sustainable development of the TNSEZ and the safety of the New Eurasian Land Bridge [34]. Understanding human and social dynamics, quantifying and mapping the spatial-temporal distribution of environmental vulnerability caused by natural and man-made impacts are needed for environmental protection and restoration [35][36][37][38]. The study area is situated in the wetland ecotone of Ebinur Lake. This region has an extremely fragile ecological environment that has been strongly affected by human activities and environmental factors. Owing to its unique geographical location, this study area is extremely important for conducting research on climate regulation, reducing the occurrence of salt-dust storms, and conserving the endemic biodiversity [33].
Agricultural and industrial sectors have developed rapidly together with the population in this region in recent years, and the amount of water consumption for agricultural, municipal, and industrial needs in regions upstream of Ebinur Lake has also increased. Due to this stress, Kuitun River, which is one of the rivers feeding Lake Ebinur, has been completely cut off, leaving only the Bortala River and Jinghe River to feed Ebinur Lake. Combined with a large amount of evaporation and dust weather, the lake area is shrinking rapidly, lakeshore region has become severely desertified, and the salt desert formed on the dried lakebed has become a major source of severe aeolian sandstorms. Therefore, the regional ecological issues caused by changes in the lake-area of Ebinur Lake have become a direct threat to the sustainable development of the TNSEZ and the safety of the New Eurasian Land Bridge [34]. Understanding human and social dynamics, quantifying and mapping the spatial-temporal distribution of environmental vulnerability caused by natural and man-made impacts are needed for environmental protection and restoration [35][36][37][38].

Data Sources
The time of passing territory of Landsat-8 and GF-6 in the study area were similar at 13:14:40 and 13:43:20 (Beijing Time), respectively, providing a reliable data source for this study. Landsat-8 images can be contaminated with cloud, particularly in the winter in the study area. Considering the season and quality of image acquisition, images with thick aerosol or heavy cloud cover were removed, and images obtained during the growing season (spring, summer, and autumn) in the study area were selected: Landsat-8 Operational Land Imager (OLI) and TIRSc images from 12 April, 17 July, and 3 September

Data Sources
The time of passing territory of Landsat-8 and GF-6 in the study area were similar at 13:14:40 and 13:43:20 (Beijing Time), respectively, providing a reliable data source for this study. Landsat-8 images can be contaminated with cloud, particularly in the winter in the study area. Considering the season and quality of image acquisition, images with thick aerosol or heavy cloud cover were removed, and images obtained during the growing season (spring, summer, and autumn) in the study area were selected: Landsat-8 Operational Land Imager (OLI) and TIRSc images from 12 April, 17 July, and 3 September 2019 (downloaded from the United States Geological Survey website (USGS) (http://earthexplorer.usgs.gov, accessed on 12 March 2022) and GF-6 WFV images from 9 April, 24 July, and 26 August 2019, (obtained from the Satellite Application Center of the Xinjiang Uygur Autonomous Region), which minimized interference and maximized the image quality, and the spectral and textural features of these images were, therefore, clear and distinct.
Considering that the Landsat-8 TIRS images from USGS have been resampled to a resolution of 30 m, GF-6 and Landsat-8 TIRS images were first scaled up to 100 m to obtain the remote sensing index and original LST under the low resolution, as well as establish the downscaling model. GF-6 images with the original resolution were used to construct the high-resolution remote sensing index to downscale the original LST and yield the 16 m LST. were obtained from three ground stations (downloaded from the China Meteorological Data Service Centre (http://data.cma.cn, accessed on 12 March 2022, and Hydrographic and Water Resources Survey Bureau of Xinjiang Bortala Mongolia Autonomous Prefecture), measured using a ground temperature meter (sensor): glass liquid cryometer and platinum resistance cryosensor. The actual LST was estimated from upwelling and downwelling longwave radiations observed by pyranometers using the following equation: 25 (1) where R lu (R ld ) is the surface upwelling (or downwelling) longwave radiation, ε is the land surface emissivity (LSE), T s is LST, and σ is the Stefan-Boltzmann constant. The temporal resolution is 1 h, while the underlying surface around the ground station was homogeneous according to the field visit. The remote sensing images used in this study are listed in Table 1. Table 2 shows the geographic coordinates of the ground stations and their underlying surface types. The locations of the ground stations are shown in Figure 1.

Normalization of Remote Sensing Images
Because of the differences in radiation calibration and spectral response function with different sensors, collaborative application of multi-source sensor images causes some difficulties, and observation geometry and atmospheric conditions impact the image. Therefore, normalization processing must be performed eliminate data discrepancies caused by these factors before the comprehensive application of multi-source remote sensing images.
Considering the characteristics of GF-6 coverage and large angle observation, the GF-6 images were normalized using Landsat-8 images as a reference to ensure consistency between them and to standardize the input data for the LST downscaling algorithm. This process included radiometric cross-calibration [39][40][41], orthorectification, geometric corrections, atmospheric correction using the Fast Line-of-sight Atmospheric Analysis of Spectral Hypercubes (FLAASH) algorithm, image cropping, and resampling. In radiometric crosscalibration, the angle information of OLI image and the slope and aspect information of The normalized and unnormalized GF-6 NDVI were compared to Landsat-8 NDVI. Following normalization, the GF-6 NDVI was more similar to the Landsat-8 NDVI, as shown in Figure 2, and it was, thus, confirmed that normalization succeeded in reducing disparities between the GF-6 and Landsat-8 NDVI. The statistical parameters of the data are shown in Table 3, where normalization reduced the root mean square difference (RMSD) and increased R 2 , which is evidence of its efficacy.  The single-channel algorithm [42] requires very few parameters to estimate LST, and is applicable in a certain range of atmospheric water vapor content, due to which, when high, the error of related parameters in the derivation process will increase, thus reducing the inversion accuracy. Conversely, when the atmospheric water vapor content decreases to 2 g•cm −2 , the retrieval error of LST decreases to between 1.53 K [43,44]. In this study, the atmospheric water vapor content was less than 2 g•cm −2 ; the single-channel algorithm was, therefore, used to retrieve LST for the study area. Landsat-8 TIRS and GF-6 WFV, which after cross-radiation calibration replaced Landsat-8 OLI, were used to retrieve LST. For convenience, the 30-m Landsat-8 images and 16 m GF-6 images were both resampled into 100 m images. Furthermore, the 16 m GF-6 images were also resampled into 30 m images; 100 m and 30 m LST can be retrieved with TIRS. The 100 m-retrieved LSTs were the original LSTs for downscaling, and the 30 m-retrieved LSTs were used to evaluate downscaling results.
The governing equation of the single-channel algorithm is as follows:  The single-channel algorithm [42] requires very few parameters to estimate LST, and is applicable in a certain range of atmospheric water vapor content, due to which, when high, the error of related parameters in the derivation process will increase, thus reducing the inversion accuracy. Conversely, when the atmospheric water vapor content decreases to 2 g·cm −2 , the retrieval error of LST decreases to between 1.53 K [43,44]. In this study, the atmospheric water vapor content was less than 2 g·cm −2 ; the single-channel algorithm was, therefore, used to retrieve LST for the study area. Landsat-8 TIRS and GF-6 WFV, which after cross-radiation calibration replaced Landsat-8 OLI, were used to retrieve LST. For convenience, the 30-m Landsat-8 images and 16 m GF-6 images were both resampled into 100 m images. Furthermore, the 16 m GF-6 images were also resampled into 30 m images; 100 m and 30 m LST can be retrieved with TIRS. The 100 m-retrieved LSTs were the original LSTs for downscaling, and the 30 m-retrieved LSTs were used to evaluate downscaling results.
The governing equation of the single-channel algorithm is as follows: where ε is surface emissivity, L is the radiant intensity measured by the remote sensing sensor at the altitude of the satellite (W·m −2 ·sr −1 ·µm −1 ), T is the brightness temperature, λ is the central wavelength (band 10 of Landsat-8 has a central wavelength of 10.9 µm), c 1 = 1.91104 × 10 8 W·µm 4 ·m −2 ·sr −1 and c 2 = 14,387.7 µm·K, ψ 1 , ψ 2 , and ψ 3 are atmospheric functional parameter, ω is the atmospheric water vapor content, e is the absolute vapor pressure (hPa), RH is relative humidity, and T 0 is air temperature measured at 2 m above the surface. RH and T 0 are meteorological data of the Jinghe County meteorological station (ID:51334), which were downloaded from the China Meteorological Data Service Centre (http://data.cma.cn, accessed on 12 March 2022).

Three Classic LST Downscaling Methods Used
GF-6 and Landsat-8 TIRS images were first scaled up to 100 m to obtain the remote sensing index and original LST under the low resolution, and to establish the downscaling model. GF-6 images with original resolution were used to construct a high resolution of the remote sensing index to downscale the original LST.

DisTrad
The DisTrad method was first proposed by Kustas et al. in 2003. Based on the statistical law of scale invariance between NDVI and LST at different scales, Kustas achieved scaling down the thermal infrared image from 1 km to 100 m. The governing equation of DisTrad is as follows: LST L (NDV I L ) = a + bNDV I L (11) where LST L is the LST value at the original resolution, LST L is the simulated LST value at the coarser resolution, NDVI L is the NDVI value at the coarser resolution, a and b are constants, LST L is the regression residual, NDVI H is the NDVI value at the finer resolution, and LST H is the downscaled result.

TsHARP
The TsHARP method is an improved algorithm proposed by Agam [19] on the basis of the DisTrad algorithm. This method assumes that a relatively stable functional relationship between NDVI (or vegetation coverage) and LST is maintained at different spatial scales. In this method, a function between LST and NDVI is constructed at the coarser resolution, and this is then applied to the higher spatial resolution. Residual correction is then conducted to obtain the LST at the required resolution. The governing equation of the TsHARP method is as follows: LST f (NDV I) = a 0 + a 1 (1 − NDV I) 0.625 (16) where f is the regression function between NDVI and T s , ∆T is the regression residual, and a 0 and a 1 are the regression coefficients.

MIRF
Considering that the LST is affected by multiple factors, Yang et al. [20] proposed the MIRF algorithm. In MIRF, a downscaling model is constructed using random forest regression based on multiple surface-related remote sensing indices, which include the soiladjusted vegetation index (SAVI), normalized multi-band drought index (NMDI), modified normalized difference water index (MNDWI), normalized difference dust index (NDDI), and the normalized difference building index (NDBI). The governing equation of MIRF is as follows: in which SAVI L , NMDI L , NDBI L , MNDWI L , and NDDI L are the SAVI, NMDI, NDBI, MNDWI, and NDDI values at the coarser resolution, respectively, e is the regression residual, LST O is the LST at the original resolution, LST F is the simulated LST at the coarser resolution, SAVI H , NMDI H , NDBI H , MNDWI H , and NDDI H are the SAVI, NMDI, NDBI, MNDWI, and NDDI values at the finer resolution, respectively, and LST H is the value of the downscaled LST.

Remote Sensing Indices Based on GF-6 Images
According to the field investigation and the GF-6 image of the study area, the underlying surface types of the study area mainly include vegetation, water bodies, impermeable surfaces, and barren soil. The remote sensing indices relating to these underlying surface types were selected. As NDDI cannot distinguish sand from soil, this indicator negatively affects the accuracy of LST downscaling, and it was, therefore, replaced by the normalized difference sand index (NDSI). Furthermore, as the GF-6 images does not provide shortwave infrared bands that can be used to compute the NDBI, which makes it difficult to identify built-up land using a single band or combination of bands, the ratio built-up index (RBI) was used instead of the NDBI [45]. Finally, based on underlying surface characteristics of the study area and the available bands of the GF-6 images, MIRF regression was conducted using the GF-6 NDVI, NDWI, RBI, and NDSI indices. To highlight the effects of the rededge band 1 (RE1) and red-edge band 2 (RE2) on LST downscaling, three different NDVIs (NDVI GF6_Nir , NDVI RE1 , and NDVI RE2 ) were constructed based on the common bands and RE1 and RE2 bands of the GF-6 image, and the equations used to compute each of these indices are as follows: where ρ Nir and ρ red are the reflectance values of the NIR and red bands, NDV I GF6_Nir is the NDVI based on Nir band, ρ ver1 is the reflectance of RE1, NDVI RE1 is the NDVI based on RE1, ρ ver2 is the reflectance of RE2, and NDVI RE2 is the NDVI based on RE2.
NDW I = ρ green − ρ Nir / ρ green + ρ Nir where ρ green and ρ Nir are the reflectance values of the green and NIR bands, respectively.
where ρ blue , ρ green , ρ red , and ρ Nir are the reflectance values of the blue, green, red, and NIR bands, respectively. Furthermore, the NDSI equation is as follows: where ρ red and ρ blue are the reflectance values of the red and blue bands, respectively.

Evaluation Measures
Two measures were selected to evaluate the LST downscaling results, including R 2 and RMSD, which were calculated as follows: where LST i is the downscaled result, LST i is the reference LST, LST is the mean reference LST, and n is the total number of pixels in the image. R 2 is the coefficient of determination between the reference and downscaled images. RMSD was used to test the difference between the reference and downscaled LSTs. A high R 2 and a low RMSD indicates a satisfactory downscaling. The Landsat-8-and GF-6retrieved LSTs were used to evaluate the downscaling results of the resampling to 30 m.

Downscaling Results
According to the field investigation and classification of GF-6 images in the study area, as compared with other regions, region A ( Figure 3) contains all four types of underlying surfaces (vegetation, water bodies, impermeable surfaces, and barren soil), and their distribution is concentrated; Figure 4 shows an enlarged view of region A to present the downscaling results more clearly. Figure 5 shows the underlying surface types classified by GF-6 image, using support vector machines (SVM), with classification accuracies of 92.7%, 92.5%, and 94.4%. Figure 6 shows the Landsat-8-and GF-6-retrieved LST of region A, where a, b, and c show the original 100 m LSTs for downscaling, which were retrieved by GF-6 and Landsat-8 images scaled up to 100 m, and d, e, and f show the 30 m LSTs used to evaluate downscaling results, which were retrieved by GF-6 images resampled to the resolution of 30 m and the 30 m Landsat-8 TIRS images. Figure 7 shows the 16 m downscaled LST taking 26 August 2019 as an example. As shown in Figure 7, all of the nine downscaled results in the three seasons were consistent with the overall spatial patterns of Landsat-8-and GF-6-retrieved LST, and their high-and low-temperature zones were general consistence with the Landsat-8-and GF-6-retrieved LST. Based on the images and underlying surface types of this region ( Figure 5), barren soils had the highest temperatures, followed by impermeable surfaces. Vegetation had significantly lower temperatures than barren soil and impermeable surfaces, while water bodies had the lowest temperatures. This is consistent with the regular pattern that the temperature increases in water, vegetation, impervious surface, and poor soil. Comparing the DisTrad and TsHARP results, those based on NDVI GF6_Nir and NDVI RE2 were relatively similar to each other, but there were obvious changes in LST in vegetated areas which were obtained with the RE1 band changes. Compared to the Landsat-8-and GF-6-retrieved LST, the DisTrad-and TsHARP-downscaled results provided no additional details of LST variations. The MIRF-downscaled results from the common bands and the RE1 and RE2 bands of GF-6 can describe the detail spatial variations in LST. The temperature variations of the MIRF-downscaled results were also much milder. classified by GF-6 image, using support vector machines (SVM), with classification accuracies of 92.7%, 92.5%, and 94.4%. Figure 6 shows the Landsat-8-and GF-6-retrieved LST of region A, where a, b, and c show the original 100 m LSTs for downscaling, which were retrieved by GF-6 and Landsat-8 images scaled up to 100 m, and d, e, and f show the 30 m LSTs used to evaluate downscaling results, which were retrieved by GF-6 images resampled to the resolution of 30 m and the 30 m Landsat-8 TIRS images.            Figure 7, all of the nine downscaled results in the three seasons were consistent with the overall spatial patterns of Landsat-8-and GF-6-retrieved LST, and their high-and low-temperature zones were general consistence with the Landsat-8-and GF-6-retrieved LST. Based on the images and underlying surface types of this region ( Figure 5), barren soils had the highest temperatures, followed by impermeable surfaces. Vegetation had significantly lower temperatures than barren soil and impermeable surfaces, while water bodies had the lowest temperatures. This is consistent with the regular pattern that the temperature increases in water, vegetation, impervious surface, and poor soil. Comparing the DisTrad and TsHARP results, those based on NDVIGF6_Nir and NDVIRE2 were relatively  The spatial distribution of the differences between Landsat-8-and GF-6-retrieved LST and downscaled LST (resampled to 30 m) shows that the temperature in the vegetation area was overestimated by each downscaling methods with RE1 band, and the range The spatial distribution of the differences between Landsat-8-and GF-6-retrieved LST and downscaled LST (resampled to 30 m) shows that the temperature in the vegetation area was overestimated by each downscaling methods with RE1 band, and the range of overestimation gradually decreased with the order of DisTrad, TsHARP, and MIRF, as shown in Figure 8 (taking 26 August 2019 as an example). In the MIRF method, the temperature of impervious surface around vegetation and water covered area was higher, and the temperature of vegetation area in the middle of impervious surface and bare soil covered area were lower. The MIRF-downscaled LST results also revealed roads between vegetated areas, impermeable surfaces around water bodies, greenified areas among builtup areas, and temperature changes between impermeable and barren soil surfaces. These small temperature variations were absent from the Landsat-8-and GF-6-retrieved LST. Furthermore, the LST variations obtained by MIRF downscaling with the contribution of GF-6 images were consistent with the natural LST variations. of overestimation gradually decreased with the order of DisTrad, TsHARP, and MIRF, as shown in Figure 8 (taking 26 August 2019 as an example). In the MIRF method, the temperature of impervious surface around vegetation and water covered area was higher, and the temperature of vegetation area in the middle of impervious surface and bare soil covered area were lower. The MIRF-downscaled LST results also revealed roads between vegetated areas, impermeable surfaces around water bodies, greenified areas among built-up areas, and temperature changes between impermeable and barren soil surfaces. These small temperature variations were absent from the Landsat-8-and GF-6-retrieved LST. Furthermore, the LST variations obtained by MIRF downscaling with the contribution of GF-6 images were consistent with the natural LST variations.   It can be seen from Figures 9 and 10 and Table 4 that the evaluation results that the three groups of images show consistent regular (Table 4). In a horizontal comparison between the downscaled results obtained with the same downscaling method but based on different NDVIs, downscaling with NDVI RE2 yielded the highest R 2 , the lowest RMSD values, and the largest number of pixels with residuals between −1 K and +1 K. Therefore, NDVI RE2 provided the optimal downscaling precision level. In the vertical comparison between the downscaled results with the same NDVI and different downscaling methods, MIRF-downscaled results consistently provided the highest R 2 , lowest RMSD, and the highest number of pixels with residuals between −1 K and +1 K. Due to the cost of ground temperature observation equipment, there are limited observation station in the study area; therefore, the observation data were used as a supplement to evaluate the downscaling results. Table 5 shows that the bias of retrieval LST at the three stations were all within 2.42 K, by comparing the three downscaling methods, it can be seen that the bias of DisTrad method at three stations were all within 3.95 K, TsHARP method within 3.47 K, and MIRF method within 1.96 K. Meanwhile, by comparing the downscaling results of three NDVIs, the bias of downscaling results with GF-6 NDVI at the three stations were all within 3.70 K, GF-6 NDVI RE1 was all within 3.95 K, and GF-6 NDVI RE2 was within 3.04 K. By comparing the downscaling results of the three seasons, the bias of downscaling results at three stations on the 9 April, 24 July, and 26 August 2019 were within 2.95 K, 2.76 K, and 3.95 K, respectively. MIRF LST RE2 improved the accuracy of LST at all stations.
Therefore, MIRF was considered to be the most precise downscaling method and the optimal downscaling regression kernel was NDVI RE2 , which provided additional spatial details.

Effects of the RE1 and RE2 Bands on LST Downscaling
The MIRF-downscaled results were used to analyze the effects of the new RE1 and RE2 bands on LST downscaling. A scatter plot was drawn using downscaled LSTs based on three different NDVIs (Figure 11), where it is evident that NDVI and LST have a "trianglelike" relationship that varies between the dry and wet edges; NDVI and LST are negatively correlated at the dry edges but positively correlated at the wet edges. Compared with NDVI GF6_Nir and NDVI RE2 , the value of NDVI RE1 is lower at dry edge resulting in a higher value of the corresponding LST, the value of NDVI RE1 is higher at wet edge resulting in a lower value of the corresponding LST. The correlation coefficients of NDVI GF6_Nir , NDVI RE1 , and NDVI RE2 with respect to LST were −0.85, −0.77, and −0.87 on 9 April 2019, the correlation coefficients of NDVI GF6_Nir , NDVI RE1 , and NDVI RE2 with respect to LST were −0.93, −0.89, and −0.94 on 24 July 2019, and the correlation coefficients of NDVI GF6_Nir , NDVI RE1 , and NDVI RE2 with respect to LST were −0.89, −0.86, and −0.91 on 26 August 2019, respectively. Therefore, NDVI RE2 correlated the most strongly (and negatively) with LST, and it is, thus, the most useful regression kernel for LST downscaling.

Effects of the RE1 and RE2 Bands on LST Downscaling
The MIRF-downscaled results were used to analyze the effects of the new RE1 and RE2 bands on LST downscaling. A scatter plot was drawn using downscaled LSTs based on three different NDVIs (Figure 11), where it is evident that NDVI and LST have a "triangle-like" relationship that varies between the dry and wet edges; NDVI and LST are negatively correlated at the dry edges but positively correlated at the wet edges. Compared with NDVIGF6_Nir and NDVIRE2, the value of NDVIRE1 is lower at dry edge resulting in a higher value of the corresponding LST, the value of NDVIRE1 is higher at wet edge resulting in a lower value of the corresponding LST. The correlation coefficients of NDVIGF6_Nir, NDVIRE1, and NDVIRE2 with respect to LST were −0.85, −0.77, and −0.87 on 9 April 2019, the correlation coefficients of NDVIGF6_Nir, NDVIRE1, and NDVIRE2 with respect to LST were −0.93, −0.89, and −0.94 on 24 July 2019, and the correlation coefficients of NDVIGF6_Nir, NDVIRE1, and NDVIRE2 with respect to LST were −0.89, −0.86, and −0.91 on 26 August 2019, respectively. Therefore, NDVIRE2 correlated the most strongly (and negatively) with LST, and it is, thus, the most useful regression kernel for LST downscaling.

Discussion
In this study, GF-6 and Landsat-8 images were used for downscaling experiments. The verification results indicate that the downscaled LST using the MIRF method and the regression kernel with GF6 RE2 band were generally accurate.
From the results of validation, the LST downscaling experiment (Figure 7) showed that all the downscaled results preserved the quality of the overall spatial pattern of Landsat-8-and GF-6-retrieved LST, while the high-and low-temperature zones were generally consistent with the regular pattern of nature compared to the MIRF. The DisTrad-and TsHARP-downscaled results provided no additional details regarding LST variations. The MIRF-downscaled results reflect the spatial distribution details of LST on a small scale for detecting temperature changes between roads covered by vegetation, impervious surfaces around water bodies, green areas in urban construction areas, and impervious surfaces and bare soil. These detailed differences are absent in the Landsat-8-and GF-6-retrieved LST. The spatial distribution of the differences between Landsat-8-and GF-6-retrieved LST and downscaled LST (Figure 8) showed that the temperature in the vegetation area was all overestimated by each downscaling methods with RE1 band, and the range of overestimation gradually de-creased with the order of DisTrad, TsHARP, and MIRF. From the quantitative results, as shown in Figure 9 and Table 4, as compared with the Landsat-8and GF-6-retrieved LST, the variation degree of the downscaling results was different. Upon comparing the downscaled results obtained for different NDVIs, downscaled with NDVI RE2 yielded the highest R 2 , with the lowest RMSD values among all downscaling methods. Moreover, upon comparison between the results of different downscaling methods, the MIRF-downscaled results provided the highest R 2 , while the lowest RMSD for all NDVIs. This suggests that the downscaling results of NDVI RE2 using MIRF had the highest accuracy and the lowest scaling effect.
As shown in Figure 10, the distribution of differences is left-skewed in each method using NDVI RE1 . Therefore, the results of the three NDVI participations versions in the MIRF method were further analyzed, as shown in Figure 11. Compared with NDVI GF6_Nir and NDVI RE1 , the value of NDVI RE1 is lower at dry edge resulting in a higher value of the corresponding LST, the value of NDVI RE1 is higher at wet edge resulting in a lower value of the corresponding LST. This is related to the central wavelength of band RE1, which is 710 µm, and the vegetation condition of the study area. The reflectivity of vegetation increases sharply, which is reflected by the steep slope, from visible band to the spectral region of approximately 710 µm [46,47]. NDVI RE2 had a stronger negative correlation with LST, while NDVI RE1 has the lowest correlation, which were consistent with those shown in Figure 9. The spatial distribution of land surface temperature was correlated with the spatial distribution of landscape features of underlying surface. In this study, we introduced NDVI RE2 into three typical algorithms instead of NDVI GF6_Nir , and obtained the most satisfactory downscaling LST results with MIRF. Thus, the four remote sensing indices selected in this experiment can better express the surface features of the study area, which is helpful to improve the accuracy of LST downscaling.
We discussed the feasibility and accuracy of using GF-6 WFV images and statistical models to perform LST downscaling of the study area in this paper. High-spatial resolution LST contains more abundant textural information and can effectively reflect the surface temperature in terms of small-scale spatial heterogeneity. It would be helpful to further study the causes and characteristics of heat island effect in the study area, and the results can aid in agricultural irrigation, regional planning, drought assessment, ecological monitoring, water-heat balance research, and water resources allocation. Based on the analysis of the high-resolution spatial distribution of LST and its driving factors, a high-resolution geospatial resilience map can be drawn, which is critical for understanding the complex interactions between human (socioeconomic) and natural (ecological) systems. Evaluation of these systems is essential for determining sustainable development policies for the study area.
The errors in geometric correction between different source images leads to errors in the experimental results. Hence, further studies are needed regarding the influence of using GF-6 to construct other remote sensing indexes as regression kernels on LST downscaling. Based on the 16 m LST obtained in this study, we show that GF6 PMS images can be used to conduct research at higher spatial resolutions. Further research is needed to understand the effect of the superiority of GF-6 red-edge bands in vegetation recognition ability on downscaling results.

Conclusions
A preliminary study was conducted on the feasibility of using GF-6 images for LST downscaling in the Ebinur Lake Watershed. Landsat-8-and GF-6 WFV images obtained during the growing season (spring, summer, and autumn) were used as data sources, downscaling model was constructed, and the selection of regression kernel was explored in this study. The following conclusions were obtained by comparison analysis: (1) Compared with Landsat-8-and GF-6-retrieved LST, the results of downscaling LST using NDVI RE2 as a single factor regression kernel had the highest R 2 and the lowest RMSD, and the number of pixels with LST errors of between −1 K and +1 K were the highest. As NDVI RE2 was strongly and negatively correlated with the downscaled LSTs, it might be an excellent indicator of the spatial variations in LSTs and provide an outstanding LST downscaling performance. (2) The downscaling method of multi-remote sensing indices is better than the singlefactor method; the correlation between LST and NDVI is not obvious in the high heterogeneity area, which causes to a large error in the downscaling results of the single-factor method. The spatial patterns of downscaled LSTs using NDVI RE2 , RBI, NDSI, and NDWI as multi-remote sensing indices with the MIRF method were consistent with the Landsat-8-and GF-6-retrieved LST, which improved the accuracy of LST at all stations; hence, the downscaled LSTs provide additional details spatial description of LST variations, which were absent in Landsat-8-and GF-6-retrieved LSTs. Furthermore, the temperature gradations of the downscaled LSTs were smoother and more consistent with the natural variations in LST. (3) The results of this study prove the viability of downscaling LSTs based on GF-6 and Landsat-8 images. Furthermore, 16 m-resolution images were successfully used to improve the medium-resolution LST. The downscaling results also proved to be reliable and highly precise, and can meet the application requirements of LST spatial resolution in the study area.