Analysis of the Land Surface Temperature Scaling Problem : A Case Study of Airborne and Satellite Data over the Heihe Basin

This study analyzed the scaling problem of land surface temperature (LST) data retrieved with the Temperature Emissivity Separation (TES) algorithm. We compiled a remotely sensed dataset that included Thermal Airborne Hyperspectral Imager (TASI) and satellite-based Advanced Spaceborne Thermal Emission Reflection (ASTER) data, which were acquired simultaneously. This dataset provided the range of spatial heterogeneities of land surface necessary for the study, which was quantified by the dispersion variance. The LST scaling problem was studied by comparing the remotely sensed LST products in two ways. First, the LST products calculated in the distributed method and the lumped method were compared. Second, the airborne and satellite-based LST products derived from the TES algorithm were compared. Four upscaling methods of LST were used in the process. A scaling correction methodology was developed based on the comparisons. The results showed that the scaling effect could be as large as 0.8 K when the spatial resolution of the TASI LST data was coarse. The scaling effect increases quickly with the spatial resolution until it reaches the characteristic scale of the landscape and is positively correlated with the spatial heterogeneity. The first two upscaling methods denoted as Methods 1–2 can upscale the LST more effectively when compared with the other two scaling methods (Methods 3–4). The scaling effect for the ASTER data is not notable. The comparison between the TASI and OPEN ACCESS Remote Sens. 2015, 7 6490 ASTER data showed that they were highly consistent, with a root mean square error (RMSE) of approximately 0.88 K, when the pixels were relatively homogeneous. When the spatial heterogeneity was significant, the RMSE was as large as 2.68 K. The scaling correction methodology provided resolution-invariant results with scaling effects of less than 0.5 K.


Introduction
Because of the spatial heterogeneity of the land surface and the nonlinear relationship between remotely sensed radiance and biophysical parameters [1,2], the scaling problem causes biases in the computation of biophysical parameters such as the leaf area index (LAI), net primary productivity (NPP), and sensible and latent heat flux from moderate-resolution remote sensing data [3][4][5].Additionally, the scaling problem limits the usefulness of these biophysical parameters in climate models and validation campaign that require consistent and accurate biophysical parameters at different spatial and temporal scales [6].The scaling problem arises from (i) the inconsistencies between the retrieval model of the land surface's biophysical parameters, which are usually established and calibrated at small spatial scales and assume that the land surface is homogeneous, and the remotely sensed data, whose pixels often exhibit strong spatial variation [7][8][9][10]; and (ii) the discrepancies among the products derived from different remotely sensed data at different spatial resolutions, which are attributed to the heterogeneity of the land surface, the scaling approaches and the limited accuracy of the estimation [11].
Land surface temperature (LST) is a key parameter for the Earth's surface energy balance and is required for many applications, including agrometeorology, hydrology, climatology, ecology and environmental studies [12][13][14][15].The scaling problem of LST has attracted much interest due to the desire to (i) apply local-scale approaches to regional-scale data [5]; (ii) validate LST products at the regional scale by comparisons with products at the local scale [5,9] and (iii) provide the opportunity for data fusion studies to generate LST products with both high spatial and temporal resolutions [5,6].
Many studies have focused on LST scaling problems, which can be categorized into three groups.The first is the definition of LST at the pixel scale.Becker et al. [15] provided a theoretical definition of the radiometric temperature for heterogeneous non-isothermal surfaces in which the spatial scale was taken into consideration.Li et al. [14] recommended a definition of the radiometric temperature for nonisothermal surfaces that depends on the distribution of the LST and the emissivity within a pixel for deriving LST products from satellite data.The second group comprises the scaling characteristics of LST.Moran et al. [5] compared two LST products over a semiarid rangeland that were upscaled using two aggregation schemes.The first scheme calculated the LSTs at the pixel resolution and averaged them to a coarser resolution, and the second calculated the LSTs directly at the coarse resolution by aggregating the fine-resolution data to the coarse scale.The results showed that the error in aggregating the LST was negligible for a wide range of conditions.However, the incoming sky radiation was assumed to be zero, and the emissivity used in the Stefan-Boltzmann (S-B) law was estimated using a logarithmic relation between emissivity and the normalized difference vegetation index (NDVI), which was assumed to be the broad band emissivity (BBE).Moreover, simple averaging was used to upscale the LST product at a fine resolution, which might violate the energy balance equation [9,11].These assumptions might greatly affect the results.The third group is the comparison of different LST products at different spatial resolutions.Lakshmi et al. [16] compared the mean LSTs of a study site in the Little Washita Watershed in the Southern Great Plains Region of the United States that were derived from field measurements, aircraft observations and satellite observations at different spatial resolutions.The average temperature differences between scales were found to be 2 °C to 5 °C.Jacob et al. [17] compared the surface emissivity and radiometric temperature derived from data collected with the MODerate resolution Imaging Spectroradiometer (MODIS) and Advanced Spaceborne Thermal Emission Reflection (ASTER) sensors over semi-arid and savanna areas.The results showed that the MODIS and ASTER products were consistent and that the combined effects of the surface heterogeneity and spatial resolution were not critical for the study area.Liu et al. [11] proposed scaling approaches for LST over flat areas and areas of high relief to explore the combined uncertainties in scaling based on the S-B law by comparing the upscaled ASTER LST product and the MODIS LST product.The results indicated that the difference between the different satellite sensor LST products in this case study was greater than the difference caused by the spatial resolution and scaling approaches.However, in these comparisons, the LST products were derived from different algorithms.The inconsistency in the algorithms between the sensors made the comparisons difficult to interpret and introduced uncertainties when the data were resampled [6].Additionally, most of the comparisons were between LST products at coarse spatial resolutions in which the LST's spatial heterogeneity might be lost compared with the LSTs at fine resolutions (e.g., 3 m in our study) [18].The loss of heterogeneity may affect the study results.
Under the circumstances, we wonder whether there is a scaling problem in the estimation of LST using the Temperature Emissivity Separation (TES) algorithm; if the scaling problem is present in the LST estimation, what factors attribute to the scaling problem and how we can solve it.In the investigation of these problems, we compiled a dataset including Thermal Airborne Hyperspectral Imager (TASI) and satellite-based ASTER data that were acquired simultaneously during the Heihe Watershed Allied Telemetry Experimental Research (HiWATER) experiment.The LST scaling problem was studied by comparing the remotely sensed LST products in two ways.First, the LST products calculated at the pixel resolution and upscaled to coarser scales using four different scaling methods were compared with those calculated directly at the coarse resolution by aggregating the fine-resolution data to the coarse data.Second, the airborne and satellite-based LST products, which were both derived with the TES algorithm, were compared.These two methods are consistent with the two aspects of the LST scaling problem.A scaling correction methodology was developed based on these methods.

HiWATER Experiment
HiWATER is an ongoing watershed-scale eco-hydrological experiment designed from an interdisciplinary perspective to address problems that include heterogeneity, scaling, uncertainty and closing of the water cycle at the watershed scale.The experiment was performed in the Heihe River Basin, which is in the arid region of northwestern China (Gansu Province) between 97.1°E-102.0°Eand 37.7°N-42.7°N.Three Key Experimental Areas (KEAs) were selected to conduct intensive and long-term observations: the cold region experimental area in the upper reaches, the artificial oasis experimental area in the middle reaches, and the natural oasis experimental area in the downstream region [19,20].The airborne and satellite-based observations in this study were derived simultaneously from the KEA of the middle stream at approximately 12:00 am (local time) on 10 July 2012.

Study Regions
Four main types of land cover are present in the middle-stream KEA: cropland, village, Gobi and river.The cropland accounts for more than half of the TASI data used in this study and consists of large areas of corn and some wheat.The village is composed of houses and small roads.The Gobi land cover consists of small gravel, bare soil and small areas of Alhagi sparsifolia Shap., which is a spiny, perennial subshrub that grows mainly in saline and arid regions in the native ranges of northwestern China, Central Asia, India, and the Middle and Near East [19].A small branch of the Heihe River is present in the Gobi land.Within these four land surface types, five study areas (1260 m × 2880 m) were selected, including two Gobi areas, two areas of cropland mixed with villages, and one Gobi area that contained the river.Ground photographs of the four land cover types and the geographical locations of these study regions are shown in Figure 1.
The land cover types in these regions were derived from a dataset, which is composed of the classification results of HJ-1/CCD data on a monthly basis (Table 1) [21,22].The classification data in July, 2012 was selected.Based on the validation data from the ground measurements and high-resolution data, the classification accuracy is 93.06%, and the Kappa coefficient is 0.92.The main land surface types, average LST values and their standard deviations are summarized in Table 2.The different standard deviations of these five study areas indicate that the spatial heterogeneity increases from the Gobi areas to the cropland areas to the mixed Gobi and river area.These differences provide the range of spatial heterogeneity that is necessary for this study.

Experimental Data Preprocessing
TASI airborne data and ASTER satellite data collected during the HiWATER experiment were used in this study.The data were acquired at nearly the same time; thus, the temporal difference between the datasets was neglected.The image registration between the TASI and ASTER data was performed using the georegistration information in the images, which can reach an accuracy of 1.5 m.

Description and Preprocessing of ASTER Data
ASTER was launched on NASA's Terra satellite in December 1999.The thermal infrared (TIR) sensor of ASTER is a five-band nadir viewing scanner (±3°) that is pointable to ±8.5° and has a 90 m spatial resolution and a 63 km wide swath [6,17].The overpass time of the ASTER data used in this study is 00:13 pm (local time) on 10 July 2012.
The ASTER L1B products were used to estimate the LST from the ASTER data using the TES algorithm [23].The atmospheric correction was performed using the radiative transfer model of MODTRAN with input atmospheric profiles generated by the National Center for Environmental Prediction (NCEP).The waveband integrated values of three atmospheric radiative property parameters, including transmittance, upwelling and downwelling radiance, were acquired from the MODTRAN calculations.The Water Vapor Scaling (WVS) algorithm [24] was utilized in the correction process to improve the performance of the atmospheric correction.The surface outgoing radiance was then derived from the at-sensor radiance using the radiative transfer function: where  is the band,  , is the surface outgoing radiance,  , is the observed radiance,  , ↑ is the upwelling atmospheric radiance,  , is the transmissivity, ε  is the surface emissivity,   (  ) is the Planck radiance,   is the LST, and  , ↓ is the downwelling atmospheric radiance.

Description and Preprocessing of TASI Data
TASI is a new generation of commercial hyperspectral infrared sensor that was released by ITRES of Canada in 2006.TASI is designed to detect land surface temperature and emissivity characteristics and has 32 bands in the TIR region (8-11.5 μm) of the electromagnetic spectrum, a wavelength spacing of 0.1095 μm, a full width at half maximum (FWHM) of 0.0548 μm, and a total field of view of 40° at a spatial resolution of 3 m [25].
To be consistent with the acquisition time of the ASTER data, the slice of TASI data began at 00:14 pm (local time) on 10 July 2012, and ended at 00:22 pm (local time).The flight duration was 8 min, during which the LST was assumed to be stable.In this case, the temporal effect was negligible.
The preprocessing of the TASI data included three main steps: radiometric calibration, geometric correction and atmospheric correction.The first two steps were completed by the software provided by ITRES.In this study, the In-Scene Atmospheric Compensation (ISAC) algorithm was used to perform the atmospheric correction [26].
Based on the TES algorithm for the ASTER data, a modified TES algorithm for the TASI data was proposed to derive the LST and spectral emissivities.The inversion result was validated by a root mean square error (RMSE) of approximately 1.3 K for temperature.The emissivity spectrum was very similar to that of the field measurements from a Bomem Fourier transform infrared (FTIR) spectrometer [25].

Quantification of the Spatial Heterogeneity of LST
The spatial heterogeneity of LST should be quantified to analyze the scaling problem [8,11].In this study, the dispersion variance was used as a quantifiable parameter to describe the LST's spatial heterogeneity by integrating the fitted variogram over a spatial scale.The calculation process followed two steps.First, a variogram describing the correlation between neighboring sites as a function of the spatial separation [18] was fitted to the LST data using the semi-automatic fitting software Isatis in a probabilistic framework.The dispersion variance was then calculated as follows: where (, ) is the dispersion variance; (| − |) is the fitted variogram; | − | represents the distance between two LST points, which are denoted as  and  in domain ; and || represents the area of .
A variogram was fitted to the estimated LST in each study regions.Because of the loss in the spatial heterogeneity of the data at coarse resolution [18], the variograms in the five regions were fitted with the TASI LST data rather than with the ASTER LST data.

Methods of Studying the LST Scaling Problem
The two aspects of the LST scaling problem were studied by comparing the remotely sensed LST products in two ways.First, the LST products calculated at the pixel resolution and upscaled to coarser scale using upscaling methods were compared with the products calculated at a coarse resolution by aggregating the fine-resolution data to the coarse data.Second, the TASI and ASTER LST products, both derived from the TES algorithm, were compared.The two methods of comparison were used to investigate the two aspects of the LST scaling problem.

Comparison between Distributed LST and Lumped LST
Figure 2 shows a schematic of the first comparison.Two methods were developed to acquire the LST products at large scale with the same sensor: the distributed method and the lumped method.In the distributed method, the LST product was calculated at the pixel resolution by the TES algorithm and upscaled to a coarse scale.In the lumped method, the radiance data were first aggregated to the coarse scale based on the law of energy conservation, and the LST was then calculated with the coarse data using the TES algorithm.Four different methods were used in the LST upscaling process [11].Because the study areas are very flat, the relief effect was not considered in the four upscaling methods.Combined with the S-B law and irradiance average theory, the large-scale LST was computed from the small-scale LST as follows: where the subscript  denotes pixel ; ε  and   are the BBE and the LST at the small scale, respectively;   is the areal ratio of the small-scale grid spacing i within the large-scale spacing, it equals 1 for the small-scale grid spacing within the large-scale spacing; and  is the BBE for the large-scale pixel.The narrow band emissivities were estimated using the TES algorithm.The BBE for the ASTER data was estimated from the narrow band emissivities based on the linear equation provided by Cheng et al. [27].
For the BBE of the TASI data, the emissivity spectrum was first transformed to the five TIR bands of ASTER with the spectral response function of ASTER, and the BBE was then calculated using the same equation as for the ASTER BBE.Based on Zhang et al. [28], ε was derived from ε  with the areally weighted averaging approach.The large-scale LST is determined by the LST and the emissivity at the local scale, as well as the emissivity at the large scale.When the emissivity is neglected, Equation (3) becomes where the LST at the large scale is only affected by the fine-resolution LST and their areal ratio.If the exponent is eliminated, Equation (3) becomes If the surface emissivity is neglected, it becomes The upscaling methods described by Equations (3-6) are denoted as Method 1, Method 2, Method 3, and Method 4, respectively.Correspondingly, the distributed LSTs upscaled by the equations are denoted as  1  ,  2   3  and  4  , respectively.
A rectified method of LST estimation was developed to acquire the effective LST at large scales based on the TES algorithm.The emissivity used to retrieve the LST at large scales was estimated by aggregating the emissivity derived from the original data by the TES algorithm instead of calculating it with the TES algorithm at coarse resolutions [28].The LST was then calculated in the same way as the TES algorithm with the aggregated emissivity.This rectified LST is virtually the ensemble radiometric temperature calculated with the ensemble r-emissivity, which was defined by Norman et al. [29].In this case, the rectified LST can be viewed as the effective LST at large scales.
The difference between the lumped LST and the distributed LST is defined as the scaling effect, which is expressed as where   is the scaling effect,   is the lumped LST, and   is the distributed LST.To measure the scaling effect at a given spatial resolution for a specific study site, the mean scaling effect is introduced as where N is the total number of large-scale pixels in the study area; and    and    are the ith LST pixels in the study area derived from the lumped method and the distributed method, respectively .

Comparison between the TASI and ASTER LSTs
In this study, the TASI and ASTER LSTs were both retrieved with the TES algorithm to eliminate the inconsistency of the retrieval algorithm.To analyze the factors that account for the scaling uncertainties, the TASI and ASTER LSTs were compared in the study regions.

Correction Methodology for the LST Scaling Effect
In this study, we assumed the distributed LST to be the accurate effective temperature on a large scale.To ensure consistency between the LSTs estimated from the distributed and lumped methods, corrections should be performed on the lumped LST.The empirical relationship between the emissivity and the spectral contrast in the TES algorithm may cause biases in the estimated LST [17].Based on the approach developed by Hu and Islam [2], we proposed a scaling error correction model to correct this relationship where   is the minimum emissivity in the emissivity spectrum; ,  and  are the three regression coefficients; and  denotes the maximum-minimum emissivity difference among the normalized emissivity spectrum and represents the spectral contrast.By assuming that (i) the  values vary slowly, (ii) the relationship is continuous and weakly nonlinear, and (iii) continuous derivatives of the relationship exist at least up to the second order, the relationship was corrected by a second-order Taylor series expansion as follows: =  − .  +  ′′ (). 2 /2 (10) where  ′′ () is the second derivative of the original empirical relationship (Equation ( 9)) and  2 is the local variance of the .The  at large scale was calculated by inputting the aggregated radiance into the TES algorithm, of which the scaling effect was not considered.The lumped LST was then retrieved using this corrected TES algorithm.

Analysis of the TASI LST Data
In the analysis, the distributed LST is viewed as the effective LST at large scales.The rationality of this assumption will be demonstrated in the subsequent experiment.Four upscaling methods were used to estimate the distributed LST; as a result, four different graphs of the scaling effects that were calculated with Equation (8) were acquired (Figure 3).The scaling effects of Methods 1 and 2 are similar, and the scaling effects of Methods 3 and 4 are similar.Thus, the following analyses are divided into two groups (Methods 1-2 vs. Methods 3-4).The purpose of the comparison is to investigate the first aspect of the LST scaling problem.The key to the TES algorithm is the empirical relationship between the emissivity values and the spectral contrast [23].The laboratory samples used to calibrate the TES's empirical relationship are homogeneous.When the spatial resolutions of the remotely sensed data are fine (e.g., 3 m), the TES algorithm can be applied successfully.When the data are coarse, the significant differences between the laboratory measurements and the coarse data may induce significant errors in the LST estimation [17].
In Method 1 and Method 2, the scaling effects initially increase with the spatial resolution and then become asymptotic.This behavior can be explained by the saturation of heterogeneity.An investigation of spatial heterogeneity [18,30] showed that the heterogeneity increases as the spatial resolution becomes larger until the scale approaches the characteristic scale of the landscape.The uncertainty of the empirical relationship in the TES algorithm is affected by the spatial heterogeneity.In this case, the scaling effect is closely related to the spatial heterogeneity; thus, it becomes saturated when the spatial resolution reaches the characteristic scale.Three types of study areas with different characteristic scales were selected in this study.Correspondingly, the scaling effect becomes asymptotic at these distinctive spatial scales.The spatial heterogeneity increases from Gobi land to cropland to the mixed area.Under these circumstances, the scaling effects in the Gobi sites are the smallest at all scales, where they have a maximum of 0.7 K, and they are greater in cropland, where they reach a maximum of 0.75 K.The scaling effect at Site 5 is less than at the cropland sites when the spatial resolution is less than approximately 110 m, but it exceeds the scaling effect at the cropland sites when the spatial resolution is larger.This pattern occurs because Site 5 is a mixed area of Gobi, cropland and river.When the spatial resolution is fine, the number of mixed pixels in the landscape is small and the mean scaling effect is small; when the spatial resolution is coarse, mixed pixels with different land cover types make up a larger proportion of pixels; thus, the mean scaling effect increases because of the greater heterogeneity.In general, the rules of change for the scaling effect in Method 1 and Method 2 are nearly identical.In Methods 3 and 4, the relationships between the spatial resolution and the scaling effect at Site 1 and Site 2 are similar to those in Methods 1 and 2. The relationships are different for Sites 3-5, especially at large scales.The difference is inferred to be caused by the upscaling methods.In this study, the upscaled LST is considered to be the accurate LST; its accuracy determines the subsequent analysis of the scaling effect.In contrast to Methods 1 and 2, Methods 3 and 4 neglect the exponent in the upscaling process, which may cause errors in the upscaled LST when the land surface heterogeneity is significant.The land surfaces at Sites 1 and 2 are relatively homogeneous compared with the other sites; thus, neglecting the exponent in the upscaling process does not significantly affect the accuracy of the upscaled LST.The land surfaces at the other three sites are heterogeneous, and neglecting the exponent will cause a large upscaling bias.The smaller scaling effects at Sites 3-5 for Methods 3 and 4 are notable.This difference is attributed to the underestimated distributed LST values with Methods 3 and 4 in these areas compared to Methods 1 and 2 (Table 3).The means at Sites 3-5 are lower by approximately 0.15 K when they are upscaled by Methods 3 and 4 than when they are upscaled by Methods 1 and 2. Because the lumped LST values are generally smaller than the distributed LST values (Figure 4), the difference between the distributed and lumped LSTs decreases when the distributed LST is underestimated, which accounts for the decreasing scaling effect.
The LST estimated using the rectified method and the distributed LST were compared to ensure that the distributed LST is the effective LST at large scales.The comparison result is shown in Figure 5.The distributed LSTs estimated from Methods 1 and 2 are nearly the same as the rectified LST, which proves the assumption described at the beginning of this section that the distributed LST should be viewed as the effective LST rather than the lumped LST.The biases in Method 1 are slightly smaller than those in Method 2. In Methods 3 and 4, the biases are also negligible at Sites 1 and 2, whereas the biases at Sites 3-5 are significant, especially when the spatial scale is greater than 100 m.This result corresponds to the inference that the distributed LST values with Methods 1 and 2 are more effective than those with Methods 3 and 4, especially when the spatial heterogeneity is significant.Additionally, this result ensures that the scaling effect in the TES algorithm is mainly attributed to the empirical relationship between the emissivity values and the spectral contrast.
The analysis presented above demonstrates that the TES algorithm will cause a bias in the retrieved LST at large scales because of the inconsistency between the laboratory measurements and the large-scale data, which directly affects the estimated emissivity.Methods 1 and 2 can upscale the LST more effectively than Methods 3 and 4.
Table 3. Statistics of the distributed TASI LST (300 m) from four different upscaling methods in terms of the minimum, maximum and mean LST values at the five sites.Methods 1-4 are labeled as M1-M4, respectively.

LST at Site 1 (K)
LST at Site Figure 4 shows a pixel-by-pixel comparison between the distributed LST upscaled using Method 1 and the lumped LST for the five study areas at a spatial resolution of 300 m.The lumped LST is underestimated in all of the areas.The scaling effects and RMSEs at Sites 1-4 are similar (approximately 0.7 K), which is also reflected in Figure 3 (at a resolution of 300 m in Figure 3a).The similarity is attributed to the saturated scaling effect at the spatial scale of 300 m.The scaling effect and RMSE at Site 5 are obviously larger than those at the other sites (approximately 0.8 K) because of the large heterogeneity in the mixed area.
Scatter plots between the spatial heterogeneity and the scaling effect for the five study sites are shown in Figure 6.The scaling effect is proportional to the intra-pixel heterogeneity, which is quantified by the dispersion variance (, ), at all the sites.As the dispersion variance increases, the scaling effect becomes more notable.In conclusion, the scaling effect is largest in the mixed area of Site 5, which is the most heterogeneous site at the landscape level, and is moderate at the cropland sites and smallest at the most homogeneous Gobi sites.The scaling effect increases rapidly with the spatial resolution until the resolution reaches the characteristic scale of the landscape, and the scaling has a positive correlation with the spatial heterogeneity, which is quantified by the dispersion variance.

Analysis of the ASTER LST Data
The ASTER LST scaling effect is described in Figure 7.Because the results from upscaling Methods 1 and 2 are nearly the same, only the results from Method 1 are shown.The scaling effect is not significant for the ASTER data.The largest scaling error at all scales at all of the study sites is only 0.06 K, and the range of the scaling effect is 0.003-0.06K, which can nearly be neglected.The scaling effect does not increase with the spatial scale; it remains nearly constant for all spatial scales.This quasi-constant scaling effect can be attributed to the significant spatial heterogeneity in the 90-m ASTER pixels.Because of the heterogeneity in the ASTER data, there is an inconsistency between the laboratory measurements and the satellite data that causes a bias in the empirical relationship when it is used to estimate the emissivity at a spatial resolution of 90 m.The bias propagates to the estimated LST.The bias is still present when the TES algorithm is used at larger scales and it does not increase significantly.Thus, the biases in the distributed and lumped ASTER LSTs cancel each other out in the calculation of the scaling effect.
It should be noted that our purpose is not to prove the ineffectiveness of the TES algorithm.When the aim is to retrieve an LST product with an accuracy of 1 K or higher, the scaling effect in the TES algorithm should be considered.

Analysis of the Comparison between the ASTER and TASI LSTs
To investigate the relationship between the ASTER LST data and the TASI LST data, the TASI LST data were upscaled to the spatial resolution of the ASTER LST data.The RMSEs between the ASTER LST and the distributed TASI LST derived by Methods 1 and 2 are shown in Table 4.Because of the upscaling biases in Methods 3 and 4, these methods were not used to decrease the uncertainty caused by the upscaling method.The RMSE values in Method 1 and Method 2 were nearly the same in all five study regions.The differences between the ASTER and TASI LSTs are closely related to the land cover types.In the sites covered by Gobi, the RMSEs were both approximately 0.9 K.In the sites covered by cropland, the RMSEs were slightly greater than 1.9 K.In the mixed site, the RMSE reached 2.68 K.The comparisons between the ASTER and TASI LSTs over different land covers are shown in Figure 8.Most of the points cluster on the 1:1 line, which indicates a good consistency between the ASTER and TASI LST.The LST points in the Gobi areas cluster between 319 K and 322 K on the 1:1 line because of the high temperatures at the Gobi sites.Numerous points cluster on the 1:1 line between 300 K and 305 K because of the relatively lower temperatures in the cropland sites.The mixed area is the most heterogeneous; accordingly, the points cluster in a larger range between 315 K and 322 K on the 1:1 line.The points in Figure 8a are the most clustered because of the homogeneity in the Gobi regions, whereas the points are the most dispersed in Figure 8c because of the significant heterogeneity of the mixed site.The inconsistency between the ASTER and TASI LSTs can be attributed to the land surface heterogeneity and the limited accuracy in the retrieved LST products.The more significant the spatial heterogeneity of the study site, the greater the RMSE; this pattern reflects the close relationship between the spatial heterogeneity and the inconsistency.The TES algorithm was used to estimate the LSTs for both the TASI and the ASTER data.However, the accuracy of the TES algorithm is affected by the atmospheric correction errors of the images, especially over gray bodies [31].The WVS algorithm was used to refine the atmospheric correction for the ASTER data, whereas the ISAC algorithm was used to correct the atmospheric effect.The two different atmospheric corrections can greatly affect the accuracy of the products.Additionally, the different spatial scales of the ASTER and TASI data can cause different uncertainties in the empirical relationship between the emissivity values and the spectral contrast, which affects the retrieved LST.

Correcting the Scaling Effect
Because of the significant scaling effect of the TASI LST data, the scaling effect was corrected with a scale-transformation model based on a Taylor expansion (Figure 9).The scaling effect after the scaling correction decrease in the five study regions and are always less than 0.5 K.The scaling effects in the more heterogeneous sites are still greater than in the relatively homogeneous sites after the correction.Additionally, the correction provided resolution-invariant results for the scaling effect.
The correction results for Site 1, Site 3, Site 4 and Site 5 at a spatial scale of 300 m are shown in Figure 10.Because of the similar pattern of scaling effect at Site 2 with that at Site 1, the result at Site 2 is not shown.The distributed LSTs were upscaled using Method 1.The proportion of pixels with a large scaling effect clearly decreases, whereas the proportion of pixels with a slight scaling effect increases.The peaks move from large scaling effect values to smaller values.The peaks move from −0.7 K to −0.3 K and from −0.7 K to −0.4 K for Site 1 and Site 2, from −0.7 K to −0.3 K and from −0.8 K to −0.4 K for Site 3 and Site 4. For Site 5, the peak moves from −1.0 K to −0.5 K.

Conclusions
The goal of this study was to investigate the scaling problem of LST.Five areas were selected for study using simultaneous multi-sensor observations from the HiWATER experiment.Sites 1 and 2 are Gobi areas, Sites 3 and 4 are cropland areas, and Site 5 is a mixed area of Gobi and water.The LST's spatial heterogeneity increases from the Gobi areas to the cropland areas to the mixed area.The multi-sensor data included airborne TASI data and satellite ASTER data.The data were acquired at nearly the same time; thus, the temporal change of the LST data was not taken into consideration.The LSTs for the ASTER and TASI data were both retrieved using the TES algorithm.To describe the spatial heterogeneity of the LSTs at the landscape level in the study areas, the dispersion variance was estimated based on variogram modeling.
The LST data were upscaled to coarser resolutions according to four different upscaling methods based on the S-B law and were compared to investigate the relationship between the spatial resolution and the scaling effect.Methods 1 and 2 provide similar results, and Methods 3 and 4 provide similar results.Additionally, the nearly identical results of the upscaled TASI LSTs from the first two methods and the rectified TASI LSTs indicate that Methods 1 and 2 can accurately upscale fine-resolution LST data, which is because of the conservation of the exponent relationship in the S-B law in Methods 1 and 2.
The scaling effect of the TASI LST data is clear and could be as large as 0.8 K.This effect increases with the spatial resolution until the spatial resolution reaches the characteristic scale of the study area.The scaling effect is greatest in the mixed area of Site 5, which is the most heterogeneous site at the landscape level, and is moderate for cropland sites and smallest for the most homogeneous Gobi sites.The scaling effect of the ASTER LST data is not significant and had a maximum of 0.06 K.This finding may be caused by the significant heterogeneity in the original ASTER pixels.The comparison between the TASI data and the ASTER data showed that they were highly consistent, with a RMSE of approximately 0.88 K, when the pixels were relatively homogeneous.When the spatial heterogeneity was significant, the RMSE could be as large as 2.68 K.
A scale-correction model based on a Taylor expansion was developed based on the analysis of scaling effect.In this model, the scaling effect caused by the nonlinear relationship between the MMD and the radiance was not considered.The scaling-correction methodology showed that the scaling effect was resolution invariant and was always less than 0.5 K, which demonstrates the effectiveness of the scalingcorrection methodology.
Because of the limited spatial range of the data, the conclusions in this study may not be generalized; this would require a larger database and a more detailed analysis.However, this study provides useful insights into the scaling problem of LST.

Figure 1 .
Figure 1.Spatial distribution and ground photographs of the five study sites.The upper right image is the ASTER L1 B VNIR image of the study area taken on 10 July 2012.The RGB components are channels 3 (0.81 μm), 2 (0.66 μm) and 1 (0.56 μm), and the spatial resolution is 15 m.The land cover types include (a) cropland, (b) Gobi, (c) river and (d) village.

Figure 2 .
Figure 2. Schematic of the comparison between the distributed and lumped LSTs.The same inversion model is used in the distributed method and the lumped method.

Figure 3 .
Figure 3. Relationship between the spatial resolution and the LST scaling effect.The distributed LSTs are acquired from (a) Method 1; (b) Method 2; (c) Method 3 and (d) Method 4.

Figure 7 .
Figure 7.The relationship between the spatial resolution and the LST scaling effect for the ASTER LST data in the five study areas.The distributed LST was derived from Method 1.

Figure 8 .
Figure 8.Comparison between the ASTER LST data and the TASI LST data in (a) Gobi areas; (b) cropland areas and (c) the mixed area.The TASI LSTs were upscaled using Method 1.The sizes of the dots in the scatterplots correspond to their density in the swarm of points.The larger dots are given hotter colors in the dense particle region.

Figure 9 .Figure 10 .
Figure 9. Relationship between the spatial resolution and the LST scaling effect after the scaling correction.The distributed LSTs were upscaled using Method 1.

Table 1 .
Percentages of the four land cover types in the five study regions.These results are derived from the classification of HJ-1/CCD data.The Gobi and bare soil in the cropland are not distinguished in this classification scheme.

Table 2 .
The main types of land surface cover and the mean values and standard deviations of LST in the five study areas.mLST and σLST represent the mean and standard deviation, respectively, and were calculated from both the ASTER and TASI LST images.