Evaluation of TsHARP Utility for Thermal Sharpening of Sentinel-3 Satellite Images Using Sentinel-2 Visual Imagery

: A spatially distributed land surface temperature is important for many studies. The recent launch of the Sentinel satellite programs paves the way for an abundance of opportunities for both large area and long-term investigations. However, the spatial resolution of Sentinel-3 thermal images is not suitable for monitoring small fragmented ﬁelds. Thermal sharpening is one of the primary methods used to obtain thermal images at ﬁner spatial resolution at a daily revisit time. In the current study, the utility of the TsHARP method to sharpen the low resolution of Sentinel-3 thermal data was examined using Sentinel-2 visible-near infrared imagery. Compared to Landsat 8 ﬁne thermal images, the sharpening resulted in mean absolute errors of ~1 ◦ C, with errors increasing as the di ﬀ erence between the native and the target resolutions increases. Part of the error is attributed to the discrepancy between the thermal images acquired by the two platforms. Further research is due to test additional sites and conditions, and potentially additional sharpening methods, applied to the Sentinel platforms.


Introduction
The land surface temperature (LST) at both high spatial and temporal resolutions is a key variable in many environmental studies, including vegetation monitoring, moisture estimation [1,2], drought assessment, evapotranspiration mapping [3][4][5], land-cover classification [6], and surface energy balance quantification [7]. However, until recently, satellite-derived LST has been provided at either a high temporal resolution (1 day) but low spatial resolution (1 km), e.g., National Oceanic and Atmospheric Administration-Advanced Very High Resolution Radiometer (NOAA-AVHRR) and Moderate Resolution Imaging Spectrometer (MODIS)-Terra/Aqua, or at a higher spatial resolution (60-120 m) but a low revisit time of 16 days, e.g., Landsat Thematic Mapper (TM) -Enhanced Thematic Mapper Plus (ETM+)/ Operational Land Imager (OLI).
To bridge this gap, attempts have been made to downscale frequently acquired low spatial resolution thermal images in order to assess LST at a high spatial resolution, and further use the data for the above-mentioned applications. The downscaling methods can be roughly classified into using Sentinel-2 NVIR data has recently been published [32]. A methodology to derive land-surface energy fluxes using the DMS approach was tested in agricultural and forest sites located in temperate zones. The results of the study indicated an improvement in derived energy fluxes from sharpened LST in comparison with low-resolution fluxes. However, the fluxes were still less accurate than those derived from high-resolution LST. The overarching goal of the current project was to examine the utility of TsHARP to sharpen the Sentinel-3 thermal data using Sentinel-2 VNIR imagery.

TsHARP Method
The TsHARP algorithm is based on the linear correlation between LST and fractional vegetation cover (fc). A detailed description of the TsHARP procedure is given in Agam et al. [17]. An empirical linear regression model is fitted to the LST and f c data at the native coarse resolution of the LST image after aggregating the fine (target) resolution f c image to match the coarse resolution of the LST image (LST coarse and f c coarse , Equation (1)).
f c = 1 − NDVI max − NDVI i NDVI max − NDVI min 0.625 (2) where NDVI i is the index value in a specific pixel while NDVI max and NDVI min are the maximum and minimum values from the scene, respectively [33].
Assuming that the correlation between LST and f c is scale-independent, the resulting regression coefficients (a and b) obtained at the low resolution are applied to the fine spatial resolution vegetation index together with the fine resolution f c data (f c fine ) to predict the regression-based LST at the target fine resolution (LST fine , Equation (3)). Lastly, the residual error (i.e., the difference between the native, LST native , and the estimated, LST coarse , temperatures) for the corresponding coarse resolution pixel is added to the predicted temperature to increase the prediction accuracy (Equation (4)).
Stage 1-sensor comparison and inter-calibration-before using Landsat-8 to validate the sharpening product, the comparability between the Sentinel-3 and the Landsat-8 thermal images had to be evaluated. To this end, six pairs of Sentinel-3 vs. Landsat-8 thermal images, acquired within a maximum of a 15-minute time difference, were compared. To minimize potential additional sources of error due to different atmospheric corrections between the two sensors [34], the comparison, as well as the sharpening, was applied on top-of-atmosphere brightness temperature (BT) images, which has been previously done (e.g., [19,25,35]). The six BT image-pairs were compared at the coarse resolution of 960 m (see more in Section 2.4 hereafter). For the following analyses, two of the six pairs of images, for which a coinciding same-day Sentinel-2 overpass was available (Table 1), were chosen. For these two pairs, the linear regression coefficients resulting from regressing the Sentinel-3 BT values against the Landsat-8 BT values were used to calibrate the images. The calibrated Landsat-8 BT images were used in Stage 3 for validation (see below).

Methodological Workflow
Testing the accuracy of TsHARP for sharpening Sentinel-3 thermal images using Sentinel-2 VNIR images was assessed by comparing the sharpened thermal images to Landsat-8 thermal images that were acquired at a fine resolution. This was applied in three stages ( Figure 1).

Study Sites
Six areas representing different climatic conditions and land-cover types were selected (Table 1 and Figure  2).
-Site 1 -Campo Grande, Brazil A relatively homogeneous landscape characterized by extensive savanna formations crossed by forests and stream valleys. The altitude of the site ranges from 298 to 764 m. The area includes irrigated and rainfed wheat, corn, barley, rice, and soybeans fields, orchards, natural sand banks, savannahs, open pastures, and bushes. The climate is classified as subtropical and tropical with an annual mean temperature of 24 °C and annual mean precipitation of 1471 mm [36].
-Site 2 -Podebrady, Czech Republic Lying in a temperate climate zone with annual precipitation of about 660 mm, this site is characterized by mild and humid summers with an average annual temperature of 8.8 °C. The altitude varies from 146 to 1,385 m. This site contains mixed evergreen and broadleaves forests, urban areas, croplands, pastures, orchards, wetlands, and water bodies. Most agricultural fields are planted with winter wheat, barley, sugar beet, rapeseed, and maize.
-Site 3 -Iowa-Nebraska border near Omaha, Nebraska, USA A relatively flat plain with an average altitude of 310 m above mean sea level (a.m.s.l) dominated by croplands, mainly corn and soybean fields (93%), with a minority of sorghum, oats, and wheat. Small areas of forest, grassland, riparian vegetation, and urban regions occur within the scene. The area is characterized by a humid continental climate, with an annual precipitation rate of 700 mm and an annual temperature of 10.5 °C.

Study Sites
Six areas representing different climatic conditions and land-cover types were selected (Table 1 and Figure 2). -Site 1-Campo Grande, Brazil A relatively homogeneous landscape characterized by extensive savanna formations crossed by forests and stream valleys. The altitude of the site ranges from 298 to 764 m. The area includes irrigated and rainfed wheat, corn, barley, rice, and soybeans fields, orchards, natural sand banks, savannahs, open pastures, and bushes. The climate is classified as subtropical and tropical with an annual mean temperature of 24 • C and annual mean precipitation of 1471 mm [36].
-Site 2-Podebrady, Czech Republic Lying in a temperate climate zone with annual precipitation of about 660 mm, this site is characterized by mild and humid summers with an average annual temperature of 8.8 • C. The altitude varies from 146 to 1,385 m. This site contains mixed evergreen and broadleaves forests, urban areas, croplands, pastures, orchards, wetlands, and water bodies. Most agricultural fields are planted with winter wheat, barley, sugar beet, rapeseed, and maize.
-Site 3-Iowa-Nebraska border near Omaha, Nebraska, USA A relatively flat plain with an average altitude of 310 m above mean sea level (a.m.s.l) dominated by croplands, mainly corn and soybean fields (93%), with a minority of sorghum, oats, and wheat. Small areas of forest, grassland, riparian vegetation, and urban regions occur within the scene. The area is characterized by a humid continental climate, with an annual precipitation rate of 700 mm and an annual temperature of 10.5 • C.
-Sites 4 and 5 are located near Sacramento, California, USA These areas are predominantly agricultural landscapes with mainly irrigated rice fields. Other common crops are grapes, citrus, almonds, and lettuce. A small fraction of the area comprises grasslands, hay fields, and cattle ranches. The climate in the region is Mediterranean with average annual high and low temperatures of 24.2 • C and 9.2 • C, respectively, and average precipitation of 485 mm that falls predominantly during the winter season (November-March). The altitude reaches a maximum of about 2,230 m. Site five has a 15-minute gap between Landsat 8 and Sentinel-3 imagery, while images were extracted at the same time at Site 4.
-Site 6-Madhya Pradesh District, India This is a fertile land region with the main crop production of wheat, pulses, rice, oilseeds, sorghum, soybean, and sugarcane. The average field size is 0.5 x 0.8 km, which is the smallest among all sites, with the most diverse land cover. Other land covers found in the scene are ponds, grasslands, savannas, hills, urban areas, and bushes. The climate is humid subtropical, with hot and dry summers and cold and wet winters. The annual rainfall of the region is 1218 mm with an average annual high and low temperatures of 33 • C and 18 • C, respectively. The altitude of the region varies from 312-970 m.
All scenes correspond to entire tiles of Landsat 8. The outlines in Panels b and e in Figure 2 delineate the analyzed areas for Site 2 and Site 5 since Sentinel-3 did not cover the whole Landsat scene (Site 2) and to avoid clouds (Sites 5).

Data
Thermal images from Landsat 8 OLI and Sentinel-3 Sea and Land Surface Temperature Radiometer (SLSTR) were collected in June-October 2017. Landsat 8 OLI images containing VNIR and shortwave infrared (SWIR) with 30-m spatial resolutions were downloaded from the official USGS website (http://earthexplorer.gov.by). Landsat 8's native thermal resolution is 100 m, but only the 30-m images disaggregated by the Landsat Science Team were available publicly. The Sentinel-3 SLSTR products were provided by the Scientific Sentinel Hub (http://scihub.copernicus.eu/). They consist of two thermal bands at a 1,000-m spatial resolution in addition to the VNIR bands at a 500-m spatial resolution.
The Sentinel-2A data are composed of 13 spectral bands in the VNIR region resolutions ranging from 10 to 60 m. The Sentinel-2 tile has a small fixed size of 100 x 100 km, so the selection of satellite images was limited to the overlapping areas of the Landsat and Sentinel overpasses.

Data Processing
Landsat 8 images were calibrated to top-of-atmosphere reflectance and brightness temperature using the Landsat Ecosystem Disturbance Adaptive Processing System [37]. Then, the reflectance maps were atmospherically and geometrically corrected using the ATCOR module included in the ERDAS IMAGINE software [38]. The atmospherically corrected red and near-infrared bands were used to calculate NDVI. Water and artificial pixels were masked out using supervised classification since they do not conform to the NDVI-BT relationship. Landsat 8 BT images were aggregated to 960 m, which is the closest multiple of 30 m to the 1,000-m resolution of the Sentinel-3 BT images. These images were used to sharpen Landsat-to-Landsat to eliminate errors stemming from the differences between sensors. In addition, Landsat 8 BT images were aggregated to 60, 120, and 240 m. All three were used for validating the sharpening into these three resolutions to test the accuracy dependency on the target resolution. Aggregation was conducted after converting BT to radiance, according to the Stefan-Boltzmann Law, and converted it back to BT at the coarse resolution. Lastly, Sentinel-3 BT images were resampled to 960 m, to match the aggregated resolution of Landsat 8. The Sentinel-3 Remote Sens. 2019, 11, 2304 7 of 17 BT images were re-projected and geo-registered to the same reference system of the UTM Zone to match Landsat 8 thermal imagery. The Sentinel-2A dataset was atmospherically corrected with sen2cor software [39]. Band 8a (near-infrared, 20-m pixel size) and band 4 (Red, 10-m pixel size) were used to calculate NDVI.
soybean, and sugarcane. The average field size is 0.5 x 0.8 km, which is the smallest among all sites, with the most diverse land cover. Other land covers found in the scene are ponds, grasslands, savannas, hills, urban areas, and bushes. The climate is humid subtropical, with hot and dry summers and cold and wet winters. The annual rainfall of the region is 1218 mm with an average annual high and low temperatures of 33 °C and 18 °C, respectively. The altitude of the region varies from 312-970 m.
All scenes correspond to entire tiles of Landsat 8. The outlines in Panels b and e in Figure 2 delineate the analyzed areas for Site 2 and Site 5 since Sentinel-3 did not cover the whole Landsat scene (Site 2) and to avoid clouds (Sites 5).

Data
Thermal images from Landsat 8 OLI and Sentinel-3 Sea and Land Surface Temperature Radiometer (SLSTR) were collected in June-October 2017. Landsat 8 OLI images containing VNIR and shortwave infrared (SWIR) with 30-m spatial resolutions were downloaded from the official USGS website (http://earthexplorer.gov.by). Landsat 8's native thermal resolution is 100 m, but only the 30-m images disaggregated by the Landsat Science Team were available publicly. The Sentinel-3 SLSTR products were

TsHARP Performance Assessment
To evaluate the sharpening accuracy of Sentinel-3 BT images (BT sentinel3 ) to a finer resolution based on VNIR data acquired by Sentinel-2, fine-resolution BT images acquired by Landsat 8 (BT landsat ) were used. To evaluate the uncertainty caused by comparing results from different sensors, a two-step analysis was performed. First, BT landsat was aggregated to the coarse resolution of BT sentinel3 on which the sharpening algorithms were applied. The sharpened images were then compared to the original BT landsat . After that, BT sentinel3 were sharpened and compared to BT landsat . This allowed the isolation of errors that were solely due to differences between the two sensors, which are irrespective of the sharpening procedure. The level of agreement between the reference and the sharpened temperatures was estimated by root-mean-square error (RMSE), mean absolute error (MAE), bias, the coefficient of determination (R 2 ), and the normalized root-mean-square error (nRMSE); Equations (5)-(9), respectively [40].
where T sharp is the sharpened temperature, T ref and T ref_mean are the reference temperature and the average of the reference temperature in the entire image, respectively, and T max and T min are the maximum and minimum temperature values, respectively.

Sensor Intercalibration of Brightness Temperature
The surface temperature rapidly changes, especially during the mid-morning. The typical overpass time of many satellites including Landsat and Sentinel. A comparison between sensors is, thus, challenging, and, unless the acquisition time is exactly simultaneous, some of the differences should be attributed to the time gap. As mentioned above, time acquisition was the primary criterion for choosing the sites for this study, and, even with this careful screening, some gaps in time acquisition were inevitable (Table 1). Figure 4 displays the results of the correlation between the Sentinel-3 BT and the reference aggregated Landsat-8 BT images, and Table 2 depicts the associated statistics. Strong positive correlations were found between the images with the intercepts ranging from 0.10-1.73 • C and the slopes from 0.96-1.0. The highest intercepts were detected for areas with a wide altitude range: Sites 2, 4, and 6.
RMSE and MAE ranged from 0.71-1.37 and 0.61-1.01 • C, respectively. The normalized RMSE showed that all sites had nearly the same error range with values from 4.2% to 7.4% ( Table 2). The smallest errors were found for homogeneous and relatively flat regions (Sites 1 and 3), despite the time difference between image acquisitions.
The positive biases signify that the Sentinel-3 derived BTs were slightly higher than those derived by Landsat 8. The near-unity slopes and the relatively high R 2 suggest that, despite the uncertainties related to the calibration and different viewing geometries, the responses of both sensors are similar. The regression results also indicate that it is possible, to a reasonable degree, to calibrate the derived temperatures, i.e., "to correct" the Sentinel-3 data to better match the Landsat 8 data. The quotation marks are added to indicate that we do not intend to define which of the two sensors is more accurate. Rather, we inter-calibrate them for the purpose of testing the TsHARP algorithm.
intercepts were detected for areas with a wide altitude range: Sites 2, 4, and 6. RMSE and MAE ranged from 0.71-1.37 and 0.61-1.01 °C, respectively. The normalized RMSE showed that all sites had nearly the same error range with values from 4.2% to 7.4% ( Table 2). The smallest errors were found for homogeneous and relatively flat regions (Sites 1 and 3), despite the time difference between image acquisitions.  Table 2. Quantitative analysis of the differences between the simulated Landsat 8 thermal data and Sentinel-3 TM data at a coarse spatial resolution. The statistics include: the regression coefficients slope and intercept, coefficient of determination (R 2 ), root mean square error (RMSE), mean absolute error (MAE), mean bias (Bias = Sentinel-3 (S3) minus Landsat 8 (L8)), and normalized RMSE (nRMSE).

Sites
Date of Image Acquisition   Table 2. Quantitative analysis of the differences between the simulated Landsat 8 thermal data and Sentinel-3 TM data at a coarse spatial resolution. The statistics include: the regression coefficients slope and intercept, coefficient of determination (R 2 ), root mean square error (RMSE), mean absolute error (MAE), mean bias (Bias = Sentinel-3 (S3) minus Landsat 8 (L8)), and normalized RMSE (nRMSE). To the best of our knowledge, the relationship between Sentinel-3 and Landsat 8 thermal bands has not yet been reported. Studies comparing land surface temperature from MODIS and Landsat platforms [41] revealed a significant difference between sensors induced by sensor measurement errors and systematic errors that have led to alterations in radiance at the sensors. Liu et al. [42] found a discrepancy between MODIS and aggregated ASTER surface temperature of about 3 • C due to the difference in the retrieval algorithm, atmospheric correction, and sensor performance. Merlin et al. [43] also reported errors between MODIS and ASTER data of about 2.4 • C. Weng et al. [44] pointed out that thermal data from different sensors at a close acquisition time can be comparable after applying pre-processing procedures, such as radiometric calibration, geometric detection, and atmospheric correction. Nevertheless, biases can be observed due to the differences in acquisition time, bandwidth, view geometry, swath width, geo-location errors, and a spectral response function [44,45]. Satellites' altitude can also lead to temperature discrepancies [46].

Date of Image Acquisition
Despite the relatively high R 2 , which indicates that it is feasible to apply the inter-comparison regression coefficients and use the Landsat 8 high-resolution images to evaluate the accuracy of the sharpening approach, we conducted one additional intermediate step. To assess the accuracy of TsHARP for the study sites, the sharpening procedure was first applied to the aggregated Landsat scenes, using only one source of data. This allowed the assessment of the founding errors of the method itself, excluding additional errors due to the differences between the sensors.  Figure 3 for the RGB and false color images of the sites). A visual qualitative assessment reveals that the spatial distribution of the reference image and disaggregated images showed a similar spatial distribution both when using a single (Landsat 8) platform and when using two platforms for the sharpening (Sentinel-2 and 3) and comparing the results to a third platform (Landsat 8). The maps were comparable in regions with high NDVI values, and the sharpened thermal maps showed more spatial detail than the coarse resolution BT imagery. However, the thermal maps obtained from a single platform demonstrated a much higher contrast between low and high temperatures compared to the reference image.  The TsHARP Landsat and TsHARP Sentinel images were compared with the reference BT images at a 60-m resolution by computing the error difference (reference BT minus downscaled BT). Maps of the differences between the three images (the two sharpened temperatures TsHARP Landsat minus TsHARP Sentinel and the Landsat reference image) for Sites 1 and 2, are presented in Figures 7 and 8, respectively. In Site 1, both TsHARP Landsat and TsHARP Sentinel imagery underestimated the temperature over forests and dense vegetation and overestimated the temperature over sparse vegetation fields and bare soils compared with the reference temperature. Differences >3 • C were observed in 0.09% of the area (1283 pixels out of a total of 1,357,185 pixels in the scene) for TsHARP Landsat , and, in 7.2% of the area (98,216 pixels), the errors ranged from 1.5 to 3 • C, mostly along the borders of fields and riparian areas. This can be explained by the high correlation between BT and NDVI (Figure 7a). Agam et al. [21] reported that sharpening of homogeneously vegetated areas results in smaller errors due to the accurate fitting of the BT-NDVI regression model. For TsHARP Sentinel errors > 3 • C were observed in 1.4% of the area (19,128 pixels) and errors ranging from 1.5-3 • C were observed in 20% of the area (270,511 pixels) (Figure 7b). This is likely due to loss of accuracy between different satellite platforms. Over this site, the Sentinel-based sharpened temperatures were mostly lower than those from TsHARP Landsat (Figure 7c).   Sharpening over Site 2 yielded less accurate (±3 • C) temperatures for both TsHARP Landsat and TsHARP Sentinel . The lowest errors were found over agricultural landscapes for both platforms and sites. However, densely vegetated areas and bare soils resulted in larger errors due to the variation in NDVI values in different land covers. In this case as well, most errors were generally along the borders of fields and riparian areas.

TsHARP Validation
borders of fields and riparian areas. This can be explained by the high correlation between BT and NDVI ( Figure  7a). Agam et al. [21] reported that sharpening of homogeneously vegetated areas results in smaller errors due to the accurate fitting of the BT-NDVI regression model. For TsHARPSentinel errors > 3 °C were observed in 1.4% of the area (19,128 pixels) and errors ranging from 1.5-3 °C were observed in 20% of the area (270,511 pixels) ( Figure  7b). This is likely due to loss of accuracy between different satellite platforms. Over this site, the Sentinel-based sharpened temperatures were mostly lower than those from TsHARPLandsat (Figure 7c). Sharpening over Site 2 yielded less accurate (±3 °C) temperatures for both TsHARPLandsat and TsHARPSentinel. The lowest errors were found over agricultural landscapes for both platforms and sites. However, densely vegetated areas and bare soils resulted in larger errors due to the variation in NDVI values in different land covers. In this case as well, most errors were generally along the borders of fields and riparian areas.
The error statistics analysis indicates that the accuracy of the sharpening algorithm decreases as the target resolution becomes finer (Table 3), i.e., that the greater the difference between the original resolution and the target resolution becomes, the greater the errors. Agam et al. [17] and Essa et al. [47] also observed a decrease in accuracy with a finer resolution due to the increasing degree of subpixel variability. RMSE for TsHARPLandsat ranged from 0.68-0.83 °C and 0.85-1.17 °C, for Sites 1 and 2, respectively. The results of RMSE for TsHARPLandsat are comparable with the results from Agam et al. [17] where RMSE ranged between 0.67 and 1.39 °C, depending on the date and sensor. At both sites, BT obtained from TsHARPSentinel was less accurate with RMSE ranging from 1.1-1.5 °C. The bias of TsHARPLandsat was small and negative for Site 1, which signifies an overestimation of the sharpened temperature. The bias of TsHARPSentinel indicates that the sharpened temperature was considerably   Figure 8). This is likely due to the weaker correlation between BT and NDVI for Site 2. The potential limitation of using NDVIbased sharpening approaches for agricultural areas was shown by Karnieli et al. [28]. The authors suggested that, at high latitude regions, where plant growth is limited by availability of energy rather than by availability of water, the NDVI-BT correlation is positive. Being located at 50 N latitude, this may feature an additional explanation to the larger errors observed for Site 2.  The error statistics analysis indicates that the accuracy of the sharpening algorithm decreases as the target resolution becomes finer (Table 3), i.e., that the greater the difference between the original resolution and the target resolution becomes, the greater the errors. Agam et al. [17] and Essa et al. [47] also observed a decrease in accuracy with a finer resolution due to the increasing degree of subpixel variability. RMSE for TsHARP Landsat ranged from 0.68-0.83 • C and 0.85-1.17 • C, for Sites 1 and 2, respectively. The results of RMSE for TsHARP Landsat are comparable with the results from Agam et al. [17] where RMSE ranged between 0.67 and 1.39 • C, depending on the date and sensor. At both sites, BT obtained from TsHARP Sentinel was less accurate with RMSE ranging from 1.1-1.5 • C. The bias of TsHARP Landsat was small and negative for Site 1, which signifies an overestimation of the sharpened temperature. The bias of TsHARP Sentinel indicates that the sharpened temperature was considerably lower than the reference temperature for both sites. The MAEs for TsHARP Landsat compared to the reference BTs were also lower than for TsHARP Sentinel . Compared to TsHARP Landsat , the MAE for TsHARP Sentinel was higher by~0.3 • C for Site 1 and by~0.2 • C for Site 2 at 60-m spatial resolution. The R 2 values for TsHARP Sentinel were always less than those estimated for Landsat-8 data. An overall weaker performance was observed for Site 2 in comparison to Site 1 ( Table 3, Figure 8). This is likely due to the weaker correlation between BT and NDVI for Site 2. The potential limitation of using NDVI-based sharpening approaches for agricultural areas was shown by Karnieli et al. [28]. The authors suggested that, at high latitude regions, where plant growth is limited by availability of energy rather than by availability of water, the NDVI-BT correlation is positive. Being located at 50 N latitude, this may feature an additional explanation to the larger errors observed for Site 2.
Most TsHARP research is based on disaggregating MODIS, ASTER, or Landsat images from TIR to VNIR resolutions. Most of these studies have tested the sharpening using the same satellite platform. Only a few of them attempted to sharpen thermal infrared images using multiple platforms [15,20,22,40,43,44]. Mukherjee et al. [25] pointed out that using TsHARP to downscale a MODIS image from 1000 m to 250 m resulted in an RMSE of about 1.59 • C and R 2 = 0.82, while downscaling aggregated Landsat 7 from 960 to 120 m yielded an error of 0.78 • C. Our results are consistent with studies by Bisquert et al. (2016a) for whom RMSE values ranged from ± 1.6-± 1.7 • C using one Landsat platform and ±1.8-±2.1 • C using MODIS versus Landsat satellites.
Lastly, to emphasize the advantage of TsHARP over a simple resampling, the Sentinel-3 coarse resolution images for both sites were resampled to 60 m without applying a sharpening algorithm and compared to the Landsat reference scenes (Figure 9). 0.78 °C. Our results are consistent with studies by Bisquert et al. (2016a) for whom RMSE values ranged from ± 1.6-± 1.7 °C using one Landsat platform and ±1.8-±2.1 °C using MODIS versus Landsat satellites.
Lastly, to emphasize the advantage of TsHARP over a simple resampling, the Sentinel-3 coarse resolution images for both sites were resampled to 60 m without applying a sharpening algorithm and compared to the Landsat reference scenes (Figure 9). The resulting lower R 2 (0.53 vs. 0.75 and 0.30 vs. 0.65, for Sites 1 and 2, respectively) indicates that TsHARP outperforms a simple sharpening. One should also bear in mind the possibility that the errors obtained are, in fact, smaller than the ones reported in this paper. The lack of ground measurements prevents us from determining whether the Sentinel-3 or the Landsat-8 thermal images better agree with the actual surface temperatures. Thus, it is possible that we compared the TsHARP results to a less accurate reference. This remains an open question.

Conclusions
The mean absolute differences between Sentinel-3 and Landsat 8 BT images were up to 1 °C in all six sites, with the Sentinel-3 temperatures being typically higher. The smallest errors were found for homogeneous and relatively flat regions despite the time difference between image acquisitions. In comparative studies, such as the one presented in this paper, it is possible to intercalibrate between these two sensors when needed.
Applying TsHARP on Sentinel-3 BT data using Sentinel-2 VNIR data resulted in mean absolute errors of ~1 °C, with errors increasing as the difference between the native and the target resolutions increases. Part of the error is attributed to the discrepancy between the BT images acquired by the two platforms. This is reflected by the smaller errors observed when sharpening was applied on synthetic-aggregated Landsat-8 BT images, with errors ranging between 0.53 and 0.87 °C. These results show the potential for sharpening Sentinel-3 BT images The resulting lower R 2 (0.53 vs. 0.75 and 0.30 vs. 0.65, for Sites 1 and 2, respectively) indicates that TsHARP outperforms a simple sharpening. One should also bear in mind the possibility that the errors obtained are, in fact, smaller than the ones reported in this paper. The lack of ground measurements prevents us from determining whether the Sentinel-3 or the Landsat-8 thermal images better agree with the actual surface temperatures. Thus, it is possible that we compared the TsHARP results to a less accurate reference. This remains an open question.

Conclusions
The mean absolute differences between Sentinel-3 and Landsat 8 BT images were up to 1 • C in all six sites, with the Sentinel-3 temperatures being typically higher. The smallest errors were found for homogeneous and relatively flat regions despite the time difference between image acquisitions. In comparative studies, such as the one presented in this paper, it is possible to intercalibrate between these two sensors when needed.
Applying TsHARP on Sentinel-3 BT data using Sentinel-2 VNIR data resulted in mean absolute errors of~1 • C, with errors increasing as the difference between the native and the target resolutions increases. Part of the error is attributed to the discrepancy between the BT images acquired by the two platforms. This is reflected by the smaller errors observed when sharpening was applied on synthetic-aggregated Landsat-8 BT images, with errors ranging between 0.53 and 0.87 • C. These results show the potential for sharpening Sentinel-3 BT images using Sentinel-2 VNIR data for a range of applications. Further research is needed to test additional sites and conditions, and potentially additional sharpening methods, which can be applied to the Sentinel platforms.