1. Introduction
The variety of land cover in urban regions can significantly affect temperature distributions across many spatial scales [
1,
2,
3,
4,
5]. Urban areas with varied surface features, such as vegetation, building density, and land use, can create localized heat pockets, complicating temperature distribution forecasts [
6,
7]. High-resolution remote sensing and ground meteorological data are needed to obtain micro-scale temperature differences. Some related research works can be listed as [
8,
9,
10,
11]. Landsat data, which provides reliable multi-decadal records of LST, has enhanced urban thermal assessments. They have made it possible to study how urbanization affects UHI in various geographic settings. For instance, using the same Landsat-based analysis revealed high intensities of UHI in Java, Indonesia, whereby urban expansion increased LST by more than 0.25 °C every year [
12]. While similar trends were observed in Chengdu, China, the built-up area increased sixfold with the increase in both summer and winter LST. Ambon Island of Indonesia and Nagpur, India, recorded a significant loss in vegetation cover with a corresponding rise in LST depicted through the inverse relationship between NDVI and LST [
13,
14]. The study in Valencia underlined the amplification of UHI events due to heat waves and the contribution of LST variability depending on LCZs. Investigations in Colombia and Doha also confirmed urbanization in thermal discomfort [
15,
16]. High-resolution LST and precision are enhanced, including the increased capability of UHI mitigation, by models such as WRF/UCM, Sentinel-2, and ECOSTRESS. These findings point to the strategic place Landsat will hold when assessing urban microclimates and sustainable planning on a global scale in the decades to come [
17,
18].
Climate change-related research findings indicate that the urban heat island effect will increase in power, increasing the frequency, severity, and duration of heat events [
19,
20,
21]. However, the traditional meteorological observation stations, typically distributed in an intra-city area without regular intervals, cannot accurately provide high spatial resolution LST data. On the contrary, these observation stations are usually distributed at less representative places like airports, whose surface conditions, such as scarce vegetation, provide higher results in terms of temperature [
22]. Moreover, the air temperature is conventionally measurable, but as an indicator, it does not say much about the actual magnitude of surface heating experienced in a heavily urbanized area. Urban heat islands (UHIs) and land surface temperature (LST) variations are among the main challenges to urban sustainability in urban regions, especially under rapid urbanization. The current study uses satellite data and algorithms such as the Mono Window Algorithm, Radiative Transfer Equation, and Single-Channel Algorithm for LST retrieval [
23,
24]. Incorporating techniques like machine learning with spatiotemporal fusion has recently enhanced LST monitoring [
25,
26]. This research underlines a significant inverse relation between vegetation and land surface temperature and a positive relation to developed regions, amplifying green development mitigation approaches [
27,
28]. This study merges advancements in LST and UHI fluctuation estimations across cities in Indian Punjab and translates them into practical recommendations for sustainable urban development.
Given the high cost of airborne data collection, many studies rely on satellite-based data to capture LST, though satellite data often have limitations in spatial resolution [
22,
29]. Satellite-derived LST provides essential information on the pattern of surface heating that directly affects urban microclimates.
However, despite their advantages, satellite-derived temperature observations come with several limitations. For instance, even the most extended continuous global datasets about Earth observation from space, such as those from Landsat-30 years and MODIS-22 years, have insufficient climatological information for comprehensive long-term urban climate studies [
3,
30]. In addition, satellites can mainly monitor daytime land surface temperature, LST, instead of air temperature, AT, and only when the skies are free from clouds. For geographically smaller areas like Manhattan, about 60 km
2, satellite imaging may poorly consider micro-scale variations in temperature; atmospheric conditions such as cloud cover, scattering, and absorption could result in inaccuracies [
22]. Landsat provides a resolution of 30 m for its visible bands; by comparison, higher-resolution sensors like Sentinel-2 and WorldView-3 can achieve spatial resolutions as high as 10 m or better. That being said, it is essential to note that even high-resolution advanced satellite imagery may not be able to capture small-scale temperature changes within urban environments [
31,
32]. Therefore, the complexity of urban heat islands presents significant challenges in predicting localized temperature patterns, given the lack of fine-scale LST data. This limitation is addressed through interpolation and statistical downscaling techniques to increase the spatial resolution of the satellite data itself [
33]. Spatial downscaling of LST overcomes the shortage of course TIR data, while high-resolution LST is imperative for various applications. Of the existing algorithms, temperature-sharpening algorithms, random forests, and machine learning-based RFATPK and XGBoost have achieved variable successes in both urban and natural environments [
34,
35,
36,
37]. Fully mechanized methodologies, like HSR-LST, are expected to reach fine-scale downscaling with the help of high spatial resolution RGB images, making it easier to have intricate LST analyses on complex urban geometries [
38]. Following [
39], regression methods like Kriging Regression intrinsically capture variability in surface materials resulting from diverse landscapes and provide workable solutions for urban thermal assessments. Commonly, ensemble learning models such as MFGWML improve accuracy and effectively tackle nonlinearity and spatial non-stationarity across heterogeneous landscapes, as cited in [
40]. The findings of the comparison analysis demonstrate an improvement in the applicability of the random-forest-based algorithms for global downscaling. This was also confirmed by [
41]. Other ways of generating alternative methods include ResNet and NLST downscaling using spectral and mechanistic explanatory variables for land surface temperature applicability extension in urban microclimate and UHI research. For example, one finds [
42,
43]. Another technique adopted is the XGBoost-based method for deciphering the relationship within the pattern of land surface temperature and inequalities, along with socio-environmental features such as urban heat islands, as performed by [
44]. Hence, these LST downscaling methods mark a step in the right direction, with many barriers standing in the path of determining and handling spatial variability to boost accuracy. This study integrated these improvements to enhance the downscaling of LST over complex landscapes and urban microclimates. This study focuses on the prediction of land surface temperature (LST) at high spatial-temporal resolution, rather than air temperature (AT), by fusing a combination of Landsat satellite data and meteorological inputs from the North America Land Data Assimilation System (NLDAS). The model developed here is called the urban climate (UC) model and combines the NLDAS surface characteristics such as vegetation cover, building height, albedo, and water fraction with the NLDAS hourly meteorological measurements to simulate, on an hourly time-step at a 60 m spatial resolution, surface values for shortwave radiation and LST. The surface temperature observations derived from thermal bands, such as those from Landsat or NLDAS, have a much coarser spatial resolution than the visible bands. This discrepancy arises because thermal sensors require larger pixel sizes to effectively detect weaker thermal radiation signals. For example, while Landsat visible bands have a resolution of 30 m, its thermal bands typically operate at a coarser resolution of 60 m or 100 m. Similarly, temperature datasets derived from NLDAS have resolutions at the kilometer scale. This disparity affects the precise capturing of localized temperature variations, especially in urban areas with complex land surface characteristics. Therefore, fine-scale interpolation techniques, such as universal kriging, are often applied to enhance the spatial resolution of thermal data. It is conducted to match the resolution of visible bands for urban temperature modeling.
The model aims to resolve the complex interactions between urban land cover and atmospheric conditions, particularly in Manhattan and New York City. This urban microclimate model is designed to simulate local temperature variability at the neighborhood scale, accounting for land use heterogeneity and urban build-ups. Additionally, the model can be applied to other urban studies related to air pollution dispersion, pedestrian thermal comfort, and building energy consumption.
The study focuses on the summers of 2013 and 2014 in Manhattan, New York City, a region with notable urban heat island effects. The years 2013 and 2014 have been chosen because they were relatively free from solid interference due to extreme climatic events. Events like heatwaves or storms could distort the pattern of temperature and UHI. This provided a perfect base for evaluating urban microclimates. This period also coincided with the availability of consistent, high-quality Landsat and NLDAS datasets. These datasets were crucial for accurately modeling land surface temperature, which ensured the comparability of results with similar studies conducted worldwide. The methodology developed is still applicable today. It showed how historical patterns could give insight into future urban planning and measures for sustainability in the face of urbanization and climate change.
This model’s LST prediction is achieved through statistical techniques, including the Generalized Additive Model (GAM) and spatial autoregression (SAR), which account for spatial dependencies and nonlinear relationships between surface characteristics and temperature. This research provides a detailed understanding of urban surface heating dynamics by utilizing LST as the model output and incorporating surface and atmospheric variables as inputs. It offers insights for mitigating UHI through urban design strategies.
  2. Data and Methodology
  2.1. Study Area
The study focuses on the heavily urbanized area of Manhattan, NYC, which is recognized as the most densely populated and geographically smallest of the five boroughs of NYC (
Figure 1). The climatic conditions of Manhattan and NYC are summarized here to provide a climatic characterization. Manhattan has a humid subtropical climate, Köppen classification Cfa, characterized by hot summers and cold winters. The average temperature in summer ranges between 25 °C and 30 °C, with frequent heatwaves that raise the temperature above 32 °C [
45]. Winters are cold, with average temperatures ranging from −2 °C to 4 °C. Annual precipitation is around 1200 mm, uniformly distributed during the year. A high urban density and scarce vegetation increase the urban heat island effect. It produces a temperature rise of 3 °C to 5 °C in the surrounding countryside locally [
46]. These climatic features decisively influence the surface temperature in the region and should be considered within the framework of urban microclimatology.
The region regularly suffers from increased temperatures compared to suburban or exurban areas. It has been reported that some areas in Manhattan are among the most high-risk areas for heat stress, where the temperature could rise to ~35 °C (Pioppi et al., 2020 [
46]). The ongoing effect of climate change would only exacerbate the current scenario, where the number of days above 32.2 °C (90 °F) is expected to double in Manhattan by the year 2050 [
45].
The high urban density, scarce vegetation, and humid subtropical climate of Manhattan, with considerable temperature variations, draw greater attention to the need for an integrated approach to modeling urban microclimates. These aspects require using advanced physical parameters, such as albedo, building fraction, elevation, vegetation indices, and meteorological variables. These variables were integrated into the urban meteorological model. Therefore, LST variations at satisfactory spatial resolutions could be emulated correctly. It will also address unique challenges resulting from the heterogeneity of urban Manhattan.
  2.2. Data Type
The urban factors in cities, such as an increase in temperature, humidity, and winds, are affected by the retention of heat in buildings, the pavement, and the lack of vegetation coverage; therefore, simulating the local micro-scale requires incorporating land cover data from fine resolution cataloging physical parameters such as the albedo, building fraction, building height, elevation, and water fraction, which are used to formulate surface properties that are most important in temperature variation in cities caused by spatial variation in land cover. The data source, type, and resolution of each variable used in the UC model can be seen in 
Table 1.
According to 
Table 1, the data sources and types, resolutions, and necessary transformations in the analysis were summarized in 
Table 1. The thermal bands of Landsat were naturally 100 m, but they have been transformed here using interpolation to a resolution of 60 m. This way, their resolution will fit finer resolution datasets, such as NDVI, and could capture even slight urban-scale temperature variations more effectively. The 60 m resolution enabled a trade-off between computation efficiency and the demand for detailed spatial analysis in areas as densely urbanized as Manhattan. The LST reading accuracy was also enhanced with atmospheric correction algorithms on the Landsat image, such as the Mono Window Algorithm. This could be obtained by retrieving Landsat data with LST and NDVI products from USGS Earth Explorer (
https://earthexplorer.usgs.gov/, accessed on 20 July 2024).
The Landsat satellite program, operated by NASA-USGS, provides Earth observation data in moderate resolution in multi-spectral and thermal infrared. In addition, it has often been used in UHI and microclimate studies for its consistency, high spatial resolution (30 m for visible bands), and long-term data records (since 1972). Thermal bands derived from LST were taken at a 100 m resolution and then resampled for spatial consistency with visible bands. NLDAS data were obtained from NASA’s North American Land Data Assimilation System (NLDAS-2: 
https://ldas.gsfc.nasa.gov/nldas/, accessed on 10 July 2024), which combined large-scale reanalysis and observational data to generate hourly meteorological parameters. It included temperature, relative humidity, and shortwave radiation at a spatial resolution of 12.5 km. In this work, the meteorological variables mentioned were downscaled to 60 m using universal kriging to make them compatible with the Landsat-derived parameters. For more information about NLDAS products, refer to the NASA NLDAS Data Portal. These datasets were interpolated at 60 m resolution for better results. The interpolation details, which are the downscaling of NLDAS products, have been provided in 
Section 3.1. The interpolation from 12.5 km to 60 m was achieved using a universal kriging method. This geostatistical approach is suitable for downscaling because it considers both the spatial autocorrelation of the data and the underlying trend, allowing us to capture fine-scale variations in urban environments. While the original resolution of the NLDAS data is coarser, kriging provides for an accurate temperature estimation at a finer resolution by incorporating neighboring data points and accounting for the spatial structure of the dataset.
The meteorological parameters were obtained from Phase 2 of the NLDAS-2, a collection of large-scale reanalysis and observation-based products from 1979 to the present [
47]. Specifically, this study utilized the bias-corrected meteorological forcing from the NLDAS modeling framework known as NLDAS-FORA [
33,
48,
49]. The spatial resolution of this gridded data is 1/8° latitude–longitude resolution covering the conterminous US with an hourly (60 min) time step. The NLDAS forcing dataset is derived initially from the North American Regional Reanalysis (NARR) modeling framework, which is later spatially downscaled to a high-resolution NLDAS 1/8° (~12.5 km). Key meteorological parameters such as 2 m air temperature, 2 m relative humidity, and shortwave solar radiation were obtained from the NLDAS-FORA products.
With the longest continuously acquired collection of space-based moderate-resolution land remote sensing data, Landsat was used for its high spatial resolution measurements of the Earth’s surface properties to add to temporal resolution measurements of NLDAS data. Landsat was also used to estimate the above-ground amount of green vegetation coverage (NDVI), albedo, and land surface temperature. The Landsat observations were used to find the total impact of vegetation present in cooling and reducing the ambient air temperature. Albedo was calculated to determine how much sun energy is absorbed by different land surfaces. NDVI is calculated while obtaining LST data. The results of NLDAS downscaling were compared with three weather stations in New York City (NYC) for validation purposes. Other significant parameters, including building height, building fraction, albedo, and presence of water body, were obtained from the NYC Department of the Building database.
  2.3. Proposed Models and Statistical Methods
The urban meteorological model (UC) is a land surface meteorological model designed for the urban surface to quantify the built environment’s impact and meteorological parameters, considering physical and environmental indicators and finding areas most affected by UHI. The environmental risk factors are associated with changes in temperature, lack of vegetation, and urban development (including building height and density). The UC model bridges the gap between the land surface temperature layer and forecast models to represent the effect of urban land cover and changing micro-scale climate on the densely urbanized area. 
Figure 2 shows the key steps, data processing, method(s), and significant outputs related to the development of the UC model. The workflow is generalized for land surface temperature prediction and urban microclimate analysis. While this workflow is generalized to be applied to urban areas, the specific study area analyzed in this paper is Manhattan, NYC, in 
Figure 1.
The model was developed mainly to predict the land surface temperature at a much finer level and was designed to run in any urban area with the same variables used. It is designed to simulate the local micro-scale climate variability based on the urban surface diversity at the city or neighborhood scale. The modeling framework involves several geospatial and statistical components to comprehensively assess the relationship between meteorological parameters and various micro-scale urban characteristics.
Urban surfaces have unique emissivity characteristics compared to their natural counterparts in composition, texture, and reflectivity. Built-up concrete, asphalt, and metal areas offer lower emissivity than natural surfaces like vegetation and water bodies. This study rectified these differences by estimating the LSE using the NDVI method. The NDVI-based approach is practical in distinguishing the surfaces into three categories: fully vegetated areas, bare ground, and mixed surfaces. A high emissivity value close to 0.98 was assigned for fully vegetated areas. In contrast, urban and built-up surfaces were assigned lower emissivity values due to their higher heat retention and lower radiative cooling capacity [
50].
Further, to ensure the accuracy of emissivity correction, MWA has been employed for atmospheric effect consideration due to interference created by water vapor and aerosol, which can distort the satellite thermal band readings. This will help refine the LST estimations by compensating for atmospheric attenuation of the emitted radiation [
51]. In addition, parameters related to urban environments, including building fraction, height, and albedo, were included in the modeling framework. These parameters have helped account for the complex thermal behavior of urban areas with low emissivity and high thermal inertia, resulting in elevated temperatures. Therefore, this nuanced approach ensures that the LST retrieval accurately reflects the heterogeneous thermal properties of urban and natural surfaces [
52].
Temperature data for the study area were extracted from the Landsat thermal band and NLDAS dataset for the duration of the study. In processing the Landsat thermal band and converting it to land surface temperature (LST), 12 cloud-free images for the summers of 2013–2014 were downloaded. While we initially performed only surface emissivity estimation using the NDVI method, we recognize that atmospheric correction is essential for accurate LST retrieval. To correct this, we will incorporate the Mono Window Algorithm (MWA) for atmospheric correction, which has been widely used in similar studies [
53]. Atmospheric parameters necessary for this correction will be derived from the National Centers for Environmental Prediction (NCEP) profiles using the web-based tool. We expect to achieve more accurate LST values by accounting for atmospheric interference. LSE was calculated using the NDVI method following three cases (bare ground, fully vegetated, and a mixture of bare soil and vegetation) [
53].
In this example, the third case (Equation (1)) is applied; hence, the following equation is used to extract LSE:
        where 
ε is the LSE and 
Pv is the proportion of vegetation obtained and is calculated by
        where 
NDVImax = 0.5 and 
NDVImin = 0.2
The next step involves calculating the sensor radiance (
Lλ), which is the amount of energy that reaches the satellite sensor:
        where 
DN = the quantized calibrated pixel value, 
LMin = the spectral radiance that is scaled to 
QCalMin in watt/m
2 × ster × µm, 
LMax = the spectral radiance that is scaled to 
QCalMax in watt/m
2 × ster × µm, 
QCalMin = the minimum quantized calibrated pixel value (corresponding to 
LMin) in 
DN, and 
QCalMax = the maximum quantized calibrated pixel value (corresponding to 
LMax) in 
DN.
The at-sensor radiance is, in turn, converted to the effective at satellite temperatures of the viewed earth–atmosphere system under an assumption of unity emissivity. This is also referred to as blackbody temperature and denotes a surface that absorbs all the electromagnetic radiation that reaches it:
        where 
K1 = Calibration constant 1 (666.09 watt/m
2 × ster × µm), 
K2 = Calibration constant 2 (1282.71 K), and 
Lλ= At sensor radiance calculated from Equation (5).
A comprehensive list of radiometric calibration coefficients for Equations (3) and (4) can be obtained from [
54]. Since 
T is a reference blackbody temperature, the final step involves correcting for spectral emissivity according to the nature of the surface:
        where 
Tb = Blackbody temperature from Equation (4), λ = Wavelength of emitted radiance (11.5 µm), 
ρ = h × c/σ = 1.438 × 10
2 mK (σ = Boltzmann constant = 1.38 × 10
23 J/K, h = Planck’s constant = 6.626 × 10
34 Js, c = velocity of light = 2.998 × 10
8 m/s), and 
lnε = Land surface emissivity calculated from Equation (1).
The thermal bands are later converted to temperature and geocoded to each zip code in the study area (
Figure 3). In the raster processing, points were used to obtain the temperature value with a 30 m × 30 m distance, with temperature extracted at each point. The last step was to join extracted temperature point data to the shapefile of the New York City zip code (
Figure 3). GIS conducted this to spatially enter the temperature with the geocoded zip code shapefile. The converted LST map could depict the micro-scale variations for New York. It also shows the regions with higher temperatures due to land surface properties and weather parameters on that day.
Assuming the NLDAS data had a normal distribution and a specific dominant trend in the sample points, the data were interpolated for a higher resolution (60 m) using the universal kriging method. Universal kriging is a method of interpolating a surface under non-stationary conditions. The mean values vary in a deterministic manner in different locations while the variance stats are constant. As NLDAS 0.125° topography differs significantly from the topography of the 40 km EDAS grid, the following processing step involved adjusting the 2 m temperature NLDAS grid points using Equation (1):
        where T
NLDAS is the 2 m temperature (K) in the NLDAS, T
EDAS is the EDAS 2 m temperature (K), γ is the lapse rate (assumed to be −6.5 °K per km), and ΔZ is the elevation difference (m) between the NLDAS and EDAS topography [
55]. Later, these downscaled NLDAS products were interpolated to match Landsat thermal band resolution using the kriging interpolation technique [
56,
57].
The results of the interpolated temperature were compared with the available weather stations within NYC. A buffer analysis in GIS was carried out with only three gridded data points over NYC, and the locations of the three points did not overlap the three main weather stations in NYC, which are JFK, La Guardia Airport, and Central Park WS. The average NLDAS temperature within a buffer zone was used to calibrate NLDAS data and WS. R-square for 1 km, 300 m, and 100 m for the three WS and NLDAS data points were also calculated to assess the accuracy of the downscaled NLDAS temperature (refer to 
Figure 4 for the results).
These buffer zones were used to analyze the spatial sensitivity of the interpolated NLDAS data to varying scales of proximity around the weather station locations. The 1 km buffer represented a broader area that captures regional trends. In contrast, the 100 m and 300 m buffers captured more minor spatial scales closer to the station for finer-scale validation. They represented the extent to which the downscaling process could capture localized variations in temperature and allowed alignment with ground-based observations. The buffer zoning comparison also represented the scale at which the NLDAS data best reflected the observed temperatures (
Figure 4).
  2.4. Developing Statistical Models
This study adopted two statistical models to build the relationship between the satellite-derived LST and NLDAS products and various surface characteristics. In the first stage, the Generalized Additive Model (GAM) was implemented to identify the model’s performance in explaining the sophisticated spatial variation of LST.
GAMs are a specific class of statistical model in which the linear relationship between the response and the predictors is interpreted by implementing nonlinear smooth functions to model and capture the nonlinearity in the data. These flexible and smooth techniques help us fit linear models, which can be either linearly or nonlinearly dependent on several predictors, to capture nonlinear relationships between response and predictors.
The GAM is a semi-parametric regression model, presented in the following form in Equation (8) [
58]:
        where 
Y, 
E(
Y), 
g, 
, and 
fi (
Xi) represented the response variable, a smooth function of the predictor 
Xk, and the link function connecting the expected value of 
Y to the linear predictor, respectively.
This structure allowed each predictor to have its own, perhaps nonlinear, effect on the response variable, especially in complex datasets. Then, the spline smoothing methods were implemented to model such relationships.
The GAM was used to predict Landsat 8 (LC8) temperature as a function of NLDAS variables and the physical parameters, albedo, water, and NDVI. Spline smoothing was used to fit the data.
Although GAMs are better at explaining the complex relationship among variables, they fail to accurately depict the spatial autocorrelation among the variables. While GAMs can model nonlinear relationships, they may not fully account for spatial dependencies from adjacent regions, potentially leading to suboptimal performance in predicting LST at the micro-scale.
To resolve this issue, this study first identifies if a spatial correlation exists in the dataset, as it would lead to implementing a spatial regression technique. This study conducted the Moran’s I test for the degree of spatial autocorrelation for the land surface temperature. The Moran’s I index indicates the appropriateness of a spatial autoregression model for this study [
59,
60]. The presence of spatial autocorrelation among atmospheric and surface properties can influence the accuracy of the regression equation. Simple linear regression techniques do not account for the degree of spatial dependency. Therefore, they misrepresent the outcome of the regression. This study implemented the spatial autocorrelation (SAR) technique to model the relationship among Landsat-derived LST, NLDAS products, and associated surface characteristics. Specifically, a spatial lag model was implemented, considering LST as the dependent variable. Equation (8) shows the mathematical formulation of the SAR model utilized herein.
        where 
 is the spatially lagged dependent variable for weights matrix 
W, 
 is defined by the matrix of observations on the explanatory variable, 
 is the vector of error terms, and 
 and 
 are the parameters for the regression equation. The values of its corresponding neighbors influence the dependent variable 
y through a spatial weight matrix (SWM). For this study, the neighbors were defined by the ’queen’ neighborhood relationship.
  3. Results and Discussion
  3.1. Performance of NLDAS Downscaling
The finding comparisons captured in 
Figure 4 show the temporal variability and high correlation of the aggregated mean temperature for the three weather stations and NLDAS at different resolutions. The three WS and 60 m resolution NLDAS data showed a perfect correlation of ~0.98 for stations (
Figure 4). Next, the interpolated NLDAS data at 60 m resolution were compared and cross-validated with ground-level measurements for the three available weather stations. The results indicated a higher correlation for the interpolated NLDAS temperature than the observed weather station data. The R
2 varied from 0.88 to 0.93 for the three weather stations, with a more significant correlation for Central Park and La Guardia (R
2 > 0.91) (
Figure 4). The results from 
Figure 4 indicate that the resolving variability over time is strong for NLDAS and WS at different resolutions.
Balancing the coarse resolution of large-scale meteorological data like NLDAS with coarse resolutions was challenging. This challenge intensified with a heterogeneous urban environment in Manhattan, producing thermal variations at far finer scales. This presented a methodological approach to retain the heterogeneous features of space and provide more localized urban heat effects. The choice of universal kriging addressed spatial variability by incorporating observed spatial autocorrelation patterns. It also ensured accurate interpolation across diverse urban land covers. Moreover, the incorporation of elevation adjustments accounted for topographic influences on temperature distributions. These enhancements mitigated over-smoothing risk in downscaling large-scale datasets to urban microclimates.
Another challenge was a temporal mismatch between the high-frequency NLDAS data at an hourly scale and the coarser temporal resolution of Landsat imagery at 16 days. These mask the short-term thermal dynamics crucial for understanding urban microclimates. Therefore, continuous temporal coverage by NLDAS was used to interpolate and refine thermal patterns during Landsat’s data gaps. This integration allowed for a better temporal representation of surface temperature trends. It provided diurnal and seasonal variability for the model to consider. Moreover, the methodology for selecting cloudless and representative summer days was highly focused on temporal consistency. It reduced noise across all datasets and allowed for more robust predictions in urban heat island analyses.
  3.2. Results from the Statistical Models
  Generalized Additive Models (GAMs)
The GAM was implemented to assess the linear relationship between the response and predictors in the data, modeling and capturing the nonlinearity in the data. 
Table 2 shows each variable’s calculated approximate significance of smooth terms in the GAM multivariate regression model and calculated 
p-values.
The significance of smooth terms for each of the variables is strong (
p-value < 0.05). All variables are statically significant, indicating a substantial contribution in predicting Landsat temperature at 60 m resolution. Next, the model was used to predict the LC8 temperature from the variables (i.e., NLDAS, bfrac (building fraction), albedo, NDVI, etc.) at 60 m resolution. It should be noted that the ’bfrac’ parameter in this study quantifies the proportion of land covered by buildings in each grid cell. This variable is key to understanding the built environment’s contribution to the urban heat island effect, as higher building fractions typically lead to elevated land surface temperatures due to decreased vegetative cooling and increased heat retention by urban structures. The results for the accuracy of the fitting were analyzed using statistical parameters, as shown in 
Table 3.
The correlation factor of 0.3979 exhibits a poor to moderate agreement between model prediction and LC8 data. The simplistic linear modeling did not capture the micro-scale variation in surface temperature. The temperature radiating from surrounding environments and surfaces was not reflected in the GAM. The results for the GAM showed very low R-squared values and repeated p-values for each variable.
  3.3. Results from the SAR Model
The SAR model was implemented to overcome the limitations of the GAM. First, we conducted the Moran’s I statistical analysis to identify the spatial pattern or presence of clusters in the data. Moran’s I index represents the degree of autocorrelation among nearby objects across a spatial area. The relationship between land surface temperature (LST) variations and urban characteristics is influenced by the urban form, seasonality, and changes in land use. These factors collectively highlight the complex dynamics shaping thermal environments in cities. Urban form increases the heat absorbed and decreases natural cooling, primarily because of building density and impervious surfaces. These factors have a direct relation to increasing LST values. Conversely, vegetation can reduce this effect by shading and through evapotranspiration. These findings align with previous studies [
25,
61]. The Single-Channel Algorithm has been shown to retrieve LST reliably. It shows good agreement with MODIS-LST datasets (R
2 > 0.75), hence proving its accuracy [
23]. Summer seasons intensify UHI effects in Indian Punjab cities. This is due to drier conditions and reduced soil moisture, which agrees with global observations made by [
24,
28]. Strategic urban planning should focus on preserving vegetation and compact urban development to minimize critical heat zones. Spatiotemporal modeling offers valuable guidance for integrating these strategies into urban design frameworks. These efforts align with the principles of sustainable urban development, as identified by [
26,
27].
The results from Moran’s test suggest that the LC8 temperature is primarily affected by spatial autocorrelation, as seen from Moran’s I (
Figure 5).
A Moran’s I index of 0.58 indicates strong evidence of clustering in the surface temperature derived from Landsat. The SAR model was used for spatial weighting matrices to incorporate spatial autocorrelation in the regression model to account for spatial correlation. A Landsat image was used to run a multi-regression model with all the variables (albedo, water, build, NDVI, etc.) and NLDAS, and the results were compared with the added SAR function into the model.
To improve the regression model and account for the spatial correlation using the SAR model, the results were spatially auto-regressed to close eight neighboring distances using the spatial autoregressive model (Equation (7)) and to develop the UC model. 
Table 4 shows the result of incorporating the adjacent point using Equation (7).
According to 
Table 4, the value of Rho in the SAR model was determined through spatial autoregression. This method quantified the impact of spatially lagged dependent variables and described the degree of spatial autocorrelation among neighboring observations. This study calculated a Rho of 0.903, indicating high spatial dependence. This result underlined the incorporation of spatial relationships in making the LST prediction within heterogeneous urban environments.
The calculated Rho value of 0.903 increased the results. It also indicated the neighboring values’ positive effect on the outcome (LC8 temp) to a degree of 0.903, which was statistically significant. 
Table 5 shows a quick comparison between the two models (i.e., GAM and SAR).
The R2 improved substantially to 0.85 for the SAR model, suggesting the model’s validation as the UC model. Model RMSE and MAE also decreased for the SAR model. Even though the GAM increased the efficiency of the regression model, the UC model further increased the prediction of the regression model by incorporating the neighboring values.
  3.4. Comparison of UC-GAM and Landsat LST for 2013 and 2014
The model’s accuracy is tested for overall fit by comparing the mapped result for spatial autoregression (UC), the GAM, and the actual LC08 values for each day. The maps in 
Figure 6 show the produced results for the predicted temperature at 60 m resolution using the GAM and UC model compared to the actual measured temperature from Landsat for each day.
With its significant fit, the UC model picked up many of the micro-features within the study area, showing the same hotspots as demonstrated in each LC08 image for each day. The errors between the actual Landsat measured temperature and UC predicted values for each image were calculated to further test the model’s overall fit. A few errors were observed, and the model predicted the LST to be at a 60 m resolution. In addition, the R-square was calculated for the observed and predicted values for the Landsat images, and the results show a robust correlation of 0.79–0.95 for each day of the study.
Figure 7 shows a firm agreement between measured and predicted LST values, with R
2 in the range of 0.79 and 0.95 and RMSE in the range of 0.561 and 1.013. However, a minor bias was noticeable in scatterplots. The calculated MBE also showed a systematic deviation. Possible sources of this bias included interpolation errors during kriging, using static land surface characteristics (e.g., NDVI), and the integration of datasets with different temporal resolutions.
 Future work could explore integrating advanced nonlinear modeling techniques, such as random forest or XGBoost, to mitigate the bias observed in the predictions. This method may improve the model performance efficiently. Additionally, incorporating dynamic parameters such as wind speed and soil moisture or refining the interpolation process may improve model performance. Residual analysis and sensitivity tests could further elucidate the specific drivers of bias.
Improvements in the predictive accuracy of the SAR model were in line with the findings of [
62]. They presented evidence that integrating Moran’s I in their approach remarkably improved the clustering analysis in urban surface temperature studies. Similarly, the spatial weighting matrices and the very close neighboring points, with a Rho equal to 0.903, underlined the importance of spatial dependencies in urban thermal modeling. The alignment of these studies showed the solidity of the applicability of spatial autocorrelation techniques for re-evaluating the urban LST models with a particular focus on densely compact cities like Manhattan.
Additionally, the SAR model accounted for spatial dependencies in agreement with [
63], who employed a geographically weighted autoregressive model in downscaling MODIS LST. Their findings revealed that consideration of spatial heterogeneity and autocorrelation reduced the RMSE while improving the performance of their model. The present study had reductions in RMSE almost parallel to theirs through downscaling. Thus, this confirmed that spatial autoregression was useful in dealing with non-stationarity in LST variations.
Furthermore, the comparison between the GAM and SAR models in this study mimicked [
64] findings. They showed that linear models, like GAM, may not appropriately capture the spatial variability inherent in LST. Their application of machine learning and statistical techniques revealed similar trends. Their results indicated that linear models could not deal with spatial autocorrelation well, whereas the spatially explicit models did. This study’s improvement in R
2 from 0.397 to 0.85 proved the trend, underlining the superiority of spatial models for heterogeneous urban environments. This also supported their assertion that addressing uneven LULC changes and integrating neighborhood effects improved the predictive accuracy of urban thermal studies.
Accordingly, the results of this study were similar to [
65], which used geographically and temporally weighted autoregressive models for LST downscaling. Their approach achieved similarly high correlations (R
2 > 0.85) and low RMSE. This underlined the efficiency of using temporal and spatial weighting in LST prediction. Their findings extended this study’s results by showing the SAR model’s applicability to high-resolution urban data. The strong agreement between observed and predicted LST values in both studies validated the use of advanced spatial models in urban microclimate research, especially for mitigating urban heat islands.
Furthermore, universal kriging primarily addresses spatial autocorrelation and elevation differences through lapse rate correction. This approach ensured compatibility with high-resolution Landsat data and provided spatially refined inputs for urban microclimate modeling. However, it did not incorporate critical land cover characteristics such as vegetation density, impervious surface fraction, or water bodies. These variables were well-established determinants of urban temperature variations. For instance, dense vegetation typically could reduce surface temperature through evapotranspiration. Accordingly, impervious surfaces could elevate local temperatures due to heat retention. The absence of these factors in the interpolation framework may limit the model’s ability to fully capture the complexity of urban heat dynamics [
66,
67]. However, the methodology presented in this study represented significant progress in spatial downscaling for urban-scale analyses. Universal kriging provided a robust geostatistical framework, enabling precise alignment of NLDAS air temperature with Landsat-derived LST data. This approach must be enhanced by integrating additional land cover variables, such as NDVI, albedo, and building density, into the interpolation framework. This integration will enable the modeling of nonlinear and spatially variable relationships between land cover and urban temperatures. Advanced techniques, such as Kriging Regression or machine learning algorithms like random forest, will also be explored to improve predictive accuracy further. These enhancements build on the foundation laid by this study, paving the way for more comprehensive models that account for both atmospheric and surface-level factors influencing urban temperatures.
  3.5. Limitations of the Study and Contributions
Cloud cover over urban areas is higher due to increased condensation nuclei and convective air movements. This poses a significant challenge in accurately retrieving land surface temperature by thermal infrared remote sensing. Clouds mask the satellite’s thermal sensors, creating gaps in the data that impede precise analysis of urban heat. This aspect is particularly critical for methodologies that rely on clear-sky observations. Additionally, more cloud cover reduces satellite visibility and complicates accurate thermal data retrieval for analyzing city heat. Urban surface heterogeneity impacts thermal emissivity, causing errors in estimations of land surface temperature. Correspondingly, there are limited satellite revisit times to capture fast-changing temperatures in cities, hence not creating critical temporal gaps in data [
68,
69,
70,
71].
However, this study’s method was highly adaptable, given its reliance on multi-source data fusion, advanced statistical models, and state-of-the-art reconstruction techniques. By addressing cloud cover challenges through reconstruction methods and spatial models, the method maintained its accuracy and robustness, as validated by studies such as [
19,
49]. This work coupled high-resolution Landsat data with detailed meteorological inputs from the North American Land Data Assimilation System (NLDAS). Further refinement was achieved using universal kriging interpolation to downscale coarse-resolution NLDAS temperature data into a finer 60 m grid. This approach was practical for urban microclimate modeling in Manhattan, as shown by a strong correlation of up to 0.95. The correlation was observed between downscaled NLDAS temperature estimates and weather station data in Manhattan. The study used both Generalized Additive Models (GAMs) and spatial autoregression (SAR) to predict land surface temperatures (LSTs). The GAM with LST captured nonlinear relationships between surface characteristics, such as vegetation, albedo, and building fraction. However, the GAM failed to account for spatial autocorrelation between neighboring data points. In contrast, the SAR model included spatial dependencies and delivered better performance, with a coefficient of determination of R
2 = 0.85. The SAR model also had a lower root mean squared error (RMSE) of 0.736, improving prediction accuracy. The SAR analysis indicated that neighbor land characteristics are crucial for predicting LST variations in heterogeneous urban areas like Manhattan. The results also showed the robustness of the urban microclimate (UC) model when tested against Landsat-derived LST values.
The tests were conducted over several days during the summers of 2013 and 2014. The UC model accurately captured micro-scale temperature variations, with R2 values between 0.79 and 0.95. RMSE values for the UC model ranged from 0.561 to 1.013, highlighting its precision. These results showed the method’s adaptability in addressing spatial and temporal gaps in thermal data. The study acknowledged the limitation posed by cloud cover in satellite-based observations. Despite this, interpolation and statistical models effectively overcame cloud-induced data gaps. This ensured the accuracy and applicability of the UC model for fine-scale urban temperature predictions.
Furthermore, this study focused on the urban microclimate of Manhattan during summer seasons when snow’s presence and impact on albedo are negligible. Summer was selected to enable the examination of the most prominent UHI effects and thermal variations related to urbanization. It was crucial for assessing heat stress and urban sustainability strategies. While the methods were robust and adaptive, excluding snow scenarios did limit direct applicability to environments where albedo changes due to snow are a dominant factor. Avenues for future work would be to extend the methods presented here through the inclusion of datasets representing winter months. The methods specifically incorporate dynamic albedo and emissivity changes within the model. They can further determine seasonal applicability and sharpen predictions across various climatic conditions.
  4. Conclusions
This study developed an LST modeling framework in urban microclimates by fusing high-resolution Landsat data with NLDAS meteorological inputs. The urban microclimate (UC) model was far superior to other standard statistical methods. This was mainly because of the incorporation of spatial dependencies using a spatial autoregressive approach of SAR. The final model specification of SAR showed an R2 of 0.85. It reduced RMSE by 0.736, indicating its high ability to capture temperature variations at a micro-scale accurately.
The study has demonstrated that surface characteristics such as building density, vegetation, and albedo significantly shape urban temperature distributions. The results showed that urban form and neighboring influences impacted LST, as validated by Moran’s I clustering analysis (index = 0.58). The ability of the SAR model to account for spatial autocorrelation was important in improving the prediction accuracy of densely built environments like Manhattan.
High-resolution UC model predictions effectively captured urban hotspots, with a solid agreement to observed Landsat LST values (R2 = 0.79–0.95). The agreement demonstrated the utility of the model for urban thermal analysis, in which micro-scale variability is significant. Integrating the meteorological and land surface data into the model proved its adaptability and applicability in diverse urban settings.
The study primarily focused on summer months, but the methodology offered the potential for seasonal extension to evaluate albedo changes in snow-covered environments. Future improvements could include dynamic parameters like wind speed and direction to enhance predictive accuracy and broaden the model’s application.
This work highlighted spatial models’ critical role in urban climate studies. The UC model provided an instrumental tool for assessing urban heat islands and developing sustainable urban planning strategies while addressing challenges and opportunities related to rapidly urbanizing areas.