1. Introduction
Urbanization is reshaping city size and morphology worldwide. The global urban population is expected to rise from 56% to 68% by 2050 [
1]. This expansion replaces natural landscapes with built environments, altering regional climates and generating distinct urban microclimates. One of the most studied effects is the Urban Heat Island (UHI), characterized by elevated temperatures in urban centers compared to surrounding rural areas. First documented by Howard [
2], UHIs arise from high heat retention in dense urban structures, reduced vegetation, and anthropogenic activities [
3]. Recent satellite studies confirm its intensification in metropolitan areas, such as Houston, Texas [
4], underscoring the value of remote sensing for tracking UHI dynamics and evaluating mitigation strategies.
UHI development results from a complex interaction of urban morphology, land cover, vegetation, and building materials, all influencing heat absorption and retention. Increased building height and density exacerbate heat accumulation, particularly in megacities with restricted airflow [
5]. UHI intensities vary widely, from 2 °C in small cities to over 12 °C in dense metropolitan areas [
6]. However, although UHI intensities exceeding 12 °C have been reported under specific conditions, typical values range from 2 °C to 4 °C depending on urban morphology and climatic context. Differences in magnitude stem from methodological approaches, climatic conditions, and urban configurations, highlighting the need for standardized measurement frameworks [
7]. Moreover, urban design plays a key role in air circulation and thermal radiation, with compact, multi-story structures amplifying the UHI effect [
8].
Beyond temperature increases, UHIs influence energy consumption, thermal comfort, and public health. Higher urban temperatures drive up cooling demands, increasing energy use and peak electricity loads [
9,
10]. Additionally, temperature rises correlate with higher mortality rates, with up to a 2.72% increase in mortality per additional degree above optimal thresholds [
11,
12]. Meteorological factors such as wind speed and atmospheric stability modulate UHI intensity by affecting heat dispersion [
13,
14]. Studies in cities like Seoul and Łódź demonstrate how wind dynamics and land use shape UHI distribution [
15,
16]. In the context of global warming and rapid urbanization, UHIs pose a major challenge for sustainable urban development and livability [
17,
18].
Despite extensive research, the main drivers of UHIs and the most effective mitigation strategies remain subjects of debate. Some studies emphasize urban morphology [
8], while others focus on wind dynamics and heat transport [
7]. Modeling urban wind behavior remains complex due to data limitations, whereas urban morphological factors—such as building height, density, and spatial configuration—are more readily quantifiable using satellite imagery and geospatial databases. Research confirms that urban morphology significantly influences microclimates, with elements like street canyon morphology, vegetation cover, and land surface permeability affecting UHI intensity [
19,
20,
21]. In several cities across France, specific morphological characteristics, such as height-to-width ratios and sky view factors, have been linked to UHI variation [
22].
This study calculates Land Surface Temperature (LST) at the road segment level, using 50 m road sections as analytical units to integrate morphological and environmental assessments. LST is considered as a sign of a UHI, being a parameter that measures its expression, Surface Urban Heat Island (SUHI) [
23]. Roads are considered in this work as a measurement unit due to their extensive urban presence and their role in thermal dynamics. Specifically, roads can be used as containers of the surrounding morphological and environmental conditions. This structured approach enables a standardized evaluation of localized heat accumulation and its contribution to UHIs, positioning streets as key spaces for large-scale mitigation measures.
Urban climate research increasingly utilizes the Local Climate Zone (LCZ) classification to understand spatial patterns of UHI intensity [
24]. Recent studies, such as [
25], have demonstrated the value of LCZs in relating Land Surface Temperature (LST) and air temperature distributions at fine spatial scales. Although this study does not explicitly apply an LCZ framework, it indirectly incorporates key morphological and environmental descriptors analogous to LCZ parameters, ensuring comparability while maintaining flexibility based on openly available raw datasets.
Existing UHI assessment methods often face limitations related to data accessibility (especially atmospheric and meteorological, as well as morphological) and computational demands [
7,
26]. While physical models require detailed morphological, atmospheric, and meteorological data, remote sensing methods may lack the resolution to capture localized thermal variations [
27,
28]. To bridge these gaps, this study adopts a morphology-based approach, analyzing how building height, density, design, and green space distribution influence heat retention. Additionally, it incorporates moisture levels via surface water bodies and solar radiation through the Summer Climate Severity (SCS) index.
The inclusion of a climate severity metric is grounded in prior efforts to quantify summer thermal stress and its effects on urban systems. One of the earliest anthropocentric indices [
29] was originally developed for the northeastern United States, though its global applicability remains unverified. In Europe, a Climatic Severity Index was proposed [
30] to define summer climatic zones in Italy, linking them directly to air conditioning energy demand in buildings. In a similar manner, the Spanish Building Code (CTE) introduced a classification of summer climatic zones in the Iberian Peninsula based on accumulated summer degree-days. This national-level approach provides a standardized, regulation-backed index for quantifying summer climatic severity, which serves as the foundation for the SCS used in this study.
This methodological framework aligns with recent research advocating for morphology-based UHI assessments, particularly in Mediterranean contexts where compact urban structures significantly modulate temperature distributions [
31,
32].
To enhance practical applicability, this work develops a custom QGIS plugin for streamlined data processing and broad usability across different urban contexts. Statistical techniques, including ANOVA and correlation analyses, reinforce the reliability of the findings. This comprehensive framework not only advances LST-based UHI research but also provides actionable insights for urban planning, supporting evidence-based strategies for mitigating UHIs in diverse climatic zones.
The Iberian Peninsula was selected as the study area due to its varied climatic and urban characteristics. The findings will serve as a foundation for replicating this approach in other cities with similar geographical contexts, advancing UHI research and supporting data-driven urban policies.
The remainder of this paper is structured as follows:
Section 2 details the data sources, parameters, and methodology for UHI modeling.
Section 3 presents statistical results and discussion.
Section 4 presents model applicability across urban environments, and
Section 5 summarizes key insights.
2. Materials and Methods
The proposed methodology for characterizing UHIs (
Figure 1) involves morphological and environmental parameters to understand their influence on the gradual accumulation of heat in city centers. This approach characterizes UHIs at a 50 m street or road scale, as illustrated in
Figure 1. Key factors considered include street and building geometry, SVF, altitude, vegetation, and water bodies (using spectral indices), and SCS. The study covers the summers of 2021, 2022, and 2023, considering July and August for each year as the peak heat period.
To ensure precision and efficiency, we developed a custom plugin for Quantum GIS (QGIS ©, available at:
www.qgis.org (accessed on 10 January 2025)), which automates all necessary operations using predefined information layers. The plugin enhances reproducibility and minimizes human error, ensuring that parameter calculations adhere to rigorous scientific standards.
This methodology assesses the relationship between each selected parameter and nighttime surface temperature, specifically to the 90 m synthetic average summer LST at night. Below, we detail the data sources for parameter computation, followed by the statistical and correlation analyses conducted using the R programming language (v.4.3.0).
2.1. Data Sources
To ensure global applicability and replicability, all data sources follow the Open Data principle and provide global coverage. Based on the two main evaluation criteria—urban morphology and urban environmental conditions—the data sources are categorized as follows:
2.1.1. Urban Morphology Data Sources
Urban morphology describes the physical structure of human settlements, and several parameters, such as wind corridors, altitude, green spaces, and hydromorphology, are considered. The data sources include:
OpenStreetMap (OSM): Provides street-level data on roads, buildings, and urban infrastructure, accessed via the QuickOSM plugin (
Section 3).
Cadastral Data: Offers detailed land use and property boundary information. Data for this study was sourced from the General Directorate of the Spanish Cadastre.
Digital Terrain and Surface Models (DTM/DSM): all data was sourced from the Spanish Geographic Institute (IGN), aligned with the INSPIRE Directive. The following specific data were used:
DTM at 2 m resolution, providing elevation data, crucial for computing street altitudes.
DSM at 2 m resolution, used to calculate the SVF.
Building height model at 2.5 m resolution that helps estimate wind corridors, influencing UHI formation.
Coastline and Municipality Boundaries: used to determine the distance of a city to the sea and to classify cities as coastal or non-coastal. These data were sourced from the IGN.
2.1.2. Urban Environmental Data Sources
Urban environmental conditions influence urban heat islands (UHIs) through vegetation cover, water presence, SCS, and LST. The data sources used in this study include:
Sentinel-2 BOA Reflectances: Captured by the Multispectral Imager (MSI) sensor aboard Sentinel-2 satellites of the European Satellite Agency (ESA), which have 10 m spatial resolution. These atmospherically corrected measurements are used to derive spectral indices (e.g., the NDVI and NDWI) for monitoring vegetation and water bodies in the city.
Summer degree-days in base 20: This historical summer temperature data servers to classify cities based on the severity of their summers, the SCS index was used, following the Spanish Building Code (CTE). The index relies on degree-days (DD) at a base temperature of 20 °C, using data from AEMET (Spanish Meteorological Agency).
MODIS and ASTER Nighttime LST: To use summer nocturnal LST as reference parameter in the model, we created a synthetic night LST image with a 90 m spatial resolution based on the average summer nighttime LST. This approach addresses the absence of an open product from ESA or NASA that offers both adequate spatial and temporal resolution for this purpose. Thus, we integrated the spatial detail of ASTER with the temporal frequency of MODIS, following methods previously validated in UHI studies (e.g., ref. [
33]). Quality control was applied to remove extreme temperature anomalies or cloud interference.
MODIS with 1 km resolution, daily nighttime LST data.
ASTER with 90 m resolution, on-demand nighttime LST data.
2.2. Urban Morphology Parameters
From the data sources selected, different parameters are calculated to describe effectively the urban morphology and design (green spaces, hydromorphology) and the SCS. These parameters are detailed in the following subsections. These parameters represent morphological drivers of urban heat patterns.
2.2.1. Wind Corridor (WC)
This parameter is calculated by averaging the height of buildings around the 50 m street/road segment and dividing it by twice the average distance of the buildings to the axis of this 50 m street/road segment. This metric follows the classical height-to-width ratio approach proposed by [
6] to estimate potential urban ventilation.
Building heights were extracted from cadastral geometry data. Since the DSM often lacks height values at building centroids, all DSM points within each building footprint from the cadastral data were analyzed, and then their average height was computed.
2.2.2. Buildings on Both Sides (BBS)
This categorical parameter assesses the presence of buildings on both sides (value 2), one side (value 1), or neither side (value 0) of each 50 m road/street segment. To obtain this information, data on streets and roads from OSM, the building footprints from cadastral data, and the DSM are required.
Specifically, a 20 m buffer from the axis of each 50 m road/street segment is automatically created. This buffer is analyzed to determine whether it intersects with building footprints. The center of each intersecting building and the angle formed between the building and the street/road segment are calculated.
If all the angles have the same sign, there are buildings on one side of the road (value 1 for the parameter). If the angles have different signs, there are buildings on both sides (value 2 for the parameter). Finally, if no buildings intersect with the buffer, there are no buildings on either side (value 0 for the parameter).
2.2.3. Sky View Factor (SVF)
This parameter quantifies the proportion of the sky visible from a specific point on the road. To compute this parameter, the 2 m DSM is used. The calculation is performed using the SAGA-GIS tool, which processes raster-based elevation data. Each raster point is analyzed to determine angles and distances in multiple directions, with the number of directions typically set to eight in the default plugin.
The algorithm follows a structured sequence: first, it computes the slope and aspect (the direction of the steepest slope) for each point. Then, it calculates Phi as the arctangent of the point’s angle, followed by the sine and cosine of both Phi and the slope. Finally, the SVF value is obtained as the average result of a specific formula applied to each directional calculation.
After calculating the SVF raster, it is assigned to each 50 m road/street segment by calculating the average of the point values which match with the street points.
2.3. Urban Environmental Parameters
From the data sources selected, different parameters are calculated in order to describe effectively the urban morphology and design (green spaces, hydromorphology) and the SCS. These parameters are detailed in the following subsections.
2.3.1. Normalized Difference Vegetation Index
The NDVI is used to quantify the presence of vegetation and its level of greenness, so that the vegetation density and changes in plant health can be determined from the index value. In this work, Sentinel-2 Bottom of Atmosphere (
BOA) reflectance data is used, with bands 4 (
Red band) and 8 (near infrared band), both at a spatial resolution of 10 m. This index is calculated using Equation (1):
The average summer
NDVI at 10 m resolution is computed using Google Earth Engine © [
34], providing insights into vegetation cover. The average
NDVI value within the same 20 m buffer as described in
Section 2.2.2 is then assigned to each 50 m street/road segment.
2.3.2. Normalized Difference Water Index
The NDWI quantifies water and humidity levels for vegetation and soil. In this case, Sentinel-2 BOA reflectance data from bands 8 (near infrared,
NIR, band) and 3 (
Green) is used, through Equation (2):
The average summer
NDWI at 20 m resolution is computed using Google Earth Engine © [
34], providing insights into water bodies and humidity levels. The average NDWI value within the same 20 m buffer as described in
Section 2.2.2 is computed and assigned to each 50 m street/road segment.
2.3.3. Altitude
The altitude of each 50 m road/street segment is determined by averaging the altitude values of the 2 m DTM points within each 50 m road/street segment from OSM.
2.3.4. Distance to the Sea (DtS)
This parameter is calculated by averaging the distance of the 50 m street/road segment from the ocean. To obtain this information, streets and roads from OSM and lines of Spain from the IGN are required.
2.3.5. Coastal
This categorical parameter is used to determine whether a city is near the sea (value 1) or not (value 0). To determine if it is close to the sea or not, the distance from the city center to the sea is considered. This information is obtained from coastline and municipality data provided by the IGN.
2.3.6. Summer Climate Severity (SCS)
This index classifies cities into four climatic types based on the DD at a base temperature of 20 °C for the specified summer months. This classification aims to understand extreme heat conditions of cities during the summer, from June to September in the Northern Hemisphere. DD are calculated by obtaining historical temperature data for the location (from weather stations or meteorological services), calculating the daily average temperatures, and then summing the amount by which these averages exceed the base temperature of 20 °C. Then, SCS is calculated using Equation (3), in accordance with the CTE regulations:
where
a,
b, and
c are the regression coefficients, with values
, respectively.
By integrating these data, we can better quantify and compare the intensity and duration of heat waves across different cities, providing valuable insights for urban planning and heat mitigation strategies. The SCS classification for the Spanish cities according to this model is illustrated in
Figure 2.
2.4. UHI Control Parameter
The summer nighttime LST serves as the reference for the model of UHIs. Its creation involves generating synthetic 90 m resolution night LST images by combining data from MODIS and ASTER. This is the process followed:
Correlation analysis: Initially, the correlation between the nocturnal LST values from MODIS and ASTER and the MODIS and ASTER LST images of the same locations, dates, and times is calculated.
MODIS data processing: The 1 km average summer nocturnal LST from MODIS is calculated using the daily available nocturnal data from this sensor.
Synthetic image creation: The 1 km summer average LST product from MODIS is then transformed into a synthetic 90 m average summer nocturnal LST image by applying the correlation between ASTER and MODIS nocturnal LST values.
The LST values of this synthetic image are used as the reference for the proposed method to model UHIs.
2.5. Sample Data
To ensure the robustness of the model, we established criteria for selecting a set of Spanish cities for methodology calibration, aiming to achieve a representative sample across the four SCS climatic zones. This approach provides comprehensive and balanced data for accurate calibration. Specifically, a selection of 20 cities out of the 52 Spanish cities, representing approximately 40% of the total, was randomly performed, ensuring diversity while maintaining manageability in data collection and analysis.
Five cities were selected from each of the four climatic zones. This number ensures around 10% representation from each climatic zone and balances the total number of road segments analyzed, targeting approximately 44,770 road segments per SCS zone. This ensures that the data from each zone is comparable in scale and scope, enhancing the robustness and generalizability of the methodology across different environmental conditions.
Thus, the Spanish cities that were considered for calibrating the model were those shown in the
Table 1.
In order to apply this methodology in other European cities, it will be necessary to have the same input data. Furthermore, it is important to mention that the implementation of this methodology must be aligned with the European Union’s INSPIRE Directive, which establishes a framework to ensure that geospatial data are compatible and can be shared and used effectively across Europe.
2.6. Statistical Analysis and Correlation Assessment
2.6.1. Analysis of Variance (ANOVA)
To identify significant differences between the four SCS climatic zones, representative samples were first obtained for each zone. ANOVA, a widely used statistical test that assumes normality, independence, and homoscedasticity [
35], was employed. This method tests the null hypothesis that group means are equal. If the False-Positive Risk (FPR) is below a conventional threshold (typically 0.05), the null hypothesis is rejected, indicating significant differences between groups [
36].
The initial step involved verifying three statistical assumptions:
Normality: Checked using the Kolmogorov–Smirnov–Lilliefors test due to the sample sizes exceeding 50.
Independence: Ensures that sample values from one population were not related to those from another.
Homoscedasticity: Evaluated using Levene’s test, the most common test for this purpose.
If any of these assumptions were violated and considering more than two groups, the Kruskal–Wallis H test, a non-parametric ANOVA, was applied to detect significant differences between the four zones. A significant Kruskal–Wallis statistic indicated average differences between at least two climate types.
Post hoc tests, such as the Holm test (less strict than Bonferroni) [
37], were conducted following significant ANOVA results to identify pairwise differences among groups, providing detailed insights into the data.
2.6.2. Correlation Analysis
After confirming significant differences between the four zones, a correlation analysis was conducted.
Correlation coefficients, including Pearson’s correlation test for quantitative parameters and Kendall’s correlation test for qualitative parameters [
38], were used to assess relationships. Pearson’s correlation was used to assess relationships between the 90 m average summer LST at night and the following variables: Wind corridor, Altitude, NDVI, NDWI, and SVF. Kendall’s correlation was applied separately to evaluate the relationship between LST at night and the Buildings on Both Sides parameter. Correlation coefficients range from −1 to 1, with 0 indicating no correlation and ±1 indicating high correlation. Positive or negative values denote simultaneous increases or decreases in correlated parameters.
2.6.3. Linear Model
Once the differences between the four zones and the correlating parameters in each zone with the LST are studied, these correlations were used to fit one mathematical model per zone using the corresponding significant parameters. Next, each zone was divided into 70:30% training/test sets to fit the linear models. This approach, based on zone-specific multiple linear regression, was chosen to ensure model interpretability and generalizability across diverse urban contexts, aligning with recent UHI studies that favor transparency and replicability over algorithmic complexity [
39].
2.6.4. Statistical Evaluation of Error
In line with the recommendations from the American Statistician [
40], this study refrained from using
p < 0.05 as a threshold for defining statistical significance and avoided the term “significant”. Instead, all the testing of hypothesis was conducted using
p-value to Bayes Factor Bound (BFB) calibrations, following the guidelines of Benjamin and Berger [
41]. False-Positive Risk (FPR) values, as advocated by Colquhoun [
42], were also calculated to assess the ratios between the Null Hypothesis (H0) and Alternative Hypothesis (Ha).
p-values were evaluated using a robust threshold of 0.003 (3σ) for more conclusive results. This threshold corresponds to an FPR of 4.5 ± [1.2, 15.9]%, with priors set at 0.5 ± [0.2, 0.8] [
43], unless otherwise stated.
In addition to using
p-values to evaluate statistical hypotheses, effect size was calculated using the Kruskal–Wallis ε
2 statistic in [
44], and 95% confidence intervals for correlation analysis [
45].
For model evaluation, Root Mean Square Error (
RMSE), Mean Absolute Error (
MAE), and Mean Absolute Percentage Error (
MAPE), as in Equation (4), were employed as the metrics to measure the discrepancy between predicted and actual Land Surface Temperature (LST) values, provided their adequateness to evaluate mathematical models [
46]:
where
and
represent the measured and predicted values of the LST for the 50 m road/street segment
i, respectively, and
n is the total number of roads/streets segments considered in the validation dataset.
5. Conclusions
This study presents a robust and scalable methodology for modeling nighttime Urban Heat Island (UHI) intensity across different climatic contexts, using a combination of morphological and environmental urban parameters. Leveraging open-access geospatial data and a custom QGIS plugin, the method enables high-resolution characterization of urban thermal behavior through indicators such as Wind Corridor, Altitude, NDVI, NDWI, and Sky View Factor (SVF).
Statistical analysis confirmed that all parameters exhibited significant differences across climatic zones and strong correlations with nighttime Land Surface Temperature (LST), although the strength and direction of these relationships varied notably by zone. Accordingly, four zone-specific linear models were developed (based on Summer Climate Severity categories), showing good predictive accuracy (RMSE between 1.40 °C and 1.59 °C on average). Their generalizability was validated using four additional cities representative of each zone.
These findings highlight the context-dependent role of urban parameters. Vegetation (NDVI) showed a strong cooling effect, especially in cooler zones, whereas Altitude had contrasting impacts depending on the climatic context. The NDWI and SVF also played relevant but variable roles. These insights provide valuable guidance for localized urban planning and UHI mitigation strategies.
In terms of contributions, this study offers the following:
A transferable and replicable framework for UHI modeling based on standardized, open geospatial inputs.
A demonstration of how climatic zoning enhances model performance and interpretability.
Practical tools for city planners to evaluate microclimatic conditions at the street level.
From a planning perspective, the results can inform decision-making under current Spanish urban legislation. Instruments such as the Ley del Suelo and the PGOU general urban development plans offer a regulatory basis for integrating environmental indicators into zoning and building design. The LST predictors identified (vegetation, morphology, and moisture) can support strategies to mitigate urban heat. Moreover, alignment with climate adaptation frameworks like Sustainable Energy and Climate Action Plans (SECAPs) enhance the practical relevance of this methodology for municipal action.
Limitations of the study include the scarcity of validation data due to the limited availability of nighttime LST imagery from the ASTER sensor, and the reliance on climatic classification tailored to the Iberian Peninsula. This restricts applicability to the Mediterranean context, requiring further testing in other regions.
Future research could build upon this work as follows:
Incorporating additional parameters such as wind direction, surface roughness, and solar radiation to better capture microclimatic interactions.
Exploring the influence of coastal proximity and differentiating between sea types (e.g., Mediterranean vs. Atlantic).
Applying artificial intelligence techniques to automate climate zone classification and improve predictive capacity beyond linear assumptions.
Extending the model temporally to assess seasonal and diurnal UHI variations.
Developing city-specific models to reflect local thermal behavior and improve generalizability.
Enhancing spatial validation through mobile data collection or dense urban climate sensor networks.
Overall, the proposed methodology represents a significant step toward a context-aware, data-driven strategy for urban climate management, enabling cities to better understand and mitigate the localized effects of heat accumulation under varying climatic pressures.