Within-Class and Neighborhood Effects on the Relationship between Composite Urban Classes and Surface Temperature

Understanding the relationship between urban structure and ecological function—or environmental performance—is important for the planning of sustainable cities, and requires examination of how components in urban systems are organized. In this paper, we develop a Structure of Urban Landscape (STURLA) classification, identifying common compositions of urban components using Berlin, Germany as a case study. We compute the surface temperature corresponding to each classification grid cell, and perform within-cell and neighborhood analysis for the most common composite classes in Berlin. We found that with-class composition and neighborhood composition as well as the interaction between them drive surface temperature. Our findings suggest that the spatial organization of urban components is important in determining the surface temperature and that specific combinations, such as low-rise buildings surrounded by neighborhood trees, or mid-rise buildings surrounded by high-rise buildings, compound to create a cooling effect. These findings are important for developing an understanding of how urban planning can harness structure-function relationships and improve urban sustainability.


Introduction
Urbanization is a core process affecting our ability to address the local and global challenges of sustainability.The scope and pace of urbanization are described extensively in the literature.It is by now clear that urban sustainability is among the most important global challenges of our era [1].A determinant feature of such urban areas is their spatial heterogeneity and the disturbance they cause to natural processes.Recent literature on urban ecosystem services (ES) [2] and urban ecology [3] highlights the importance of natural processes within cities and the ways in which they benefit human wellbeing.Understanding cities as integrated human-nature systems can support the development of sustainable urban solutions [4].However, for urban planners to be able to integrate ecological principles in planning practices requires both a better understanding of the relationship between urban structures and ecological phenomena, and the translation of this understanding into tools that can be easily integrated into decision making processes.Urban spatial structure is important in understanding urban ES provision, and it can provide a bridge to planning sustainable cities [5].Urban structure characteristics-including vegetation and other land covers-can be used to estimate ecological functions [6,7].However, defining urban structures-and key relationships between urban structures-and ecological processes is especially challenging in landscapes characterized by density and patchy spatial patterns [8].A functional classification of urban structure is necessary for understanding the nature of social-ecological relationships in urban areas [3,9,10].The Structure of Urban Landscape (STURLA) classification system was developed to provide a simple, unbiased, multi-dimensional classification of urban land covers, their composition, and neighborhood structures.In previous papers [11,12], we outlined the rationale of STURLA and presented initial case studies in Berlin and New York City (NYC).The premise of the classification is that the relationship between urban structure and ecological function/environmental performance, requires an examination of the complexity of urban structure, in particular how components within urban systems are organized.Particularly important is the vertical dimension, which, thus far, is rarely addressed [13,14].To achieve an unbiased approach to urban structure classification, Hamstead et al. [11] developed a procedure that depends on fusing fine scale urban land cover with building height information and overlaying this with a classification grid cell of a predetermined scale.Each cell in the classification grid is characterized by a combination of the components that fall within that cell.Different classification units and scales were tested, along with their relationship to one biophysical function that is important for human health in cities: urban surface temperature.
The mechanisms by which urban atmospheric temperatures differ from non-urban regions are complex and relate to urban structures and human activity [15].Oke [16] describes the main ways by which urban landcover and land use influence temperatures.In essence, human-made structures and surfaces alter the natural balance of surface energy and radiation through mechanisms such as: absorption of radiation by vertical surfaces; retention of infrared radiation in street canyons; uptake and retention, or delayed release of heat by buildings and paved surfaces; conversion of solar radiation to sensible heat rather than latent heat due to decline in surface transpiration; and the release of sensible and latent heat from the combustion of fuels related to urban activities [15,16].Thus, analyzing urban structures for these mechanisms, including the adsorption by vertical dimensions is critical for understanding temperature patterns across urban areas.Remotely sensed surface temperatures have been used in diverse ways: as an indicator of environmental performance and a proxy of multiple ecological processes in urban systems [17,18] to support the development of an urbanization index [19], to study its relationship to land cover [20]; as an urban heat island [21][22][23][24][25]; and for urban structures and configurations [24][25][26][27][28][29].Studying the surface temperature of different green spaces, Chen et al. [30] found that configurations such as edges, shape, size, and connectivity, are correlated with surface temperature and that this relationship varies over different types of green surfaces.Some studies focus only on the difference in surface temperature between green and impervious surfaces [22,24,25], while other studies consider more specific urban landcovers [21,28] or combine measures of vegetation with demographic and socioeconomic variables [21,26].A remotely sensed surface temperature is an appealing indicator because specific data is available at a rare combination of conditions continuous spatial coverage and at both moderate spatial and temporal resolution.This thus allows for analysis of variations within urban systems.Few other indices or proxies are available that provide all three of these necessary characteristics.Findings commonly associate the occurrence of vegetation and vegetation density with cooler temperatures than when associates with paved surfaces.
In previous research using STURLA, empirical evidence from two cities-Berlin and NYC-suggest that it is possible to identify common urban structure units that encompass the majority of the cities' land area, at least in the case study examples from the western world [12].Moreover, when correlated to surface temperature, these common urban structure classifications exhibit distinct temperature signatures for different urban structure units with temperature trends dramatically similar between Berlin and NYC [11,12].These results suggest the potential to transform our understanding of the relationship between urban structure and ecological function, but they require further study to expand the classification system-to new geographic locations, ecological functions and environmental performance indicators and to verify their stability across scales and test the initial results from the cited papers.
Although previous research showed a clear distinction between the temperature signatures of most of the composite classes, it also showed a large variability in temperature within certain (mixed) classes.This variability may be due to both a local (within cell) and a neighborhood effect of urban land unit composition on surface temperature [12].
In this paper, we continue to explore STURLA classification as it relates to surface temperature.Here, however, we examine the surface temperature variability within the most common composite class in Berlin.This class is examined as a function of the proportion of the individual components in the class (within-class influence) and of neighborhood influence.We hypothesize that the variation in temperature may be influenced by the proportion of the different land cover components in and around the grid cell, and that the influence of some components will show a stronger effect from neighborhood parameters (for example trees) while other components will show stronger local effects (low buildings).
Therefore, the main research questions in this paper are: • Within STURLA classes, which land cover class components have the most significant impact on surface temperature?

•
What are the neighborhood effects on surface temperature?

•
How do land cover components and neighborhood effect interact in driving the surface temperature of particular STURLA classes?
Testing the classification system to understand the quantitative influence of the individual components will help test the basic STURLA assumption that we can use a simple present/absent test to determine urban composite classes.Understating the drivers behind the relationship between surface temperature and classification components will also strengthen the classification method by testing the importance of each of the different urban components in determining surface temperature and by suggesting any potential neighborhood effect that might guide further scale considerations classification and analysis.

The Case Study of Berlin
Berlin has a population of about 3.5 million within an area of about 900 sq km.In comparison to other European cities that have more than 500,000 inhabitants, Berlin is ranked 23rd in terms of population density.Open and green spaces comprise 45% of Berlin's land use, the majority of which is forested.The amount of green space increases with distance from the city center.However, the presence of large urban green spaces of more than two sq km in inner-city areas is a unique feature of Berlin.Built-up areas represent 55% of Berlin's land use.About half of the built-up area is residential and most residential areas are made up of low-rise buildings.A midrise building structure composed of what are typical five-story Wilhelminian buildings, forms a ring around the city center.High-rise buildings can be found within the dense city center or elsewhere locally on locations of prefabricated high-rise residential buildings [12].The city of Berlin is used as a case study here to present a continuation of previous STURLA research.In addition, Berlin's land use and landscape structures are representative in terms of the area's building stock erected continuously from the times of early industrialization in the 1850s until today.Also, population densities in the high-, mid-and low-rise buildings are typical of Germany and other Central European cities [31,32] so that findings for Berlin are representative of at least Central European cities.Moreover, Berlin offers a great variety of built and non-built neighborhoods to extensively test the STURLA-approach.Berlin had been used as a sample city for many similar social-ecological studies and, thus, the results we show in this paper can be compared to and contextualized by this body of knowledge.To create the STURLA classification we used an integrated land cover and building dataset that was overlaid with the classification grid of 120 sq m.This study improves on Larondelle et al. [12] by creating a new integrated dataset utilizing previously unavailable, high resolution data from Berlin's Environmental Atlas [33,34].The new building and vegetation height dataset, derived from high resolution, multispectral airborne imagery using varying segmentation methods [34] was integrated with a dataset on soil sealing [33].Subsequently [35], we categorized a height of five meters and above as tree cover and below five meters vegetation as grass/shrub cover.The building height dataset was classified as low-rise (1-3 stories), mid-rise (4-9 stories) and high-rise (above 9 stories).Roads and water classes were created using roads and water layers from the Urban Atlas [36].Surface sealing data from the Environmental Atlas Berlin [33] was used to distinguish between bare soil and other paved land covers.First, all pixels with no vegetation and a sealing rate of 0 were reclassified as bare soil.Pixels with a sealing rate higher than 0, which could not be classified using other input data as building or roads were reclassified as other paved.Next, the five raster datasets (building, tree canopy/grass-shrubs, roads, water, and other paved/bare soil) were resampled into aligning 2.5 m raster and were mosaicked to create an interceded dataset that includes the following classes: trees, grass/shrub, bare soil, water, low-rise, mid-rise, high-rise, roads and other paved surfaces.Finally, roads and other paved surfaces classes were collapsed into one class and termed "paved".
All data we have used for the city of Berlin has been based on data collected around the year 2010.For this reason, the surface temperature data was selected from 2010.All GIS analyses in this study were performed using ESRI ArcGIS 10.3.1 (ESRI, Redlands, CA, USA).
The integrated dataset was overlaid with a classification grid of 120 sq m.Each grid cell was assigned a STURLA class by identifying and concocting all urban components present within the grid cell [11].Next, we identified the most common STURLA class as class 'GTLMPB' -grass/shrubs (g) -tree canopy (t) -low-rise (l) -mid-rise (m) -roads-other paved (p) -bare soil (b), which covers 35% of the total area of Berlin.GTLMPB pixels were extracted for further analysis.

Within-Class and Neighborhood Analysis
Within-class Composition.We derived the percent cover of all components within each classification grid cell in the 'GTLMPB' class by running the Zonal Analysis tool 'Zonal Statistics as Table '.In ArcGIS, Zonal Statistics allows the calculation of a particular statistic (such as min, max or mean) for each zone in a zone database, using values from another dataset.Here the classification grid serves as the zone dataset and the landcover dataset is used to calculate the proportion of different landcovers within each zone (grid cell).
Neighborhood Composition.We used the results from the zonal analysis described above to assign a percent value to each STURLA cell for each urban component in the study area.We then computed the mean of each component in neighboring cells using the 'Focal Statistics' tool.Neighborhood was defined as the immediate adjacent STURLA cells to each 'GTLMPB' cell (a 3 × 3 matrix) and excluding the cell itself.Figure 1 illustrates the within-cell and neighborhood analysis concepts.

Surface Temperature
The surface temperature was obtained from the Landsat thermal band 6(1).We obtained monthly composite data for the month of July 2010 from the Global Web-enables Landsat Data (WELD) website.WELD data is terrain-corrected and radiometrically calibrated Landsat data [37].There are known uncertainties associated with utilizing temporally composite surface temperature datasets [38].However, Landsat 7 scanline error and resulting data gaps means that a composite of at least two scenes is necessary.WELD compositing procedures rely on an hierarchical pixel by pixel decision rationale documented by Roy et al. [39].
As the basis for surface temperature we used the WELD top of the atmosphere brightness temperature.We converted the top of the atmosphere brightness temperature to surface temperature using the following equation [22,25]: where ST = surface temperature (C); TB = brightness temperature obtained from WELD data; λ = wavelength of emitted radiance (11.5 μm);  = ℎ × ⁄ = 1.438 × 10 −2 mK ( = Boltzmann constant = 1.38

Surface Temperature
The surface temperature was obtained from the Landsat thermal band 6(1).We obtained monthly composite data for the month of July 2010 from the Global Web-enables Landsat Data (WELD) website.WELD data is terrain-corrected and radiometrically calibrated Landsat data [37].There are known uncertainties associated with utilizing temporally composite surface temperature datasets [38].However, Landsat 7 scanline error and resulting data gaps means that a composite of at least two scenes is necessary.WELD compositing procedures rely on an hierarchical pixel by pixel decision rationale documented by Roy et al. [39].
As the basis for surface temperature we used the WELD top of the atmosphere brightness temperature.We converted the top of the atmosphere brightness temperature to surface temperature using the following equation [22,25]: where ST = surface temperature (C); T B = brightness temperature obtained from WELD; λ = wavelenth of emitted radiance (11.5 µm) ρ = h × c/σ = 1.438 × 10 −2 mK (σ = Boltzmann constant = 1.38 × 10 −23 J/K, h = Planck's constant = 6.626 × 10 −34 Js, c = velocity of light = 2.998 × 108 m/s; ε = land surface emissivity.Land surface emissivity was estimated based on landcover properties and on NDVI [25,[40][41][42].We used the NDVI layer obtained from WELD [39] based on the same composite data as the thermal data.An NDVI greater than 0.5 was assumed to have an emissivity of 0.99.The emissivity of pixels with NDVI values between 0.2-0.5 were calculated as: where ε v = emissivity of vegetation (0.99), ε s = emissivity of soil (0.97).P v was calculated as follows [42]: Pixels with NDVI < 0.2 were assigned an emissivity of 0.97 [42].Finally, LANDSAT cloud classification was obtained through WELD and pixels suspected as clouds were removed from the image.

Analysis of Relationship between Surface Temperature and Within-Class and Neighborhood Variables
To investigate the relationship between surface temperature and within-class and neighborhood variables, we developed a least squares regression model, with surface temperature as the response variable and percentage covers of urban components (within-class and neighborhood) as potential explanatory variables (see Table 1).Figure 2 shows our overall statistical modelling scheme for analyzing the data.The details are included in the following four major steps.

Data Exploration
First, we explored data and removed the observations where there were missing surface temperatures.Then the Box-Cox transformation [43] was applied on the response variable to correct non-normality and/or non-constant variance, which are major violations of regression assumptions.In the final model, the power transformation y −2 was found to be the appropriate transformation.Hence 1/ST 2 was used as the response variable.The later step of model diagnostics showed that this transformation made large improvements in model fit.

Variable Selection
The preliminary exploration of the relationship between surface temperature and model variables is presented in Appendix A as a matrix of plots with bivariate scatterplots below the diagonal, histograms on the diagonal, and the Pearson correlation coefficients above the diagonal.The figure shows that high correlation occurs in the same urban components for within-class and neighborhood variables, which is to be expected, as well as across components, such as NH_2 & NH_9 and NH_1 & NH_4.All of these suggest a problem of multicollinearity.Due to the multicollinearity issue in the data, we used the Lasso method [44] to perform variable selection-that is, to exclude irrelevant and highly correlated explanatory variables from the regression model.The variance inflation factor (VIF) [45] is an important assessment of the severity of the collinearity problem.A large VIF value may indicate a serious collinearity problem.Thus, we adjusted the tuning parameter in the Lasso selection to remove enough irrelevant variables such that the VIFs of all the remaining variables in the model were controlled under 5.

Interaction Terms Selection
In addition to the individual explanatory variables, we were also interested in the meaningful interactions among the explanatory variables to see if these interactions could further contribute to the model.We selected the best subset of interaction terms using model selection criteria, such as Akaike information criterion (AIC) [46], Bayesian information criterion (BIC) [47], and crossvalidation.First, for each model size, an AIC (or equivalent BIC, given fixed model size) was used to determine the best set of interactions to enter the model, while keeping in the model all the individual variables selected by the Lasso and some preselected interactions of particular interest.Then, we used a separate validation dataset to assess the model fit and chose the most appropriate model size based on the validation prediction errors.

Model Verification
After the 'best' model was determined, we checked the adequacy of the model for violation of linearity, variance homogeneity and normality.We performed residual analysis to identify outliers and removed those from the data with studentized residuals greater than 5.To be sure not to exclude observations that have studentized residuals falling between 3 and 5, which is a grey area when defining outliers, we also compared our regression model with two robust regression models.Robust regression is a procedure that is not sensitive to small deviations from the normal assumption of random errors and is often useful for confirming the reasonableness of the least squares regression

Variable Selection
The preliminary exploration of the relationship between surface temperature and model variables is presented in Appendix A as a matrix of plots with bivariate scatterplots below the diagonal, histograms on the diagonal, and the Pearson correlation coefficients above the diagonal.The figure shows that high correlation occurs in the same urban components for within-class and neighborhood variables, which is to be expected, as well as across components, such as NH_2 & NH_9 and NH_1 & NH_4.All of these suggest a problem of multicollinearity.Due to the multicollinearity issue in the data, we used the Lasso method [44] to perform variable selection-that is, to exclude irrelevant and highly correlated explanatory variables from the regression model.The variance inflation factor (VIF) [45] is an important assessment of the severity of the collinearity problem.A large VIF value may indicate a serious collinearity problem.Thus, we adjusted the tuning parameter in the Lasso selection to remove enough irrelevant variables such that the VIFs of all the remaining variables in the model were controlled under 5.

Interaction Terms Selection
In addition to the individual explanatory variables, we were also interested in the meaningful interactions among the explanatory variables to see if these interactions could further contribute to the model.We selected the best subset of interaction terms using model selection criteria, such as Akaike information criterion (AIC) [46], Bayesian information criterion (BIC) [47], and cross-validation.First, for each model size, an AIC (or equivalent BIC, given fixed model size) was used to determine the best set of interactions to enter the model, while keeping in the model all the individual variables selected by the Lasso and some preselected interactions of particular interest.Then, we used a separate validation dataset to assess the model fit and chose the most appropriate model size based on the validation prediction errors.

Model Verification
After the 'best' model was determined, we checked the adequacy of the model for violation of linearity, variance homogeneity and normality.We performed residual analysis to identify outliers and removed those from the data with studentized residuals greater than 5.To be sure not to exclude observations that have studentized residuals falling between 3 and 5, which is a grey area when defining outliers, we also compared our regression model with two robust regression models.Robust regression is a procedure that is not sensitive to small deviations from the normal assumption of random errors and is often useful for confirming the reasonableness of the least squares regression when suspicious outliers exist in the model.We considered robust regression models based on the Huber [48] and the Tukey bisquare estimators, respectively.Of the two estimators, the Tukey bisquare estimator is more robust and hence even less sensitive to the outliers.
As shown in Figure 2, the model development process was iterative and the final model was not picked until the model adequacy was verified and free of outliers.It is worth mentioning that in the very beginning of this model developing process, we partitioned the original data into 50% training data and 50% validation data.The models were trained from Section 2.5.1 to Section 2.5.4 using only the training set.The validation set only participated in selecting the optimal model size in the last step of Section 2.5.3.Hence, the validation set can be treated as a fairly independent dataset to evaluate model performance.In fact, in Section 2.5.4,we checked the model adequacy, including model fit, assumptions and outliers.We checked on both the training and validation sets for each considered model from the iterations.The final model was verified to have consistent performance across both the training and validation sets, which demonstrates final model robustness.After the final model was fit, to understand the relative importance of the explanatory variables, we also obtained the standardized coefficients, i.e., the coefficients for zero-centered and unit-scaled explanatory variables, and used them to determine which variables have a greater impact on the response, independent of their scale of units.

Berlin Classification, Surface Temperature, GTLMPB
We found that 90% of the city area of Berlin can be explained with 19 STURLA classes.Three of the classes comprise one urban component and 16 are composite of multiple components.The surface temperature is lowest for the water-only class and in the combinations of trees, grass, and bare soil.It is warmer for composite classes that include components of the built environment.Figure 3 shows the 19 classes broken down by mean percent of within class composition and the mean surface temperature of grid cells in each class.when suspicious outliers exist in the model.We considered robust regression models based on the Huber [48] and the Tukey bisquare estimators, respectively.Of the two estimators, the Tukey bisquare estimator is more robust and hence even less sensitive to the outliers.As shown in Figure 2, the model development process was iterative and the final model was not picked until the model adequacy was verified and free of outliers.It is worth mentioning that in the very beginning of this model developing process, we partitioned the original data into 50% training data and 50% validation data.The models were trained from Section 2.5.1 to Section 2.5.4 using only the training set.The validation set only participated in selecting the optimal model size in the last step of Section 2.5.3.Hence, the validation set can be treated as a fairly independent dataset to evaluate model performance.In fact, in Section 2.5.4,we checked the model adequacy, including model fit, assumptions and outliers.We checked on both the training and validation sets for each considered model from the iterations.The final model was verified to have consistent performance across both the training and validation sets, which demonstrates final model robustness.After the final model was fit, to understand the relative importance of the explanatory variables, we also obtained the standardized coefficients, i.e., the coefficients for zero-centered and unit-scaled explanatory variables, and used them to determine which variables have a greater impact on the response, independent of their scale of units.

Berlin Classification, Surface Temperature, GTLMPB
We found that 90% of the city area of Berlin can be explained with 19 STURLA classes.Three of the classes comprise one urban component and 16 are composite of multiple components.The surface temperature is lowest for the water-only class and in the combinations of trees, grass, and bare soil.It is warmer for composite classes that include components of the built environment.Figure 3 shows the 19 classes broken down by mean percent of within class composition and the mean surface temperature of grid cells in each class.The 'GTLMPB' class comprises 35% of the total area of Berlin and can be found across the city with main concentrations outside the inner circle of the city center.The total land area of the 'GTLMPB' class is 326.5 sq km. Figure 4 shows the spatial distribution of GTLMPB and surface temperature in Berlin.The GTLMPB class has the largest temperature range of all Berlin STURLA classes 11.7-47.8The 'GTLMPB' class comprises 35% of the total area of Berlin and can be found across the city with main concentrations outside the inner circle of the city center.The total land area of the 'GTLMPB' class is 326.5 sq km. Figure 4

Relationship between Surface Temperature, and Within-Class and Neighborhood Variables
Following the analysis in Section 2.5, the final model between surface temperature, and within-class and neighborhood variables, is given by: where ŜT is the estimated surface temperature; βi is the coefficient of variable x i and represents the change in 1/ ŜT 2 per unit increase in x i , holding all other explanatory variables x j (j = i) fixed.
The estimated coefficients in the models are given via R [49], output in Table 2. Table 3 compares the standardized regression coefficients in the least squares model (our final model) and two robust regression models.Here, we standardized the coefficients in order to give a relatively fair assessment of how close the regression coefficients are between the three models.
From Table 3, the regression coefficients of our model and two robust regressions agreed quite well (in fact, all the differences are within two standard errors), except for the coefficients for the interaction NH_2 × NH_8 where the differences are above three standard errors.Although it is an open question which model gives the best estimated coefficient for NH_2 × NH_8, all three models rank NH_2 × NH_8 as the most important interaction predictor variable.Overall, the agreement between the three models reassures that our final model is a reasonable one, not unduly influenced by any observations with large deviations.
Then, we interpreted the results in the final model.Notice, since the response variable in model (1) was 1/ST 2 , if an explanatory variable yields a positive regression coefficient, it means that the corresponding urban component has the effect of reducing the surface temperature.

Main Effects
Individual components that are negatively related to surface temperature include within-class and neighborhood tree cover (IC_2; NH_2), and neighborhood water cover (NH_7).The variables that are positively related to surface temperature include within-class and neighborhood mid-rise buildings (IC_4; NH_4), neighborhood high-rise buildings (NH_5), neighborhood bare soil (NH_8) and within-class and neighborhood other paved surfaces (IC_9; NH_9).Note that the within-class low-rise buildings (IC_3) were kept in the model due to the presence of the related higher order terms, NH_2 × IC_3 and NH_4 × IC_3.However, the main effect of the within-class low-rise buildings is not significant (p-value = 0.49) in its association with surface temperature when holding other variables in the model constant.
The effects of the same component in within-class and neighborhood (i.e., within-class and neighborhood trees) typically respond similarly when related to surface temperature.However, some variables were only selected for the model in either within-class or neighborhood form.For example, bare soil is positively related to surface temperature in the within-class variable but the neighborhood variable was excluded by the Lasso method due to its weak association with surface temperature.Such is also the case for neighborhood low-rise buildings (NH_3); the within-class variable (IC_3) has turned insignificant in the model after interaction terms were added into the model.In addition, high-rise buildings and water cover are only represented as neighborhood variables because these components are not part of the GTLMPB class and hence not represented in within-class variables.Grass/Shrubs and roads were not selected for the model in either the within-class or neighborhood form, because of their low association with surface temperature and their high correlation with the other explanatory variables in the model.

Interaction Effects
The interactions between explanatory variables play an important role in the model.These interaction terms were selected by a combination of the authors' expertise and model selection techniques.The interactions involving neighborhood tree cover (NH_2) typically have an additional cooling effect, except for NH_2 × IC_9 (within-class paved).For example, when neighborhood tree cover (NH_2) and bare soil (NH_8) coexist, they will produce an extra cooling effect on temperature, in addition to their individual effects.On the contrary, when neighborhood tree cover is accompanied by the within-class paved component, the cooling effect of tree cover will be dampened by the interaction.Additionally, the presence of neighborhood high-rise (NH_5) around within-class mid-rise (IC_4) produces an extra cooling effect on temperature, whereas neighborhood mid-rise (NH_4) and within-class low-rise (IC_3) exhibit an additional warming effect.

Variable Importance
Table 4 ranks the predictor variables in model ( 4) from most important to least important, based on the absolute values of their standardized coefficients in descending order.
From Table 4, the most important main effects on temperature reduction are tree cover in both local and neighborhood forms.Neighborhood and within-class mid-rise cover are the most important warming main effects.Neighborhood tree cover (NH_2) and bare soil (NH_8) are selected as the most important interaction effects and ranked 4th overall.

Discussion
In this paper, we aimed to answer two distinct questions raised by previous research: what is the relationship between class composition and surface temperature, and to what extent is surface temperature driven by the within-class components versus the importance of neighborhood effect on surface temperature?By asking these questions, we aim to continue to develop the STURLA classification system as a tool for analyzing commonly occurring urban building blocks and their relationship with ecological and environmental phenomena in cities.
In previous papers using the STURLA classification, it was demonstrated that classes created by the random application of a grid cell other than urban landscape can create meaningful composite classes and these can help explain the nuanced interaction between urban structure and surface temperature [11].Examining the distribution of components within each STURLA cell and how such interactions impact surface temperature clarifies the relationships between urban components and surface temperature that drive these previous results.While STURLA classification was able to show a clear relationship with surface temperature on average, a main challenge with the methodology was the wide range of temperatures associated with certain classes, and particularly with highly mixed composite classes.For this reason, GTLMPB was chosen for this study.It is the most common class in Berlin, it is highly composite, and it encompasses the widest range of surface temperatures out of all STURLA classes.In addition, it is in the middle of the pack in terms of variance, and standard deviation (STD) values in this class are closer to the mean than (for example) values in the single component class of grass or the hyper composite class GTLMPWB.
As explained in the case study description, the mixture of built-up area and open space can be found everywhere in Berlin.Thus, it is not surprising that the most common class is a highly composite one which can be found in all areas of the city.As illustrated in Figure 4, the class of GTLMPB can be found all over Berlin except from the very dense city center ("Berlin Mitte"), which is mainly characterized by high-rise and larger urban green spaces such as the Tiergarten.Closer to the city center, the class GTLMPB will be defined by the main components: mid-rise, paved, and low-rise (in this particular order), while with distance to the city center the share of grass/shrubs and tree canopy will increase and the share of mid-rise will gradually decrease.
Analyzing the GTLMPB class, we found that within-class and neighborhood variables, as well as the interaction between them, drive surface temperature.For within-class, our findings support common literature findings that particular urban components exert a strong influence on surface temperatures, both negative (e.g., trees) and positive (e.g., mid-rise buildings and paved surfaces).For example, Schwarz et al. [17] and Jenerette [21], like others, find vegetation (especially trees) to be associated with cooling, while paved surfaces are associated with warming at parcel and neighborhood scales.Our results indicate that local tree cover and neighborhood tree cover are the most important cooling components.In general, vegetation differs from the human made materials used in urban environments for their moisture and their aerodynamic and thermal properties [50].The main process by which vegetation influences surface temperatures is evapotranspiration, in which plants use energy from solar radiation to evaporate water into the atmosphere.This process increases latent rather sensible heat and cools the leaf surface of plants and their immediate environment [51].Trees provide another important mechanism-shade-by which cooling occurs.Tree shade reduced the storage and convection of heat on the surfaces that are covered by the tree canopy [52].It is notable that, compared to tree cover, percent grass cover is not included in any of the final models, either at local or neighborhood scales.While in bivariate analysis, grass shows a negative association with surface temperature, grass becomes insignificant when other variables are considered.This may be attributed to the large variation in temperatures of grassy patches as exhibited in the grass only class.Moreover, the state of the grass cover depends on the soil properties, particularly its water holding capacity, which can be very distinct in urban areas and which makes different grass areas vary in their cooling effects [53].In Berlin, many inner city grasslands were created at former rail brownfields with low quality sand or gravel-sand substrata, meaning that they present diminished capacity for cooling through evapotranspiration.
Neighborhood analysis was conducted to develop an understanding of the importance of the larger urban context in which composite classes are ingrained.It has been demonstrated in the existing literature that landscape configuration and context play an important role in the association with surface temperature [10,27,28].In addition, preliminary findings from previous STURLA analysis indicate that neighborhood effects may partly explain the variation in the association of composite classes [12].Results of this study confirm significant urban neighborhood effects on surface temperature.An increase in the percent of neighborhood green and water surrounding a composite class pixel contributed to a pixel's cooling while an increase in the percent of neighborhood mid-rise, high-rise, paved and bare soil contributes to a pixel's warming.Further research is required to determine the scale threshold of the neighborhood effects and whether they depend on the pure pixel size of the STURLA classification.
As expected, all paved and building components show a positive association with surface temperature.However, an important contribution of the STURLA approach is in the inclusion of the vertical dimension of the built environment, which enables the distinction between types of buildings and their urban context.Here, we find that different types of buildings show a significant difference in their association with surface temperature.Both neighborhood (overall rank 3) and within cell (rank 5) cover of mid-rise buildings are important warming components, followed by neighborhood high-rise buildings (rank 6).The importance of mid-rise and high-rise buildings in explaining surface temperature can be explained by the heightened capacity of buildings to uptake and retain heat not only because of the materials properties, but also due to reduced convective losses and ventilation, and by the additional warming due to trapping of radiation by vertical surfaces that causes an increased absorption of solar radiation [16].These findings provide empirical evidence supporting the need identified in the literature to address structure and architecture in studies of urban surface temperature [27] and more generally in the context of regulating urban ecosystem services [13].Low-rise building cover was found to be significant only in its interactions with neighborhood tree cover and neighborhood mid-rise buildings.Results show that the presence of increasing neighborhood tree cover offers additional cooling, particularly in the case of low-rise buildings and in between paved surfaces.This can be explained by the effectiveness of trees in shading areas that are of similar or lower elevation.
The context in which low-rise occurs in Berlin is twofold.First, there are the classic detached or terrace housing areas, characterized by a low density, allocated in the peripheries of the city.Mostly, these detached and terrace houses have green or open spaces attached.Other pavement here occurs in the form of parking spaces, roads and sealed patios.Second, the other context refers to denser low-rise mixed with mid-rise and less green space, which occurs sometimes in the vicinity of high-rise, and is located closer to the inner city.
Interestingly, we find similar cooling effects in the interaction between neighborhood high-rise buildings and within class mid-rise buildings, further emphasizing the importance of artificial shading (through buildings) for surface temperature regulation.The contrary is seen in the interaction between within-class low-rise and neighborhood mid-rise, where a warming effect is compounded by the interaction.This is in accordance with recent findings by [54].Finally, the strong cooling effect in the interaction between neighborhood tree cover and neighborhood bare soil indicates the importance of open space in urban structure.

Conclusions
The Structure of Urban Landscape classification offers a simple way to systematically classify urban areas into composite building blocks including land cover and building categories.In this paper we delved into the relationship between surface temperature and the composition of one such composite class, an urban building block common to Berlin: -grass/shrubs (g) -tree canopy (t) -low-rise (l) -mid-rise (m) -roads-other paved (p) -bare soil (b).We find that with-class composition as well as neighborhood composition and the interaction between them works to drive surface temperature.Our findings support the previously reported relationship between surface temperature and urban composition and structure and adds empirical evidence to a growing literature addressing the relationship between ecosystem services and complex urban structure.In this study, we confirm the importance of trees in cooling both local and neighborhood areas, and emphasize the importance of the urban context in which trees are present such as the effectiveness of trees in cooling low-rise buildings.Most importantly, we demonstrate the influence of the vertical dimension for surface temperature such as the warming effect of mid-rise building.Additional research is necessary to replicate these findings in different types of urban areas with different urban development histories, so as to test the utility of the STURLA classification in explaining other ecological and environmental processes, and in identifying the approaches' sensitivity to scale and orientation.Nonetheless, these findings, can support urban sustainability planning by providing insight for urban planners in considering the configuration of urban components when designing new development, rebuilding city centers and implementing urban ecological restoration.

Figure 1 .
Figure 1.Same data frame from the Berlin map; (a) the classification cells with their different classes (above); (b) with fused land cover and building height underlying data in the background (below).

Figure 1 .
Figure 1.Same data frame from the Berlin map; (a) the classification cells with their different classes (above); (b) with fused land cover and building height underlying data in the background (below).

Figure 2 .
Figure 2. Strategy diagram for data analysis.

Figure 2 .
Figure 2. Strategy diagram for data analysis.

Figure 3 .
Figure 3. Mean surface temperature and percent land cover in Structure of Urban Landscape (STURLA) classes.

Figure 3 .
Figure 3. Mean surface temperature and percent land cover in Structure of Urban Landscape (STURLA) classes.
shows the spatial distribution of GTLMPB and surface temperature in Berlin.The GTLMPB class has the largest temperature range of all Berlin STURLA classes 11.7-47.8°C (1.95 STD).

Table 2 .
Coefficient estimates and model summary.

Table 3 .
Standardized regression coefficients of the least squares regression model versus two robust regression models.

Table 4 .
Variable importance from most important (top) to least important (bottom) based on standardized coefficients.