1. Introduction
The world’s urban population has been growing quickly from only 30% of the global population residing in urban areas in the 1950s to as much as 54%, or 3.9 billion urban residents, in 2014. It is projected to increase toward 66% by 2050 [
1], so it is important to understand the effects of urban areas on the environment. One such effect is the elevated temperature in urban areas, which in turn affects the growth cycle of plants known as plant phenology.
Urban heat island (UHI) is the term used to describe higher temperatures in an urban environment compared to the surrounding rural areas [
2]. UHI is mainly a result of changing a natural landscape into a man-made urban texture [
3] and arises due to the modified surface affecting the storage and transfer of heat, water, and airflow [
4]. The UHI has a measurable effect, as previous research has shown a small UHI effect of 0.17 °C for cities as small as 2–3 km
2 and a larger effect of 2.9 °C during the day and 2.3 °C during the night for larger cities of >100 km
2 [
5]. Since the UHI is comparable to projected global temperature increases [
6], urban environments have been used as a surrogate for future climate change (i.e., [
7]).
The relationship between urban intensity and the course of annual developmental events in plants, known as plant phenology, has been documented [
8]. From a biological perspective, phenological research predominantly addresses the timing of switches between recurrent phases of organisms [
9]. In cool and temperate regions, to maximize growth, trees need to extend the growth period as long as possible while avoiding frost damage [
10]. It has been confirmed that bud burst is under strong temperature control, with leaves appearing earlier and growing faster during a warmer spring [
11]. In the fall, growth cessation and dormancy are in response to the shortening of the photoperiod [
12], however, low temperature also causes growth cessation [
13]. Plants, in general, will develop earlier in cities by a factor of days to weeks when compared to their rural settings. However, there is no clear association with autumn phenophases. Comparing the urban to rural trends of phenology allows for the evaluation of phenological trends in differing conditions and assessment of impacts of climate change [
6].
Earlier greening of land cover and later autumnal leaf fall may alter seasonal climate through biogeochemical processes and physical properties [
14]. Phenological changes will have varying impacts on albedo [
15] and surface temperature [
16]. In addition, the length of the growing season plays a key role in the amount of CO
2 in the atmosphere [
17]. Earlier spring onset leads to an increase in Gross Primary Production (GPP) and ecosystem respiration, with GPP having the larger increase resulting in a higher Net Ecosystem Productivity (NEP) [
18]. From a biological perspective, due to species responding differently to warming temperatures, mismatches in the phenology across trophic levels may already be occurring due to non-uniform shifting of co-occurring organisms, changing the biological communities [
19].
Satellite derived measurements of land surface reflectance have been used to study land surface phenology (LSP) over large areas (for example, [
20]). When applied to examine the length of growing season for urban and rural areas, it was found that the growth season was about 15 days longer in urban areas [
21]. However, this type of analysis has historically been restricted due to clouds, atmospheric conditions, and the infrequent satellite coverage [
22]. Removing these atmospheric effects has a relatively long history [
23]. However, even after this correction, the level of detail that urban maps provide is dependent on the spatial resolution of the sensor [
24] and the error in phenology detection increases with a reduction in temporal resolution [
25]. Therefore, both high temporal and high spatial satellite sensors each have complementary advantages when observing urban phenology.
With the goal to obtain more information than each sensor can independently provide [
26], a number of blending algorithms have been developed that focus on spatial and temporal dynamics [
27]. One such method is the Spatial and Temporal Adaptive Reflectance Fusion Model (STARFM). It was designed to predict surface reflectance with the spatial resolution of Landsat and the temporal resolution of the Moderate Resolution Imaging Spectroradiometer (MODIS) [
28]. STARFM is, as of 2013, widely used [
27]. Originally shown to capture phenology changes for forested or cropland areas, as well as a mixture of cropland, evergreen, and deciduous forest [
28], it has since been applied to observe, for example, community level phenology [
29], crop growth stages [
30], urban phenomena such as the land surface temperature (LST) of the urban heat island [
31], and the Normalized Difference Vegetation Index (NDVI) values of a semi-arid rangeland of northern Utah [
32].
This study aims to build on the existing research and use blended, also known as fused or synthetic, satellite imagery to examine how the built up environment affects LSP of developed land in a metropolitan area of Northern Utah. We derived 12 years of fused Landsat and MODIS reflectance data using the STARFM method with three different selection criteria for input data. Then, from the resulting fused datasets, we derived three time-series of a standard vegetation index for the study area to estimate phenology metrics and make comparisons of the urban and exurban area. Our study addressed three important research questions:
- (1)
In our study area, is there an observable difference in key phenological parameters (Start of Season (SOS), End of Season (EOS), and Length of Season (LOS)) between the urban and exurban areas when using fused imagery over several years and how sensitive those differences to base pair selection?
- (2)
If there is an observable difference in SOS, EOS, or LOS between the urban and exurban areas when using fused imagery, how often do these differences appear?
- (3)
Does the amount of impervious area at a location affect any observable differences in SOS, EOS, or LOS between the urban and exurban areas when using fused imagery?
3. Results
3.1. Accuracy Assessment of STARFM
Although our study area falls into the area of a previous study [
32] which validated STARFM’s ability to fuse images in our general location, a brief accuracy assessment was performed to ensure the heterogeneous urban area of our study did not significantly alter the accuracy of synthetic Landsat images. To assess the accuracy, we compared reflectance values of several Landsat surface reflectance images to the synthetic Landsat scene that was closest in date. The comparison was made by reporting both the R
2 values and the mean absolute difference (
Table 3). None of these Landsat images that were used in validation were used as a part of a base pair in STARFM. In addition, the scenes chosen represent both leaf-on conditions and leaf off conditions.
3.2. Descriptive Statistics
Since SOS dates for many observations occurred before 23 February 2000, when the MCD43A4 data first became available, we removed this year from further analyses of SOS and LOS. In addition, since the descriptive statistics placed the EOS dates for many observations after the last year (2011) of data, further analysis for both EOS and LOS did not include 2011. For each year, we sampled 48,942 exurban and 128,539 urban points, although the number of samples used for any given year varied due to no data values. The maximum number of no data values for all three base pair selection methods for the SOS was between 30,489 and 33,005 for the entire exurban area in and 91,482 and 116,354 for the urban areas.
The normalized data of the informal selection show an earlier SOS for urban areas by a factor of 2.21 to 6.73 days on average, depending on the class. The more developed classes had the biggest difference in SOS and the least developed classes along with all of the classes together had the smallest difference. Although the developed areas as a whole and individual classes both show a later EOS for the urban areas, the number of days between the mean EOS for the urban and the exurban area had a lower range that was from 0.04 to 0.71 days. Lastly, the difference in the average length of season was longer in urban areas and ranges from being 2.84 days for all developed areas up to 6.8 days longer for the most developed areas, with the difference increasing for each class with percent imperviousness (
Figure 6a). Standard deviations for the years were generally largest for the LOS parameter (
Table 4).
The normalized data of the 95% statistical selection threshold show an earlier SOS for urban areas by a factor of 1.05 to 3.56 days on average. Although the developed areas as a whole and the two classes of developed open space and developed medium intensity show a later EOS for the urban areas up to 1.25 days, the other two classes show a later EOS for the exurban area of 0.35 and 0.79 days. Again, the difference in the average length of season was longer in urban areas and ranges from being 0.78 days up to 4.32 days longer. The difference for each class does not consistently increase with percent imperviousness (
Figure 6b).
The normalized data of the 90% statistical selection threshold on STARFM version 1.2.1 also show an earlier SOS for all urban areas ranging from 0.28 to 3.55 days. All developed areas as a whole and all four classes show a later EOS for the urban areas ranging from 2.84 to 5.15 days. Again, the difference in the average length of season was longer in urban areas and ranges from being 3.22 and 7.02 days longer (
Figure 6c).
To see how consistent these overall findings were from year to year between classes, one can also look at the 11 years separately. This showed that the average urban SOS date was earlier for up to all of the 11 years for developed high intensity urban areas when using informal base pair selection criteria, but also earlier in only six years when using 90% selection criteria for developed low intensity areas. The average EOS date was later for the urban areas from as few as three out of the 11 years to as many as nine, with all three selection methods showing the least amount of years in the developed high intensity class. The average LOS was longer in the urban areas for 10 out of 10 years for developed open space with an informal selection and for only six years for various classes with both the 90% and 95% selection criteria (
Table 5).
3.3. Tests for Variance Equality and Normality
Based on the results of the F-tests, almost all land cover classes and the developed areas as a whole had statistically unequal variances between the urban and exurban areas. The exceptions to this were the informal selection developed medium intensity land cover class for the SOS and the 90% selection criteria’s LOS for all developed areas, developed open space and developed low intensity, where all
p-value were above 0.1. Visual inspection of all histograms showed an approximate normal distribution (see
Supplementary Materials for the histograms). This method was chosen over a statistical test due to small deviations from normality being marked as significant even though they should not affect parametric test results [
53].
3.4. Tests for Inequality of Phenological Parameters
RStudio performed a two-tailed and two-sample t-test for each parameter and each developed area as a whole as well as each land cover class separately to see if the mean of the urban area was statistically different from the mean of the exurban area. The t-test assumed unequal variances in all cases except when the F-test could not show an unequal variance. The tests showed that for the SOS, both the informal and 95% selection criteria show a statistical difference between the urban and exurban areas at the 99.9% level for all areas together and all classes separately, with all observed p-values of <2.2 × 10−16. However, after applying a Bonferroni correction, the 90% selection criteria only shows a 99.9% significant difference for all developed areas, developed medium intensity, and developed high intensity areas, with p-values of <2.2 × 10−16 for all developed areas and medium developed areas and 4.91 × 10−15 for developed high intensity areas. It also shows an insignificant difference in the SOS for both developed open space and developed low intensity with p-values of 0.0045 and 0.013, respectively.
The EOS results were all statistically significant at the 99.9% level, with all observed p-values of <2.2 × 10−16, for the 90% selection criteria. However, they were mixed for both the informal and 95% selection criteria. With the informal selection criteria, both developed areas as a whole and developed open space had an observed p-value of <2.2 × 10−16. After applying a Bonferroni correction for the remaining classes, the results show no statistical difference for EOS in the developed low intensity, developed medium intensity, and developed high intensity classes with observed p-values of 0.448, 0.011, and 0.107, respectively. With the 95% selection criteria, both developed low intensity and medium intensity were statistically significant with p-values of 6.5 × 10−10 and <2.2 × 10−16, respectively, and after the Bonferroni correction developed high intensity was significant at the 99% level with an observed p-value of 0.000012. There was no significance for the remaining classes after a Bonferroni correction with all developed areas having a p-value of 0.03 and developed open space having a p-value of 0.75.
The length of season showed the most consistent results with most classes for all methods having an observed p-value of <2.2 × 10−16 making them significant at the 99.9% level. The only exceptions to this were the developed high intensity results for both the 95% and 90% selection methods. The 90% selection criteria results were still significant at the 99.9% level but the observed p-value was 1.5 × 10−14. The 95% selection method still did show significance at the 99% level after a Bonferroni correction with an observed p-value of 0.000049.
4. Discussion
4.1. Start of Season
Our study analyzed the effects of UHI on phenology in urban areas, although we did not attempt to measure temperature for each land cover per se. Instead, we used percent imperviousness to understand the spatial variability of temperature in cities. Since UHI is the direct product of the transformation of natural landscapes into an impervious man-made urban texture [
3] and a strong linear relationship between the percent impervious surface area and land surface temperature has been shown [
54], it is reasonable to assume that the areas we defined as urban are warmer than the areas we defined as exurban. Based on previous findings suggesting that leaves appear earlier during a warmer spring [
11] and that leaves develop earlier in cities [
6], we reasonably expected our urbanized areas to exhibit an earlier SOS, which was supported by the results of this study, although not statistically when using STARFM v1.2.1 and a 90% selection criteria for all classes. Therefore, base pair selection and software version may make a difference when observing SOS of the urban environment. In addition, it has been shown that temporal gradients in leaf onset varies [
55], which coincides with our findings of a large standard deviation for SOS (
Table 4).
Interestingly, our results do not show a similar magnitude of differences in the SOS between classes throughout the urban and exurban area. This coincides with the case study by Melaas et al. [
56], which showed that the amount of impervious surface area in surrounding vegetation patches influences the timing of SOS. Although we do not show a clear trend with the difference in SOS growing as mean percent imperviousness does, for all three selection methods the difference in mean days becomes larger between urban and exurban areas for the two most developed classes when compared to the two least developed classes. In addition, out of all 11 years where SOS was calculated, all methods showed an earlier urban SOS for all classes more times than a later urban SOS. However, there is a range to the number of years where urban SOS is earlier. Based on this, there is not a uniform effect on the SOS throughout the entire urban area. In addition, it is reasonable to assume that we may lose some information if we generalize entire municipalities as having an earlier SOS. For this reason, having a spatial resolution finer than that of MODIS, such as that produced by fused data, would allow for a better understanding of the complex relationships between heterogeneous urban and exurban environments.
4.2. End of Season
The EOS results were not as conclusive as the SOS results, which is not surprising since previous research also found an unclear association between the urban-exurban differences and autumn phenophases [
6] and high variability in the difference [
57]. Even though the mean EOS was later for urban areas in 13 out of 15 of the stacked datasets, the statistical significance across all classes was present only when the 90% criteria was used. This also indicates that base pair and software selection may lead to different results than when looking at features or specific locations within the municipality. This supports previous findings that EOS in urban vegetation patches is influenced by the percent impervious surface of surrounding patches [
56].
There are two potential reasons for such findings of EOS inconsistency between base pair selection and software. First, there could be a consistent and large difference in the EOS, as the 90% selection criteria with STARFM v 1.2.1 showed, but our methods were too simplistic or introduced excessive errors so the difference was improperly observed with using an informal and 95% selection criteria and STARFM v1.1.2. Second, there are potentially other environmental factors that are similar between the urban and exurban area, which are the main cues for trees to start growth cessation. For example, Perry [
12] points to the shortening of the photoperiod. If this were true, we would not expect the UHI to change EOS. In addition, this would mean that the difference in EOS shown when using the 90% criteria is a result of observing a sample of the EOS under control of another environmental factor such as, for example, photoperiod.
4.3. Length of Season
Although LOS is the only phenological parameter that is consistently statistically significant, we are cautious of these results. Both the SOS and EOS contribute to the LOS, and under an informal and 95% selection criteria the most reliable contributor to the statistically longer growing season in the urban area is the earlier start in the spring. As shown in
Figure 6a,b, the difference in the SOS for the urban and exurban area is proportionally larger than the EOS, thus contributing more to the difference in the LOS. In addition, even without statistically different EOS dates for all of the classes with these two methods, the difference in the LOS is still statistically significant for all classes. However, when we look at LOS derived from the 90% criteria, the opposite is true that without all statistically different SOS dates, the difference in the LOS is still statistically significant for all classes. Therefore, since there is uncertainty with why LOS is longer in the urban areas, it is difficult to support that it is true.
Moreover, when looking at the mean LOS for each class, a similar trend appears as we see with the SOS, which is that there is a difference in the LOS across classes. Once again, this observation suggests to look at intra-city features and not just the urban area as a whole to be able to describe better the effects of the urban environment on phenology.
4.4. Study Assumptions and Potential Limitations
The method we used for normalization ensures that all data obtained are used. It also corrects for interannual shifts in phenological parameters by placing the event at each location in relation to the expected date of the same exurban event for the same year. This allows for the comparison of the difference in time between an event at each location and the timing of the corresponding expected exurban area’s event and not a comparison of the actual dates, which may not be comparable from one year to the next due to changing weather patterns. Thus, after normalization, one can compare the urban and exurban data to show the differences within each separate year and also among all years together. Importantly, the approach accounts for the uneven and inconsistent sample proportions between the urban and exurban environments from year to year. Simply normalizing according to the mean value of all urban and exurban pixels would pull the normalized value in different amounts, based on the proportional size, toward the sample with the larger size and not centering the normalized values at the same value from year to year. For example, if one year had one exurban point for every two urban points and the next year this ratio decreased to 1:4, the normalized values of the second year would be pulled closer to zero for the urban areas and further from zero for the exurban areas, making them incomparable.
However, an underlying assumption of this normalization method is that if a shift in dates does occur from one year to the next, the amount of the shift is the same for both the urban and exurban environment. If this assumption is violated, any disproportional interannual shifts would skew the results to either be over or under represented depending on if the urban sample size was large or small for that year.
A second underlying assumption of this study is that the expected vegetation type and management practices were similar between like land cover classes for both the urban and rural environments. For example, it was assumed that any influences a homeowner may have on phenology due to landscaping practices (i.e., watering the lawn in the spring) and vegetation transplanting (i.e., planting grass for a lawn) were just as likely to occur regardless if the individual lives in a rural and urban settings. This is an important assumption, as it has been observed that phenological traits can differ between vegetation types [
51]. However, making comparisons between the urban and rural environments of developed areas with the same land cover classes controlled for this, as opposed to comparing dissimilar land cover classes that may have dissimilar landscape management techniques.
The third assumption is that changes in NVDI values accurately represent changes in phenological parameters. The current study used changes in NDVI as a proxy for growing season parameters. However, it is not well understood how well these values, derived from fused imagery, correspond to the actual ground events of SOS, EOS, and LOS. The Collection 5 MODIS NBAR data used in this study were the 8-day overlapping product that is collected over 16 days. Depending on the data availability during the 16-day period especially around the SOS and EOS dates, we could miss or smooth the small changes from the multi-date data products.
The fourth limitation is that we did not attempt to resolve any potential errors with the NLCD classifications. Although overall accuracies were 79% and 78% for the previous Level II NLCD versions from 2001 and 2006, respectively, they did have difficulty distinguishing the context of grass. In relation to the land cover classes we looked at, the 2006 NLCD had a producer’s accuracy of 42%, 70%, 80%, and 26% for classes 21, 22, 23, and 24, respectively. The user’s accuracies for these classes were 52%, 59%, 69%, and 83%, respectively. The 2001 NLCD accuracy was similar (within 5%) to the 2006 accuracies except for the producer’s accuracy for class 24, which was 81% [
58].
The last assumption is that our sample was representative of the conditions of the entire urban and exurban areas. We had a very large sample, so this is reasonable to assume. However, the areas of no data predictions could be biased due to some uniform trait and therefor underrepresented. For example, the proportion of the sample remaining toward the center of the urban and exurban areas was similar for the informal and 95% selection criteria. However, for the 90% selection criteria the central areas were disproportionally under represented (
Table 6). Previous research [
21] has shown a relationship between the UHI and distance to urban core area and should be considered.
In addition, selecting Landsat images with less than 10% cloud cover as an initial screening process could potentially omit additional dates of clear Landsat images for the study area, thus reducing the number of base pairs used. Based on previous research [
59], the risk of omitting these scenes potentially leads to a less than optimal quality of fused image since there could have been a potential base date nearer to some of our predicted dates than what we used for prediction. Adding to this, using a visual inspection to remove Landsat images could introduce human error.
4.5. Future Work
Future research should attempt to identify the source of variation that caused inconsistent results, being cautious of how input data would affect their sample distribution and validity of predictions. Evaluating base pair selection should include not only ensuring a valid fused image, but also taking into account how the distribution will affect the location of no data and how this in turn affects TIMESAT. Since the difference in base pair selection method does not produce large differences in the validity of our fused images, how the data and parameters can affect the ability of TIMESAT to make predictions is important to consider.
The MODIS collection 6 now provides daily NBAR product at 500 m resolution which could be utilized in data fusion. The BRDF corrected daily surface reflectance is another option to use in the future. The daily MODIS products may be able to reduce uncertainties in extracting phenological parameters. This study used Landsat 5 TM data for data consistency. Landsat 7 ETM+ data even with the Scan-Line-Corrector errors can still be used to increase the frequency of high resolution data. Other recent high resolution data such as Landsat 8 and Sentinel-2 can be included in the future. The Landsat and MODIS pair images may be gap-filled first before data fusion to increase valid prediction. Different weights from original and the fused data sources may be considered similar to the strategy used in mapping crop phenology [
30].
The results of future work should also validate the predicted phenological dates derived from STARFM synthetic imagery against in situ observations of phenological parameters for the urban and exurban environments, preferably for a variety of land cover classes. In addition, our study was confined to a very specific environment and may not be generalizable to a variety of areas. Expanding the prediction to a variety of climates and cities would help to overcome this. Last, the value of using a fusion model over one source of satellite data to predict the differences in the urban and exurban growing seasons should be explored. Our next steps will be to attempt to model LSP parameters with high resolution/low frequency and low resolution/high frequency data alone. This will allow making conclusions about the amount of information gained from fusing the data in regards to observing the effects of the UHI on phenology.
Beyond lacking in comparisons to other data sources and locations, the current study did not aim to explore in depth how different land cover classes affect phenological events. Although this paper described the difference in mean urban and exurban values among land cover classes, it was limited to a simple descriptive analysis. In addition, it may be interesting to see if there is a shift in calendar dates of phenological events for both urban and exurban environments, rather than just look at the shift in the number of days of the normalized difference between the two as we did.
5. Conclusions
This study has uncertainties, but it offers encouragement that STARFM can be sensitive enough to detect the change in phenological parameters of vegetation that we would expect to see because of the UHI effect in urbanized landscapes. Although base pair selection for image fusion did not strongly affect the validity of fused images, it did have a noticeable effect on the phenology results. We suspect that this is in part due to the changing distribution of where no data are available for TIMESAT to fit. As the number of base pairs increases, so will the potential locations of no data from Landsat that is masked out from a prediction. As we saw with disproportional distribution of no data in one instance, these no data points might not be randomly distributed and therefore skew any results. We believe that this could have skewed the results for the 90% base pair selection because we not only added in the potential for an additional 5% unclear pixel coverage, but also due to the software change. STARFM v1.2.1 added in a strict validation that may potentially not allow a prediction to be made in areas where it did before, thus compromising a lesser sample size for an increase in overall prediction quality and not covering up any bias distribution in no data.
Our statistical results are mixed, but generally, they support the notion that the UHI leads to an early start of growing season (SOS) and an overall increase in the length of the growing season (LOS). Conversely, the results show that depending on the methods used to select base pairs and the software version used to make the predictions, the effects urban areas have on the timing of the end of the growing season (EOS) can fluctuate. By using the fusion method, we were able to work toward understanding some of the advantages and challenges of modeling the heterogeneous urban landscape while preserving temporal accuracy. Overall, it allowed us to not only see a difference in phenological parameters between urban and exurban areas, but also to see that regardless of fusion method, there is variability among phenological parameters according to the land cover class at 30 m resolution. Depending on the base pair selection and version of STARFM, the differences in mean earlier urban SOS between land cover classes within each selection criteria ranged from as many as 4.5 to as few as 2.5 days, the differences in mean EOS urban delay between land cover classes ranged from 0.67 to 2.3 days, and the differences in mean longer urban LOS ranged from 3.6 to 3.9 days. Without having the temporal and spatial resolution of a fused image, these intercity variations in phenological parameters might not be observable.
However, how well the changes detected correlate to actual in situ conditions and the actual timing of phenological events is unknown. Lastly, it has yet to be determined how much information that describes phenology is gained by fusing the data and to what extent errors are introduced into these estimations.