1. Introduction
As urban land in the United States continues to expand, urban forests will continue to play a substantial role in the overall health of urban and rural areas alike [
1,
2]. Urban forests, defined as individual or clusters of trees within or adjacent to an urban setting, provide several well-documented economic, ecological, and health benefits for residents of the urban environments [
3,
4]. For example, urban trees provide shade and mitigate the effects of the urban heat island effect [
5,
6]. They also combat particle air pollution and CO
2 emissions by sequestering carbon [
7,
8], are linked to positive effects in human psychology in urban environments [
9,
10,
11], and reduce stormwater runoff [
12,
13]. Despite their enormous benefits, urban tree canopy (UTC) coverage is in decline as considerable urban growth in some regions has necessitated the removal of urban forest, leaving some cities struggling with the urban heat island (UHI) effect and other challenges [
2]. Managing UTC necessitates accurate measurement and inventory of urban trees within a city, and many municipalities within the Unites States of America (USA) have created organizations or branches of government to oversee the changing urban landscapes [
14].
While tagging and inventorying trees in the field creates a valuable and accurate catalog, the process consumes significant time and monetary resources. Many have turned to less time-intensive methods, such as using remote sensing technologies, to provide a review of the urban tree status [
15,
16]. Remote sensing technologies, such as light detection and ranging (LiDAR), unoccupied aerial vehicles (UAVs), and aerial and satellite imagery, have been used to inventory individual trees or tree canopy coverage in towns and cities [
17,
18]. LiDAR data provide accurate representation of tree canopies, but can be costly and require significant computational resources for processing large datasets [
19,
20,
21]. UAV data are captured in very high spatial resolutions that is effective in identifying trees and tree canopy, but it is difficult to capture data large spatial extents, i.e., an entire municipality [
22,
23,
24]. Satellite and aerial imagery are being increasingly used in the context of mapping urban tree canopies because of the increasingly high spatial resolution and wide availability [
25,
26,
27,
28].
Recent advancements in computer science have increased the applicability of both machine learning and deep learning image classifiers for remote sensing applications [
29,
30,
31]. State-of-the-art machine and deep learning models are available to UTC managers to quickly obtain accurate information regarding tree canopy coverage and how the coverage is changing over time. Accuracy is of the utmost importance as these data are used for legislative decision making and funding [
14]. These models are now being offered as tools in accessible and easy-to-use ways through popular GIS software, such as ArcGIS Pro. Recently, scientists have found it possible to detect individual trees or calculate individual tree canopies within UTC coverage for large areal extents with both machine learning and deep learning classifiers and detectors. Yan et al. [
32] compared convolutional neural networks (CNNs), support vector machine (SVM) and random forests (RF) models for UTC mapping and found CNNs performed the best. However, they used expensive satellite imagery and investigated delineating individual tree canopies in a city instead of total UTC area within an urban environment. Lv et al. [
33] compared several CNNs, including one developed by their own team, but once again sought individual tree segmentation and failed to include machine learning classifiers in their comparison. Zamboni et al. [
34] likewise surveyed a large number of deep learning models for individual tree detection but did not use machine learning classifiers. Wang et al. [
35] compared a U-Net deep learning model to object-based classification and found U-Net to be a superior classifier for UTC coverage area, but again did not compare the results to machine learning classifiers. Despite the successes of machine and deep learning classifiers on their own, further investigation on the worth of deep learning and machine learning from a practical perspective is important. Deep learning classifiers can require more time to train and classify than machine learning classifiers, suggesting that small differences in overall accuracy may not be worth the longer processing times [
36]. Application-based studies, like [
37], have created a comprehensive set of UTC data for the entire state of Wisconsin using machine learning algorithms, but the results were not compared with deep learning algorithm results. In many of the application studies, multispectral satellite data or LiDAR data are combined with aerial imagery to enhance the classification [
27,
28,
38,
39]. Therefore, comparisons with less complex and still reliable machine learning classification models, like SVM and RF, are needed on simple optical imagery readily available from a management and policy-making perspective.
The present literature on urban tree canopy mapping and UHI have focused on highly populated urban centers at the cost of neglecting smaller or mid-sized urban centers. This study is the first of its kind to compare three machine and deep learning classifiers, U-Net deep learning, support vector machine, and random forests machine learning, for mapping UTC coverage in two cities of different size and shape in the USA from the perspective of an urban tree forestry manager. From this perspective, cost, computational time, and accuracy are the most important considerations. For this purpose, free National Agriculture Imagery Program (NAIP) imagery datasets are used to provide an alternative to the costly satellite, LiDAR, and UAV data that many others use for UTC managers. Commonly used ArcGIS software and desktop computer specifications are used for analysis to simulate the resources available to urban forestry managers. Following the comparison, a change detection analysis is presented as a case study to reveal where and how tree canopies in the two diverse cities of interest, Laurel, Mississippi (MS) and Georgetown, Texas (TX), USA, have changed over a 10-year period. Temperature maps derived from Landsat 9 thermal bands are also used to reveal the location of high temperatures in relation to tree canopy gain or loss. Such land surface temperature mapping and subsequent comparisons with tree canopy coverage is common among research projects as many seek to apply the technology to identify or solve many concerns in urban areas. In fact, several studies are now working to model changes of LST in the future using data from the past now that we have sufficiently large data histories [
40,
41]. Many are working hard on specific applications in large cities around the world, but few are investigating smaller cities [
42,
43,
44]. We aim to expand the literature in this area as well by looking at smaller regions and compare with deep learning data, which has not yet been conducted.
We hypothesize that deep learning, as available through accessible and widely used GIS software packages, will provide the best overall accuracy results and suggest that NAIP data alone is sufficient to provide accuracies similar to other studies that use multiple datasets. Furthermore, we suggest that tree canopy loss will be more prevalent over a 10-year period for Georgetown, TX because of its rapid growth and it will be visually correlated with high temperatures in the regions of canopy loss.
Section 2 of this article will introduce the study areas that make this study unique and discuss methods for classifying data and generating temperature maps.
Section 3 will showcase the results of this study, while
Section 4 will discuss the strengths of our approach as well as limitations. Finally,
Section 5 concludes by sharing the most important information.
4. Discussion
This study examined the effectiveness of two machine learning and one deep learning models adapted to remote sensing imagery classification for calculating UTC area within a small southern town and a quickly growing southern town in the United States of America. All three classifiers performed more quickly when processing the smaller town data from Laurel as the data size was smaller and more geographically compact. The overall accuracy for each classifier was highest for the small, more simple Laurel imagery, but for both Georgetown and Laurel, the U-Net classifier performed the best overall (89.8% and 91.4%, respectively), better than the SVM and RF classifiers. However, the SVM and RF classifiers were much faster than the U-Net classifiers in the ArcGIS Pro 3.1.3 setting. This may be caused by the computer hardware’s limitations.
The findings from the experiment reflect well among similar experiments performed with other classifiers for determining UTC area coverage (
Table 11). In their comparative study of different spatial scales for mapping accurate UTC using U-Net and object-based image analysis (OBIA), Wang et al. [
35] found their U-Net model implemented with Python outperformed OBIA in every measure, reaching a result of 99% overall accuracy. Using both Google Earth imagery and LiDAR datasets, Timilsina et al. [
38] obtained overall accuracies of 96% and 98% for 2005 and 2015, respectively. Most studies have relied solely on aerial or satellite imagery to detect and classify tree canopy and have resulted in high overall accuracies when processed with a deep learning classifier. The weakest results were still impressive because of the intent to not only classify tree canopy but the species of tree [
39]. Overall, the U-Net classification overall accuracy obtained in this study (89.8% and 91.4%) are comparable to other results presented in
Table 11. The SVM and RF classifiers, however, performed poorly compared to the studies presented in
Table 11. The SVM and RF classifiers repeatedly performed at about 70–80% overall accuracy, while the results shown below reach upwards of 94% [
27,
28,
52]. This reveals some potential improvements to be made to the SVM and RF methods presented here. Because SVM is known to run well on fewer training samples, reducing the input for training this classifier may have improved the results. Nevertheless, the results of this study suggest that free-to-use NAIP RGB imagery is an accessible and comparable resource to use for obtaining UTC area coverage over small and growing cities. The NAIP imagery and robust tools available in ArcGIS Pro, especially the U-Net deep learning classifier, are able to provide accurate results for urban forestry managers without needing programming experience.
The case study presented UTC change results between the 2012 and 2022/2023 NAIP imagery sets for each city and compared the spatial distribution of UTC change to the urban heat maps derived from the Landsat 9 thermal bands. The patterns found in large amounts of growth in Georgetown, TX, and to a smaller extent in Laurel, were similar to the areas of growth in Columbia, SC in [
52]. They are also consistent with findings by Tamaskani Esfehankalateh et al. [
6] and Loughner et al. [
5] that suggest a strong correlation between UTC and urban heat. Spatially, the city centers of both cities experienced the least change in UTC while the periphery experienced the most urban growth and UTC decline. This suggests a stable city, like Laurel, is likely to experience little change to the UHI effect within its borders as long as trees are protected like they are in Laurel. It also suggests that it is more difficult to mitigate UHI effect in the center of cities as there is less room for trees to be planted and grow once they are already removed for urban growth. Very little, if any, tree growth was reported in either city during the 10-year period. We also found that the differences between the temperatures in the areas tree canopy loss and the temperatures in the areas of no change in both cities were statistically significant, indicating the role UTC plays in mitigating heat. The temperatures of the tree canopy loss areas were higher than those throughout the rest of the city. Future policies should prevent significant loss of tree canopy within city limits to mitigate further heat increases.
Several limitations were encountered during the course of this study. NAIP data are collected at times and dates that are beyond the control of any application. For example, despite being collected during leaf on conditions, the window for data collection is from march until October in most southern US cities, meaning that each year may be more or less easily classified depending on the tree conditions and greenness. Additionally, technological limitations with the computer hardware can impact data classification times. More modern computers, supercomputers, and cloud computing with ArcGIS can remove much of the frustration with processing time that we encountered in this study. Future work should focus on adding additional deep learning models to compare with U-Net to determine if another model may perform faster and provide a more accurate classification. Studies focusing on the identification and counting of individual urban trees have assessed multiple deep learning models, but it has yet to be investigated for mapping UTC area coverage [
32,
33,
34,
65,
66,
67].