Classification of Transmission Line Corridor Tree Species Based on Drone Data and Machine Learning
Abstract
:1. Introduction
2. Materials and Methods
2.1. Study Area
2.2. Aerial Image and LiDAR Data
2.3. Methods
2.3.1. Data Preprocessing
2.3.2. Individual-Tree Canopy Segmentation
2.3.3. Feature Extraction
2.3.4. Feature Selection Based on the RF Algorithm
2.3.5. Tree Species Classification Based on Machine Learning
2.3.6. Accuracy Evaluation Indicators
3. Results
3.1. Optimized CHM Extraction Results
3.2. Individual Tree Segmentation Results
3.3. Feature Screening Results
3.4. Classification Results and Accuracy Evaluation of Individual Tree Species
3.5. Results Analysis
- (1)
- When using DOM only, scheme Ⅲ had the highest classification accuracy with an overall accuracy of 79,35%, Kappa coefficient of 0.71, and MAE of 0.29. After feature selection, the accuracy of both classifiers improved. The classification schemes with feature selection improved the accuracy of classification using RF and SVM by 5.16% and 1.93%, respectively, compared to the schemes without feature selection.
- (2)
- When using LiDAR only, none of the classification results of schemes Ⅴ–Ⅷ were very good, and none of the overall accuracies reached 55%. For this study area, the effect of using LiDAR only for tree species classification was not satisfactory.
- (3)
- When using the combination of DOM and LiDAR for classification, scheme 12 had the best classification results, with an overall accuracy of 85.16% and a Kappa coefficient of 0.79. The accuracy of classification using RF and SVM improved by 3.23% and 6.45%, respectively, after feature selection compared to that in the scheme without feature selection.
- (4)
- In terms of tree species, Paulownia was more affected by feature selection, and in most cases, PA, UA improved after feature selection. Oak and fir were more affected by feature selection when LiDAR and DOM were combined for classification, and there was a significant improvement in PA and UA. The classification accuracy of other tree species was not ideal due to more internal species, and it may be necessary to classify other tree species into several more detailed categories in order to improve the accuracy.
4. Discussion
4.1. The Impact of Feature Screening on Classification
4.2. The Impact of the Classification Algorithm on the Accuracy
4.3. Contribution of Different Features to Classification
4.4. Effect of Observation Season on the Classification Accuracy
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Chao, M. Research on Feature Extraction Method of Power Line Corridor Based on Multiple Remote Sensing Data; WuHan University: Wuhan, China, 2010. [Google Scholar]
- Hao, G.; Xuefeng, Z.; Zandong, Z.; Shengqiang, Z. Transmission line corridor scene classification based on high-resolution remote sensing images. J. Wuhan Univ. 2014, 47, 712–716. [Google Scholar]
- Chong, H.; Chenchen, Z.; Qingsheng, L.; He, L.; Xiaomei, Y.; Gaohuan, L. Refined identification of typical tropical plantation tree species based on multi-features of optical and radar images. For. Sci. 2021, 57, 80–91. [Google Scholar]
- Jiaqi, Y. Research on Stand Type Identification Based on Airborne Hyperspectral and Lidar Data; Northeast Forestry University: Harbin, China, 2021. [Google Scholar]
- Yinghui, Z.; Dali, Z.; Zhen, Z. Classification of single tree species based on nonparametric classification algorithm and multi-source remote sensing data. J. Nanjing For. Univ. 2019, 43, 103–112. [Google Scholar]
- Yufeng, J. Research on Interspecific Classification of Mangroves Based on High-Resolution Multi-Source Remote Sensing Images; Shandong Agricultural University: Taian, China, 2021. [Google Scholar]
- Rottensteiner, F.; Trinder, J.; Clode, S.; Kubik, K. Using the Dempster–Shafer method for the fusion of LIDAR data and multi-spectral images for building detection. Inf. Fusion 2005, 6, 283–300. [Google Scholar] [CrossRef]
- Chen, G.; Weng, Q.; Hay, G.J.; He, Y. Geographic object-based image analysis (GEOBIA): Emerging trends and future opportunities. Gisci. Remote Sens. 2018, 55, 159–182. [Google Scholar] [CrossRef]
- Franklin, S.E.; Ahmed, O.S.; Williams, G. Northern Conifer Forest Species Classification Using Multispectral Data Acquired from an Unmanned Aerial Vehicle. Photogramm. Eng. Remote Sens. 2018, 55, 159–182. [Google Scholar] [CrossRef]
- Sun, H.; Deng, T.; Yanchao, L.I. Image segmentation algorithm based on the improved watershed algorithm. J. Harbin Eng. Univ. 2014, 35, 857–864. [Google Scholar]
- Buddenbaum, H.; Schlerf, M.; Hill, J. Classification of coniferous tree species and age classes using hyperspectral data and geostatistical methods. Int. J. Remote Sens. 2005, 26, 5453–5465. [Google Scholar] [CrossRef]
- Franklin, S.E.; Ahmed, O.S. Deciduous tree species classification using object-based analysis and machine learning with unmanned aerial vehicle multispectral data. Int. J. Remote Sens. 2018, 39, 5236–5245. [Google Scholar] [CrossRef]
- Ahmed, O.S.; Shemrock, A.; Chabot, D.; Dillon, C.; Williams, G.; Wasson, R.; Franklin, S.E. Hierarchical land cover and vegetation classification using multispectral data acquired from an unmanned aerial vehicle. Int. J. Remote Sens. 2017, 38, 2037–2052. [Google Scholar] [CrossRef]
- Chan, C.W.; Paelinckx, D. Evaluation of Random Forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery. Remote Sens. Environ. 2008, 112, 2999–3011. [Google Scholar] [CrossRef]
- Puttonen, E.; Jaakkola, A.; Litkey, P.; Hyypp, J. Tree Classification with Fused Mobile Laser Scanning and Hyperspectral Data. Sensors 2011, 11, 5158–5182. [Google Scholar] [CrossRef] [PubMed]
- Yijun, L.; Yong, P.; Shengxi, L.; Wen, J.; Bowei, C.; Luxia, L. Merged Airborne LiDAR and Hyperspectral Data for Tree Species Classification in Puer’s Mountains Area. For. Sci. Res. 2016, 29, 407–412. [Google Scholar]
- Kou, W.; Dong, J.; Xiao, X.; Hernandez, A.J.; Qin, Y.; Zhang, G.; Chen, B.; Lu, N.; Doughty, R. Expansion dynamics of deciduous rubber plantations in Xishuangbanna, China during 2000–2010. Gisci. Remote Sens. 2018, 55, 905–925. [Google Scholar] [CrossRef]
- Qiong, W.; Ruofei, Z.; Wenji, Z.; Kai, S.; Liming, D. Land-cover classification using GF-2 images and airborne lidar data based on Random Forest. Int. J. Remote Sens. 2018, 40, 2410–2426. [Google Scholar]
- Xiaoqin, W.; Miaomiao, W.; Shaoqiang, W.; Yundong, W. Vegetation Information Extraction Based on UAV Remote Sensing in Visible Light Band. Chin. J. Agric. Eng. 2015, 31, 152–157. [Google Scholar]
- Hong, G.; Zhang, A.; Zhou, F.; Brisco, B. Integration of optical and synthetic aperture radar (SAR) images to differentiate grassland and alfalfa in Prairie area. Int. J. Appl. Earth Obs. Geoinf. 2014, 28, 12–19. [Google Scholar] [CrossRef]
- Vincent, L.; Soille, P.J. Watersheds in Digital Spaces. IEEE Trans. Pattern Anal. Mach. Intell. 1991, 13, 583–598. [Google Scholar] [CrossRef] [Green Version]
- Pal, M. Feature Selection for Classification of Hyperspectral Data by SVM. IEEE Trans. Geosci. Remote Sens. 2010, 48, 2297. [Google Scholar] [CrossRef] [Green Version]
- Mei, H.; Zhu, Y. K-anonymous feature optimization based on the importance of random forest features. Comput. Appl. Softw. 2020, 37, 266–270. [Google Scholar]
- Mountrakis, G.; Im, J.; Ogole, C. Support vector machines in remote sensing: A review. ISPRS J. Photogramm. 2011, 66, 247–259. [Google Scholar] [CrossRef]
- Congalton, R.G. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 1991, 37, 35–46. [Google Scholar] [CrossRef]
- Hs, A.; Eh, B.; Aeka, C.; Es, D.; Omeb, E. Deep Learning model and Classification Explainability of Renewable energy-driven Membrane Desalination System using Evaporative Cooler. Alex. Eng. J. 2022, 61, 10007–10024. [Google Scholar]
- Shams, M.Y.; Elzeki, O.M.; Abouelmagd, L.M.; Hassanien, A.E.; Salem, H. HANA: A Healthy Artificial Nutrition Analysis model during COVID-19 Pandemic. Comput. Biol. Med. 2021, 135, 104606. [Google Scholar] [CrossRef]
- Hs, A.; Aeka, B.; Es, C.; Omed, E. Predictive modelling for solar power-driven hybrid desalination system using artificial neural network regression with Adam optimization. Desalination 2022, 522, 115411. [Google Scholar]
- Pham, L.T.H.; Brabyn, L.; Ashraf, S. Combining QuickBird, LiDAR, and GIS topography indices to identify a single native tree species in a complex landscape using an object-based classification approach. Int. J. Appl. Earth Obs. Geoinf. 2016, 50, 187–197. [Google Scholar] [CrossRef]
- Liu, H. Classification of urban tree species using multi-features derived from four-season RedEdge-MX data. Comput. Electron. Agric. 2022, 194, 106794. [Google Scholar] [CrossRef]
- Mäyrä, J.; Keski-Saari, S.; Kivinen, S.; Tanhuanpää, T.; Hurskainen, P.; Kullberg, P.; Poikolainen, L.; Viinikka, A.; Tuominen, S.; Kumpula, T.; et al. Tree species classification from airborne hyperspectral and LiDAR data using 3D convolutional neural networks. Remote Sens. Environ. 2021, 256, 112322. [Google Scholar] [CrossRef]
- Zhou, X.; Zhang, X. Individual Tree Parameters Estimation for Plantation Forests Based on UAV Oblique Photography. IEEE Access 2020, 8, 96184–96198. [Google Scholar] [CrossRef]
DOM | LiDAR | ||
---|---|---|---|
Ground resolution | 0.1 m | Wavelength | 1064 nm |
Focal length | 35 mm | Laster beam divergence | 0.25 mrad |
Maximum point density | 93 pts/m2 | ||
Minimum point density | 0.6 pts/m2 |
Spectral Features | Feature Description | Symbolic Representation |
---|---|---|
Mean | Average pixel value of an object in a certain band | Rmean, Gmean, Bmean 1 |
Standard deviation | Degree of dispersion of the gray value of pixels in the object area | Rstd, Gstd, Bstd 2 |
Texture Features | Feature Description | Symbolic Representation |
---|---|---|
Homogeneity | Homogeneity of the image | Rhom3 (5,7,9,11), Ghom3 (5,7,9,11), Bhom3 (5,7,9,11) 3 |
Contrast | Quality of image sharpness and depth of texture grooves | Rcon3 (5,7,9,11), Gcon3 (5,7,9,11), Bcon3 (5,7,9,11) 3 |
Difference | Texture feature of the local image area | Rdis3 (5,7,9,11), Gdis3 (5,7,9,11), Bdis3 (5,7,9,11) 3 |
Information entropy | Randomness measure of all information | Rent3 (5,7,9,11), Gent3 (5,7,9,11), Bent3 (5,7,9,11) 3 |
Second order | Uniformity of gray distribution of image and thickness of texture | Rsec3 (5,7,9,11), Gsec3 (5,7,9,11), Bsec3 (5,7,9,11) 3 |
Correlation | Similarity of image gray levels | Rcor3 (5,7,9,11), Gcor3 (5,7,9,11), Bcor3 (5,7,9,11) 3 |
Geometric Features | Feature Description | Symbolic Representation |
---|---|---|
Area | Area of segmented object | Area |
Perimeter | Perimeter of segmented object | Perimeter |
Area perimeter ratio | Ratio of area of segmented object to perimeter | A_P |
Point-Cloud Features | Feature Description | Symbolic Representation |
---|---|---|
Cumulative height percentile | Calculation of cumulative height percentile at 10% interval and calculation of its values at 25% and 75% intervals | H1, H10, H20, H25, H30, H40, H50, H60, H70, H75, H80, H90, H99 4 |
Height percentile | Calculation of height percentile at 10% intervals and calculation of its values at 25% and 75% intervals | HP1, HP10, HP20, HP25, HP30, HP40, HP50, HP60, HP70, HP75, HP80, HP90, HP99 5 |
Cumulative intensity percentile | Calculation of cumulative echo intensity percentile at 10% interval and calculation of its values at 25% and 75% intervals | INT1, INT10, Int20, Int25, Int30, Int40, Int50, Int60, Int70, Int75, Int80, Int90, Int99 6 |
Intensity percentile | Calculation of the percentile of echo intensity at 10% interval and calculation of its values at 25% and 75% intervals | IntP1, IntP10, IntP20, IntP25, IntP30, IntP40, IntP50, IntP60, IntP70, IntP75, IntP80, IntP90, IntP99 7 |
Mean intensity | Mean intensity of all echoes | INTmean |
Intensity standard deviation | Intensity standard deviation of all echoes | INTstd |
Intensity variance | Intensity variance of all echoes | INTvar |
CHM Features | Feature Description | Symbolic Representation |
---|---|---|
Mean | Mean height of divided tree canopy | Hmean |
Maximum | Maximum height of divided tree canopy | Hmax |
Minimum | Minimum height of split canopy | Hmin |
Standard deviation | Standard deviation of height of divided tree canopy | Hstd |
Variance | Division of height variance of canopy | Hvar |
Slope | Division of the slope of the canopy | Hslope |
Scheme | Feature Select | Type of Data | Classifier |
---|---|---|---|
I | No | DOM | RF |
II | No | DOM | SVM |
III | Yes | DOM | RF |
IV | Yes | DOM | SVM |
V | No | LiDAR | RF |
VI | No | LiDAR | SVM |
VII | Yes | LiDAR | RF |
VIII | Yes | LiDAR | SVM |
IX | No | DOM, LiDAR | RF |
X | No | DOM, LiDAR | SVM |
XI | Yes | DOM, LiDAR | RF |
XII | Yes | DOM, LiDAR | SVM |
Evaluation Index | Calculation Formula | Indicator Description |
---|---|---|
user accuracy, UA | Ratio of number of samples correctly classified into category i to the total number of samples in category i in the classification result, which reflects the reliability of a certain category being correctly identified | |
producer accuracy, PA | Ratio of the number of correct classifications of a category to the total number of that category in the reference sample | |
overall accuracy, OA | Proportion of correctly classified samples to the total sample, reflecting the consistency between the classification results and the actual features | |
Kappa coefficient | A precision statistic used to determine the matching degree between the actual feature category and classification result, which can weaken the influence of sample selection on the accuracy verification | |
MAE | Measure of the difference between the predicted and actual values of the model. |
Scheme | Accuracy (%) | Paulownia | oak | fir | Other Tree Species | OA (%) | Kappa | MAE |
---|---|---|---|---|---|---|---|---|
I | PA | 70.00 | 87.50 | 66.67 | 69.44 | 74.19 | 0.66 | 0.39 |
UA | 77.78 | 76.36 | 72.34 | 71.43 | ||||
II | PA | 60.00 | 83.33 | 64.71 | 72.22 | 71.61 | 0.60 | 0.42 |
UA | 75.00 | 67.80 | 73.33 | 74.29 | ||||
III | PA | 80.00 | 89.58 | 64.71 | 77.78 | 79.35 | 0.71 | 0.29 |
UA | 88.89 | 74.14 | 78.26 | 84.85 | ||||
IV | PA | 55.00 | 85.42 | 70.59 | 75.00 | 73.54 | 0.63 | 0.41 |
UA | 84.62 | 77.36 | 71.43 | 67.50 | ||||
V | PA | 55.00 | 70.83 | 45.10 | 47.22 | 52.26 | 0.34 | 0.74 |
UA | 55.00 | 57.63 | 52.27 | 53.13 | ||||
VI | PA | 55.00 | 56.25 | 54.90 | 36.11 | 50.97 | 0.39 | 0.73 |
UA | 52.38 | 55.10 | 44.44 | 59.10 | ||||
VII | PA | 45.00 | 66.67 | 43.14 | 55.56 | 53.55 | 0.36 | 0.69 |
UA | 50.00 | 51.61 | 52.38 | 60.61 | ||||
VIII | PA | 35.00 | 58.70 | 45.10 | 77.78 | 54.84 | 0.39 | 0.71 |
UA | 58.33 | 58.70 | 56.10 | 56.00 | ||||
IX | PA | 80.00 | 83.33 | 76.47 | 80.56 | 80.00 | 0.73 | 0.36 |
UA | 72.73 | 78.43 | 73.58 | 100.00 | ||||
X | PA | 75.00 | 81.25 | 84.31 | 69.44 | 78.21 | 0.70 | 0.30 |
UA | 83.33 | 82.98 | 72.88 | 80.65 | ||||
XI | PA | 90.00 | 85.75 | 74.51 | 86.11 | 83.23 | 0.77 | 0.23 |
UA | 85.71 | 77.78 | 82.61 | 91.18 | ||||
XII | PA | 85.00 | 85.42 | 86.27 | 83.33 | 85.16 | 0.79 | 0.21 |
UA | 89.47 | 89.13 | 80.00 | 85.71 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, X.; Wang, R.; Chen, X.; Li, Y.; Duan, Y. Classification of Transmission Line Corridor Tree Species Based on Drone Data and Machine Learning. Sustainability 2022, 14, 8273. https://doi.org/10.3390/su14148273
Li X, Wang R, Chen X, Li Y, Duan Y. Classification of Transmission Line Corridor Tree Species Based on Drone Data and Machine Learning. Sustainability. 2022; 14(14):8273. https://doi.org/10.3390/su14148273
Chicago/Turabian StyleLi, Xiuting, Ruirui Wang, Xingwang Chen, Yiran Li, and Yunshan Duan. 2022. "Classification of Transmission Line Corridor Tree Species Based on Drone Data and Machine Learning" Sustainability 14, no. 14: 8273. https://doi.org/10.3390/su14148273