Evaluating Tree Species Mapping: Probability Sampling Validation of Pure and Mixed Species Classes Using Convolutional Neural Networks and Sentinel-2 Time Series
Abstract
1. Introduction
- (1)
- to expand the set of pure tree species classes by incorporating mixed classes—each consisting of two pairwise different species—and sparse classes with low canopy cover into training and validation;
- (2)
- to evaluate the generated tree species maps through a probability sample-based validation, with a specific emphasis on investigating spatial autocorrelation.
- the use of a dense time series from multi-annual S2 data developed by [36], providing phenology index data at the S2 pixel level that are consistent across S2 granules;
- the inclusion of mixed species classes in training and validation data, specific validation metrics for mixed and pure species classes, and training data synthesis;
- a hybrid neural network architecture tailored specifically for the combination of time series and non-time series features, based on [43];
- A spatial split and an independent probability sample-based validation including a comprehensive and innovative spatial autocorrelation analysis for both.
2. Materials and Methods
2.1. Study Area
2.2. Forest Area Map
2.3. S2 Phenology Features
2.4. Digital Terrain Model Features
2.5. Normalized Digital Surface Model
2.6. Definitions—Feature Stacks and Feature Space
2.7. Pure, Mixed, and Sparse Classes
2.8. Training Data Labeling
- spatial disjointedness from the NFI validation dataset (see Section 2.10);
- at least 90% target class composition;
- a minimum size of 3000 m2;
- for mixed classes, homogeneous mixing of both tree species;
- for sparse classes, homogeneous mixing of canopy cover and ground-level area.
2.9. Synthetic Training Data
2.10. NFI Validation Data
2.11. Investigating and Addressing Autocorrelation in Training and Validation Data
2.11.1. Clustered Spatial Splits
2.11.2. Buffered NFI Validation Data
- The NFI-VD plots were divided into 10 folds, denoted as , with classes almost evenly distributed amongst them. This division was accomplished by first determining the number of plots for each class . For each fold , random observational plots (not already assigned to another fold) were selected. If the class of a selected plot was not yet represented by more than in , the plot was added to the fold. Additionally, the other plots from the same NFI cluster (each comprising four NFI plots) were included in the fold.
- For each NFI-VD fold, training areas contained within any buffered NFI plot from the respective fold were eliminated. This process yielded 10 sets of training data, denoted as , each corresponding to an NFI validation data fold.
- For each set of training data , a model was trained.
- Subsequently, each model was validated using the corresponding NFI validation data fold .
- Finally, the validation results over all folds were evaluated.
2.12. Neural Network Architecture
2.13. Neural Network Training
2.14. Tree Species Map Validation
- Overall accuracy (OA) for the NFI-VAL and NFI-weighted OA (NFI-w-OA) during training (the holdout set OA was weighted by NFI-Class-Distribution).
- Overall misclassification score (OMS) for the NFI-VAL: A score calculated based on the severity, judged by the phenological similarity, of all misclassifications in the confusion matrix. A score of 1.00 is the best possible value and means that all predictions are correct. See Appendix B for details.
- Prediction in close phenological proximity (PCPP) for the NFI-VAL: Predictions where at least one of the involved classes is correctly predicted. Examples include pixels predicted as spruce–fir but validated as spruce, pixels predicted as spruce–larch but validated as larch–arolla pine, pixels predicted as pine–oak but validated as oak, and pixels predicted as spruce–beech but validated as spruce–deciduous.
- Deciduous and coniferous confusions (DCC) for the NFI-VAL: Confusions between pure coniferous and pure deciduous classes. Examples include pixels predicted as larch but validated as oak and pixels predicted as beech but validated as spruce–larch.
- Post hoc pure class overall accuracy (POA), determined by eliminating all non-pure-class entries from the confusion matrix.
- Post hoc mixed class overall accuracy (MOA), calculated by eliminating all non-mixed-class entries from the confusion matrix.
- Macro-averaged (each class was weighted equally) F1 score (MAF1).
- F1 scores, producer and user accuracies, and misclassification scores on a class level.
- Confusion matrices for splits as well as aggregated confusion matrices over multiple splits.
2.14.1. Random Holdout Set Validation
2.14.2. Clustered Spatial Split Validation
2.14.3. NFI Data Validation
2.15. Training Data and Model Configurations
- The no_syn model used the raw training data without any synthetic data for mixed or sparse classes.
- The res_xx_yy_yy models were built with a with xx filters in the first block of the residual network and yy filters in the second and third blocks. Additionally, the sizes of the trailing MLP layers were adjusted.
3. Results
3.1. Clustered Spatial Split Distance Analysis
3.2. NFI Validation Buffer Distance Analysis
3.3. Models
4. Discussion
4.1. Mixed Species Classes, Training Data Labeling, and Training Data Synthesis
4.2. Neural Network Architecture
4.3. Autocorrelation Analysis
4.4. Validation
4.5. Results and Model Performance
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
Name | Description |
---|---|
MPC_increm_abs * | Increment from one DOY to the next |
DEFOLIATION_doy | DOY in [245:330] where MPC_increm_abs is minimal |
DEFOLIATION_start | DOY of last local maximum before DEFOLIATION_doy |
DEFOLIATION_end | DOY of first local minimum after DEFOLIATION_doy that is below 25th-MPC-percentile |
DEFOLIATION_duration | DEFOLIATION_end-DEFOLIATION_start |
DEFOLIATION_doy_adj | Mean (DEFOLIATION_start, DEFOLIATION_end) |
DEFOLIATION_gradient_median | Median (MPC_increm_abs[DEFOLIATION_start, DEFOLIATION_end]) |
DEFOLIATION_gradient_min | Min(MPC_increm_abs[245:330]) |
GREENING_doy | DOY in [90:182] where MPC_increm_abs is maximal |
GREENING_start | DOY of last local minimum before GREENING_doy |
GREENING_end | First local maximum after GREENING_doy that is above 75th-MPC-percentile |
GREENING_doy_adj | Mean (GREENING_start, GREENING_end) |
GREENING_duration | GREENING_end-GREENING_start |
GREENING_gradient_median | Median(MPC_increm_abs[GREENING_start, GREENING_end]) |
GREENING_gradient_max | Max(MPC_increm_abs[90:182]) |
DP_max | Maximum value from data points |
DP_ampl | (DP_max-MOD_mean)/MOD_mean * 100 |
MOD_ALL_nDP | Number of data points before outlier filter |
MOD_MP_nDP | Number of data points used for modelling after filtering and modelling period adaption |
MOD_n_years | Number of years for modelling |
MOD_max | Maximum of MPC |
MOD_max_doy | DOY of maximum of MPC |
MOD_min | Minimum of MPC |
MOD_min_doy | DOY of minimum of MPC |
MOD_mean | Mean of MPC |
MOD_median | Median of MPC |
MOD_percx | x-th percentile of MPC |
MOD_range_max_min | MOD_max-MOD_min |
MOD_range_p75_p25 | MOD_perc75-MOD_perc25 |
MOD_range_p90_p20 | MOD_perc90-MOD_perc10 |
MOD_sd | Std (differences (model, data points)) in modeling period |
MOD_ampl_max | (MOD_max-MOD_mean)/MOD_mean * 100 |
MOD_ampl_p75 | (MOD_per75-MOD_mean)/MOD_mean * 100 |
MOD_ampl_p90 | (MOD_per90-MOD_mean)/MOD_mean * 100 |
MOD_dp_dev_all_abs | Median (diffs(model, data points)) in modeling period |
MOD_dp_dev_neg_abs | Median (non-positive differences (model, data points)) in modeling period |
MOD_dp_dev_pos_abs | Median (non-negative differences (model, data points)) in modeling period |
MOD_dp_dev_all_rel | Median (differences (model, data points)/MPC values * 100) in modeling period |
MOD_dp_dev_neg_rel | Median (non-positive differences (model, data points)/MPC values * 100) in modeling period |
MOD_dp_dev_pos_rel | Median (non-negative differences (model, data points)/MPC values * 100) in modeling period |
MTC | Second biggest number of days above 0.5 perc in a row |
MTC_startdoy | Start DOY of MTC |
PTA_x_firstreach | DOY when percentile x is reached for the first time |
PTA_x_lastpass | DOY when percentile x is passed from above for the last time |
PTA_x_n_above | Number of MPC values above percentile x |
PTA_x_n_transition | Number of times MPC values transition above percentile x |
PTA_x_value | Value of percentile x |
PTC | Maximum of number of days above 0.65 perc in a row |
PTC_startdoy | start DOY for PTC |
VP_start | PTA_0.6_firstreach |
VP_end | VP_start + LBG |
VPL | PTA_0.6_lastpass-PTA_0.6_firstreach |
VEGPERIOD_length | DEFOLIATION_doy-GREENING_doy |
VEGPERIOD_length_adj | DEFOLIATION_doy_adj-GREENING_doy_adj |
VA (Vegetation-Abundance-Index) | Mean (MPC values ≥ 0.5 Percentile) |
TD (Temporal-Dispersion) | (days above 0.75 percentile) * mean (MPC values above 0.75 perc-0.75) |
LGB (Length of growing biomass) | Number of days where MPC is above threshold (threshold = (0.95 perc + 0.05 perc)/2 |
Spruce | Spruce-Fir | Spruce-Larch | Spruce- Pine | Spruce- Arolla Pine | Larch | Larch- Arolla Pine | Pine | Spruce-Beech | Spruce- Other Deciduous | Larch- Other Deciduous | Pine- Oak | Pine- Other Deciduous | Beech | Oak | Other Deciduous | Mountain Pine | Green Alder | Low Vegetation | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Spruce | 8168 | 613 | 922 | 638 | 45 | 24 | 15 | 47 | 436 | 365 | 16 | 5 | 25 | 29 | 9 | 64 | 3 | 0 | 0 |
Spruce-fir | 34 | 222 | 14 | 5 | 0 | 0 | 0 | 0 | 18 | 13 | 0 | 0 | 0 | 1 | 0 | 3 | 0 | 0 | 0 |
Spruce-larch | 362 | 28 | 957 | 22 | 25 | 30 | 47 | 0 | 20 | 25 | 1 | 0 | 0 | 0 | 0 | 4 | 5 | 1 | 0 |
Spruce-pine | 25 | 8 | 4 | 186 | 0 | 0 | 0 | 2 | 6 | 6 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 |
Spruce-arolla pine | 33 | 0 | 13 | 0 | 17 | 0 | 4 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Larch | 100 | 13 | 176 | 5 | 16 | 208 | 54 | 3 | 44 | 21 | 12 | 1 | 5 | 14 | 2 | 11 | 9 | 6 | 0 |
Larch-arolla pine | 64 | 1 | 60 | 3 | 16 | 7 | 61 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 1 | 0 |
Pine | 91 | 11 | 52 | 328 | 1 | 5 | 5 | 308 | 26 | 33 | 5 | 21 | 57 | 6 | 5 | 12 | 2 | 0 | 0 |
Spruce-beech | 180 | 67 | 60 | 46 | 0 | 2 | 0 | 1 | 733 | 108 | 8 | 0 | 9 | 56 | 0 | 33 | 0 | 0 | 0 |
Spruce-other deciduous | 147 | 78 | 45 | 51 | 0 | 0 | 0 | 6 | 342 | 510 | 23 | 2 | 29 | 115 | 10 | 94 | 1 | 0 | 0 |
Larch-other deciduous | 37 | 7 | 26 | 10 | 0 | 3 | 0 | 0 | 143 | 29 | 41 | 2 | 3 | 71 | 2 | 30 | 2 | 1 | 0 |
Pine-oak | 1 | 0 | 0 | 6 | 0 | 0 | 0 | 5 | 7 | 8 | 0 | 40 | 9 | 0 | 1 | 3 | 0 | 0 | 0 |
Pine-other deciduous | 32 | 9 | 11 | 57 | 0 | 0 | 0 | 28 | 85 | 47 | 10 | 55 | 409 | 45 | 16 | 27 | 0 | 0 | 0 |
Beech | 16 | 4 | 3 | 3 | 0 | 0 | 0 | 0 | 181 | 16 | 34 | 1 | 9 | 1008 | 26 | 57 | 2 | 0 | 0 |
Oak | 2 | 0 | 1 | 2 | 0 | 0 | 0 | 2 | 19 | 24 | 11 | 21 | 8 | 54 | 375 | 97 | 0 | 0 | 0 |
Other deciduous | 99 | 22 | 40 | 29 | 0 | 6 | 0 | 6 | 325 | 142 | 44 | 33 | 56 | 292 | 84 | 1457 | 0 | 6 | 0 |
Mountain pine | 19 | 0 | 16 | 2 | 3 | 10 | 7 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 0 | 406 | 1 | 0 |
Green alder | 0 | 0 | 7 | 0 | 0 | 2 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 2 | 82 | 0 |
Low vegetation | 870 | 94 | 422 | 84 | 22 | 50 | 20 | 12 | 310 | 419 | 21 | 2 | 17 | 106 | 13 | 197 | 24 | 55 | 0 |
Accuracy Measure | Value |
---|---|
Overall accuracy [%] | 55.33 ±1.8 |
Post-hoc pure class accuracy [%] | 90.73 ± 1.3 |
Post-hoc mixed class accuracy [%] | 64.64 ± 4.2 |
Macro F1 score [%] | 42.6 ± 1.3 |
Overall misclassification score | 1.63 ± 0.04 |
Level 1 misclassifications [%] | 69.63 |
Up to Level 2 misclassifications [%] | 79.39 |
Up to Level 3 misclassifications [%] | 84.55 |
Level 4 misclassifications [%] | 3.95 |
Level 5 misclassifications [%] | 1.53 |
Level 0 misclassifications [%] | 9.97 |
Spruce | Spruce-Fir | Spruce-Larch | Spruce-Pine | Spruce-Arolla Pine | Larch | Larch-Arolla Pine | Pine | Spruce-Beech | Spruce-Other Deciduous | Larch-Other Deciduous | Pine-Oak | Pine-Other Deciduous | Beech | Oak | Other Deciduous | Mountain Pine | Green Alder | Low Vegetation | |
F1 scores [%] | 75 | 30 | 44 | 22 | 16 | 40 | 28 | 44 | 37 | 32 | 13 | 30 | 56 | 64 | 65 | 62 | 88 | 65 | 0 |
Misclassification scores | 1.30 | 1.74 | 1.54 | 1.67 | 2.08 | 2.04 | 2.41 | 2.02 | 2.12 | 1.97 | 3.28 | 2.43 | 2.00 | 1.98 | 2.10 | 2.06 | 1.28 | 1.42 | |
Producer accuracy [%] | 79.46 | 18.86 | 33.83 | 12.59 | 11.72 | 59.94 | 28.50 | 73.33 | 27.18 | 28.78 | 18.06 | 21.86 | 64.21 | 56.03 | 69.06 | 69.68 | 88.45 | 53.59 | |
User accuracy [%] | 71.50 | 71.61 | 62.67 | 77.82 | 24.64 | 29.71 | 27.85 | 31.82 | 56.25 | 35.10 | 10.07 | 50.00 | 49.22 | 74.12 | 60.88 | 55.17 | 86.94 | 83.67 | 0.00 |
Producer overall misclassification score | 1.19 | 1.98 | 1.57 | 2.01 | 2.14 | 1.46 | 2.04 | 1.65 | 2.38 | 1.73 | 2.84 | 2.69 | 1.73 | 2.24 | 1.94 | 1.57 | 1.16 | 1.08 | |
Producer Level 1 misclassifications [%] | 83.88 | 70.94 | 72.64 | 77.99 | 42.75 | 70.6 | 53.73 | 73.81 | 39.86 | 39.16 | 32.60 | 51.91 | 70.65 | 56.03 | 69.06 | 69.68 | 88.45 | 53.59 | |
Producer up to Level 2 misclassifications [%] | 87.06 | 86.32 | 80.49 | 90.65 | 71.03 | 71.46 | 77.56 | 81.67 | 64.44 | 70.25 | 57.71 | 74.86 | 88.55 | 59.14 | 69.24 | 76.9 | 88.45 | 53.59 | |
Producer up to Level 3 misclassifications [%] | 89.73 | 88.44 | 82.89 | 91.33 | 84.82 | 82.70 | 90.18 | 93.57 | 73.15 | 70.70 | 61.23 | 77.05 | 89.96 | 78.43 | 89.50 | 84.31 | 93.24 | 57.51 | |
Producer Level 4 misclassifications [%] | 0.68 | 1.36 | 0.39 | 0.68 | 0.00 | 0.58 | 0.00 | 1.67 | 15.35 | 5.64 | 29.52 | 21.86 | 7.38 | 12.84 | 5.16 | 1.72 | 0.65 | 0.65 | |
Producer Level 5 misclassifications [%] | 1.14 | 2.21 | 1.80 | 2.30 | 0.00 | 2.31 | 0.47 | 1.90 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.83 | 2.95 | 4.54 | 0.87 | 5.88 | |
Producer Level 0 misclassifications [%] | 8.46 | 7.99 | 14.92 | 5.69 | 15.17 | 14.41 | 9.35 | 2.86 | 11.49 | 23.65 | 9.25 | 1.09 | 2.67 | 5.89 | 2.39 | 9.42 | 5.23 | 35.95 | |
User overall misclassification score | 1.42 | 1.50 | 1.5 | 1.34 | 2.03 | 2.63 | 2.78 | 2.40 | 1.86 | 2.21 | 3.71 | 2.17 | 2.26 | 1.73 | 2.26 | 2.54 | 1.40 | 1.76 | 0.00 |
User Level 1 misclassifications [%] | 90.92 | 82.58 | 88.34 | 89.12 | 72.47 | 62.57 | 31.05 | 65.7 | 64.54 | 62.22 | 17.93 | 61.25 | 62.70 | 74.12 | 60.88 | 55.17 | 86.94 | 83.67 | 0.00 |
User up to Level 2 misclassifications [%] | 97.93 | 98.71 | 99.34 | 99.58 | 100.00 | 64.28 | 65.75 | 73.76 | 95.93 | 90.78 | 32.43 | 76.25 | 76.18 | 87.43 | 64.29 | 64.33 | 86.94 | 83.67 | 0.00 |
User up to Level 3 misclassifications [%] | 98.71 | 98.71 | 99.67 | 99.58 | 100.00 | 85.14 | 98.17 | 91.01 | 97.23 | 90.92 | 68.06 | 95.00 | 86.41 | 93.53 | 88.8 | 78.79 | 99.15 | 85.71 | 0.00 |
User Level 4 misclassifications [%] | 0.40 | 0.00 | 0.00 | 0.00 | 0.00 | 10.14 | 1.37 | 6.61 | 2.76 | 9.08 | 31.94 | 5.00 | 13.6 | 4.41 | 10.06 | 13.56 | 0.43 | 2.04 | 0.00 |
User Level 5 misclassifications [%] | 0.89 | 1.29 | 0.33 | 0.42 | 0 | 4.71 | 0.46 | 2.38 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 2.06 | 1.14 | 7.65 | 0.43 | 12.24 | 0.00 |
User Level 0 misclassifications [%] | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 100.00 |
Producer sum | 10,280 | 1177 | 2829 | 1477 | 145 | 347 | 214 | 420 | 2697 | 1772 | 227 | 183 | 637 | 1799 | 543 | 2 091 | 459 | 153 | 0 |
User sum | 11,424 | 310 | 1527 | 239 | 69 | 700 | 219 | 968 | 1303 | 1453 | 407 | 80 | 831 | 1360 | 616 | 2 641 | 467 | 98 | 2738 |
Appendix B
Misclassification Score
- Pure coniferous consisting of spruce, larch, white pine, black pine, and mountain pine.
- Pure deciduous consisting of beech, oak, green alder, and other deciduous.
- Mixed coniferous consisting of spruce–white pine, spruce–larch, spruce–fir, spruce–arolla pine, and larch-arolla pine.
- Mixed coniferous-deciduous consisting of spruce–beech, spruce–deciduous, white pine–oak, white pine–deciduous, black pine–deciduous, and larch-deciduous.
- Low vegetation.
- Level 1: An exact match between the predicted and the validated class.
- Level 2: Confusion within mixed coniferous or between pure coniferous and mixed coniferous, where one species of the predicted class matches the validation. Examples include predicted spruce–fir but validated as spruce and predicted spruce–larch but validated as larch-arolla pine.
- Level 3: Confusion between pure coniferous or pure deciduous and mixed coniferous-deciduous, between mixed coniferous and mixed coniferous-deciduous or within mixed coniferous-deciduous, where one species of the predicted class matches the validation. Examples include predicted spruce but validated as spruce–beech, predicted pine–oak but validated as oak, predicted spruce–white pine but validated as spruce–deciduous and predicted spruce–beech but validated as spruce–deciduous.
- Level 4: Confusion within pure or mixed coniferous, between pure and mixed coniferous, within pure deciduous, within mixed coniferous-deciduous, where no species of the predicted class matches the validation. Examples include predicted spruce but validated as larch, predicted larch-arolla pine but validated as spruce, predicted larch-arolla pine but validated as spruce–white pine, predicted beech but validated as oak and predicted spruce–beech but validated as black pine–deciduous.
- Level 5: Confusion between pure or mixed coniferous and mixed coniferous-deciduous or between pure deciduous and mixed coniferous-deciduous, where no species of the predicted class matches the validation. Examples include predicted larch-arolla pine but validated as spruce–oak and predicted oak but validated as spruce–deciduous.
- Level 6: Confusion between pure or mixed coniferous and pure deciduous. Examples include predicted larch but validated as oak and predicted beech but validated as spruce–larch.
- Level 0: Confusion between any meta-class and the low vegetation class received distinct handling because the low vegetation class was not included in the NFI-VD data. Therefore, only user confusions could be calculated for these cases.
References
- Baumbach, L.; Hickler, T.; Yousefpour, R.; Hanewinkel, M. High economic costs of reduced carbon sinks and declining biome stability in Central American forests. Nat. Commun. 2023, 14, 2043. [Google Scholar] [CrossRef] [PubMed]
- Berger, F.; Rey, F. Mountain Protection Forests against Natural Hazards and Risks: New French Developments by Integrating Forests in Risk Zoning. Nat. Hazards 2004, 33, 395–404. [Google Scholar] [CrossRef]
- Brang, P.; Schnenberger, W.; Ott, E.; Gardner, B. Forests as Protection from Natural Hazards. In The Forests Handbook; Evans, J., Ed.; Blackwell Science Ltd.: Oxford, UK, 2001; Volume 2, pp. 53–81. ISBN 978-0-470-75707-9. [Google Scholar] [CrossRef]
- Jim, C.Y.; Chen, W.Y. Assessing the ecosystem service of air pollutant removal by urban trees in Guangzhou (China). J. Environ. Manag. 2008, 88, 665–676. [Google Scholar] [CrossRef] [PubMed]
- Miller, D.C.; Hajjar, R. Forests as pathways to prosperity: Empirical insights and conceptual advances. World Dev. 2020, 125, 104647. [Google Scholar] [CrossRef]
- O’Brien, L.E.; Urbanek, R.E.; Gregory, J.D. Ecological functions and human benefits of urban forests. Urban For. Urban Green. 2022, 75, 127707. [Google Scholar] [CrossRef]
- Sander, H.; Polasky, S.; Haight, R.G. The value of urban tree cover: A hedonic property price model in Ramsey and Dakota Counties, Minnesota, USA. Ecol. Econ. 2010, 69, 1646–1656. [Google Scholar] [CrossRef]
- Teich, M.; Accastello, C.; Perzl, F.; Kleemayr, K. (Eds.) Protective Forests as Ecosystem-Based Solution for Disaster Risk Reduction (Eco-DRR); IntechOpen: London, UK, 2022; ISBN 978-1-83969-325-0. Available online: https://www.intechopen.com/books/10812 (accessed on 1 February 2024).
- Yang, J.; McBride, J.; Zhou, J.; Sun, Z. The urban forest in Beijing and its role in air pollution reduction. Urban For. Urban Green. 2005, 3, 65–78. [Google Scholar] [CrossRef]
- Patacca, M.; Lindner, M.; Lucas-Borja, M.E.; Cordonnier, T.; Fidej, G.; Gardiner, B.; Hauf, Y.; Jasinevičius, G.; Labonne, S.; Linkevičius, E.; et al. Significant increase in natural disturbance impacts on European forests since 1950. Glob. Chang. Biol. 2023, 29, 1359–1376. [Google Scholar] [CrossRef] [PubMed]
- Seidl, R.; Thom, D.; Kautz, M.; Martin-Benito, D.; Peltoniemi, M.; Vacchiano, G.; Wild, J.; Ascoli, D.; Petr, M.; Honkaniemi, J.; et al. Forest disturbances under climate change. Nat. Clim. Chang. 2017, 7, 395–402. [Google Scholar] [CrossRef]
- Lindner, M.; Maroschek, M.; Netherer, S.; Kremer, A.; Barbati, A.; Garcia-Gonzalo, J.; Seidl, R.; Delzon, S.; Corona, P.; Kolström, M.; et al. Climate change impacts, adaptive capacity, and vulnerability of European forest ecosystems. For. Ecol. Manag. 2010, 259, 698–709. [Google Scholar] [CrossRef]
- Allen, C.D.; Macalady, A.K.; Chenchouni, H.; Bachelet, D.; McDowell, N.; Vennetier, M.; Kitzberger, T.; Rigling, A.; Breshears, D.D.; Hogg, E.T.; et al. A global overview of drought and heat-induced tree mortality reveals emerging climate change risks for forests. For. Ecol. Manag. 2010, 259, 660–684. [Google Scholar] [CrossRef]
- Hallas, T.; Steyrer, G.; Laaha, G.; Hoch, G. Two unprecedented outbreaks of the European spruce bark beetle, Ips typographus L. (Col., Scolytinae) in Austria since 2015: Different causes and different impacts on forests. Cent. Eur. For. J. 2024, 70, 1–12. [Google Scholar] [CrossRef]
- Hlásny, T.; König, L.; Krokene, P.; Lindner, M.; Montagné-Huck, C.; Müller, J.; Qin, H.; Raffa, K.F.; Schelhaas, M.-J.; Svoboda, M.; et al. Bark Beetle Outbreaks in Europe: State of Knowledge and Ways Forward for Management. Curr. For. Rep. 2021, 7, 138–165. [Google Scholar] [CrossRef]
- Kautz, M.; Meddens, A.J.H.; Hall, R.J.; Arneth, A. Biotic disturbances in Northern Hemisphere forests—A synthesis of recent data, uncertainties and implications for forest monitoring and modelling. Glob. Ecol. Biogeogr. 2017, 26, 533–552. [Google Scholar] [CrossRef]
- Ritzer, E.; Schebeck, M.; Kirisits, T. The pine pathogen Diplodia sapinea is associated with the death of large Douglas fir trees. For. Pathol. 2023, 53, e12823. [Google Scholar] [CrossRef]
- Fassnacht, F.E.; Latifi, H.; Stereńczak, K.; Modzelewska, A.; Lefsky, M.; Waser, L.T.; Straub, C.; Ghosh, A. Review of studies on tree species classification from remotely sensed data. Remote Sens. Environ. 2016, 186, 64–87. [Google Scholar] [CrossRef]
- Hallas, T.; Netherer, S.; Pennerstorfer, J.; Karel, S.; Schadauer, T.; Löw, M.; Baier, P.; Bauerhansl, C.; Kessler, D.; Englisch, M.; et al. The Bark Beetle Dashboard—Towards a Holistic Risk Assessment of Ips typographus. 2024. Available online: https://rgdoi.net/10.13140/RG.2.2.11420.09603 (accessed on 23 July 2024).
- Immitzer, M.; Vuolo, F.; Atzberger, C. First Experience with Sentinel-2 Data for Crop and Tree Species Classifications in Central Europe. Remote Sens. 2016, 8, 166. [Google Scholar] [CrossRef]
- Bolyn, C.; Michez, A.; Gaucher, P.; Lejeune, P.; Bonnet, S. Forest mapping and species composition using supervised per pixel classification of Sentinel-2 imagery. BASE 2018, 22, 172–187. [Google Scholar] [CrossRef]
- Persson, M.; Lindberg, E.; Reese, H. Tree Species Classification with Multi-Temporal Sentinel-2 Data. Remote Sens. 2018, 10, 1794. [Google Scholar] [CrossRef]
- Puletti, N.; Chianucci, F.; Castaldi, C. Use of Sentinel-2 for forest classification in Mediterranean environments. Ann. Silvic. Res. 2018, 42, 32–38. [Google Scholar] [CrossRef]
- Wessel, M.; Brandmeier, M.; Tiede, D. Evaluation of Different Machine Learning Algorithms for Scalable Classification of Tree Types and Tree Species Based on Sentinel-2 Data. Remote Sens. 2018, 10, 1419. [Google Scholar] [CrossRef]
- Grabska, E.; Hostert, P.; Pflugmacher, D.; Ostapowicz, K. Forest Stand Species Mapping Using the Sentinel-2 Time Series. Remote Sens. 2019, 11, 1197. [Google Scholar] [CrossRef]
- Hościło, A.; Lewandowska, A. Mapping Forest Type and Tree Species on a Regional Scale Using Multi-Temporal Sentinel-2 Data. Remote Sens. 2019, 11, 929. [Google Scholar] [CrossRef]
- Immitzer, M.; Neuwirth, M.; Böck, S.; Brenner, H.; Vuolo, F.; Atzberger, C. Optimal Input Features for Tree Species Classification in Central Europe Based on Multi-Temporal Sentinel-2 Data. Remote Sens. 2019, 11, 2599. [Google Scholar] [CrossRef]
- Grabska, E.; Frantz, D.; Ostapowicz, K. Evaluation of machine learning algorithms for forest stand species mapping using Sentinel-2 imagery and environmental data in the Polish Carpathians. Remote Sens. Environ. 2020, 251, 112103. [Google Scholar] [CrossRef]
- Bjerreskov, K.S.; Nord-Larsen, T.; Fensholt, R. Classification of Nemoral Forests with Fusion of Multi-Temporal Sentinel-1 and 2 Data. Remote Sens. 2021, 13, 950. [Google Scholar] [CrossRef]
- Kollert, A.; Bremer, M.; Löw, M.; Rutzinger, M. Exploring the potential of land surface phenology and seasonal cloud free composites of one year of Sentinel-2 imagery for tree species mapping in a mountainous region. Int. J. Appl. Earth Obs. Geoinf. 2021, 94, 102208. [Google Scholar] [CrossRef]
- Xi, Y.; Ren, C.; Tian, Q.; Ren, Y.; Dong, X.; Zhang, Z. Exploitation of Time Series Sentinel-2 Data and Different Machine Learning Algorithms for Detailed Tree Species Classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 7589–7603. [Google Scholar] [CrossRef]
- Zagajewski, B.; Kluczek, M.; Raczko, E.; Njegovec, A.; Dabija, A.; Kycko, M. Comparison of Random Forest, Support Vector Machines, and Neural Networks for Post-Disaster Forest Species Mapping of the Krkonoše/Karkonosze Transboundary Biosphere Reserve. Remote Sens. 2021, 13, 2581. [Google Scholar] [CrossRef]
- Hemmerling, J.; Pflugmacher, D.; Hostert, P. Mapping temperate forest tree species using dense Sentinel-2 time series. Remote Sens. Environ. 2021, 267, 112743. [Google Scholar] [CrossRef]
- Lechner, M.; Dostálová, A.; Hollaus, M.; Atzberger, C.; Immitzer, M. Combination of Sentinel-1 and Sentinel-2 Data for Tree Species Classification in a Central European Biosphere Reserve. Remote Sens. 2022, 14, 2687. [Google Scholar] [CrossRef]
- Delwart, S. ESA SENTINEL-2 User Handbook. 2015. Available online: https://sentinels.copernicus.eu/documents/247904/685211/Sentinel-2_User_Handbook (accessed on 12 February 2024).
- Löw, M.; Koukal, T. Phenology Modelling and Forest Disturbance Mapping with Sentinel-2 Time Series in Austria. Remote Sens. 2020, 12, 4191. [Google Scholar] [CrossRef]
- Bolyn, C.; Lejeune, P.; Michez, A.; Latte, N. Mapping tree species proportions from satellite imagery using spectral–spatial deep learning. Remote Sens. Environ. 2022, 280, 113205. [Google Scholar] [CrossRef]
- Waser, L.T.; Rüetschi, M.; Psomas, A.; Small, D.; Rehush, N. Mapping dominant leaf type based on combined Sentinel-1/-2 data—Challenges for mountainous countries. ISPRS J. Photogramm. Remote Sens. 2021, 180, 209–226. [Google Scholar] [CrossRef]
- Roberts, D.R.; Bahn, V.; Ciuti, S.; Boyce, M.S.; Elith, J.; Guillera-Arroita, G.; Hauenstein, S.; Lahoz-Monfort, J.J.; Schröder, B.; Thuiller, W.; et al. Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 2017, 40, 913–929. [Google Scholar] [CrossRef]
- Ploton, P.; Mortier, F.; Réjou-Méchain, M.; Barbier, N.; Picard, N.; Rossi, V.; Dormann, C.; Cornu, G.; Viennois, G.; Bayol, N.; et al. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nat. Commun. 2020, 11, 4540. [Google Scholar] [CrossRef] [PubMed]
- Wadoux, A.M.J.-C.; Heuvelink, G.B.M.; De Bruin, S.; Brus, D.J. Spatial cross-validation is not the right way to evaluate map accuracy. Ecol. Model. 2021, 457, 109692. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention—MICCAI; Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2015; Volume 9351, pp. 234–241. [Google Scholar] [CrossRef]
- Ismail Fawaz, H.; Forestier, G.; Weber, J.; Idoumghar, L.; Muller, P.-A. Deep learning for time series classification: A review. Data Min. Knowl. Discov. 2019, 33, 917–963. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 770–778. [Google Scholar] [CrossRef]
- Climate Austria: Average Temperature, Weather by Month & Weather for Austria. Available online: https://en.climate-data.org/europe/austria-4/?utm_content=cmp-true (accessed on 12 February 2024).
- Klimamittel—ZAMG. Available online: https://www.zamg.ac.at/cms/de/klima/klimauebersichten/klimamittel-1971-2000 (accessed on 12 February 2024).
- Zampieri, M.; Scoccimarro, E.; Gualdi, S. Atlantic influence on spring snowfall over the Alps in the past 150 years. Environ. Res. Lett. 2013, 8, 034026. [Google Scholar] [CrossRef]
- Savitzky, A.; Golay, M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
- Xue, J.; Su, B. Significant Remote Sensing Vegetation Indices: A Review of Developments and Applications. J. Sens. 2017, 2017, 1353691. [Google Scholar] [CrossRef]
- Kaufman, Y.J.; Tanre, D. Atmospherically resistant vegetation index (ARVI) for EOS-MODIS. IEEE Trans. Geosci. Remote Sens. 1992, 30, 261–270. [Google Scholar] [CrossRef]
- Ahamed, T.; Tian, L.; Zhang, Y.; Ting, K.C. A review of remote sensing methods for biomass feedstock production. Biomass Bioenergy 2011, 35, 2455–2469. [Google Scholar] [CrossRef]
- Qiu, B.; Zou, F.; Chen, C.; Tang, Z.; Zhong, J.; Yan, X. Automatic mapping afforestation, cropland reclamation and variations in cropping intensity in central east China during 2001–2016. Ecol. Indic. 2018, 91, 490–502. [Google Scholar] [CrossRef]
- Mandlburger, G.; Wenzel, K.; Spitzer, A.; Haala, N.; Glira, P.; Pfeifer, N. Improved topographic models via concurrent airborne lidar anddense image matching. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2017, IV-2/W4, 259–266. [Google Scholar] [CrossRef]
- Trimble Inpho|Office Software. Available online: https://geospatial.trimble.com/products/software/trimble-inpho (accessed on 25 January 2024).
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Gschwantner, T.; Berger, A.; Büchsenmeister, R.; Hauk, E. Austria. In National Forest Inventories; Vidal, C., Alberdi, I.A., Hernández Mateo, L., Redmond, J.J., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 135–157. [Google Scholar] [CrossRef]
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; Adaptive Computation and Machine Learning; The MIT Press: Cambridge, MA, USA, 2016; 775p. [Google Scholar]
- Maas, A.L.; Hannun, A.Y.; Ng, A.Y. Rectifier Nonlinearities Improve Neural Network Acoustic Models. In Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 17–19 June 2013. [Google Scholar]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
- Austrian National Forest Inventory—Tree Species Map. 2024. Available online: https://www.waldinventur.at/?x=1486825&y=6059660&z=7.75968&r=0&l=1111#/map/1/mBaumartenkarte/Bundesland/erg9 (accessed on 25 January 2024).
- Figueira, A.; Vaz, B. Survey on Synthetic Data Generation, Evaluation Methods and GANs. Mathematics 2022, 10, 2733. [Google Scholar] [CrossRef]
- S. Clerc & MPC Team. L1C Data Quality Report. 2022. Available online: https://sentinel.esa.int/documents/247904/685211/Sentinel-2_L1C_Data_Quality_Report (accessed on 20 June 2024).
Spectral Indices | Acronym | Formula | References |
---|---|---|---|
Atmospherically Resistant Vegetation Index | ARVI | [49,50] | |
Band 8 Near Infra-Red | BNIR | Developed within this study | |
Dark Area Vegetation Near Infra-Red | DAVNIR | Developed within this study | |
Green Normalized Difference Vegetation Index | GNDVI | [51] | |
Green Share | GREEN_SHARE | Developed within this study | |
Visual Reflectance Absence Index | VRAI | Developed within this study |
Class | Number S2 Pixels | Area (km2) | Number Areas |
---|---|---|---|
Spruce | 62,066 | 6.2 | 391 |
Spruce–fir | 12,590 | 1.3 | 98 |
Spruce–larch | 103,181 | 10.3 | 341 |
Spruce–white pine | 26,933 | 2.7 | 180 |
Spruce–arolla pine | 4129 | 0.4 | 55 |
Spruce–beech | 21,918 | 2.2 | 201 |
Spruce–deciduous | 27,411 | 2.7 | 148 |
Spruce sparse | 7426 | 0.7 | 86 |
Larch | 24,193 | 2.4 | 237 |
Larch–arolla pine | 16,885 | 1.7 | 91 |
Larch–deciduous | 10,269 | 1.0 | 102 |
Larch sparse | 11,811 | 1.2 | 129 |
White pine | 25,643 | 2.6 | 222 |
White pine–oak | 9763 | 1.0 | 27 |
White pine–deciduous | 22,315 | 2.2 | 113 |
White pine sparse | 8972 | 0.9 | 20 |
Black pine | 27,639 | 2.8 | 43 |
Black pine–deciduous | 8537 | 0.9 | 48 |
Mountain pine | 8274 | 0.8 | 205 |
Beech | 16,729 | 1.7 | 115 |
Oak | 31,917 | 3.2 | 104 |
Green alder | 1310 | 0.1 | 44 |
Other deciduous | 40,052 | 4.0 | 138 |
Deciduous sparse | 6895 | 0.7 | 63 |
Low vegetation | 31,160 | 3.1 | 404 |
Sum | 568,018 | 56.8 | 3605 |
Model | Resnet Blocks Filters | MLP Layers (in/out Dimensions) | Parameters |
---|---|---|---|
res_64_128_128 | 64, 128, 128 | 821, 410, 205,102, 51, 25 | 955,324 |
base | 16, 32, 32 | 725, 362, 181, 90, 45, 25 | 344,140 |
res_8_16_16 | 8, 16, 16 | 709, 177, 44, 25 | 143,445 |
res_8_16_16s | 8, 16, 16 | 709, 70, 25 | 60,168 |
no_syn | 16, 32, 32 | 725, 362, 181, 90, 45, 25 | 383,584 |
Model | Number of Parameters | NFI-w-OA [%] ± std | MAF1 [%] ± std |
---|---|---|---|
base | 344,140 | 73.8 ± 5.4 | 55.0 ± 3.5 |
res_64_128_128 | 955,324 | 73.3 ± 5.5 | 55.1 ± 3.4 |
res_8_16_16 | 143,445 | 72.4 ± 5.3 | 54.3 ± 3.3 |
res_8_16_16s | 60,168 | 72.0 ± 5.2 | 54.4 ± 3.4 |
no_syn | 383,584 | 63.8 ± 6.7 | 49.8 ± 4.0 |
Model | OA [%] ± std | MAF1 [%] ± std | OMS ± std | PCPP [%] ± std | DCC [%] ± std | POA [%] ± std | MOA [%] ± std |
---|---|---|---|---|---|---|---|
base | 55.3 ± 1.8 | 42.0 ± 1.3 | 1.63 ± 0.04 | 79.4 ± 1.4 | 1.5 ± 0.3 | 90.7 ± 1.3 | 64.6 ± 4.2 |
res_8_16_16 | 55.1 ± 2.1 | 42.2 ± 1.8 | 1.66 ± 0.04 | 79.4 ± 1.4 | 1.7 ± 0.4 | 89.7 ± 1.6 | 62.7 ± 3.8 |
res_8_16_16s | 54.8 ± 1.5 | 41.5 ± 1.5 | 1.68 ± 0.01 | 79.3 ± 1.2 | 1.7 ± 0.4 | 89.5 ± 0.9 | 62.1 ± 2.7 |
no_syn | 47.7 ± 1.1 | 40.6 ± 1.5 | 1.84 ± 0.03 | 81.4 ± 1.1 | 1.2 ± 0.3 | 89.8 ± 1.4 | 58.4 ± 3.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Schadauer, T.; Karel, S.; Loew, M.; Knieling, U.; Kopecky, K.; Bauerhansl, C.; Berger, A.; Graeber, S.; Winiwarter, L. Evaluating Tree Species Mapping: Probability Sampling Validation of Pure and Mixed Species Classes Using Convolutional Neural Networks and Sentinel-2 Time Series. Remote Sens. 2024, 16, 2887. https://doi.org/10.3390/rs16162887
Schadauer T, Karel S, Loew M, Knieling U, Kopecky K, Bauerhansl C, Berger A, Graeber S, Winiwarter L. Evaluating Tree Species Mapping: Probability Sampling Validation of Pure and Mixed Species Classes Using Convolutional Neural Networks and Sentinel-2 Time Series. Remote Sensing. 2024; 16(16):2887. https://doi.org/10.3390/rs16162887
Chicago/Turabian StyleSchadauer, Tobias, Susanne Karel, Markus Loew, Ursula Knieling, Kevin Kopecky, Christoph Bauerhansl, Ambros Berger, Stephan Graeber, and Lukas Winiwarter. 2024. "Evaluating Tree Species Mapping: Probability Sampling Validation of Pure and Mixed Species Classes Using Convolutional Neural Networks and Sentinel-2 Time Series" Remote Sensing 16, no. 16: 2887. https://doi.org/10.3390/rs16162887
APA StyleSchadauer, T., Karel, S., Loew, M., Knieling, U., Kopecky, K., Bauerhansl, C., Berger, A., Graeber, S., & Winiwarter, L. (2024). Evaluating Tree Species Mapping: Probability Sampling Validation of Pure and Mixed Species Classes Using Convolutional Neural Networks and Sentinel-2 Time Series. Remote Sensing, 16(16), 2887. https://doi.org/10.3390/rs16162887