Regional Forest Carbon Stock Estimation Based on Multi-Source Data and Machine Learning Algorithms
Abstract
:1. Introduction
2. Materials and Methods
2.1. Introduction to the Research Area
2.2. Research Data and Methods
2.2.1. Research Data
2.2.2. Calculation of the Carbon Stock
- : biomass of tree species i, in tons of dry weight;
- : standing timber volume of tree species i, measured in cubic meters per plant;
- : basic wood density of tree species i, in tons of dry weight per cubic meter;
- : aboveground biomass expansion factor of tree species i, a dimensionless parameter;
- : the belowground-to-aboveground biomass ratio for tree species i;
- : conversion and expansion factor of the biomass of tree species i, in tons of dry weight per cubic meter;
- : number of trees of tree species i, expressed as the number of trees per hectare;
- A: the area of the corresponding sub-compartments, in hectares.
2.3. Extraction of Independent Variable Factors
2.3.1. Optical Remote Sensing Factors
2.3.2. Extraction of Dual-Polarization Texture Features from Radar Backscattering Coefficients
2.3.3. Independent Variable Factors from Ground Data
2.3.4. Data Integration
2.4. Methods
2.4.1. XGBoost
2.4.2. RF
2.4.3. LightGBM
2.4.4. Lasso
2.4.5. RFE
2.5. Performance Indicators
3. Results
3.1. Screening for Independent Variable Factors
3.2. Results Analysis
3.2.1. Evaluation of Data Schemes A and B
3.2.2. Evaluation of Data Schemes C and D
4. Discussion
4.1. Main Findings and Comparison with Previous Research
4.2. Strengths and Limitations of This Study
- (1)
- In the feature selection process, although the models built using Lasso and recursive feature elimination (RFE) methods show no significant difference in accuracy, the features selected by the two methods differ substantially. Lasso regression achieves feature selection and compression through L1 regularization. The selected features generally exhibit a relatively balanced importance, with most coming from remote sensing data. This suggests that, when forest background information is limited, the model constructed using Lasso feature selection can better leverage the information in remote sensing data, providing a more robust prediction performance. In contrast, the RFE method typically results in a more unbalanced distribution of feature importance by recursively evaluating features. During the RFE selection process, features related to forest background information are often assigned higher importance. This suggests that, in situations with abundant background information, especially those containing more forest-related features, the features selected by the RFE method better reflect the data’s underlying structure, enhancing the model’s adaptability. Although the differences in features selected by the two methods did not result in significant performance disparities, their differing feature selections define their respective applicable scenarios. The Lasso method is more suitable for situations dominated by remote sensing data and limited background information, while the RFE method performs better in scenarios with abundant background information, particularly when forest-related features dominate. Therefore, the selection of an appropriate feature selection method should depend on specific application requirements and data characteristics to maximize the model’s predictive ability and scope of application.
- (2)
- The inclusion of four ecological information features —namely, “community type”, “primary tree species”, “slope aspect”, and “humus thickness”—significantly improved model performance. These ecological information features enriched the model’s ecological background information and detailed microenvironmental data, improving its ability to describe soil–vegetation interactions. Specifically, information on “community type” and “primary tree species” enabled the model to assess vegetation adaptability to the environment, “slope aspect” influenced the microclimate factors, and “humus thickness” was correlated with soil fertility and water retention capacity. These variables optimized the model’s ability to capture complex ecological relationships, thereby enhancing prediction accuracy and generalization. Compared to scheme A without ecological information features, scheme B with ecological information features demonstrated significant improvement in model performance, with the R2 increasing by 2.70%–4.00% and MSE, RMSE, MAE, rRMSE, and MAPE decreasing by 7.92%–12.37%, 3.00%–7.07%, 4.76%–7.94%, 3.87%–6.70%, and 8.54%–11.91%, respectively. These error indicators improve with the inclusion of features, indicating that the model performs better with additional input variables.
- (3)
- LightGBM improves the feature splitting efficiency significantly through a histogram-based bucketing algorithm and leaf growth strategy, enabling the effective processing of large-scale, high-dimensional data via parallelization and memory optimization. LightGBM’s native support for ecological information features eliminates the computational burden of one-hot encoding, further improving the resource utilization efficiency. Additionally, LightGBM offers a range of regularization parameters that effectively control model complexity and prevent overfitting, ensuring robust generalization capabilities and maintaining computational speed. These features position LightGBM as a superior choice over RF and XGBoost in various performance metrics. Under scheme B, the LightGBM model attained the following performance metrics: R2 = 0.78, MSE = 0.85 t/ha, RMSE = 0.92, MAE = 0.58 t/ha, rRMSE = 41.37%, and MAPE = 30.72%, underscoring its high practical value and significance in FCS estimation.
- (4)
- Although the improvement in R2 and error indices in this study is modest, these changes may still have a meaningful impact on practical applications, particularly in scenarios with large data volumes or high prediction accuracy requirements. Even a modest performance improvement may enhance the application value and prediction accuracy of the model in practical problems, such as carbon stock estimation, and holds practical significance for resource management and policymaking. Additionally, although the improvement in model performance from adding ecological information features is limited in this study area, considering the ecological and environmental differences across regions, more substantial improvements may be achieved in other study areas in the future. Therefore, future studies can further explore the potential of incorporating ecological features under different regional conditions, which may positively influence the accuracy of carbon stock models. In models with already high accuracy, small performance improvements are often more challenging; however, these improvements, although modest, reflect the subtle progress made by the algorithm in handling complex data. This modest improvement can be seen as a process of further adapting the model to the complexity of real-world data. Although the improvement is limited, it still demonstrates the potential for model optimization.
- (1)
- Although this study demonstrates that the model performs well in Qingyuan County, its applicability remains limited. First, this study is based solely on the forest types and climate conditions of Qingyuan County and does not assess the model’s performance in other regions. As a result, it is unclear whether the model can perform similarly in regions with differing forest types, climate conditions, and ecological environments. In the future, as more regional data become available, it will be essential to further validate and refine the model to ensure its adaptability and generalization ability.
- (2)
- The accuracy of FCS estimation can be significantly improved by utilizing texture features derived from remote sensing images. In this study, texture features were only extracted from radar remote sensing images, and the exploration of different window sizes or asynchronous lengths was not undertaken. Exploring different window sizes, asynchronous lengths, and multiple band combinations to extract texture features from both optical and radar remote sensing images may offer valuable insights into how texture features enhance carbon stock estimation accuracy [64].
- (3)
- The remote sensing images analyzed in this study were largely captured in November and December, which may not coincide with the period of active tree growth. In autumn and winter, certain tree species may enter a dormant state, displaying yellowing or leaf fall. Consequently, the vegetation information captured, especially by optical imagery, might not accurately reflect the actual state of the trees, leading to potentially diminishing the model’s estimation accuracy. The acquisition of remote sensing images that align with the tree growth period in the future could lead to enhanced estimation accuracy [65].
- (4)
- This study relies on remote sensing images from just one temporal phase. Access to multi-temporal remote sensing data would enhance the model’s temporal and spatial resolution, as well as its estimation accuracy, by capturing seasonal and interannual dynamics of vegetation and its response to disturbances such as fire, pests, diseases, and logging. Such data would not only facilitate a detailed characterization of the spatiotemporal variability of carbon stock but also reveal long-term trends in carbon stock and its sensitivity to climate and land use changes, thereby providing a robust scientific foundation for carbon cycle research and climate policy formulation [66].
- (5)
- The accuracy in classification was a key factor in this study when forest composition, primary tree species, humus thickness, and slope aspect were applied as classification features for FCS estimation. The classification method chosen has a direct impact on the estimation of forest types and tree species distribution, which, in turn, affects the calculation of carbon stocks. If the classification method is inaccurate, it may lead to incorrect classification of forest types, resulting in the overestimation or underestimation of carbon stocks. For example, if the tree species or forest composition in certain areas are incorrectly classified, it may affect the regional distribution of carbon stocks, thereby influencing the overall carbon stock estimation.
- (6)
- In this study, we applied standard atmospheric correction methods tailored to each satellite data: Sen2Cor for Sentinel-2 (S2) and LaSRC for Landsat 8 (L8). However, the use of different correction methods may lead to differences in surface reflectivity, thus affecting the accuracy of carbon stock estimates. We plan to explore a unified atmospheric correction method or conduct more comparative experiments in future studies to improve the consistency and accuracy of the results.
5. Conclusions
- (1)
- The integration of ecological information features, such as forest composition, primary tree species, humus thickness, and slope direction, into the model significantly enhances the estimation accuracy and notably improves the overall model performance.
- (2)
- By retaining key features, the RFE algorithm efficiently reduces the number of independent variables, which accelerates the model training and boosts its generalization ability.
- (3)
- Among the three models—XGBoost, RF, and LightGBM—the LightGBM algorithm exhibits superior performance in estimating FCS.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
Band Number | Name | Wavelength Range (µm) | Spatial Resolution (m) | Main Applications |
---|---|---|---|---|
B1 | Aerosol | 0.433–0.453 | 30 | Atmospheric correction, shallow water and coastal monitoring |
B2 | Blue | 0.450–0.515 | 30 | Water monitoring, vegetation health, soil/water comparisons |
B3 | Green | 0.525–0.600 | 30 | Vegetation health analysis, agriculture and forest monitoring |
B4 | Red | 0.630–0.680 | 30 | Vegetation analysis (NDVI calculation), land cover classification |
B5 | NIR | 0.845–0.885 | 30 | Vegetation analysis, land cover monitoring, water body boundary identification |
B6 | SWIR 1 | 1.560–1.660 | 30 | Soil and vegetation moisture content, farmland irrigation monitoring |
B7 | SWIR 2 | 2.100–2.300 | 30 | Geological feature analysis, mineral exploration, vegetation pressure |
B8 | Panchromatic | 0.500–0.680 | 15 | High-resolution image fusion and linear feature extraction |
B9 | Cirrus | 1.360–1.390 | 30 | Thin cloud detection |
B10 | TIRS 1 | 10.60–11.19 | 100 | Surface temperature monitoring, thermal characteristics analysis |
B11 | TIRS 2 | 11.50–12.51 | 100 | Surface temperature monitoring, thermal activity analysis |
References
- International Energy Agency. Net Zero by 2050. 2021. Available online: https://www.iea.org/reports/net-zero-by-2050 (accessed on 5 October 2024).
- Sepehriar, A.; Eslamipoor, R. An Economical Single-Vendor Single-Buyer Framework for Carbon Emission Policies. J. Bus. Econ. 2024, 94, 927–945. [Google Scholar] [CrossRef]
- Eslamipoor, R.; Sepehriar, A. Enhancing Supply Chain Relationships in the Circular Economy: Strategies for a Green Centralized Supply Chain with Deteriorating Products. J. Environ. Manag. 2024, 367, 121738. [Google Scholar] [CrossRef]
- Pregitzer, K.S.; Euskirchen, E.S. Carbon Cycling and Storage in World Forests: Biome Patterns Related to Forest Age. Glob. Change Biol. 2004, 10, 2052–2077. [Google Scholar] [CrossRef]
- Malhi, Y.; Meir, P.; Brown, S. Forests, Carbon and Global Climate. Philos. Trans. R. Soc. Lond. Ser. Math. Phys. Eng. Sci. 2002, 360, 1567–1591. [Google Scholar] [CrossRef]
- Bustamante, M.M.C.; Roitman, I.; Aide, T.M.; Alencar, A.; Anderson, L.O.; Aragão, L.; Asner, G.P.; Barlow, J.; Berenguer, E.; Chambers, J.; et al. Toward an Integrated Monitoring Framework to Assess the Effects of Tropical Forest Degradation and Recovery on Carbon Stocks and Biodiversity. Glob. Change Biol. 2016, 22, 92–109. [Google Scholar] [CrossRef]
- Pan, Y.; Birdsey, R.A.; Fang, J.; Houghton, R.; Kauppi, P.E.; Kurz, W.A.; Phillips, O.L.; Shvidenko, A.; Lewis, S.L.; Canadell, J.G.; et al. A Large and Persistent Carbon Sink in the World’s Forests. Science 2011, 333, 988–993. [Google Scholar] [CrossRef] [PubMed]
- Chave, J.; Andalo, C.; Brown, S.; Cairns, M.A.; Chambers, J.Q.; Eamus, D.; Fölster, H.; Fromard, F.; Higuchi, N.; Kira, T.; et al. Tree Allometry and Improved Estimation of Carbon Stocks and Balance in Tropical Forests. Oecologia 2005, 145, 87–99. [Google Scholar] [CrossRef]
- Santoro, M.; Cartus, O.; Carvalhais, N.; Rozendaal, D.M.A.; Avitabile, V.; Araza, A.; de Bruin, S.; Herold, M.; Quegan, S.; Rodríguez-Veiga, P.; et al. The Global Forest Above-Ground Biomass Pool for 2010 Estimated from High-Resolution Satellite Observations. Earth Syst. Sci. Data 2021, 13, 3927–3950. [Google Scholar] [CrossRef]
- Harris, N.L.; Gibbs, D.A.; Baccini, A.; Birdsey, R.A.; de Bruin, S.; Farina, M.; Fatoyinbo, L.; Hansen, M.C.; Herold, M.; Houghton, R.A.; et al. Global Maps of Twenty-First Century Forest Carbon Fluxes. Nat. Clim. Change 2021, 11, 234–240. [Google Scholar] [CrossRef]
- Dube, T.; Mutanga, O. Investigating the Robustness of the New Landsat-8 Operational Land Imager Derived Texture Metrics in Estimating Plantation Forest Aboveground Biomass in Resource Constrained Areas. ISPRS J. Photogramm. Remote Sens. 2015, 108, 12–32. [Google Scholar] [CrossRef]
- Thenkabail, P.S.; Smith, R.B.; De Pauw, E. Hyperspectral Vegetation Indices and Their Relationships with Agricultural Crop Characteristics. Remote Sens. Environ. 2000, 71, 158–182. [Google Scholar] [CrossRef]
- Vorovencii, I. Assessing Various Scenarios of Multitemporal Sentinel-2 Imagery, Topographic Data, Texture Features, and Machine Learning Algorithms for Tree Species Identification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 15373–15392. [Google Scholar] [CrossRef]
- Whitcraft, A.K.; Vermote, E.F.; Becker-Reshef, I.; Justice, C.O. Cloud Cover throughout the Agricultural Growing Season: Impacts on Passive Optical Earth Observations. Remote Sens. Environ. 2015, 156, 438–447. [Google Scholar] [CrossRef]
- Hirschmugl, M.; Deutscher, J.; Sobe, C.; Bouvet, A.; Mermoz, S.; Schardt, M. Use of SAR and Optical Time Series for Tropical Forest Disturbance Mapping. Remote Sens. 2020, 12, 727. [Google Scholar] [CrossRef]
- Vatandaşlar, C.; Abdikan, S. Carbon Stock Estimation by Dual-Polarized Synthetic Aperture Radar (SAR) and Forest Inventory Data in a Mediterranean Forest Landscape. J. For. Res. 2022, 33, 827–838. [Google Scholar] [CrossRef]
- Zhang, F.; Tian, X.; Zhang, H.; Jiang, M. Estimation of Aboveground Carbon Density of Forests Using Deep Learning and Multisource Remote Sensing. Remote Sens. 2022, 14, 3022. [Google Scholar] [CrossRef]
- Zhang, Y.; He, B.; Chen, R.; Zhang, H.; Fan, C.; Yin, J.; Li, Y. The Potential of Optical and SAR Time-Series Data for the Improvement of Aboveground Biomass Carbon Estimation in Southwestern China’s Evergreen Coniferous Forests. GIScience Remote Sens. 2024, 61, 2345438. [Google Scholar] [CrossRef]
- David, R.M.; Rosser, N.J.; Donoghue, D.N.M. Improving above Ground Biomass Estimates of Southern Africa Dryland Forests by Combining Sentinel-1 SAR and Sentinel-2 Multispectral Imagery. Remote Sens. Environ. 2022, 282, 113232. [Google Scholar] [CrossRef]
- Luo, Z.; Viscarra-Rossel, R.A.; Qian, T. Similar Importance of Edaphic and Climatic Factors for Controlling Soil Organic Carbon Stocks of the World. Biogeosciences 2021, 18, 2063–2073. [Google Scholar] [CrossRef]
- Hofhansl, F.; Chacón-Madrigal, E.; Fuchslueger, L.; Jenking, D.; Morera-Beita, A.; Plutzar, C.; Silla, F.; Andersen, K.M.; Buchs, D.M.; Dullinger, S.; et al. Climatic and Edaphic Controls over Tropical Forest Diversity and Vegetation Carbon Storage. Sci. Rep. 2020, 10, 5066. [Google Scholar] [CrossRef]
- Zhang, S.; Fang, Y.; Luo, Y.; Li, Y.; Ge, T.; Wang, Y.; Wang, H.; Yu, B.; Song, X.; Chen, J.; et al. Linking Soil Carbon Availability, Microbial Community Composition and Enzyme Activities to Organic Carbon Mineralization of a Bamboo Forest Soil Amended with Pyrogenic and Fresh Organic Matter. Sci. Total Environ. 2021, 801, 149717. [Google Scholar] [CrossRef] [PubMed]
- Qin, Y.; Feng, Q.; Holden, N.M.; Cao, J. Variation in Soil Organic Carbon by Slope Aspect in the Middle of the Qilian Mountains in the Upper Heihe River Basin, China. CATENA 2016, 147, 308–314. [Google Scholar] [CrossRef]
- Zhang, X.; Adamowski, J.F.; Liu, C.; Zhou, J.; Zhu, G.; Dong, X.; Cao, J.; Feng, Q. Which Slope Aspect and Gradient Provides the Best Afforestation-Driven Soil Carbon Sequestration on the China’s Loess Plateau? Ecol. Eng. 2020, 147, 105782. [Google Scholar] [CrossRef]
- Poorter, L.; van der Sande, M.T.; Thompson, J.; Arets, E.J.M.M.; Alarcón, A.; Álvarez-Sánchez, J.; Ascarrunz, N.; Balvanera, P.; Barajas-Guzmán, G.; Boit, A.; et al. Diversity Enhances Carbon Storage in Tropical Forests. Glob. Ecol. Biogeogr. 2015, 24, 1314–1328. [Google Scholar] [CrossRef]
- Vesterdal, L.; Clarke, N.; Sigurdsson, B.D.; Gundersen, P. Do Tree Species Influence Soil Carbon Stocks in Temperate and Boreal Forests? For. Ecol. Manag. 2013, 309, 4–18. [Google Scholar] [CrossRef]
- Ma, S.-H.; Eziz, A.; Tian, D.; Yan, Z.-B.; Cai, Q.; Jiang, M.-W.; Ji, C.-J.; Fang, J.-Y. Size- and Age-Dependent Increases in Tree Stem Carbon Concentration: Implications for Forest Carbon Stock Estimations. J. Plant Ecol. 2020, 13, 233–240. [Google Scholar] [CrossRef]
- Augusto, L.; Boča, A. Tree Functional Traits, Forest Biomass, and Tree Species Diversity Interact with Site Properties to Drive Forest Soil Carbon. Nat. Commun. 2022, 13, 1097. [Google Scholar] [CrossRef] [PubMed]
- Pham, T.D.; Yokoya, N.; Nguyen, T.T.T.; Le, N.N.; Ha, N.T.; Xia, J.; Takeuchi, W.; Pham, T.D. Improvement of Mangrove Soil Carbon Stocks Estimation in North Vietnam Using Sentinel-2 Data and Machine Learning Approach. GIScience Remote Sens. 2021, 58, 68–87. [Google Scholar] [CrossRef]
- Singh, C.; Karan, S.K.; Sardar, P.; Samadder, S.R. Remote Sensing-Based Biomass Estimation of Dry Deciduous Tropical Forest Using Machine Learning and Ensemble Analysis. J. Environ. Manag. 2022, 308, 114639. [Google Scholar] [CrossRef] [PubMed]
- Chen, Q.; Zhou, W.; Shi, W. Estimation of Soil Organic Carbon Density on the Qinghai–Tibet Plateau Using a Machine Learning Model Driven by Multisource Remote Sensing. Remote Sens. 2024, 16, 3006. [Google Scholar] [CrossRef]
- Bui, Q.-T.; Pham, Q.-T.; Pham, V.-M.; Tran, V.-T.; Nguyen, D.-H.; Nguyen, Q.-H.; Nguyen, H.-D.; Do, N.T.; Vu, V.-M. Hybrid Machine Learning Models for Aboveground Biomass Estimations. Ecol. Inform. 2024, 79, 102421. [Google Scholar] [CrossRef]
- Huang, L.; Huang, Z.; Zhou, W.; Wu, S.; Li, X.; Mao, F.; Song, M.; Zhao, Y.; Lv, L.; Yu, J.; et al. Landsat-Based Spatiotemporal Estimation of Subtropical Forest Aboveground Carbon Storage Using Machine Learning Algorithms with Hyperparameter Tuning. Front. Plant Sci. 2024, 15, 1421567. [Google Scholar] [CrossRef] [PubMed]
- Zhou, R.; Wu, D.; Fang, L.; Xu, A.; Lou, X. A Levenberg–Marquardt Backpropagation Neural Network for Predicting Forest Growing Stock Based on the Least-Squares Equation Fitting Parameters. Forests 2018, 9, 757. [Google Scholar] [CrossRef]
- Basler, D.; Körner, C. Photoperiod and Temperature Responses of Bud Swelling and Bud Burst in Four Temperate Forest Tree Species. Tree Physiol. 2014, 34, 377–388. [Google Scholar] [CrossRef]
- Bai, C.; Zhao, W.; Klisz, M.; Rossi, S.; Shen, W.; Guo, X. Growth Rate and Not Growing Season Explains the Increased Productivity of Masson Pine in Mixed Stands. Plants 2025, 14, 313. [Google Scholar] [CrossRef]
- Malhi, R.K.M.; Anand, A.; Srivastava, P.K.; Chaudhary, S.K.; Pandey, M.K.; Behera, M.D.; Kumar, A.; Singh, P.; Sandhya Kiran, G. Synergistic Evaluation of Sentinel 1 and 2 for Biomass Estimation in a Tropical Forest of India. Adv. Space Res. 2022, 69, 1752–1767. [Google Scholar] [CrossRef]
- State Forestry Administration of China. Guidelines on Carbon Accounting and Monitoring for Afforestation Project (LY/T 2253-2014); State Forestry Administration of China: Beijing, China, 2014.
- Mouret, F.; Albughdadi, M.; Duthoit, S.; Kouamé, D.; Rieu, G.; Tourneret, J.-Y. Reconstruction of Sentinel-2 Derived Time Series Using Robust Gaussian Mixture Models—Application to the Detection of Anomalous Crop Development. Comput. Electron. Agric. 2022, 198, 106983. [Google Scholar] [CrossRef]
- Zhang, L.; Shao, Z.; Liu, J.; Cheng, Q. Deep Learning Based Retrieval of Forest Aboveground Biomass from Combined LiDAR and Landsat 8 Data. Remote Sens. 2019, 11, 1459. [Google Scholar] [CrossRef]
- Fang, G.; Xu, H.; Yang, S.-I.; Lou, X.; Fang, L. Synergistic Use of Sentinel-1, Sentinel-2, and Landsat 8 in Predicting Forest Variables. Ecol. Indic. 2023, 151, 110296. [Google Scholar] [CrossRef]
- Zhou, R.; Wu, D.; Zhou, R.; Fang, L.; Zheng, X.; Lou, X. Estimation of DBH at Forest Stand Level Based on Multi-Parameters and Generalized Regression Neural Network. Forests 2019, 10, 778. [Google Scholar] [CrossRef]
- Huete, A.R. A Soil-Adjusted Vegetation Index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
- Jordan, C.F. Derivation of Leaf-Area Index from Quality of Light on the Forest Floor. Ecology 1969, 50, 663–666. [Google Scholar] [CrossRef]
- Goel, N.S.; Qin, W. Influences of Canopy Architecture on Relationships between Various Vegetation Indices and LAI and Fpar: A Computer Simulation. Remote Sens. Rev. 1994, 10, 309–347. [Google Scholar] [CrossRef]
- Wang, Q.; Moreno-Martínez, Á.; Muñoz-Marí, J.; Campos-Taberner, M.; Camps-Valls, G. Estimation of Vegetation Traits with Kernel NDVI. ISPRS J. Photogramm. Remote Sens. 2023, 195, 408–417. [Google Scholar] [CrossRef]
- Sims, D.A.; Gamon, J.A. Relationships between Leaf Pigment Content and Spectral Reflectance across a Wide Range of Species, Leaf Structures and Developmental Stages. Remote Sens. Environ. 2002, 81, 337–354. [Google Scholar] [CrossRef]
- Hardisky, M.A.; Daiber, F.C.; Roman, C.T.; Klemas, V. Remote Sensing of Biomass and Annual Net Aerial Primary Productivity of a Salt Marsh. Remote Sens. Environ. 1984, 16, 91–106. [Google Scholar] [CrossRef]
- Cao, R.; Feng, Y.; Liu, X.; Shen, M.; Zhou, J. Uncertainty of Vegetation Green-Up Date Estimated from Vegetation Indices Due to Snowmelt at Northern Middle and High Latitudes. Remote Sens. 2020, 12, 190. [Google Scholar] [CrossRef]
- Xiao, Y.; Zhang, J.; Cui, T.; Gong, J.; Liu, R.; Chen, X.; Liang, X. Remote Sensing Estimation of the Biomass of Floating Ulva Prolifera and Analysis of the Main Factors Driving the Interannual Variability of the Biomass in the Yellow Sea. Mar. Pollut. Bull. 2019, 140, 330–340. [Google Scholar] [CrossRef]
- Adamu, B.; Ibrahim, S.; Rasul, A.; Whanda, S.J.; Headboy, P.; Muhammed, I.; Maiha, I.A. Evaluating the Accuracy of Spectral Indices from Sentinel-2 Data for Estimating Forest Biomass in Urban Areas of the Tropical Savanna. Remote Sens. Appl. Soc. Environ. 2021, 22, 100484. [Google Scholar] [CrossRef]
- Cao, L. Estimation of Forest Stock Volume in Yanqing District Based on Sentinel-2 Images; Beijing Forestry University: Beijing, China, 2019. [Google Scholar]
- Gitelson, A.A.; Merzlyak, M.N. Remote Estimation of Chlorophyll Content in Higher Plant Leaves. Int. J. Remote Sens. 1997, 18, 2691–2697. [Google Scholar] [CrossRef]
- He, Y.; Yin, H.; Chen, Y.; Xiang, R.; Zhang, Z.; Chen, H. Soil Salinity Estimation Based on Sentinel-1/2 Texture Features and Machine Learning. IEEE Sens. J. 2024, 24, 15302–15310. [Google Scholar] [CrossRef]
- Georgopoulos, N.; Gitas, I.Z.; Stefanidou, A.; Korhonen, L.; Stavrakoudis, D. Estimation of Individual Tree Stem Biomass in an Uneven-Aged Structured Coniferous Forest Using Multispectral LiDAR Data. Remote Sens. 2021, 13, 4827. [Google Scholar] [CrossRef]
- Dar, A.A.; Parthasarathy, N. Patterns and Drivers of Tree Carbon Stocks in Kashmir Himalayan Forests: Implications for Climate Change Mitigation. Ecol. Process. 2022, 11, 58. [Google Scholar] [CrossRef]
- Dar, A.A.; Parthasarathy, N. Ecological Drivers of Soil Carbon in Kashmir Himalayan Forests: Application of Machine Learning Combined with Structural Equation Modelling. J. Environ. Manag. 2023, 330, 117147. [Google Scholar] [CrossRef] [PubMed]
- Huang, J.; Wu, D.; Fang, L. Identification of sub-compartment forest type based on multi-source data and three-tier models. J. Nanjing For. Univ. (Nat. Sci. Ed.) 2022, 46, 69. [Google Scholar] [CrossRef]
- Illarionova, S.; Tregubova, P.; Shukhratov, I.; Shadrin, D.; Efimov, A.; Burnaev, E. Advancing Forest Carbon Stocks’ Mapping Using a Hierarchical Approach with Machine Learning and Satellite Imagery. Sci. Rep. 2024, 14, 21032. [Google Scholar] [CrossRef]
- Cook-Patton, S.C.; Leavitt, S.M.; Gibbs, D.; Harris, N.L.; Lister, K.; Anderson-Teixeira, K.J.; Briggs, R.D.; Chazdon, R.L.; Crowther, T.W.; Ellis, P.W.; et al. Mapping Carbon Accumulation Potential from Global Natural Forest Regrowth. Nature 2020, 585, 545–550. [Google Scholar] [CrossRef]
- Wei, G.; Li, M.; Quan, Y.; Wang, B.; Liu, J.; Ming, L. Geographically Weighted Random Forest Approach to Predict Forest Carbon Storage by Remote Sensing in Heilongjiang. J. Cent. South Univ. Forestry Technol. 2024, 44, 64–76. [Google Scholar] [CrossRef]
- He, C.-R.; Pang, L.-F.; Tan, B.-X.; Huang, Y.-F.; Sun, X.-X. Remote Sensing Based Monitoring of Forest Aboveground Carbon Storage in Beijing. J. Northwest Forestry Univ. 2024, 39, 162–170. [Google Scholar] [CrossRef]
- Zou, W.; Chen, C.; Huang, L.; Song, M.; Li, X.; Du, H. Geographic Weighted Regression Model Combined with Remote Sensing for Estimating Forest Aboveground Carbon Storage of Songyang County. Forest Res. Manag. 2023, 28, 132–140. [Google Scholar]
- Duan, M.; Zhang, X. Using Remote Sensing to Identify Soil Types Based on Multiscale Image Texture Features. Comput. Electron. Agric. 2021, 187, 106272. [Google Scholar] [CrossRef]
- Shi, S.; Zhao, P.; Zhou, M.; Yang, X. Biomass and carbon storage of the secondary forest (Populus davidiana) at different stand growing stages in southern Daxinganling temperate zone. Ecol. Environ. 2012, 21, 428–433. [Google Scholar]
- Dahhani, S.; Raji, M.; Bouslihim, Y. Synergistic Use of Multi-Temporal Radar and Optical Remote Sensing for Soil Organic Carbon Prediction. Remote Sens. 2024, 16, 1871. [Google Scholar] [CrossRef]
Types | Satellite | Date | Product Level |
---|---|---|---|
Optical remote sensing | Sentinel-2A | 25 December 2017, 1 scene | L2A |
Landsat 8 | 3 November 2017, 1 scene | L1TP | |
Radar remote sensing | Sentinel-1A | 10 December 2017, 1 scene | IW GRD |
Species | Model | Reference |
---|---|---|
Pinus massoniana Lamb | C = V × 0.380 × 1.472 × 0.508 × 1.187 × N × A | [38] |
Secondary Pinus species | C = V × 0.424 × 1.631 × 0.496 × 1.206 × N × A | |
Abies | C = V × 0.307 × 1.634 × 0.508 × 1.246 × N × A | |
Quercus | C = V × 0.676 × 1.355 × 0.499 × 1.292 × N × A | |
Betula | C = V × 0.541 × 1.424 × 0.502 × 1.248 × N × A | |
Liquidambar | C = V × 0.598 × 1.765 × 0.480 × 1.398 × N × A | |
Hard broadleaf | C = V × 0.598 × 1.674 × 0.496 × 1.261 × N × A | |
Soft broadleaf | C = V × 0.443 × 1.586 × 0.486 × 1.289 × N × A | |
Mixed coniferous-broadleaf forest | C = V × 1.514 × 0.482 × 0.5 × 1.289 × N × A |
No. | Variable Name | Formula | Reference |
---|---|---|---|
1 | Soil Adjusted Vegetation Index (SAVI) | SAVI = ((B8 − B4)/(B8 + B4 + L)) × 1.5 | [43] |
2 | Ratio Vegetation Index (RVI) | RVI = B8/B4 | [44] |
3 | Nonlinear Index (NLI) | NLI = ((B8 × B8) − B4)/((B8 × B8) + B4) | [45] |
4 | Normalized Difference Vegetation Index (NDVI) | NDVI = (B8 − B4)/(B8 + B4) | [46] |
5 | Modified Normalized Difference Vegetation Index (mNDVI) | mNDVI = (B8 − B4)/(B8 + B4−2 × B2) | [47] |
6 | Normalized Difference Infrared Index (NDII) | NDII = (B8 − B11)/(B8 + B11) | [48] |
7 | Normalized Difference Green Index (NDGI) | NDGI = (B3 − B4)/(B3 + B4) | [49] |
8 | Enhanced Vegetation Index (EVI) | EVI = 2.5 × (B8 − B4)/(B8 + 6 × B4−7.5 × B2 + 1) | [50] |
9 | Difference Vegetation Index (DVI) | DVI = B8 − B4 | [51] |
10 | RedEdge Ratio Vegetation Index (RVIre) | RVIre = B8/B5 | [52] |
11 | RedEdge1 Normalized Difference Vegetation Index (NDVIre1) | NDVIre1 = (B8 − B5)/(B8 + B5) | [53] |
12 | RedEdge2 Normalized Difference Vegetation Index (NDVIre2) | NDVIre2 = (B8 − B6)/(B8 + B6) | [53] |
13 | Modified RedEdge Normalized Difference Vegetation Index (mNDVIre) | mNDVIre = (B8 − B5)/(B8 + B5-2 × B2) | [47] |
14 | RedEdge Nonlinear index (NLIre) | NLIre = ((B8 × B8) − B5)/((B8 × B8) + B5) | [52] |
No. | Factor Name | Source of Data | Types of Factors |
---|---|---|---|
1–19 | Band reflectance | Optical Remote Sensing | Independent Variable Factors |
20–33 | Refer to Table 3 | Vegetation indexes | |
34–35 36–37 | Mean | Radar Remote Sensing | |
Variance | |||
38–39 | Homogeneity | ||
40–41 | Contrast | ||
42–43 | Dissimilarity | ||
44–45 | Entropy | ||
46–47 | Angular second moment | ||
48–49 | Correlation | ||
50 | ELEVATION | Digital Elevation Model | |
51 | SLOPE | ||
52 | ASPECT | ||
53 | Land Type | Inventory Data used in Forest Management and Planning | |
54 | Landforms | ||
55 | Soil Type | ||
56 | Soil Thickness | ||
57 | Slope Position | ||
58 | Vegetation Coverage | ||
59 | Tree Age | ||
60 | Canopy Density | ||
61 | Forest Composition | Inventory Data used in Forest Management and Planning | Ecological information features |
62 | Primary Tree Species | ||
63 | Humus Thickness | ||
64 | Aspect Direction |
Data Scheme | Feature Selection Method | Ecological Information Features | Total Number of Initial Features |
---|---|---|---|
A | RFE | Did not add | 60 |
B | RFE | Added | 64 |
C | Lasso | Did not add | 60 |
D | Lasso | Added | 64 |
Data Scheme | A | B | C | D | |
---|---|---|---|---|---|
XGBoost | MSE | 1.01 | 0.90 | 1.04 | 0.95 |
RMSE (t/ha) | 1.01 | 0.95 | 1.02 | 0.97 | |
MAE (t/ha) | 0.65 | 0.61 | 0.66 | 0.63 | |
R2 | 0.74 | 0.77 | 0.73 | 0.75 | |
rRMSE (%) | 45.29 | 42.66 | 45.90 | 43.79 | |
MAPE (%) | 39.29 | 34.61 | 40.01 | 38.11 | |
RF | MSE | 1.01 | 0.93 | 1.02 | 0.94 |
RMSE (t/ha) | 1.00 | 0.97 | 1.01 | 0.97 | |
MAE (t/ha) | 0.63 | 0.60 | 0.64 | 0.60 | |
R2 | 0.74 | 0.76 | 0.73 | 0.76 | |
rRMSE (%) | 45.17 | 43.42 | 45.45 | 43.54 | |
MAPE (%) | 33.73 | 30.85 | 34.11 | 30.77 | |
LightGBM | MSE | 0.97 | 0.85 | 0.99 | 0.88 |
RMSE (t/ha) | 0.99 | 0.92 | 1.00 | 0.94 | |
MAE (t/ha) | 0.63 | 0.58 | 0.64 | 0.59 | |
R2 | 0.75 | 0.78 | 0.74 | 0.77 | |
rRMSE (%) | 44.34 | 41.37 | 44.79 | 42.11 | |
MAPE (%) | 34.77 | 30.72 | 34.74 | 31.19 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zheng, M.; Wen, Q.; Xu, F.; Wu, D. Regional Forest Carbon Stock Estimation Based on Multi-Source Data and Machine Learning Algorithms. Forests 2025, 16, 420. https://doi.org/10.3390/f16030420
Zheng M, Wen Q, Xu F, Wu D. Regional Forest Carbon Stock Estimation Based on Multi-Source Data and Machine Learning Algorithms. Forests. 2025; 16(3):420. https://doi.org/10.3390/f16030420
Chicago/Turabian StyleZheng, Mingwei, Qingqing Wen, Fengya Xu, and Dasheng Wu. 2025. "Regional Forest Carbon Stock Estimation Based on Multi-Source Data and Machine Learning Algorithms" Forests 16, no. 3: 420. https://doi.org/10.3390/f16030420
APA StyleZheng, M., Wen, Q., Xu, F., & Wu, D. (2025). Regional Forest Carbon Stock Estimation Based on Multi-Source Data and Machine Learning Algorithms. Forests, 16(3), 420. https://doi.org/10.3390/f16030420