Next Article in Journal
Thermal Infrared UAV Applications for Spatially Explicit Wildlife Occupancy Modeling
Previous Article in Journal
Strategic Planning for Urban Sustainability
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating Biomass in Eucalyptus globulus and Pinus pinaster Forests Using UAV-Based LiDAR in Central and Northern Portugal

by
Leilson Ferreira
1,2,
André Salgado de Andrade Sandim
1,3,*,
Dalila Araújo Lopes
1,3,
Joaquim João Sousa
4,5,
Domingos Manuel Mendes Lopes
1,3,
Maria Emília Calvão Moreira Silva
1,3,6 and
Luís Pádua
1,4,6
1
Centre for the Research and Technology of Agro-Environmental and Biological Sciences, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
2
Department of Agronomy, School of Agrarian and Veterinary Sciences, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
3
Department of Forest Sciences and Landscape Architecture, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
4
Engineering Department, School of Science and Technology, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
5
Centre for Robotics in Industry and Intelligent Systems (CRIIS), Institute for Systems and Computer Engineering, Technology and Science (INESC-TEC), 4200-465 Porto, Portugal
6
Institute for Innovation, Capacity Building and Sustainability of Agri-Food Production, University of Trás-os-Montes e Alto Douro, 5000-801 Vila Real, Portugal
*
Author to whom correspondence should be addressed.
Land 2025, 14(7), 1460; https://doi.org/10.3390/land14071460 (registering DOI)
Submission received: 3 June 2025 / Revised: 10 July 2025 / Accepted: 12 July 2025 / Published: 14 July 2025

Abstract

Accurate biomass estimation is important for forest management and climate change mitigation. This study evaluates the potential of using LiDAR (Light Detection and Ranging) data, acquired through Unmanned Aerial Vehicles (UAVs), for estimating above-ground and total biomass in Eucalyptus globulus and Pinus pinaster stands in central and northern Portugal. The acquired LiDAR point clouds were processed to extract structural metrics such as canopy height, crown area, canopy density, and volume. A multistep variable selection procedure was applied to reduce collinearity and select the most informative predictors. Multiple linear regression (MLR) models were developed and validated using field inventory data. Random Forest (RF) models were also tested for E. globulus, enabling a comparative evaluation between parametric and machine learning regression models. The results show that the 25th height percentile, canopy cover density at two meters, and height variance demonstrated an accurate biomass estimation for E. globulus, with coefficients of determination (R2) varying between 0.86 for MLR and 0.90 for RF. Although RF demonstrated a similar predictive performance, MLR presented advantages in terms of interpretability and computational efficiency. For P. pinaster, only MLR was applied due to the limited number of field data, yet R2 exceeded 0.80. Although absolute errors were higher for Pinus pinaster due to greater biomass variability, relative performance remained consistent across species. The results demonstrate the feasibility and efficiency of UAV LiDAR point cloud data for stand-level biomass estimation, providing simple and effective models for biomass estimation in these two species.

1. Introduction

Forest inventory is a fundamental approach for acquiring data in forested areas. By collecting dendrometric variables from sampled trees, parameters such as wood volume, woody biomass, carbon content, tree dimensions, and spatial distribution can be determined [1]. These data support forest management decisions aligned with predefined objectives. Beyond volumetric assessments, forest inventories can provide information relevant to forest management, biodiversity, carbon cycling, and interactions with climate and soil conditions [2,3,4]. However, traditional forest inventory methods require extensive fieldwork, involving the measurement of individual trees across multiple plots. This process is time-consuming, requires trained professionals, incurs high costs, and often demands access to remote and complex terrain [5].
Over time, inventory techniques have evolved from visual estimations in the 18th century to modern methods incorporating remote sensing technologies, including satellites, manned aircraft, and unmanned aerial vehicles (UAVs) [6]. These platforms support different types of sensors, depending on their payload capacity [7,8]. In recent years, improvements in spatial resolution have increased the relevance of airborne platforms, particularly due to the development of high-resolution light detection and ranging (LiDAR) systems capable of capturing dense point clouds at low flight speeds [9,10]. LiDAR sensors have proven their effectiveness for capturing data on forest structure and composition [11]. They operate on laser emitters and receivers, inertial navigation systems (INSs), and the Global Navigation Satellite System (GNSS) to ensure accurate capture of multiple return signals, even under dense canopies [12]. The resulting three-dimensional point clouds represent vegetation structure, enabling measurements such as tree height and canopy density [13,14]. Initially deployed to manned aircraft [15], LiDAR sensors are now increasingly used on UAVs, improving the flexibility and efficiency of spatial data acquisition [16]. Despite the high initial acquisition costs and the need for technical expertise [17], LiDAR-based data can be more cost-effective than traditional sampling methods, offering significant financial advantages [18,19]. Additionally, optimizing ground sample sizes allows UAV-LiDAR applications with minimal field effort, while increasing the surveyed area, maintaining accuracy, and reducing costs [20].
LiDAR has been used in forest assessment, demonstrating its capability to estimate, model, and characterize forest structures and ecological parameters [21,22,23]. LiDAR-based inventory systems support rapid evaluations of forest metrics, including canopy structure, tree height, wood volume, biomass, carbon stocks, and geographic positioning [24,25]. Dominant tree height models generated by LiDAR correlate with field measurements [26]. Moreover, combining UAV-LiDAR data with tools such as dendrochronology techniques and thinning simulators expands its utility for monitoring forest growth and structural dynamics [27]. The use of UAV-LiDAR has demonstrated effectiveness for detecting and measuring individual trees. For instance, Rodríguez-Puerta et al. [28] demonstrated the effectiveness of UAV-based LiDAR data for identifying individual trees and measuring height in young forests. This approach has been further refined by Pirotti [29], who applied template matching to extract tree height and position from LiDAR-derived canopy height models (CHMs) in Pinus pinaster Ait. stands. Picos et al. [30] used UAV-LiDAR and CHMs to detect individual trees in Eucalyptus plantations, while Solares-Canal et al. [31] explored the use of portable LiDAR for tree identification and segmentation in Pinus species. UAV-LiDAR integration has also shown utility in fuel strip planning and management, contributing to fire prevention strategies based on fuel distribution [32,33]. Fernández-Guisuraga et al. [32] highlighted the importance of above-ground biomass, estimated from LiDAR data, spectral, and field inventory data, as a determining factor of fire severity in P. pinaster ecosystems. Míguez and Fernández [34] combined normalized difference vegetation index (NDVI) and high-density LiDAR data to assess the natural regeneration of P. pinaster following a high-severity fire disturbance. LiDAR applications also extend to mapping and monitoring forest disturbances such as pests and diseases. Yu et al. [35] proposed a combined approach using UAV hyperspectral and LiDAR data to detect diseases such as pine wilt disease using machine learning algorithms at an individual tree level.
Forest biomass is a key indicator of ecosystem productivity and forest development [36]. Accurate biomass estimation is critical for assessing climate change impacts, particularly in relation to carbon sequestration and storage [37,38]. LiDAR point clouds can be used to estimate biomass based on dendrometric variables [39,40] and are increasingly integrated into forest management strategies, such as monitoring selective logging impacts on biomass and carbon [41]. Biomass estimates from LiDAR data have matched those from traditional inventories in both Eucalyptus plantations [42] and complex ecosystems like the Amazon rainforest [18,43]. When validated with field inventories, UAV-LiDAR showed promising results for developing biomass estimation models in tropical and subtropical forests [44,45], showing better accuracy when compared with optical remote sensing techniques [46]. Goldberg et al. [47] used airborne LiDAR data for carbon stock quantification in northern Australia’s savannas, focusing on Eucalyptus miniata A.Cunn. and Eucalyptus tetrodonta F.Muell. Similarly, Pinedo et al. [48] used UAV-LiDAR to estimate tree height and above-ground biomass in Eucalyptus globulus Labill. plantations, highlighting the technology’s potential for monitoring forest resources. Additionally, Lu et al. [38] explored the estimation of aboveground biomass in Robinia pseudoacacia L. forests using point clouds acquired through UAV and backpack LiDAR. Furthermore, Torre-Tojal [49] used random forest algorithms to estimate the above-ground biomass of Pinus radiata D.Don based on LiDAR data, demonstrating the versatility of LiDAR technology in forest biomass assessment.
Modeling approaches, including machine learning regressors, have been employed in above-ground biomass (AGB) estimation, often providing more accuracy in comparison with traditional linear models [46,50]. However, stepwise linear regression is still being adopted in the estimation of forest biomass [13,51,52,53], enabling variable selection to optimize model performance while preserving accuracy [52,53,54]. Liu et al. [55] reported that machine learning algorithms applied to UAV stereo imagery have achieved higher accuracy in eucalyptus plantations. The integration of LiDAR data with multispectral images and machine learning has shown significant potential. Rodríguez-Puerta et al. [56] compared machine learning algorithms for fuel strip planning in urban-forest interface using a combination of Sentinel-2 with UAV LiDAR and airborne laser scanning (ALS) data. A random forest model was the most reliable, with an overall accuracy of 90.66 per cent in the training set and 91.80 per cent in the test set. Stepwise linear regression has also shown high accuracy in pine forests [57,58]. However, model performance varies by forest type due to species-specific growth and distribution patterns. For example, pine stands often show greater quantitative errors, due to higher biomass variability [59,60]. Plantation age also influences the estimation of parameters derived from LiDAR data [28,34] and the detection of individual trees [61].
Another important factor is the number of metrics that can be derived from UAV-based LiDAR data, many of which correlate with field-based AGB measurements. These metrics include height amplitude, mean canopy height, canopy coverage, and canopy density [62], with mean canopy height showing a strong correlation with AGB [63]. LiDAR-based height percentiles have also proven to be accurate AGB predictors [44,50]. Simplified models based on these parameters show high coefficients of determination (R2) and relatively low errors, supporting their applicability for biomass estimation. Yan et al. [64] identified mean tree height as the most influential parameter in univariate regression models. In eucalyptus plantations, tree height and canopy depth were found to be the most effective predictors of trunk volume using a principal components analysis [65]. Leite et al. [66] evaluated both area-based and individual-tree-based methods, incorporating central tendency and distribution metrics. Similarly, Liu et al. [67] extracted multiple LiDAR-derived metrics for Larix olgensis A.Henry, including total tree height, crown width, and crown area across height percentiles.
As modeling approaches evolve, their integration with UAV-LiDAR is expected to further improve AGB and total biomass (TB) estimation for different species, serving as a precise and cost-effective tool for carbon monitoring and sustainable forest management [36,68,69]. This is especially relevant in countries with poor forestry management and highly variable and fragmented landscapes, such as in Portugal. In this context, Portuguese forestry is predominantly characterized by a distribution of P. pinaster (maritime pine) and E. globulus [70]. These species are widespread due to their adaptability to local conditions. However, the terrain across Portugal is variable, with steep slopes, rugged landscapes, and varied soil types presenting a significant management challenge [71]. Forest management has historically been sporadic and uncontrolled, leading to high tree densities, particularly of fast-growing stands of P. pinaster and E. globulus [72]. High tree density combined with the challenging terrain amplifies the susceptibility of these forests to fire disturbances [73]. The dense undergrowth, along with the flammability of both P. pinaster and E. globulus, creates an environment where fires can spread rapidly and uncontrollably. These conditions are aggravated by periods of drought and extreme weather, which have become more frequent due to climate change and socioeconomic factors [74,75].
Addressing these issues requires sustainable forest management practices that balance ecological and economic concerns [76]. This study aims to support forest sustainability by exploring the use of UAV-based lidar data to model biomass in P. pinaster and E. globulus stands. To meet this goal, biomass estimates derived from traditional forest inventory and UAV-based LiDAR data were acquired in the central and northern regions of the country. This allows for the evaluation of the accuracy of high-spatial-resolution remote sensing approaches for improving AGB and TB estimation in these two species under Portuguese forestry conditions.

2. Materials and Methods

2.1. Study Areas

The areas used in this study are distributed across different regions of mainland Portugal (Figure 1). These areas correspond to forest stands of E. globulus and P. pinaster, spanning from north to south. E. globulus stands are located in three areas: Parque das Serras do Porto (Figure 1A) within the municipalities of Paredes and Valongo, in the northern region; Serra da Lousã, within the municipalities of Góis (Figure 1B), in the central region; and in the municipality of Nisa in the Alentejo, southern region (Figure 1C). The P. pinaster stands are also located in Serra da Lousã, within the municipalities of Góis and Castanheira de Pêra (Figure 1B) and Ribeira de Pena (Figure 1D), in the north of Portugal.

2.2. Forest Inventory

The sampling procedure was carried out using two methods: field-based forest inventory and UAV-based LiDAR acquisition. Field data were collected between January 2022 and March 2023. Circular sampling plots of 500 m2 were established on a horizontal plane in stands P. pinaster and E. globulus stands.
Plots on flat terrain (slope < 5°) are not corrected for slope, with a fixed radius of 12.62 m. In sloped areas, inclination was measured with a hypsometer to determine the corrected radius required to maintain a plot area projected on the horizontal plane. This corrected radius is obtained by
c r = r cos α
where c r is the corrected radius, r the radius at the horizontal plane, and α the terrain inclination (°).
Trees higher than two meters were measured unless the stand was young (50% of trees < 1.3 m), in which all trees were measured. A GNSS receiver was used to record the central coordinates of each sample plot.
In each plot, all trees were measured and identified. The recorded variables included species, diameter at breast height (DBH), total height (Ht), crown base height (Hc), and crown projection diameters in the north–south (CDNS) and also east–west (CDEW).
The dominant height (Hdom) is defined as the average height of the 100 trees with the largest DBH per hectare. In a 500 m2 sampling plot, this corresponds to an average height of the five trees with the largest DBH. The Hdom was calculated for each sampling plot and was used to estimate partial biomass for each tree. Tree-level partial biomass was estimated using established equations based on DBH or DBH and Hdom. TB per sampling plot was then extrapolated to a per-hectare basis. Biomass was calculated using equations from the 6th Portuguese National Forest inventory, based on studies developed by Tomé et al. [77], Faias et al. [78], and Soares and Tomé [79], as presented in Table 1.

2.3. LiDAR Data Acquisition and Processing

The LIDAR data were acquired by a third party, covering all study areas. Aerial surveys were conducted using UAVs equipped with LIDAR sensors to obtain georeferenced point clouds. In Parque das Serras do Porto (Figure 1A), a Matrice 600 hexacopter UAV (SZ DJI Technology Co., Ltd., Shenzhen, China) with the Velodyne VLP16 (Velodyne Lidar, Inc., San Jose, CA, USA) sensor was used. In the other study areas, the quadcopter Matrice 300 RTK (DJI, Shenzhen, China) equipped with a Mapper 3+ (YellowScan®, Saint-Clément-de-Rivière, France) was used. Flight surveys were conducted at 40 m above the ground level, at an average speed of 5 m s−1 with an average point density greater than 50 points per m2. Each point cloud was delivered in LAS (LASer format) and divided into multiple files. The data from each area were merged for analysis.
To extract parameters for each stand, the LIDAR point clouds were processed in three stages: (1) pre-processing, which included merging LAS files, removing noise, and clipping sampling plots; (2) topographic normalization, where the ground was segmented, elevation normalized, and points selected according to a height threshold; and (3) parameter extraction, where different metrics are derived from the normalized point clouds. The steps of this methodology are further described in the following subsections.

2.3.1. Point Cloud Pre-Processing

Since the point clouds may contain non-representative points, a point cloud cleaning operation can be performed to remove duplicate or isolated points corresponding to noise. The Statistical Outlier Removal (SOR) filter in CloudCompare software (version 2.12.1) was used for this purpose. This method calculates the average distance from each point to its k nearest neighbors. Points with distances greater than the average plus a certain number of standard deviations were removed. The inputs for the SOR filter were six points, mean distance estimation, and a value of one for the standard deviation multiplier threshold (nSigma). As a result, a new point cloud was generated as the output of the SOR method, where noise was excluded, improving point distribution and reducing the total number of points.
Next, circular plots were extracted from the point clouds, corresponding to the sampled areas. This step was performed in QGIS by buffering the center of each plot to generate a circumference and clipping the point cloud using the “lasclip” function from LAStools (rapidlasso GmbH, Gilching, Germany). After clipping, additional outliers, such as power lines, were manually removed from some point clouds using CloudCompare.

2.3.2. Terrain Normalization

After clipping and cleaning, topographic normalization was performed. This step adjusts point elevations (Z-axis) to represent height above ground, enabling comparison across plots, as shown in the example presented in Figure 2. To achieve this, ground points must be distinguished from the remaining points. Ground segmentation was performed using the Simple Morphological Filter (SMRF) algorithm [80], which includes creating a surface map with the minimum elevation from the minimum elevation data, segmenting ground elements, and separating ground points from the original point cloud (Figure 2b). This step was applied using MATLAB (R2024a Update 1).
The normalization removes the effect of topographic variation on vegetation points. Using the ground points identified from the SMRF algorithm, an interpolation was performed to estimate the ground elevation (Z) for each point. The normalized height was then calculated by subtracting the interpolated ground elevation from each point’s original elevation (Figure 2c). Following point cloud normalization, points below a certain height threshold were removed to exclude vegetation not relevant to tree biomass estimation. This step is carried out to avoid the influence of points that belong to shrub vegetation, which could distort the distribution of the extracted metrics; thus, most points corresponding to tree structures are retained for analysis. Next, a triangulation of the normalized and filtered point clouds was performed to enable volume extraction. This was achieved using the alphaShape algorithm [81], which creates a triangle mesh based on the 3D coordinates (X, Y, Z) of the points and a radius parameter (α) that defines point connectivity; an α of 0.25 was used in this study.

2.3.3. Stand-Level Parameters Extraction

After filtering and normalizing point cloud height, a series of parameters were extracted for each plot analyzed in the scope of this study. These parameters are based on various height percentiles, excluding points below the threshold defined in the previous step (2 m). Three groups of parameters were extracted based on different height percentiles: height, crown structure (density and projected area), and dispersion. A detailed description of the variables for each group of parameters is presented in Table 2.
Height parameters are obtained by measuring the 25th, 40th, 50th, 75th, 90th, and 95th percentiles, along with the mean and maximum height, as illustrated in Figure 3. Structure parameters (crown density, projected crown area, and volume) were calculated using points located above 2 m (minimum value) and were computed for each percentile and mean height. Crown density represents the amount of the forest covered by the vertical projection of tree crowns [82], calculated as the proportion of vegetation points relative to the analyzed area. The projected crown area corresponds to the vertical projection of the tree crowns relative to the analyzed plot area and was normalized to hectare units. The volume, also converted to m3 ha−1, was estimated from the triangulated surface. Height dispersion was evaluated through variance, standard deviation, and range.

2.4. Data and Statistical Analysis

After extracting geometric and dispersion parameters, plots were grouped by species (E. globulus or P. pinaster). Each dataset was pre-analyzed to remove plots with insufficient points or inconsistent data. By using the predictor variables based on the extracted parameters (Section 2.3.3), two response variables were modeled, TB and AGB, both expressed in megagrams per hectare (Mg ha−1). The dataset of each species was randomly split into train (70%) and test (30%) subsets, allowing the validation on data not used during model training.

2.4.1. Feature Selection

The assumptions of normality and homogeneity of variances were tested using the Shapiro–Wilk and Levene’s tests, respectively, in SPSS Statistics 26 (IBM Corp., Armonk, NY, USA) [83]. As most variables violated parametric assumptions (p < 0.05), Spearman’s rank correlation coefficient was employed to assess the strength and direction of monotonic associations between predictor and response variables [84]. All correlations were interpreted within a 95% confidence interval.
To address multicollinearity, a known limitation in regression modeling where predictors can show strong interdependencies, a preliminary variable screening was conducted. Multicollinearity inflates the variance of coefficient estimates, reduces model interpretability, and undermines predictive accuracy [85]. Given the large initial set of predictors (n = 35) and most extracted parameters based on height percentiles, the likelihood of intercorrelation was high.
The selection process began with Spearman’s rank correlation analysis to identify variables moderately to strongly associated with the response variable. To reduce redundancy among predictors, pairwise correlations above 0.95 were removed. Next, variables with high intercorrelation were screened using the variance inflation factor (VIF). The process begins by fitting a linear model with all input variables and calculating initial VIF values (VIF1) for each predictor. Then the variable with the highest VIF value is removed if this value exceeds the threshold that was set to five [86]. This elimination triggers a recalculation of VIF values for the remaining variables in the updated model, as the removal of one predictor alters the collinearity structure. This process is repeated in a stepwise manner until all remaining variables demonstrate VIF values below the threshold, obtaining the final VIF values (VIF2). Subsequently, stepwise regression was applied in the variables selected from VIF2 using the Akaike information criterion (AIC) as the selection criterion to determine optimal predictor subsets for modeling AGB and TB, for each species separately [87]. This iterative procedure balances model simplicity and predictive performance by adding or removing variables based on their statistical contribution [88]. In this step, a forward selection was implemented by adding one variable at a time.

2.4.2. Above-Ground Biomass and Total Biomass Modeling

Predictive modeling was performed using Multiple Linear Regression (MLR) and Random Forest (RF). The trained regression models were fitted using the selected metrics (Section 2.4.1). MLR is an extension of simple linear regression that incorporates multiple predictor variables into a single multivariate framework, allowing the analysis of complex relationships between the response and explanatory variables [89]. RF is an ensemble learning algorithm that constructs a large number of decision trees during training and combines their outputs to improve predictive accuracy and control overfitting. The method’s strength is based on its use of bootstrap aggregation (bagging) and random feature selection at each split, which improves model robustness and reduces variance, particularly in high-dimensional datasets [90,91,92].
MLR was implemented and analyzed using the R statistical software (version 4.3.2; R Core Team, 2023) [93]. RF regression models [90] were implemented using the randomForest [94] and caret [95] packages in R, with k-fold cross-validation (k = 5) to ensure model generalizability and minimize overfitting. Hyperparameter tuning was performed using a grid search approach, and model training was conducted to identify the optimal configuration. Specifically, the number of trees was set to 500, and the number of variables randomly selected at each split (mtry) ranged from 2 to 6. Due to the limited number of P. pinaster plots (n = 15), only MLR was applied as an exploratory approach. Machine learning models such as random forest typically require larger datasets to avoid overfitting and ensure generalizability [96]. Therefore, and in accordance with literature that discourages the use of ML models under such conditions [97], MLR was employed for this dataset, while MLR and RF were both applied for E. globulus data.
Model performance was evaluated using three commonly adopted regression metrics: the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE). These metrics were selected to provide a complementary understanding into the models’ predictive accuracy and robustness, by quantifying the proportion of variance in the dependent variable explained by the predictors for model fit (R2), and by offering an evaluation of model accuracy (RMSE and MAE), with RMSE penalizing large discrepancies more heavily and MAE providing a straightforward interpretation of average error [98]. These performance metrics were computed for training and test datasets, as well as for the full dataset to support final model evaluation.

3. Results

3.1. Data Characterization and Analysis

Table 3 summarizes the variables and estimated metrics for biomass estimation. Among the two studied species, P. pinaster stands showed the highest mean TB (416.3 Mg ha−1), while the lowest TB was observed in an E. globulus stand (3.4 Mg ha−1). The lowest TB in a P. pinaster stand was 113 Mg ha−1. Maximum TB values were 680.8 Mg ha−1 for P. pinaster and 107.7 Mg ha−1 for E. globulus. Similar trends are observed for AGB, with the highest mean value observed in P. pinaster stands (403.6 Mg ha−1). The minimum AGB was observed in an E. globulus stand (2.7 Mg ha−1), while the minimum in P. pinaster was 105.3 Mg ha−1. The maximum AGB values reached 664.2 Mg ha−1 for P. pinaster and 86.3 Mg ha−1 for E. globulus. Regarding stand-level canopy structure metrics, P. pinaster stands show a higher mean maximum height (18.7 m) compared with E. globulus stands (14.9 m). Height distribution in P. pinaster stands ranges from 9.7 m to 26 m, while E. globulus heights range from 6.6 m to 21.7 m. Crown density also follows this trend, P. pinaster stands have a mean density of 37.6%, ranging between 2.9% and 67.9%, while E. globulus stands have a mean density of 20.7%, reaching a maximum of 45.2% and a minimum of 1.1%. The crown projection area is higher in P. pinaster stands (5059.2 m2 ha−1) when compared with the E. globulus stands (3318.1 m2 ha−1). Volume values show smaller differences between species. The mean volume is 3921.9 m3 ha−1 in P. pinaster stands and 3198.5 m3 ha−1 in E. globulus stands. The minimum and maximum values were both observed in E. globulus stands, ranging from 7 m3 ha−1 to 13,602.8 m3 ha−1.
The correlation between each individual metric estimated from the LiDAR point cloud data with AGB and TB is presented in Table 4. In E. globulus stands, correlation coefficients for height metrics range from r = 0.85 for maximum height to r = 0.95 for mean height. Among dispersion metrics, height amplitude shows the highest correlation (r = 0.86), while height variance and standard deviation were similar (r = 0.54). Canopy cover density (CCD) at the 90th percentile had the highest correlation for this metric type (r = 0.61), while density at the minimum height above 2 m showed the weakest correlation (r = 0.51). For the crown projection area, the highest correlation was achieved when considering points above 2 m (r = 0.56). Volume-related metrics showed the weakest correlations overall, with values ranging from 0.30 for minimum height to 0.51 at the 95th height percentile. As for P. pinaster stands (Table 4), height appears as the most strongly correlated parameter, ranging from r = 0.90 and 0.92 (maximum height in AGB and TB, respectively) to r = 0.94 and 0.96 (mean height and height at 40th and 50th percentiles). Among dispersion metrics, amplitude showed a high correlation (r = 0.92 for AGB and r = 0.90 for TB), while variance and standard deviation were below 0.5. For canopy density, the highest correlation was observed at the minimum height (2 m) with r = 0.52 for AGB and r = 0.53 for TB, whereas the lowest was at the 95th percentile (r = 0.16 and 0.20). Crown area above 2 m showed the strongest correlation in this category, while crown area at the 90th and 95th percentiles was below 0.3. As for volume metrics, the weakest was observed for volume at the 90th percentile, while the mean volume showed the highest correlation values in this type of metric.
After filtering for multicollinearity, several variables were excluded using Spearman’s correlation values. The variables that were used in the VIF analysis varied across the two species (Table 4). For E. globulus, the initial set of variables used were height variance, the 25th percentile of height, maximum height, CCD at the minimum height above two meters and at the 90th and 95th height percentiles, crown area at the 95th percentile of height, and volume at the median height. After the VIF filtering procedure, maximum height and CCD at the 90th percentile of height were excluded. As for P. pinaster, the height variance, maximum height, CCD and projected crown area at the minimum height, crown area at the 95th percentile of height, and volume at the 25th percentile of height were used in the VIF procedure, with the final set of variables being composed by height variance, maximum height, and volume at the 25th percentile of height.

3.2. Above Ground and Total Biomass Estimation

Regression models for AGB and TB estimation were fitted using the selected metrics extracted from the LiDAR point clouds of the plots of each species. The results obtained during training were satisfactory, with the models explaining at least 89% of the variance in the dependent variables. The best performance was observed for the E. globulus stands, while the highest estimation errors occurred for P. pinaster.

3.2.1. Biomass Estimation of Eucalyptus globulus Stands

The regression models for E. globulus were developed using stepwise linear regression and are presented in (2) and (3). After applying MLR, the best predictive performance for both AGB and TB at the stand level was achieved using the height at 25th percentile (HP25), CCD at two meters height (CCDmin), and height variance (Hvar).
A G B = 24.94 + 5.46 H P 25 + 1.59 H v a r + 24.21 C C D m i n
T B = 31.14 + 6.82 H P 25 + 1.99 H v a r + 30.23 C C D m i n ,
The final models provided an R2 of 0.89 for both AGB and TB, with an RMSE of 6.7 Mg ha−1 and 8.37 Mg ha−1 and a MAE of 5.36 Mg ha−1 and 6.69 Mg ha−1, respectively (Table 5).
When applied to the test dataset (n = 12), the model performance decreased slightly but remained robust, with R2 = 0.73, RMSE = 10.49 Mg ha−1 and 13.1 Mg ha−1, and MAE = 8.03 Mg ha−1 and 10.03 Mg ha−1 for AGB and TB, respectively. When the model was applied to all E. globulus plots, the overall performance was R2 = 0.85, RMSE = 7.9 Mg ha−1 and 9.9 Mg ha−1, and MAE = 6.1 Mg ha−1 and 7.6 Mg ha−1, for AGB and TB, respectively. The predicted values showed a strong correlation with those derived from the field inventory (Figure 4), with estimations ranging from 0.0 to 28.5 Mg ha−1 for TB (Figure 4c) and from 0.0 to 22.8 Mg ha−1 for AGB (Figure 4d)and 18.8 Mg ha−1.
When applying RF, HP25 demonstrates to be the dominant predictor with feature importance of 60.4% and 62.7% for TB and AGB, respectively, followed by CCDmin (20.6% and 19.0%) and Hvar (19.0% and 18.3%). The RF models showed a strong predictive capability for both TB and AGB at the stand level (Table 5). For AGB, the model achieved an R2 of 0.96 on the training set, with an RMSE of 4.69 Mg ha−1 and a MAE of 3.61 Mg ha−1. For TB, the training performance was similarly high (R2 = 0.96; RMSE = 5.71 Mg ha−1; MAE = 4.43 Mg ha−1). When tested on independent data (n = 12), performance decreased but remained robust, with R2 values of 0.72 and 0.73, RMSEs of 10.42 Mg ha−1 and 12.92 Mg ha−1, and MAEs of 7.95 Mg ha−1 and 9.76 Mg ha−1 for AGB and TB, respectively. On the full dataset, the overall performance resulted in R2 values of 0.90 for AGB and TB, with RMSE of 6.53 and 8.25 Mg ha−1 and MAE of 4.76 and 5.71 Mg ha−1, respectively. The predicted values were strongly correlated with measured biomass (Figure 5), with residuals ranging from 0.0 to 28.2 Mg ha−1 for TB (Figure 5c) and from 0.3 to 21.8 Mg ha−1 for AGB (Figure 5d).

3.2.2. Biomass Estimation of Pinus pinaster Stands

The regression models for P. pinaster are shown in (4) and (5) for AGB and TB, respectively. The initial variable selection included height amplitude (above 2 m), mean height, crown area, and canopy density at 2 m. The variable selection included the maximum height (Hmax) and stand-level volume per hectare at the 25th percentile of height (VP25) as the most effective predictors for both AGB and TB in P. pinaster.
A G B = 299.6 + 33.65 H m a x + 0.0172 V P 25
T B = 297.7 + 34.12 H m a x + 0.0176 V P 25 ,
The AGB model obtained an R2 of 0.91, RMSE = 56.43 Mg ha−1, and MAE = 51.8 Mg ha−1. The TB model achieved an R2 of 0.91, RMSE = 58.53 Mg ha−1, and MAE = 53.31 Mg ha−1. The performance in the test set was similar for R2 (0.91) and slightly lower for the other metrics, with RMSE = 89.1 Mg ha−1 and 91.59 Mg ha−1 and MAE = 79.37 Mg ha−1 and 81.33 Mg ha−1 for AGB and TB, respectively. These results are presented in Table 6.
When applied to all P. pinaster plots, the models achieved R2 = 0.88 for AGB (RMSE = 69.06 Mg ha−1, MAE = 60.99 Mg ha−1) and R2 = 0.88 for TB (RMSE = 71.27 Mg ha−1, MAE = 62.65 Mg ha−1). However, when compared with E. globulus (Figure 4 and Figure 5), the estimation errors were higher, ranging from 12.5 to 140.1 Mg ha−1 for TB (Figure 6c) and from 12.5 to 138.4 Mg ha−1 for AGB (Figure 6d).

4. Discussion

Accurate biomass estimation is an important component of forest monitoring, directly contributing to carbon accounting and sustainable forest management. In this study, MLR and RF models were developed using UAV-based LiDAR data to estimate AGB and TB in E. globulus, while only MLR was applied to P. pinaster due to limited sample size. MLR is a widely used parametric approach that allows for the selection of the most relevant predictors, which enables interpretable models, efficient computation, and reduced model complexity, but it requires linear relationships and careful multicollinearity control [54]. In contrast, machine learning models such as RF can capture non-linear interactions and tolerate correlated variables but often lack interpretability and require larger datasets. Both of these models are widely used in remote sensing applications for forest biomass estimation [52,53].
To reduce the complexity of the model and improve the accuracy of predictions, a minimal number of parameters needs to be selected through feature selection techniques [99]. The complexity of this task is amplified when there is a high degree of correlation between the predictors, as in the case of this study (Table 4). Therefore, a multistep feature selection process was implemented, which specifically includes Spearman correlation filtering, VIF, and stepwise regression. This filtering approach reduced the initial set of variables to three to four predictors, explaining over 89% of the variance in the training dataset. Such variable selection is essential in remote sensing, where metrics often show high collinearity.
For E. globulus, both MLR and RF performed well. MLR achieved R2 = 0.89 in training and R2 = 0.73 in testing (Table 5), with RMSE ranging from 6.70 to 13.10 Mg ha−1. RF obtained a better performance in training when compared with MLR (R2 = 0.96), but both models showed comparable results in the test set (R2 = 0.72 for AGB using RF and R2 = 0.73 for the cases). The reduction from training to test sets is in line with the anticipated model behavior in terms of generalization, as biomass variation is typically influenced by factors specific to the location, environmental heterogeneity, and stand history [100]. On the full dataset (Figure 4 and Figure 5), RF achieved R2 = 0.90 with RMSE of 6.53–8.25 Mg ha−1, while MLR achieved R2 = 0.86 with RMSE of 7.5–9.37 Mg ha−1. These results confirm that canopy height-based metrics extracted from UAV-LiDAR are reliable predictors of biomass in eucalyptus plantations.
The MLR model performed well in P. pinaster with R2 = 0.91 in both training and testing (Table 6), but absolute errors were higher (RMSE up to 91.59 Mg ha−1 for TB in the test set), which may be related to the wider biomass range (113–681 Mg ha−1). On the full dataset (Figure 6), the R2 performance was high at 0.88, with an RMSE of 69.06–71.27 Mg ha−1. Despite the differences in the absolute error, relative errors in the training set between species were comparable (14–17%), indicating a good model generalization across species with different structural complexity [59,60]. The exclusive use of MLR for P. pinaster is justified by the limited number of field samples, a common limitation in forest biomass studies [41,43]. It is an established fact that machine learning (ML) typically requires large datasets to reliably capture non-linear interactions [101]. However, it must be noted that this is not always feasible due to the cost and logistical complexity of field sampling [18]. Consequently, MLR was determined to be a more suitable approach to be reported, highlighting the technical merits of field data collection. As observed in Liu et al. [102], even when the number of predictors is reduced, MLR can produce reliable outputs if linear relationships are evident.
In regard to selected model variables, in E. globulus, the most significant predictors were identified as HP25, CCDmin, and Hvar. These features characterize vertical structure and canopy heterogeneity, which are ecologically relevant indicators of biomass accumulation. Lower height percentiles, such as HP25, are capable of capturing understory dynamics, while CCDmin and Hvar provide information on foliage distribution and spatial irregularity. For P. pinaster, Hmax and VP25 were identified as suitable predictors, highlighting the influence of upper canopy development and low-end volume distribution in biomass differentiation. These results are in line with Domingo et al. [58], who used low-density LiDAR data to estimate TB in Pinus halepensis, achieving comparable results (R2 = 0.87) when using, among others, elevation variance and HP25. Yan et al. [64] and Pinedo et al. [48] found that mean canopy height was a dominant predictor for AGB in Eucalyptus, achieving R2 of 0.87 and 0.68, respectively, for AGB estimation. This study obtained similar or better performance, demonstrating the relevance of vertical structure metrics in biomass estimation.
When compared with other remote sensing approaches, the models in this study generally obtained a higher accuracy. Fernández-Guisuraga et al. [32] reported R2 values of 0.72 and 0.68 for TB and AGB in P. pinaster using ALS and Landsat data, while Li et al. [53] achieved R2 = 0.41 using Landsat 8 imagery in a subtropical pine forest. Maesano et al. [103] developed a stepwise regression model combining UAV-RGB and LiDAR data in Pinus nigra, achieving R2 = 0.81 and RMSE = 45.5 Mg ha−1, a comparable R2 but with lower error than our P. pinaster results, which were impacted by greater biomass variability.
While RF showed better performance than MLR in terms of overall fit, the difference in predictive performance on test data was small. Nevertheless, the consistency in performance across datasets is indicative of the reliability of the UAV-based point cloud canopy metrics used for biomass estimation at the stand level. Despite its linear assumptions and simpler structure [101], the MLR model demonstrated a performance comparable to the complex RF model [90]. Other studies demonstrated similar findings with parametric models to match or surpass machine learning techniques, provided that multicollinearity is effectively mitigated and variable selection is conducted with rigor [104,105,106]. An important strength of the MLR models is their simplicity, especially where decision-makers require interpretability and reproducibility [58]. The linear nature of MLR models facilitates the interpretation of the individual impact of each explanatory variable on biomass estimates, which is advantageous for operational applications. For instance, in E. globulus, the positive coefficients for HP25 and CCDmin suggest that both mid-canopy height and lower canopy density are positively related to biomass accumulation, which aligns with ecological expectations. Similarly, for P. pinaster, Hmax and VP25 provide interpretable associations to canopy dominance and volume distribution within plots. These relationships are transparent and reproducible, offering information for forest managers and supporting decision-making in resource planning. In contrast, Random Forest-based models, on the other hand, function as “black boxes”, making it difficult to explicitly analyze the role of each variable in the final prediction, which may limit their acceptance in contexts where interpretability is essential, with higher computational cost [107]. Although RF achieved slightly better accuracy, its lack of transparency can limit its adoption in operational forestry. In future work, model-agnostic interpretability tools such as SHAP (SHapley Additive exPlanations) values [108] could be integrated to improve the transparency of RF models and better interconnect variable influence to stakeholders.
One limitation of both modeling approaches is their current restriction to stand-level biomass estimation. However, with appropriate data density and tree segmentation, the methodology could be adapted for individual tree biomass estimation, enabling a refinement of forest inventories, supporting species-level management, and improving biomass mapping in mixed and heterogeneous landscapes. Moreover, the use of other data sources should be considered for data fusion. Li et al. [36] used multispectral and LiDAR data to compare the performance of multiple stepwise regression, random forest, support vector machine, and decision tree algorithms for E. globulus biomass estimation. The random forest achieved the best performance with R2 = 0.94 (training) and R2 = 0.87 (test) and lower RMSE values. Lu et al. [38] observed a similar result when combining UAV and backpack LiDAR point clouds in an R. pseudoacacia forest, achieving a higher accuracy using random forest when compared with multiple regression (R2 > 0.90). Despite the potential of advanced methods, the simplicity of the MLR models presented in this study provides several practical advantages. These models can be applied to large areas, require lower computational requirements when compared with machine learning approaches, reduce fieldwork, and enable cost-effective, non-destructive biomass monitoring. Moreover, the performance was equally satisfactory when all the available data were used (Figure 4 and Figure 6), suggesting that these models maintain their generalization capacity [109].

5. Conclusions

This study demonstrated the application of UAV-based LiDAR point clouds for estimating above-ground and total biomass in E. globulus and P. pinaster stands in central and northern Portugal. The resulting models performed well, with high coefficients of determination and relatively low estimation errors, even when applied to independent validation datasets. The simplicity of the final models, requiring few predictor variables, highlights the practical utility of UAV-LiDAR data in forest biomass estimation. The methodology presented offers an efficient, cost-effective, and non-destructive alternative for large-scale forest monitoring, with potential for operational application. Despite higher absolute errors in P. pinaster plots, the relative performance was consistent across species, suggesting good generalizability of the methodology. The methodology applied in this study can serve as a non-destructive forest biomass assessment approach, with applications in forest inventories, carbon stock monitoring, and landscape-scale forest management planning. Moreover, this methodology can be evaluated in other species with economic value and different canopy structures, such as Quercus suber.
Future research should focus on expanding the dataset to include a greater number and diversity of plots, stratified by species, age class, and stand structure. This would improve model calibration and support more generalizable biomass estimates. The methodology used in this study can also be adapted, with additional data and segmentation techniques, to estimate biomass at the individual tree level. Integrating tree-level structural metrics from UAV-based LiDAR data with species classification derived from multispectral or hyperspectral data would support the development of species-specific, tree-level biomass models. These approaches could be particularly useful in mixed or structurally complex stands. Furthermore, combining structural and spectral information has the potential to improve biomass predictions by capturing complementary information related to canopy architecture, foliage distribution, and physiological traits. Future studies should also prioritize the interpretability of machine learning models, especially when they are used in applied forestry contexts. This will allow practitioners to better understand the underlying ecological drivers of biomass variability and improve trust in model-based decision-making.

Author Contributions

Conceptualization, L.P., D.M.M.L. and M.E.C.M.S.; methodology, L.F., D.M.M.L. and L.P.; software, L.F. and L.P.; validation, L.F., D.M.M.L. and L.P.; formal analysis, L.F., D.M.M.L. and L.P.; investigation, L.F., L.P. and A.S.d.A.S.; resources, A.S.d.A.S., D.M.M.L. and M.E.C.M.S.; data curation, L.F., A.S.d.A.S., D.M.M.L. and L.P.; writing—original draft preparation, L.F., A.S.d.A.S., D.A.L. and L.P.; writing—review and editing, A.S.d.A.S., D.A.L., J.J.S., D.M.M.L., M.E.C.M.S. and L.P.; visualization, L.F., A.S.d.A.S., D.A.L. and L.P.; supervision, D.M.M.L., M.E.C.M.S. and L.P.; project administration, D.M.M.L. and M.E.C.M.S.; funding acquisition, M.E.C.M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by National Funds by FCT—Portuguese Foundation for Science and Technology, under the projects UID/04033/2023: Centre for the Research and Technology of Agro-Environmental and Biological Sciences and LA/P/0126/2020 (https://doi.org/10.54499/LA/P/0126/2020).

Data Availability Statement

The original contributions presented in this study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to acknowledge the Portuguese Foundation for Science and Technology (FCT) for financial support through national funds under the projects UID/04033/2023 and LA/P/0126/2020 (https://doi.org/10.54499/LA/P/0126/2020).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Floriano, E.P. Inventário Florestal, 2nd ed.; Pagel: Rio Largo, Brazil, 2021; ISBN 9786500261325. [Google Scholar]
  2. Akwaji, P.I.; Onah, D.O. Forest Inventory for Sustainable Forest Management in Cross River State, Nigeria. Sci. Rep. Life Sci. 2023, 4, 38–52. [Google Scholar]
  3. Heym, M.; Uhl, E.; Moshammer, R.; Dieler, J.; Stimm, K.; Pretzsch, H. Utilising Forest Inventory Data for Biodiversity Assessment. Ecol. Indic. 2021, 121, 107196. [Google Scholar] [CrossRef]
  4. He, X.; Lei, X.; Zeng, W.; Feng, L.; Zhou, C.; Wu, B. Quantifying the Effects of Stand and Climate Variables on Biomass of Larch Plantations Using Random Forests and National Forest Inventory Data in North and Northeast China. Sustainability 2022, 14, 5580. [Google Scholar] [CrossRef]
  5. da Silva, B.R.F.; Ucella-Filho, J.G.M.; Bispo, P.d.C.; Elera-Gonzales, D.G.; Silva, E.A.; Ferreira, R.L.C. Using Drones for Dendrometric Estimations in Forests: A Bibliometric Analysis. Forests 2024, 15, 1993. [Google Scholar] [CrossRef]
  6. Colomina, I.; Molina, P. Unmanned Aerial Systems for Photogrammetry and Remote Sensing: A Review. ISPRS J. Photogramm. Remote Sens. 2014, 92, 79–97. [Google Scholar] [CrossRef]
  7. Torresan, C.; Berton, A.; Carotenuto, F.; Gennaro, S.F.D.; Gioli, B.; Matese, A.; Miglietta, F.; Vagnoli, C.; Zaldei, A.; Wallace, L. Forestry Applications of UAVs in Europe: A Review. Int. J. Remote Sens. 2017, 38, 2427–2447. [Google Scholar] [CrossRef]
  8. Ferreira, L.; Sousa, J.J.; Lourenço, J.M.; Peres, E.; Morais, R.; Pádua, L. Comparative Analysis of TLS and UAV Sensors for Estimation of Grapevine Geometric Parameters. Sensors 2024, 24, 5183. [Google Scholar] [CrossRef] [PubMed]
  9. Andersen, H.-E.; Mcgaughey, R.J.; Reutebuch, S.E. Assessing the Influence of Flight Parameters, Interferometric Processing, Slope and Canopy Density on the Accuracy of X-band IFSAR-derived Forest Canopy Height Models. Int. J. Remote Sens. 2008, 29, 1495–1510. [Google Scholar] [CrossRef]
  10. Mondello, C.; Hepner, G.; Williamson, R.A. 10-Year Industry Forecast: Phases I-III—Study Documentation. Photogramm. Eng. Remote Sens. 2004, 70, 5–58. [Google Scholar]
  11. Bergen, K.M.; Goetz, S.J.; Dubayah, R.O.; Henebry, G.M.; Hunsaker, C.T.; Imhoff, M.L.; Nelson, R.F.; Parker, G.G.; Radeloff, V.C. Remote Sensing of Vegetation 3-D Structure for Biodiversity and Habitat: Review and Implications for Lidar and Radar Spaceborne Missions. J. Geophys. Res. Biogeosci. 2009, 114, 1–13. [Google Scholar] [CrossRef]
  12. Pilarska, M.; Ostrowski, W.; Bakuła, K.; Górski, K.; Kurczyński, Z. THE POTENTIAL OF LIGHT LASER SCANNERS DEVELOPED FOR UNMANNED AERIAL VEHICLES—THE REVIEW AND ACCURACY. ISPRS—Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2016, XLII-2/W2, 87–95. [Google Scholar] [CrossRef]
  13. Almeida, D.R.A.; Stark, S.C.; Chazdon, R.; Nelson, B.W.; Cesar, R.G.; Meli, P.; Gorgens, E.B.; Duarte, M.M.; Valbuena, R.; Moreno, V.S.; et al. The Effectiveness of Lidar Remote Sensing for Monitoring Forest Cover Attributes and Landscape Restoration. For. Ecol. Manag. 2019, 438, 34–43. [Google Scholar] [CrossRef]
  14. Vazirabad, Y.F.; Karslioglu, M.O. Lidar for Biomass Estimation. In Biomass—Detection, Production and Usage; InTech: London, UK, 2011. [Google Scholar] [CrossRef]
  15. Baltsavias, E.P. A Comparison between Photogrammetry and Laser Scanning. ISPRS J. Photogramm. Remote Sens. 1999, 54, 83–94. [Google Scholar] [CrossRef]
  16. Habib, A.; Ghanma, M.; Tait, M. Integration of LIDAR and Photogrammetry for Close Range Applications. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2004, 35, 1045–1050. [Google Scholar]
  17. Barros, Q.S.; d’ Oliveira, M.V.N.; da Silva, E.F.; Görgens, E.B.; de Mendonça, A.R.; da Silva, G.F.; Reis, C.R.; Gomes, L.F.; de Carvalho, A.L.; de Oliveira, E.K.B.; et al. Indicators for Monitoring Reduced Impact Logging in the Brazilian Amazon Derived from Airborne Laser Scanning Technology. Ecol. Inform. 2024, 82, 102654. [Google Scholar] [CrossRef]
  18. Ferreira, L.; Bias, E.d.S.; Barros, Q.S.; Pádua, L.; Matricardi, E.A.T.; Sousa, J.J. Assessing the Impacts of Selective Logging on the Forest Understory in the Amazon Using Airborne LiDAR. Forests 2025, 16, 130. [Google Scholar] [CrossRef]
  19. Papa, D.d.A.; de Almeida, D.R.A.; Silva, C.A.; Figueiredo, E.O.; Stark, S.C.; Valbuena, R.; Rodriguez, L.C.E.; d’ Oliveira, M.V.N. Evaluating Tropical Forest Classification and Field Sampling Stratification from Lidar to Reduce Effort and Enable Landscape Monitoring. For. Ecol. Manag. 2020, 457, 117634. [Google Scholar] [CrossRef]
  20. Dutcă, I.; Mather, R.; Ioraș, F. Sampling Trees to Develop Allometric Biomass Models: How Does Tree Selection Affect Model Prediction Accuracy and Precision? Ecol. Indic. 2020, 117, 106553. [Google Scholar] [CrossRef]
  21. Almeida, D.R.A.D.; Nelson, B.W.; Schietti, J.; Gorgens, E.B.; Resende, A.F.; Stark, S.C.; Valbuena, R. Contrasting Fire Damage and Fire Susceptibility between Seasonally Flooded Forest and Upland Forest in the Central Amazon Using Portable Profiling LiDAR. Remote Sens. Environ. 2016, 184, 153–160. [Google Scholar] [CrossRef]
  22. Manzanera, J.A.; García-Abril, A.; Pascual, C.; Tejera, R.; Martín-Fernández, S.; Tokola, T.; Valbuena, R. Fusion of Airborne LiDAR and Multispectral Sensors Reveals Synergic Capabilities in Forest Structure Characterization. GIScience Remote Sens. 2016, 53, 723–738. [Google Scholar] [CrossRef]
  23. Maltamo, M.; Mehtätalo, L.; Valbuena, R.; Vauhkonen, J.; Packalen, P. Airborne Laser Scanning for Tree Diameter Distribution Modelling: A Comparison of Different Modelling Alternatives in a Tropical Single-Species Plantation. For. Int. J. For. Res. 2018, 91, 121–131. [Google Scholar] [CrossRef]
  24. Chen, Q.; Gao, T.; Zhu, J.; Wu, F.; Li, X.; Lu, D.; Yu, F. Individual Tree Segmentation and Tree Height Estimation Using Leaf-Off and Leaf-On UAV-LiDAR Data in Dense Deciduous Forests. Remote Sens. 2022, 14, 2787. [Google Scholar] [CrossRef]
  25. Meyer, V.; Saatchi, S.; Clark, D.B.; Keller, M.; Vincent, G.; Ferraz, A.; Espírito-Santo, F.; d’Oliveira, M.V.N.; Kaki, D.; Chave, J. Canopy Area of Large Trees Explains Aboveground Biomass Variations across Neotropical Forest Landscapes. Biogeosciences 2018, 15, 3377–3390. [Google Scholar] [CrossRef]
  26. Ribas-Costa, V.A.; Gastón, A.; Cook, R.L. Modeling Dominant Height with USGS 3DEP LiDAR to Determine Site Index in Even-Aged Loblolly Pine (Pinus taeda L.) Plantations in the Southeastern US. For. Int. J. For. Res. 2024, cpae034. [Google Scholar] [CrossRef]
  27. Navarro-Cerrillo, R.M.; Padrón Cedrés, E.; Cachinero-Vivar, A.M.; Valeriano, C.; Camarero, J.J. Integrating Dendrochronological and LiDAR Data to Improve Management of Pinus Canariensis Forests under Different Thinning and Climatic Scenarios. Remote Sens. 2024, 16, 850. [Google Scholar] [CrossRef]
  28. Rodríguez-Puerta, F.; Gómez-García, E.; Martín-García, S.; Pérez-Rodríguez, F.; Prada, E. UAV-Based LiDAR Scanning for Individual Tree Detection and Height Measurement in Young Forest Permanent Trials. Remote Sens. 2022, 14, 170. [Google Scholar] [CrossRef]
  29. Pirotti, F. Assessing a Template Matching Approach for Tree Height and Position Extraction from Lidar-Derived Canopy Height Models of Pinus pinaster Stands. Forests 2010, 1, 194–208. [Google Scholar] [CrossRef]
  30. Picos, J.; Bastos, G.; Míguez, D.; Alonso, L.; Armesto, J. Individual Tree Detection in a Eucalyptus Plantation Using Unmanned Aerial Vehicle (UAV)-LiDAR. Remote Sens. 2020, 12, 885. [Google Scholar] [CrossRef]
  31. Solares-Canal, A.; Alonso, L.; Picos, J.; Armesto, J. Individual Tree Identification and Segmentation in Pinus Spp. Stands through Portable LiDAR. Forests 2024, 15, 1133. [Google Scholar] [CrossRef]
  32. Fernández-Guisuraga, J.M.; Suárez-Seoane, S.; Fernandes, P.M.; Fernández-García, V.; Fernández-Manso, A.; Quintano, C.; Calvo, L. Pre-Fire Aboveground Biomass, Estimated from LiDAR, Spectral and Field Inventory Data, as a Major Driver of Burn Severity in Maritime Pine (Pinus pinaster) Ecosystems. For. Ecosyst. 2022, 9, 100022. [Google Scholar] [CrossRef]
  33. Rocha, K.D.; Silva, C.A.; Cosenza, D.N.; Mohan, M.; Klauberg, C.; Schlickmann, M.B.; Xia, J.; Leite, R.V.; Almeida, D.R.A.D.; Atkins, J.W.; et al. Crown-Level Structure and Fuel Load Characterization from Airborne and Terrestrial Laser Scanning in a Longleaf Pine (Pinus Palustris Mill.) Forest Ecosystem. Remote Sens. 2023, 15, 1002. [Google Scholar] [CrossRef]
  34. Míguez, C.; Fernández, C. Evaluating the Combined Use of the NDVI and High-Density Lidar Data to Assess the Natural Regeneration of P. Pinaster after a High-Severity Fire in NW Spain. Remote Sens. 2023, 15, 1634. [Google Scholar] [CrossRef]
  35. Yu, R.; Luo, Y.; Zhou, Q.; Zhang, X.; Wu, D.; Ren, L. A Machine Learning Algorithm to Detect Pine Wilt Disease Using UAV-Based Hyperspectral Imagery and LiDAR Data at the Tree Level. Int. J. Appl. Earth Obs. Geoinf. 2021, 101, 102363. [Google Scholar] [CrossRef]
  36. Li, Y.; Wang, R.; Shi, W.; Yu, Q.; Li, X.; Chen, X. Research on Accurate Estimation Method of Eucalyptus Biomass Based on Airborne LiDAR Data and Aerial Images. Sustainability 2022, 14, 10576. [Google Scholar] [CrossRef]
  37. Gibbs, H.K.; Brown, S.; Niles, J.O.; Foley, J.A. Monitoring and Estimating Tropical Forest Carbon Stocks: Making REDD a Reality. Environ. Res. Lett. 2007, 2, 045023. [Google Scholar] [CrossRef]
  38. Lu, J.; Wang, H.; Qin, S.; Cao, L.; Pu, R.; Li, G.; Sun, J. Estimation of Aboveground Biomass of Robinia Pseudoacacia Forest in the Yellow River Delta Based on UAV and Backpack LiDAR Point Clouds. Int. J. Appl. Earth Obs. Geoinf. 2020, 86, 102014. [Google Scholar] [CrossRef]
  39. Silva, A.G.P.; Görgens, E.B.; Campoe, O.C.; Alvares, C.A.; Stape, J.L.; Rodriguez, L.C.E. Assessing Biomass Based on Canopy Height Profiles Using Airborne Laser Scanning Data in Eucalypt Plantations. Sci. Agric. 2015, 72, 504–512. [Google Scholar] [CrossRef]
  40. Silva, C.A.; Saatchi, S.; Garcia, M.; Labriere, N.; Klauberg, C.; Ferraz, A.; Meyer, V.; Jeffery, K.J.; Abernethy, K.; White, L.; et al. Comparison of Small- and Large-Footprint Lidar Characterization of Tropical Forest Aboveground Structure and Biomass: A Case Study From Central Gabon. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 3512–3526. [Google Scholar] [CrossRef]
  41. Gomes, L.F.; Brites, R.S.; Locks, C.J.; Anjos, R.R. dos Estimativas das Alterações na Biomassa Florestal Utilizando LiDAR em Área de Manejo Florestal Sustentável na Amazônia Sul-Ocidental. Anuário Inst. Geociênc. 2020, 43, 260–268. [Google Scholar] [CrossRef]
  42. Nobre, S.; Silva, E.A. Estimativa de variáveis dendrométricas em plantios experimentais de Eucalyptus sp. utilizando imagens de VANT. Geogr. Ensino Pesqui. 2024, 28, e84504. [Google Scholar] [CrossRef]
  43. Rex, F.E.; Corte, A.P.D.; Silva, C.A.; Machado, S.d.A.; Sanquetta, C.R. Dynamics of Above-Ground Biomass in the Brazilian Amazon Using LiDAR Data. Anuário Inst. Geociênc. 2020, 43, 228–238. [Google Scholar] [CrossRef]
  44. Kc, Y.B.; Liu, Q.; Saud, P.; Gaire, D.; Adhikari, H. Estimation of Above-Ground Forest Biomass in Nepal by the Use of Airborne LiDAR, and Forest Inventory Data. Land 2024, 13, 213. [Google Scholar] [CrossRef]
  45. Chan, E.P.Y.; Fung, T.; Wong, F.K.K. Estimating Above-Ground Biomass of Subtropical Forest Using Airborne LiDAR in Hong Kong. Sci. Rep. 2021, 11, 1751. [Google Scholar] [CrossRef] [PubMed]
  46. Gao, L.; Zhang, X. Above-Ground Biomass Estimation of Plantation with Complex Forest Stand Structure Using Multiple Features from Airborne Laser Scanning Point Cloud Data. Forests 2021, 12, 1713. [Google Scholar] [CrossRef]
  47. Goldbergs, G.; Levick, S.R.; Lawes, M.; Edwards, A. Hierarchical Integration of Individual Tree and Area-Based Approaches for Savanna Biomass Uncertainty Estimation from Airborne LiDAR. Remote Sens. Environ. 2018, 205, 141–150. [Google Scholar] [CrossRef]
  48. Pinedo, L.E.; Ortega Quispe, K.; Ccopi Trucios, D.; Urquizo Barrera, J.; Rios Chavarría, C.; Pizarro Carcausto, S.; Matos Calderon, D.; Patricio Rosales, S.; Rodríguez Cerrón, M.; Ore Aquino, Z.; et al. Estimation of Height and Aerial Biomass in Eucalyptus Globulus Plantations Using UAV-LiDAR. Trees For. People 2025, 19, 100763. [Google Scholar] [CrossRef]
  49. Torre-Tojal, L.; Bastarrika, A.; Boyano, A.; Lopez-Guede, J.M.; Graña, M. Above-Ground Biomass Estimation from LiDAR Data Using Random Forest Algorithms. J. Comput. Sci. 2022, 58, 101517. [Google Scholar] [CrossRef]
  50. Wang, F.; Sun, Y.; Jia, W.; Zhu, W.; Li, D.; Zhang, X.; Tang, Y.; Guo, H. Development of Estimation Models for Individual Tree Aboveground Biomass Based on TLS-Derived Parameters. Forests 2023, 14, 351. [Google Scholar] [CrossRef]
  51. de Oliveira, C.P.; Ferreira, R.L.C.; da Silva, J.A.A.; de Lima, R.B.; Silva, E.A.; da Silva, A.F.; de Lucena, J.D.S.; dos Santos, N.A.T.; Lopes, I.J.C.; Pessoa, M.M.d.L.; et al. Modeling and Spatialization of Biomass and Carbon Stock Using LiDAR Metrics in Tropical Dry Forest, Brazil. Forests 2021, 12, 473. [Google Scholar] [CrossRef]
  52. Lu, D.; Chen, Q.; Wang, G.; Moran, E.; Batistella, M.; Zhang, M.; Vaglio Laurin, G.; Saah, D. Aboveground Forest Biomass Estimation with Landsat and LiDAR Data and Uncertainty Analysis of the Estimates. Int. J. For. Res. 2012, 2012, 436537. [Google Scholar] [CrossRef]
  53. Li, C.; Li, Y.; Li, M. Improving Forest Aboveground Biomass (AGB) Estimation by Incorporating Crown Density and Using Landsat 8 OLI Images of a Subtropical Forest in Western Hunan in Central China. Forests 2019, 10, 104. [Google Scholar] [CrossRef]
  54. Fadil, S.; Sebari, I.; Ajerame, M.M.; Ajeddour, R.; El Maghraoui, I.; Ait El Kadi, K.; Zefri, Y.; Jabrane, M. An Integrating Framework for Biomass and Carbon Stock Spatialization and Dynamics Assessment Using Unmanned Aerial Vehicle LiDAR (LiDAR UAV) Data, Landsat Imagery, and Forest Survey Data in the Mediterranean Cork Oak Forest of Maamora. Land 2024, 13, 688. [Google Scholar] [CrossRef]
  55. Liu, Y.; Lei, P.; You, Q.; Tang, X.; Lai, X.; Chen, J.; You, H. Individual Tree Aboveground Biomass Estimation Based on UAV Stereo Images in a Eucalyptus Plantation. Forests 2023, 14, 1748. [Google Scholar] [CrossRef]
  56. Rodríguez-Puerta, F.; Alonso Ponce, R.; Pérez-Rodríguez, F.; Águeda, B.; Martín-García, S.; Martínez-Rodrigo, R.; Lizarralde, I. Comparison of Machine Learning Algorithms for Wildland-Urban Interface Fuelbreak Planning Integrating ALS and UAV-Borne LiDAR Data and Multispectral Images. Drones 2020, 4, 21. [Google Scholar] [CrossRef]
  57. Torre, L. Biomass Estimation Using LiDAR Data. Int. J. Sustain. Energy Plan. Manag. 2018, 17, 79–90. [Google Scholar] [CrossRef]
  58. Domingo, D.; Lamelas, M.; Montealegre, A.; García-Martín, A.; De La Riva, J. Estimation of Total Biomass in Aleppo Pine Forest Stands Applying Parametric and Nonparametric Methods to Low-Density Airborne Laser Scanning Data. Forests 2018, 9, 158. [Google Scholar] [CrossRef]
  59. O’Sullivan, K.S.W.; Vilà-Cabrera, A.; Chen, J.-C.; Greenwood, S.; Chang, C.-H.; Jump, A.S. High Intraspecific Trait Variation Results in a Resource Allocation Spectrum of a Subtropical Pine across an Elevational Gradient. J. Biogeogr. 2022, 49, 668–681. [Google Scholar] [CrossRef]
  60. Freeman, J.S.; Marques, C.M.P.; Carocha, V.; Borralho, N.; Potts, B.M.; Vaillancourt, R.E. Origins and Diversity of the Portuguese Landrace of Eucalyptus Globulus. Ann. For. Sci. 2007, 64, 639–647. [Google Scholar] [CrossRef]
  61. de Oliveira, L.T.; de Carvalho, L.M.T.; Ferreira, M.Z.; Oliveira, T.C.d.A.; Batista, V.T.F.P. Influência da idade na contagem de árvores de Eucalyptus sp. com dados de lidar. CERNE 2014, 20, 557–565. [Google Scholar] [CrossRef]
  62. Sheridan, R.D.; Popescu, S.C.; Gatziolis, D.; Morgan, C.L.S.; Ku, N.-W. Modeling Forest Aboveground Biomass and Volume Using Airborne LiDAR Metrics and Forest Inventory and Analysis Data in the Pacific Northwest. Remote Sens. 2015, 7, 229–255. [Google Scholar] [CrossRef]
  63. Borsah, A.A.; Nazeer, M.; Wong, M.S. LIDAR-Based Forest Biomass Remote Sensing: A Review of Metrics, Methods, and Assessment Criteria for the Selection of Allometric Equations. Forests 2023, 14, 2095. [Google Scholar] [CrossRef]
  64. Yan, Y.; Lei, J.; Huang, Y. Forest Aboveground Biomass Estimation Based on Unmanned Aerial Vehicle–Light Detection and Ranging and Machine Learning. Sensors 2024, 24, 7071. [Google Scholar] [CrossRef]
  65. Silva, C.A.; Klauberg, C.; Hudak, A.T.; Vierling, L.A.; Liesenberg, V.; e Carvalho, S.P.C.; Rodriguez, L.C.E. A Principal Component Approach for Predicting the Stem Volume in Eucalyptus Plantations in Brazil Using Airborne LiDAR Data. For. Int. J. For. Res. 2016, 89, 422–433. [Google Scholar] [CrossRef]
  66. Leite, R.V.; do Amaral, C.H.; Pires, R.d.P.; Silva, C.A.; Soares, C.P.B.; Macedo, R.P.; da Silva, A.A.L.; Broadbent, E.N.; Mohan, M.; Leite, H.G. Estimating Stem Volume in Eucalyptus Plantations Using Airborne LiDAR: A Comparison of Area- and Individual Tree-Based Approaches. Remote Sens. 2020, 12, 1513. [Google Scholar] [CrossRef]
  67. Liu, X.; Hao, Y.; Widagdo, F.R.A.; Xie, L.; Dong, L.; Li, F. Predicting Height to Crown Base of Larix Olgensis in Northeast China Using UAV-LiDAR Data and Nonlinear Mixed Effects Models. Remote Sens. 2021, 13, 1834. [Google Scholar] [CrossRef]
  68. Magalhaes Albuquerque, A.; Debiasi, P.; Lourenço de Lima, T.V.; Hirokawa Higa, G.T.; Pistori, H.; Ferraço Scolforo, H.; Ferreira Silva, T.C.; de Andrade Porto, J.V.; Stape, J.L. Qualitative Forest Inventory in Eucalyptus Plantations Using Unmanned Aerial Vehicles, Multispectral Sensors, and Deep Learning. IEEE Geosci. Remote Sens. Lett. 2024, 21, 1–5. [Google Scholar] [CrossRef]
  69. Talebiesfandarani, S.; Shamsoddini, A. Global-Scale Biomass Estimation Based on Machine Learning and Deep Learning Methods. Remote Sens. Appl. Soc. Environ. 2022, 28, 100868. [Google Scholar] [CrossRef]
  70. Fernandes, P.A.T. Natural Regeneration and Expansion Capacity of Forestry Species: Pinus pinaster and Eucalyptus Globulus. Ph.D. Thesis, Universidade de Lisboa, Lisboa, Portugal, 2016. [Google Scholar]
  71. Martins, M.A.S.; Parente, J.; Keizer, J.J.; Vieira, D.C.S. What Is the Extension of Bench Terrace Construction for Forest Plantations? The Case of North Central Portugal. Land Degrad. Dev. 2023, 34, 5180–5190. [Google Scholar] [CrossRef]
  72. Maia, P.; Keizer, J.; Vasques, A.; Abrantes, N.; Roxo, L.; Fernandes, P.; Ferreira, A.; Moreira, F. Post-Fire Plant Diversity and Abundance in Pine and Eucalypt Stands in Portugal: Effects of Biogeography, Topography, Forest Type and Post-Fire Management. For. Ecol. Manag. 2014, 334, 154–162. [Google Scholar] [CrossRef]
  73. Ferreira-Leite, F.; Ganho, N.; Bento-Gonçalves, A.; Botelho, F. Iberian Atmospheric Dynamics and Large Forest Fires in Mainland Portugal. Agric. For. Meteorol. 2017, 247, 551–559. [Google Scholar] [CrossRef]
  74. Costa, L.; Thonicke, K.; Poulter, B.; Badeck, F.-W. Sensitivity of Portuguese Forest Fires to Climatic, Human, and Landscape Variables: Subnational Differences between Fire Drivers in Extreme Fire Years and Decadal Averages. Reg. Environ. Change 2011, 11, 543–551. [Google Scholar] [CrossRef]
  75. Pedersen, J.S.T.; Dias, L.F.; Kok, K.; van Vuuren, D.; Soares, P.M.; Santos, F.D.; Azevedo, J.C. Increased Policy Ambition Is Needed to Avoid the Effects of Climate Change and Reach Carbon Removal Targets in Portugal. Reg. Environ. Change 2024, 24, 62. [Google Scholar] [CrossRef]
  76. Valente, S.; Coelho, C.; Ribeiro, C.; Marsh, G. Sustainable Forest Management in Portugal: Transition from Global Policies to Local Participatory Strategies. Int. For. Rev. 2015, 17, 368–383. [Google Scholar] [CrossRef]
  77. Tomé, M.; Barreiro, S.; Paulo, J.A.; Faias, S.P. Seleção de Equações Para Estimação de Variáveis Da Árvore Em Inventários Florestais a Realizar Em Portugal. 2007. Available online: https://www.isa.ulisboa.pt/cef/forchange/fctools/sites/default/files/pub/docs/equacoes-if_em_portugal.pdf (accessed on 30 May 2025).
  78. Faias, S.; Morais, P.; Dias, S.; Morão, S.; Salas, R.; Tomé, M.; Ôchoa, P. Uma Rede Europeia de Zonas Piloto Para Avaliação de Critérios e Indicadores de Gestão Florestal Sustentável–Relatório Final Do Projecto FORSEE. 2007. Available online: https://repositorio.ulisboa.pt/bitstream/10400.5/1763/1/REP-RFP1_2007_relatorio_FORSEE.pdf (accessed on 30 May 2025).
  79. Soares, P.; Tomé, M. Analysis of the Effectiveness of Biomass Expansion Factors to Estimate Stand Biomass. In Proceedings of the International Conference on Modeling Forest Production, Vienna, Austria, 9–22 April 2004; pp. 368–374. [Google Scholar]
  80. Pingel, T.J.; Clarke, K.C.; McBride, W.A. An Improved Simple Morphological Filter for the Terrain Classification of Airborne LIDAR Data. ISPRS J. Photogramm. Remote Sens. 2013, 77, 21–30. [Google Scholar] [CrossRef]
  81. Edelsbrunner, H.; Mücke, E.P. Three-Dimensional Alpha Shapes. ACM Trans. Graph. 1994, 13, 43–72. [Google Scholar] [CrossRef]
  82. Ma, Q.; Su, Y.; Guo, Q. Comparison of Canopy Cover Estimations From Airborne LiDAR, Aerial Imagery, and Satellite Imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 4225–4236. [Google Scholar] [CrossRef]
  83. Field, A. Discovering Statistics Using IBM SPSS Statistics; SAGE: Singapore, 2013; ISBN 978-1-4462-7458-3. [Google Scholar]
  84. Mukaka, M. Statistics Corner: A Guide to Appropriate Use of Correlation Coefficient in Medical Research. Malawi Med. J. 2012, 24, 69–71. [Google Scholar] [PubMed]
  85. Graham, M.H. Confronting Multicollinearity in Ecological Multiple Regression. Ecology 2003, 84, 2809–2815. [Google Scholar] [CrossRef]
  86. Zuur, A.F. Mixed Effects Models and Extensions in Ecology with R. Available online: https://biblioteca.uazuay.edu.ec/buscar/item/74750 (accessed on 30 June 2025).
  87. Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S; Statistics and Computing; Springer: New York, NY, USA, 2002; ISBN 978-1-4419-3008-8. [Google Scholar]
  88. Sauerbrei, W.; Royston, P.; Binder, H. Selection of Important Variables and Determination of Functional Form for Continuous Predictors in Multivariable Model Building. Stat. Med. 2007, 26, 5512–5528. [Google Scholar] [CrossRef]
  89. Montgomery, D.C.; Peck, E.A.; Vining, G.G. Introduction to Linear Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2021; ISBN 978-1-119-57875-8. [Google Scholar]
  90. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  91. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer Series in Statistics; Springer: New York, NY, USA, 2009; ISBN 978-0-387-84857-0. [Google Scholar]
  92. Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer: New York, NY, USA, 2013; ISBN 978-1-4614-6848-6. [Google Scholar]
  93. R Core Team, R: The R Project for Statistical Computing. Available online: https://www.r-project.org/ (accessed on 27 October 2024).
  94. Breiman, L.; Cutler, A.; Liaw, A.; Wiener, M. R randomForest: Breiman and Cutlers Random Forests for Classification and Regression 2024. Available online: https://cran.r-project.org/web/packages/randomForest/index.html (accessed on 1 July 2025).
  95. Kuhn, M.; Wing, J.; Weston, S.; Williams, A.; Keefer, C.; Engelhardt, A.; Cooper, T.; Mayer, Z.; Kenke, B.; R Core Team; et al. Caret: Classification and Regression Training. 2022. Available online: https://cran.r-project.org/web/packages/caret/index.html (accessed on 13 July 2025).
  96. Aliferis, C.; Simon, G. Overfitting, Underfitting and General Model Overconfidence and Under-Performance Pitfalls and Best Practices in Machine Learning and AI. In Artificial Intelligence and Machine Learning in Health Care and Medical Sciences: Best Practices and Pitfalls; Simon, G.J., Aliferis, C., Eds.; Springer International Publishing: Cham, Switzerland, 2024; pp. 477–524. ISBN 978-3-031-39355-6. [Google Scholar]
  97. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning: With Applications in R; Springer Texts in Statistics; Springer US: New York, NY, USA, 2021; ISBN 978-1-07-161417-4. [Google Scholar]
  98. Willmott, C.J.; Matsuura, K. Advantages of the Mean Absolute Error (MAE) over the Root Mean Square Error (RMSE) in Assessing Average Model Performance. Clim. Res. 2005, 30, 79–82. [Google Scholar] [CrossRef]
  99. Gregorutti, B.; Michel, B.; Saint-Pierre, P. Correlation and Variable Importance in Random Forests. Stat. Comput. 2017, 27, 659–678. [Google Scholar] [CrossRef]
  100. Puliti, S.; Breidenbach, J.; Schumacher, J.; Hauglin, M.; Klingenberg, T.F.; Astrup, R. Above-Ground Biomass Change Estimation Using National Forest Inventory Data with Sentinel-2 and Landsat. Remote Sens. Environ. 2021, 265, 112644. [Google Scholar] [CrossRef]
  101. da Silva, V.S.; Silva, C.A.; Mohan, M.; Cardil, A.; Rex, F.E.; Loureiro, G.H.; de Almeida, D.R.A.; Broadbent, E.N.; Gorgens, E.B.; Dalla Corte, A.P.; et al. Combined Impact of Sample Size and Modeling Approaches for Predicting Stem Volume in Eucalyptus Spp. Forest Plantations Using Field and LiDAR Data. Remote Sens. 2020, 12, 1438. [Google Scholar] [CrossRef]
  102. Liu, M.; Hu, S.; Ge, Y.; Heuvelink, G.B.M.; Ren, Z.; Huang, X. Using Multiple Linear Regression and Random Forests to Identify Spatial Poverty Determinants in Rural China. Spat. Stat. 2021, 42, 100461. [Google Scholar] [CrossRef]
  103. Maesano, M.; Santopuoli, G.; Moresi, F.; Matteucci, G.; Lasserre, B.; Scarascia Mugnozza, G. Above Ground Biomass Estimation from UAV High Resolution RGB Images and LiDAR Data in a Pine Forest in Southern Italy. iForest 2022, 15, 451–457. [Google Scholar] [CrossRef]
  104. Domingo, D.; Lamelas-Gracia, M.T.; Montealegre-Gracia, A.L.; de la Riva-Fernández, J. Comparison of Regression Models to Estimate Biomass Losses and CO2 Emissions Using Low-Density Airborne Laser Scanning Data in a Burnt Aleppo Pine Forest. Eur. J. Remote Sens. 2017, 50, 384–396. [Google Scholar] [CrossRef]
  105. Zhao, K.; Popescu, S.; Nelson, R. Lidar Remote Sensing of Forest Biomass: A Scale-Invariant Estimation Approach Using Airborne Lasers. Remote Sens. Environ. 2009, 113, 182–196. [Google Scholar] [CrossRef]
  106. Mascaro, J.; Asner, G.P.; Muller-Landau, H.C.; van Breugel, M.; Hall, J.; Dahlin, K. Controls over Aboveground Forest Carbon Density on Barro Colorado Island, Panama. Biogeosciences 2011, 8, 1615–1629. [Google Scholar] [CrossRef]
  107. Smith, P.F.; Ganesh, S.; Liu, P. A Comparison of Random Forest Regression and Multiple Linear Regression for Prediction in Neuroscience. J. Neurosci. Methods 2013, 220, 85–91. [Google Scholar] [CrossRef]
  108. Lundberg, S.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions 2017. Available online: https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf (accessed on 30 May 2025).
  109. Chai, T.; Draxler, R.R. Root Mean Square Error (RMSE) or Mean Absolute Error (MAE)? Geosci. Model Dev. Discuss. 2014, 7, 1525–1534. [Google Scholar] [CrossRef]
Figure 1. Location of the study areas in mainland Portugal and respective sample stands. Images from Google Earth (accessed on 20 February 2025) and obtained from the National Geographic Information System (SNIG), OrtoSat true color image of mainland Portugal 2023.
Figure 1. Location of the study areas in mainland Portugal and respective sample stands. Images from Google Earth (accessed on 20 February 2025) and obtained from the National Geographic Information System (SNIG), OrtoSat true color image of mainland Portugal 2023.
Land 14 01460 g001
Figure 2. Terrain normalization process applied to a point cloud from one of the surveyed stands: (a) original point cloud; (b) classified ground points (green) and nonground points (purple); (c) normalized point cloud. Colors represent the altitude/height of each point.
Figure 2. Terrain normalization process applied to a point cloud from one of the surveyed stands: (a) original point cloud; (b) classified ground points (green) and nonground points (purple); (c) normalized point cloud. Colors represent the altitude/height of each point.
Land 14 01460 g002
Figure 3. Visual representation of height percentiles and excluded points (below two meters) in a Eucalyptus globulus stand. Transition areas between each color represents a height value: black—red, minimum height above two meters; red—green, height at the 25th percentile; green—dark blue, height at the 40th percentile; dark blue—light blue, mean height; light blue—purple, height at the 50th percentile; purple—yellow, height at the 75th percentile; yellow—brown: height at the 90th percentile; brown—green: height at the 95th percentile.
Figure 3. Visual representation of height percentiles and excluded points (below two meters) in a Eucalyptus globulus stand. Transition areas between each color represents a height value: black—red, minimum height above two meters; red—green, height at the 25th percentile; green—dark blue, height at the 40th percentile; dark blue—light blue, mean height; light blue—purple, height at the 50th percentile; purple—yellow, height at the 75th percentile; yellow—brown: height at the 90th percentile; brown—green: height at the 95th percentile.
Land 14 01460 g003
Figure 4. Observed and predicted total biomass (a,b) above ground biomass (AGB) and their respective residual plots (c,d) for Eucalyptus globulus data using multiple linear regression.
Figure 4. Observed and predicted total biomass (a,b) above ground biomass (AGB) and their respective residual plots (c,d) for Eucalyptus globulus data using multiple linear regression.
Land 14 01460 g004
Figure 5. Observed and predicted total biomass (a,b) above ground biomass (AGB) and their respective residual plots (c,d) for Eucalyptus globulus data using a random forest.
Figure 5. Observed and predicted total biomass (a,b) above ground biomass (AGB) and their respective residual plots (c,d) for Eucalyptus globulus data using a random forest.
Land 14 01460 g005
Figure 6. Observed and predicted total biomass (a,b) above ground biomass (AGB) and their respective residual plots (c,d) for Pinus pinaster data using multiple linear regression.
Figure 6. Observed and predicted total biomass (a,b) above ground biomass (AGB) and their respective residual plots (c,d) for Pinus pinaster data using multiple linear regression.
Land 14 01460 g006
Table 1. Equations for calculating Pinus pinaster and Eucalyptus globulus biomass per compartment. AGB: above ground biomass; TB: total biomass; DBH: diameter at breast height; Ht: total tree height.
Table 1. Equations for calculating Pinus pinaster and Eucalyptus globulus biomass per compartment. AGB: above ground biomass; TB: total biomass; DBH: diameter at breast height; Ht: total tree height.
CompartmentPinus pinasterEucalyptus globulus
Trunk w s = 0.0146 d b h 1.94687 h t 1.106577 w s = 0.009964 d b h 1.780459 h t 1.369618 , if h d o m > 10.7100
w s = 0.009964 d b h β 2 h t 1.369618 , if h d o m ≤ 10.7100
β 2 = h d o m 0.70909 + 0.627861 h d o m
Bark w b = 0.0114 d b h 1.8728 h t 0.6694 w b = 0.000594 d b h 2.3794759 h t 1.0849888 , if h d o m > 18.2691
w b = 0.000594 d b h β 2 h t 1.0849888 , if h d o m ≤ 18.2691
β 2 = h d o m 0.69951 + 0.45855 h d o m
Branches w b r = 0.00308 d b h 2.757606 h t d b h 0.39381 w b r = 0.095603 d b h 1.674653 h t d b h 0.85073
Foliage w l = 0.09980 d b h 1.392518 h t d b h 0.71962
AGB w a = w s + w b r + w l w a = w s + w b r + w l
Root w r = 0.2756 w a w r = 0.2487 w a
TB w t = w a + w r w t = w a + w r
Table 2. UAV-based LiDAR point cloud metrics derived for each stand.
Table 2. UAV-based LiDAR point cloud metrics derived for each stand.
VariablesDescription
H v a r , H s t d , H a m p Height dispersion metrics: variance, standard deviation, and amplitude (max. − min.)
H P 25 , H P 40 , H P 50 , H m e a n , H P 75 , H P 90 , H P 95 , H m a x Height percentiles and statistics at 25th, 40th, 50th (median), mean, 75th, 90th, 95th, and maximum height
C C D m i n , C C D P 25 , C C D P 40 , C C D P 50 , C C D m e a n , C C D P 75 , C C D P 90 , C C D P 95 Proportion of area covered by the vertical projection of the tree canopy above a given at different percentiles
A m i n , A P 25 , A P 40 , A P 50 , A m e a n , A P 75 , A P 90 , A P 95 Horizontal area covered by canopy at different height percentiles
V m i n , V P 25 , V P 40 , V P 50 , V m e a n , V P 75 , V P 90 , V P 95 Estimated canopy volume, computed at different height percentiles
Table 3. Distribution of metrics for the evaluated Eucalyptus globulus and Pinus pinaster stands. SD: standard deviation; P25: value at the 25th percentile; P75: value at the 75th percentile; TB: total biomass; AGB: above ground biomass.
Table 3. Distribution of metrics for the evaluated Eucalyptus globulus and Pinus pinaster stands. SD: standard deviation; P25: value at the 25th percentile; P75: value at the 75th percentile; TB: total biomass; AGB: above ground biomass.
ParameterSpeciesMean ± SDMin.P25MedianP75Max.
TB
(Mg ha−1)
E. globulus42.14 ± 25.213.4124.3640.0959.83107.72
P. pinaster416.30 ± 189.23112.97289.10452.01576.60680.82
AGB
(Mg ha−1)
E. globulus33.74 ± 20.192.7319.5132.1047.9186.26
P. pinaster403.62 ± 185.79105.34279.57440.67561.46664.17
Max. height (m)E. globulus14.93 ± 3.536.6112.4714.4817.3521.65
P. pinaster18.69 ± 4.789.6615.9018.6622.6625.99
Canopy
density (%)
E. globulus20.71 ± 10.141.0912.5519.8028.7445.19
P. pinaster37.62 ± 19.182.8627.5436.3347.1467.85
Crown area
(m2 ha−1)
E. globulus3318.07 ± 1493.64139.672336.563305.694293.846672.64
P. pinaster5059.16 ± 2554.35328.893907.074605.556639.569444.33
Volume
(m3 ha−1)
E. globulus3198.48 ± 2987.507.031429.901927.564415.9113,602.75
P. pinaster3921.92 ± 3009.5453.661412.933630.604480.6011,199.45
Table 4. Correlation coefficients (r) and variance inflation factors (VIF) for predictor variables of aboveground biomass (AGB) and total biomass (TB) in Eucalyptus globulus and Pinus pinaster. VIF values are identical for AGB and TB. H: height; A: projected crown area per hectare (m2 ha−1); V: volume per hectare (m3 ha−1); CCD: canopy cover density.
Table 4. Correlation coefficients (r) and variance inflation factors (VIF) for predictor variables of aboveground biomass (AGB) and total biomass (TB) in Eucalyptus globulus and Pinus pinaster. VIF values are identical for AGB and TB. H: height; A: projected crown area per hectare (m2 ha−1); V: volume per hectare (m3 ha−1); CCD: canopy cover density.
VariablesEucalyptus globulusPinus pinaster
AGBTBBothAGBTBBoth
rrVIF1VIF2rrVIF1VIF2
H v a r 0.5420.5423.171.200.4300.3333.561.62
H s t d 0.5420.5420.4300.333
H a m p 0.8550.8550.9150.903
H P 25 0.9250.9256.621.680.9270.952
H P 40 0.9430.9430.9390.964
H P 50 0.9440.9440.9390.964
H m e a n 0.9490.9490.9390.964
H P 75 0.9370.9370.9270.939
H P 90 0.9240.9240.9270.939
H P 95 0.9150.9150.9270.939
H m a x 0.8530.8538.490.9030.9152.551.75
C C D m i n 0.5130.51313.674.710.5150.527126.35
C C D P 25 0.5480.5480.4550.491
C C D P 40 0.5510.5510.3940.442
C C D P 50 0.5590.5590.3940.442
C C D m e a n 0.5810.5810.4550.491
C C D P 75 0.5710.5710.3940.442
C C D P 90 0.6050.60566.700.3090.345
C C D P 95 0.5780.57838.443.020.1640.200
A m i n 0.5640.5640.4300.455219.27
A P 25 0.5210.5210.3580.370
A P 40 0.5050.5050.3450.358
A P 50 0.5100.5100.3450.358
A m e a n 0.5010.5010.4300.442
A P 75 0.4490.4490.3450.358
A P 90 0.4500.4500.2610.273
A P 95 0.4370.43718.650.2610.27320.69
V m i n 0.3020.3020.5030.442
V P 25 0.3730.3730.4550.41814.761.52
V P 40 0.3720.3720.4180.382
V P 50 0.3840.38415.182.590.4180.382
V m e a n 0.3950.3950.5270.491
V P 75 0.4130.4130.3700.345
V P 90 0.4590.4590.3210.309
V P 95 0.5050.5050.4300.418
Table 5. Regression performance metrics for above-ground biomass (AGB) and total biomass (TB) in Eucalyptus globulus stands. MLR: multiple linear regression; RF: random forest; R2: coefficient of determination; RMSE: root mean square error; MAE: mean absolute error.
Table 5. Regression performance metrics for above-ground biomass (AGB) and total biomass (TB) in Eucalyptus globulus stands. MLR: multiple linear regression; RF: random forest; R2: coefficient of determination; RMSE: root mean square error; MAE: mean absolute error.
ModelDatasetParameterAGBTB
MLRTrainR20.890.89
RMSE (Mg ha−1)6.708.37
MAE (Mg ha−1)5.366.69
TestR20.730.73
RMSE (Mg ha−1)10.4913.10
MAE (Mg ha−1)8.0310.03
RFTrainR20.960.96
RMSE (Mg ha−1)4.695.71
MAE (Mg ha−1)3.614.43
TestR20.720.73
RMSE (Mg ha−1)10.4212.92
MAE (Mg ha−1)7.959.76
Table 6. Regression performance metrics for above-ground biomass (AGB) and total biomass (TB) in Pinus pinaster stands. R2: coefficient of determination; RMSE: root mean square error; MAE: mean absolute error.
Table 6. Regression performance metrics for above-ground biomass (AGB) and total biomass (TB) in Pinus pinaster stands. R2: coefficient of determination; RMSE: root mean square error; MAE: mean absolute error.
DatasetParameterAGBTB
TrainR20.910.91
RMSE (Mg ha−1)56.4358.53
MAE (Mg ha−1)51.8053.31
TestR20.910.91
RMSE (Mg ha−1)89.1091.59
MAE (Mg ha−1)79.3781.33
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ferreira, L.; Sandim, A.S.d.A.; Lopes, D.A.; Sousa, J.J.; Lopes, D.M.M.; Silva, M.E.C.M.; Pádua, L. Estimating Biomass in Eucalyptus globulus and Pinus pinaster Forests Using UAV-Based LiDAR in Central and Northern Portugal. Land 2025, 14, 1460. https://doi.org/10.3390/land14071460

AMA Style

Ferreira L, Sandim ASdA, Lopes DA, Sousa JJ, Lopes DMM, Silva MECM, Pádua L. Estimating Biomass in Eucalyptus globulus and Pinus pinaster Forests Using UAV-Based LiDAR in Central and Northern Portugal. Land. 2025; 14(7):1460. https://doi.org/10.3390/land14071460

Chicago/Turabian Style

Ferreira, Leilson, André Salgado de Andrade Sandim, Dalila Araújo Lopes, Joaquim João Sousa, Domingos Manuel Mendes Lopes, Maria Emília Calvão Moreira Silva, and Luís Pádua. 2025. "Estimating Biomass in Eucalyptus globulus and Pinus pinaster Forests Using UAV-Based LiDAR in Central and Northern Portugal" Land 14, no. 7: 1460. https://doi.org/10.3390/land14071460

APA Style

Ferreira, L., Sandim, A. S. d. A., Lopes, D. A., Sousa, J. J., Lopes, D. M. M., Silva, M. E. C. M., & Pádua, L. (2025). Estimating Biomass in Eucalyptus globulus and Pinus pinaster Forests Using UAV-Based LiDAR in Central and Northern Portugal. Land, 14(7), 1460. https://doi.org/10.3390/land14071460

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop