Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model

Xue, Wenkai; Zhou, Kai; Dunzhu, Pubu; Xing, Zhen; Wu, Yunhua; Lin, Ling; Shen, Xin; Cao, Lin

doi:10.3390/rs18060909

Open AccessArticle

Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model

by

Wenkai Xue

^1,†,

Kai Zhou

^1,†

,

Pubu Dunzhu

²,

Zhen Xing

³

,

Yunhua Wu

²,

Ling Lin

³,

Xin Shen

¹ and

Lin Cao

^1,*

¹

Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China

²

Forestry Survey and Planning Institute of Tibet Autonomous Region, Lhasa 850003, China

³

College of Resources and Environment, Xizang Agricultural and Animal Husbandry University, Linzhi 860000, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Remote Sens. 2026, 18(6), 909; https://doi.org/10.3390/rs18060909

Submission received: 13 January 2026 / Revised: 23 February 2026 / Accepted: 10 March 2026 / Published: 16 March 2026

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

By integrating the tree–shrub separation algorithm with point cloud segmentation or marker-controlled watershed algorithms, more accurate single-tree segmentation of seabuckthorn natural forests was achieved.
Combining UAV LiDAR, multispectral data, and the N-PROSAIL model, a seabuckthorn DBH estimation model was established, and an LNC prediction system was constructed.

What are the implications of the main findings?

Provided a systematic process method for trait extraction of natural seabuckthorn forests, reducing fieldwork efforts for seabuckthorn.
Efficient and accurate extraction of single-tree phenotypic traits of seabuckthorn natural forests through remote sensing is crucial for germplasm resource development, precision forestry, and ecological restoration.

Abstract

The accurate and efficient extraction of individual tree phenotypic traits for seabuckthorn (Hippophae rhamnoides L.) in natural forests is crucial for germplasm exploration, precision silviculture, and ecological restoration. This study extracted structural and biochemical traits of seabuckthorn in Tibet’s Lhasa valley using Unmanned aerial vehicle (UAV) LiDAR, multispectral imagery, and the N-PROSAIL model. Firstly, building on a classification conducted through multi-scale spatial analysis and hierarchical clustering with dynamic thresholds, shrub interference was effectively reduced, thereby improving the accuracy of individual tree segmentation. Tree height and crown width were derived from the segmentation results, and a DBH estimation model was developed using handheld LiDAR data. Finally, leaf nitrogen content was mapped within canopies using random forest combined with the N-PROSAIL model and nitrogen reference data. The results demonstrated that the optimized segmentation method successfully extracted structural traits (F1 = 84.21%). Tree height was accurately estimated (R² = 0.814, RMSE = 0.580 m), and the DBH prediction model performed satisfactorily (R² = 0.779, RMSE = 1.725 cm). The random forest model also effectively estimated leaf nitrogen content (R² = 0.680, RMSE = 2.074 mg/g).

Keywords:

seabuckthorn forests; structural and biochemical traits; UAV; LiDAR; N-PROSAIL model; multi-spectral imageries

1. Introduction

Forest ecosystems are fundamental to human society and the living world, which depends on them for survival and growth [1,2]. Forests serve as essential reservoirs of stable organic carbon, genetic diversity, and energy resources [3,4,5], while playing a decisive role in maintaining ecological balance [6,7], supporting environmental conservation, and sustaining the fundamental conditions for human survival and development [8,9]. The natural environmental conditions and effects provided by forest ecosystems to sustain human life are called forest ecosystem services, which are formed by the structure and processes of forest ecosystems [10,11,12]. Forest ecosystem services are divided into seven categories: water conservation, soil conservation, carbon sequestration and oxygen release, nutrient accumulation, environmental purification, biodiversity conservation and forest recreation [11,12,13,14,15]. In recent years, the theory of “four reservoirs”, including water [13], food [14], money [15] and carbon [11], has further enriched our understanding of forest ecosystem services: forests support the water cycle (water reservoir) through water conservation, regulating the water cycle and supporting forest recreation. Forests regulate the water cycle through water harvesting (water bank), support agriculture and contribute to food security (food bank), provide economic values such as timber and ecotourism (money bank), and mitigate global warming by absorbing and storing carbon dioxide (carbon bank). For a long time, the view of forest ecosystem services as a source of public goods has led to a widespread undervaluation of their benefits in terms of forest resource utilization, as well as a lack of proper recognition of their importance [16]. This has resulted in the predatory exploitation of forest resources, severely damaging forest ecosystems and, in turn, contributing to increasingly critical ecological issues such as climate warming, biodiversity loss, and natural disasters. These ecological issues pose a great threat to human life and production. History has proved that the rise and fall of forest ecosystems are closely related to the sustainable development of human production and life [17,18,19], and the theory of “four reservoirs” emphasizes the central position of forest ecosystems in maintaining ecological security and socio-economic development. The protection and development of forest ecosystems is not only a necessity for environmental protection [20], but also key to achieving long-term prosperity and stability in human society [21,22].

Xizang is known for its extreme natural environment. The thin air and low oxygen conditions brought by the high altitude, together with the significant temperature difference between day and night, pose a serious challenge to the survival of plants and animals [23]. Mozhugongka County, where the study area is located, is part of the Lhasa Valley Plain and has a typical plateau temperate semi-arid monsoon climate. The high altitude results in a cold and dry climate with low oxygen content in the air. The annual precipitation is about 515.9 mm, mainly concentrated from June to September, while precipitation is scarce in winter. The annual sunshine hours reach up to 2813.5 h and the frost-free period is about 90 days. Strong windy weather is common in winter and spring, accelerating water evaporation and increasing the risk of soil erosion. Such extreme climatic conditions pose a serious challenge to the local ecosystem [24,25].

Seabuckthorn (Hippophae rhamnoides L.), as a drought-resistant, cold-resistant and sand-resistant plant [26], has significant ecological restoration functions. Seabuckthorn is botanically classified as a deciduous shrub or small tree, exhibiting remarkable morphological plasticity in response to environmental conditions. In our study area, it occurs both as low shrub thickets and as arborescent individuals exceeding 3 m in height with distinct trunk development. This growth-form continuum is central to our objective of extracting phenotypic traits from tree-like individuals to support germplasm selection and breeding programs. Seabuckthorn has a well-developed root system, which can effectively fix the soil [16], reduce soil erosion, and prevent desertification and land degradation [27]. Its drought-resistant and barrenness-resistant characteristics mean that it is able to grow under harsh environmental conditions and make it an important ecological restoration plant. Seabuckthorn also has strong nitrogen-fixing ability and can grow on wasteland, making it a pioneer plant for windy and sandy land [28]. Therefore, seabuckthorn has a strong ability to improve the ecological environment and create a biological chain. The nitrogen-fixing ability of seabuckthorn effectively contributes to soil fertilization and yield improvement. Its root nodules are formed through the infection of the roots by the endophytic bacterium Frankia [29], and exhibit a higher nitrogen fixation capacity compared to leguminous plants [30]. Through the nitrogen fixation of the seabuckthorn root system, soil organic matter and nitrogen content are notably increased [31], while the presence of and improvement in organic matter can reduce soil bulk weight and increase porosity. This is more conducive to improving the water storage capacity of the soil, and thus improves the soil’s geotechnical capacity [32]. Therefore, planting seabuckthorn is considered an effective means to solve the problems of soil erosion and sanding [33]. In China, the government attaches great importance to ecological protection and restoration. In the ecological management of the western and highland areas, seabuckthorn has become an important plant in the implementation of ecological restoration projects [34]. Policies such as “returning farmland to forests and grasslands”, natural forest protection projects and desertification prevention and control projects implemented by the state have mentioned seabuckthorn as a key plant for greening and ecological restoration. In recent years, seabuckthorn has been widely planted in ecological restoration projects in Xizang and other places to restore grassland, woodland and desertified areas. Through the support of these policies, the scale of seabuckthorn cultivation has gradually expanded and begun to drive the dual development of the local ecology and economy. The reason for choosing the river valley plain of Mozhugongka County as the study area is that the diversity of seabuckthorn forms in this area increases the generality and representativeness of the results [35]. The ecological characteristics and expression of seabuckthorn in this area can provide valuable reference cases for other, similar alpine, arid and semi-arid regions in China and worldwide [36].

Unmanned aerial vehicle (UAV) remote sensing technology, compared with traditional satellite remote sensing, is more flexible and efficient, especially in ecological environment monitoring in areas with poor natural environments and sparse human trails [37]. Compared with satellite-borne remote sensing data, UAV remote sensing data are more refined [38], and can be better applied to individual tree-level research, especially on seabuckthorn (with different morphologies, such as shrubs, small trees, tall trees, etc.). UAV remote sensing shows the potential ability to extract the position, structure, and physiological and biochemical traits of seabuckthorn with high precision [39] compared with ground-based remote sensing technology (e.g., ground-based LIDAR). UAV remote sensing can be used to more widely and quickly gather heterogeneous data from multiple sources [40,41,42]. With their ease of deployment and operation, drones can quickly cover a designated area and acquire high-resolution images instantly, regardless of ground conditions. Equipped with advanced multispectral (or hyperspectral) and LiDAR sensors, drones are able to provide detailed surface information, such as land cover types, vegetation growth and health, and soil erosion. UAV remote sensing is particularly advantageous in arid and semi-arid regions [20,43,44]. It can monitor the progress of desertification and land degradation with higher efficiency and frequency, as well as the process of plant recovery in real time, and has potential in monitoring natural seabuckthorn forests. The high-precision data collected by drones can make it possible to more precisely assess seabuckthorn vegetation cover, its health status and its relationship with soil and climate factors. In addition, it can quantify the impact of seabuckthorn planting on soil and water conservation, ecological restoration and land productivity enhancement, which provides strong data support for the effectiveness of ecological protection measures. In conclusion, the convenient deployment of UAV remote sensing and the high accuracy of the multi-source heterogeneous data it provides not only improve the efficiency of small- and medium-scale monitoring, but also provide technical support for optimizing ecological restoration strategies. The introduction of UAV remote sensing technology marks a new era of more efficient and accurate ecological environment monitoring.

Taking an individual tree as the basic unit of a forest and extracting its structural traits is the focus of forest resources investigation, and the most important traits of the individual tree structure are the tree height and crown [45]. Tree height is the main index for forest resource investigation and monitoring, and accurately obtaining the tree height of forest trees is of great significance to the management of forest resources [46]. Tree height can show the biological characteristics and growth capacity of trees, and is a critical indicator of tree growth [47]. Tree height is also an indicator for determining the stand quality of a community, which determines the biomass of the community. Tree crowns are important for the survival and growth of trees, and they are indispensable for physiological activities, such as respiration, photosynthesis, and transpiration and can directly reflect the growth and dynamic changes in individual trees. Therefore, obtaining accurate structural information of individual trees can facilitate understanding of the competitive relationship among trees and allow for the monitoring and prediction of tree growth, helping to determine the health status and biomass (e.g., carbon storage) of the trees [48].

The existing research on the use of remote sensing to monitor seabuckthorn is still in the preliminary stage, with very limited relevant research, a fragmented research scope, and a lack of systematicity and continuity [24,49,50]. Several studies focus on classifying seabuckthorn as a shrubland component at the patch or community scale [51], while others extract parameters from tree-form individuals in simplified, managed settings like plantations or shelterbelts [52]. This highlights a critical gap, where no complete technical system exists that is capable of operationally distinguishing, segmenting, and phenotyping the intermixed growth forms within natural, structurally complex seabuckthorn forests. A complete technical system, from data acquisition and processing methods to information extraction, has not yet been formed, especially for the automatic extraction of individual wood-scale traits, which is almost a blank state. Therefore, this study aims to develop a suite of individual tree remote sensing monitoring solutions that are applicable to seabuckthorn. Systematic experiments and a full-process validation are conducted, covering multi-source remote sensing data acquisition, preprocessing, and feature extraction, as well as individual tree crown segmentation and trait prediction. This work aims to address the research gap in this field and to provide a repeatable, transferable technical reference for the standardized and operational application of remote sensing in seabuckthorn monitoring.

This study develops an integrated technical solution for the remote sensing-based monitoring of seabuckthorn at the individual tree level, utilizing UAV-based multispectral and LiDAR data. The specific research objectives include (1) proposing a method for precise tree-shrub classification from LiDAR point clouds through the integration of multi-scale spatial analysis and hierarchical clustering, supported by geometric constraints, density features, and dynamic threshold optimization; (2) extracting tree height and crown width from the individual tree segmentation results, and using hand-held LiDAR to measure Diameter at Breast Height (DBH) for constructing DBH estimation models; (3) predicting leaf nitrogen content at the individual tree level by combining multispectral data with spectra simulations from the N-PROSAIL model.

2. Materials and Methods

2.1. Study Area and Technology Workflow

The study area was located in Mozhugongka County, Lhasa City, within the Xizang Autonomous Region. This area is located in central Xizang, on the west side of the Mira Mountains, and encompasses part of the middle and upper reaches of the Lhasa River Valley, forming an important section of the mid-Yarlung Zangbo River valley system. The region belongs to the Lhasa Valley Plain and falls within the plateau temperate semi-arid monsoon climate zone.

Due to its high elevation, the climate of Mozhugongka County is characterized by cold, arid conditions with thin air. Winters and springs are windy, and the annual temperature variation is small, but the diurnal temperature range is large. The area had an annual frost-free period of approximately 90 days, with sunshine hours exceeding 2813 per year. Annual precipitation was about 515 mm, predominantly occurring between June and September. The geographical coordinates of the study area ranged from 91.74°E to 91.77°E and 29.82°N to 29.84°N. The study area comprises natural, monodominant stands of seabuckthorn. The vegetation structure is characterized by a continuous morphological gradient within the species, ranging from low, multi-stemmed shrubs to distinct, single-stemmed arborescent individuals.

A comprehensive field survey was conducted on 120 sample trees. Tree height was measured using a Vertex IV ultrasonic hypsometer (Haglöf, Switzerland), diameter at breast height (DBH) was obtained with a diameter tape, and precise individual tree locations were recorded using a Qianxun SR6 RTK system (Qianxun SI, Chengdu, China). Additionally, three-dimensional point cloud data of the sample plot were collected using a LiGrip H120 handheld laser scanner (Green Valley Technology Co., Ltd., Beijing, China). Boundary markers and closed scanning paths were employed to ensure data completeness and spatial alignment accuracy. Leaf samples were collected from 37 representative trees. Leaves were then dried at 80 °C for 48 h, after which the dried seabuckthorn leaves were ground, sieved (0.25 mm), and analyzed with the micro-Kjeldahl method [53] to determine leaf nitrogen content (LNC) (mg/g). This integrated dataset provides a multi-source foundation for subsequent analyses, encompassing tree structural parameters, spatial distribution, three-dimensional morphology, and biochemical traits.

The technical workflow was structured into four key steps, as outlined below. Firstly, data were collected sequentially using three systems: UAV LiDAR data acquired with a DJI M350 RTK UAV (DJI, Shenzhen, China) equipped with a Huace AA-10 LiDAR sensor (Huace, Beijing, China); handheld LiDAR data obtained using a Ligrip H120 scanner; and multispectral imagery captured by a DJI Mavic 3M UAV (DJI, Shenzhen, China). The UAV and handheld LiDAR data were preprocessed through stitching, cropping, and denoising, while the multispectral imagery underwent stitching, cropping, radiometric calibration, and atmospheric correction. Next, tree and shrub point clouds were separated. Based on the preprocessed UAV LiDAR data, a Cloth Simulation Filter (CSF) [54] was applied to isolate ground points and extract above-ground vegetation. The normalized point cloud was then processed at multiple scales to identify candidate tree points, which were subsequently clustered using the Ordering Points to identify the clustering structure algorithm (OPTICS) [55] and finalize the separation. Subsequently, individual tree segmentation was performed. From the separated tree point cloud, a Digital Elevation Model (DEM) and a Digital Surface Model (DSM) were generated by interpolation, and a Canopy Height Model (CHM) was derived. A marker-controlled watershed algorithm [17] was applied to the CHM for crown segmentation, while a hierarchical clustering algorithm was used on the normalized point cloud for direct point-based segmentation. Finally, individual tree traits were extracted. Tree height and crown width were derived from the UAV LiDAR data, and DBH was obtained from the handheld LiDAR data. Predictive models were developed to estimate these structural parameters directly from the UAV LiDAR data. For biochemical traits, LNC was retrieved by integrating the segmented crown boundaries with the preprocessed multispectral data, supported by both field measurements and N-PROSAIL model simulations.

2.2. UAV Data Acquisition and Pre-Processing

This study was carried out within the natural seabuckthorn forests in Mozhugongka County, Lhasa City, Xizang Autonomous Region. The LiDAR point cloud data acquisition was carried out using a DJI M350 RTK UAV equipped with a Huace AA-10 LiDAR sensor. A total of nine flights were carried out, with the flight strip width set to 180 m, a flight speed of 8 m/s, and a relative altitude of 190 m (the ground elevation of the survey area ranges from 3800 m to 3950 m). The overlapping degree of heading sideways was controlled at 30–40%, the laser emission frequency was 100 kHz, the scanning speed was 90 RPM, and the starting and stopping scanning angles were set at 135–225°, so that the density of the point cloud acquired finally reached 80–120 points/m². The LiDAR acquisition equipment is shown in Figure 1a.

The multispectral data were collected by a DJI Mavic 3M multispectral UAV. The central wavelength and bandwidth information of each band are shown in Figure 1, and the sensor spectral response curve is shown in Figure 1. The preset spatial resolution of the mission was 0.12 m. The shutter-priority mode was used, with a range of shutter speeds from 1/1200 to 1/2000, and the ISO setting was set to automatic. The overlap rates of heading and sidetracking were set to 80% and 75%, respectively, and the flight speed was 14 m/s.

The point cloud data were subsequently subjected to pre-processing steps, such as splicing, cropping and denoising. The multispectral data were processed with radiometric calibration, smoothing and denoising, and precise geo-alignment. The multispectral UAV acquisition equipment is shown in Figure 1a.

2.3. Individual Tree Segmentation Methods

Addressing the significant interference caused by the mixed growth of trees and shrubs and their distinct morphological disparities in individual tree segmentation within natural seabuckthorn forests, this study adopts a two-stage strategy [56]. First, a high-precision separation of trees and shrubs is performed on the point cloud data by integrating multi-scale dynamic thresholds, using hierarchical clustering techniques to accurately extract tree points. Subsequently, individual tree segmentation is implemented using both point cloud-based clustering and marker-controlled watershed algorithms on the purified tree point cloud to enhance the overall segmentation accuracy.

This study employed a vegetation point cloud classification method that integrates multi-scale spatial analysis with hierarchical clustering. The original LiDAR point cloud first underwent preprocessing, which involved extracting the 3D coordinates of each point and removing invalid data points using a dual-threshold filtering approach. Subsequently, the (CSF) algorithm was applied to separate ground points from non-ground points, resulting in a purified above-ground vegetation dataset for the subsequent tree–shrub classification. For the obtained non-ground point cloud, this study introduced a tree point detection method that combines multi-scale spatial analysis with a dynamic height threshold. Specifically, for each point in the cloud, the relative height and point cloud density features were calculated using three different neighborhood radii: 3.0 m, 5.0 m, and 7.0 m. To capture cross-scale spatial features ranging from individual tree crown details to the broader tree group environment, different spatial resolutions were applied: 3.0 m for resolving individual tree crowns or dense shrubs, 5.0 m for analyzing the local structure of tree clusters or the canopy of larger individual trees, and 7.0 m for characterizing tree groups, forest stand edges, or forest canopy gaps. The process is illustrated in Figure 2. The formulas for calculating the relative height

Δ h_{i}

and the point cloud density

ρ_{r}

are listed as follows:

Δ h_{i} = z_{i} - m a x (z_{j} | p_{j} \in N_{r} (p_{i}))

(1)

ρ_{r} = \frac{|N_{r} (p_{i})|}{π r^{2}}

(2)

where

z_{i}

is the height of point

p_{i}

,

N_{r} (p_{i})

denotes the set of points within the neighborhood of radius r centered at

p_{i}

, and

| N_{r} (p_{i}) |

is the number of points within this neighborhood.

The dynamic threshold was set with a base height threshold of 1.5 m, incorporating a density-adaptive adjustment mechanism. In sparse regions (low density), the threshold was appropriately increased to 1.8 m to suppress misclassification. The formula for calculating the dynamic threshold

ρ_{t r e e}

is

ρ_{t r e e} = 1.5 + 0.3 \times (1 - ρ_{r})

(3)

This formula uses a density penalty term (coefficient 0.3) to reduce misjudgments in low-density areas. The density penalty coefficient was determined through an empirical optimization process. We evaluated a range of values on a representative subset of the study area, with each candidate coefficient assessed via visual inspection of the resulting separation accuracy. A value of 0.3 was optimally selected, as it effectively balanced sensitivity and specificity. It applied a sufficient penalty to raise the adaptive threshold in sparse regions, thereby reducing false inclusions from background vegetation, while remaining conservative enough to avoid incorrectly excluding genuine tree points in moderately dense areas. This calibrated value was subsequently applied consistently across the entire study area.

In such areas, where

ρ_{r}

is small,

1 - ρ_{r}

is larger, increasing the density penalty term and thus raising the value of

ρ_{t r e e}

. This means that stronger feature signals are required in low-density regions for a point to be classified as a tree. Simultaneously, the dynamic threshold adjustment allows for weak signals in sparse areas to be identified, thereby enabling the detection of isolated trees. The OPTICS algorithm is applied to the candidate tree points for hierarchical density-based clustering. The core distance and reachability distance for each point are calculated. The core distance

d_{c o r e} (p_{i})

is defined as the minimum radius required to contain at least 10 neighboring points, calculated as

d_{c o r e} (p_{i}) = i n f {r | | N_{r} (p_{i}) | \geq 10}

(4)

The reachability distance

d_{r e a c h} (p_{j}, p_{i})

is calculated as

d_{r e a c h} (p_{j}, p_{i}) = m a x (d_{c o r e} (p_{i}), ∥ p_{j} - p_{i} ∥_{2}

(5)

Subsequently, a point ordering is generated based on the ascending reachability distance. Different density clusters are identified by analyzing the slope changes in the reachability plot. A new cluster is delineated when the increase in reachability distance between consecutive points exceeds three times the standard deviation. The maximum neighborhood radius is set to

ϵ_{m a x} = 6.0 m

. This forms an efficient multi-scale cascade analysis, where a 7.0 m radius is used at the front end to capture the macro-environment for optimized initial screening, while a 6.0 m radius is applied at the back end to focus on segmenting individual tree objects at typical crown scales, effectively avoiding under-segmentation. Finally, points belonging to the clusters identified in the OPTICS result are extracted as the tree point cloud. Shrub points are separated using a logical exclusion method, achieved by inversely selecting points that are neither classified as trees nor as ground points. Matrix operations are used to label candidate tree points, and their inverse yields the candidate shrub points:

P_{s h r u b} = P_{n o n - g r o u n d} - P_{t r e e}

(6)

Additionally, height constraints (Z < 3.0 m) and density constraints (point density in shrub areas is lower than in candidate tree areas) are applied to optimize the classification result. Morphological post-processing operations (such as opening) can be further employed to remove noise points and enhance the connectivity of shrub regions.

The advantage of this method lies in its multi-scale complementarity and dynamic trait optimization: a small radius (3 m) captures isolated trees, while a large radius (7 m) identifies extensive canopies. It adaptively adjusts search radii and threshold weights based on point density and validates the rationality of the classification results through height-position profile analysis. This method effectively improves the accuracy of tree–shrub separation in complex scenarios, providing a reliable data foundation for subsequent biomass estimation and ecological analysis.

Two methods were employed for individual tree segmentation and to compare their performance: point cloud-based cluster segmentation and marker-controlled watershed segmentation based on a canopy height model. In the point cloud segmentation method, the raw point cloud data acquired by LiDAR were first systematically preprocessed. Noise filtering was achieved by statistical outlier removal, the improved fabric simulation filtering algorithm was used to separate ground and non-ground points, and the digital elevation model and digital surface model were generated based on the progressive triangular mesh encryption algorithm; then, the canopy height model was computed for the two differences, and the same specification of the digital elevation model was normalized with the point cloud data. In the selection of the segmentation algorithm, for the special morphology of seabuckthorn scrub, both the DBSCAN algorithm based on density clustering and the Euclidean clustering method were tested to achieve individual tree segmentation through three-dimensional spatial neighborhood analysis. Additionally, they were supplemented with vertical profile character analysis to detect the local extreme value points in the cross-section of 0.5 m interval of canopy height and to determine the position of tree tops by combining the characteristics of the curvature change [57].

The marker-controlled watershed segmentation scheme was then unfolded based on the canopy height model [58]. Firstly, anisotropic Gaussian filtering was applied to the 0.5 m resolution CHM to smooth out small undulations, and then local maximum detection in a dynamic window was used to locate potential tree tops. Considering the clumping nature of buckthorn scrub, morphological reconstruction techniques were introduced to eliminate pseudo-markers: firstly, small areas were removed by area-splitting operations, and then marker fusion algorithms based on canopy geometry features were applied.

The segmentation results generated by each algorithm were mainly examined by visual interpretation. We visually checked whether the segmented individual trees are reasonable, judged whether they were over-segmented or under-segmented, and checked the segmentation accuracy using precision, recall, over-segmentation rate and F1-score values.

P r e c i s i o n = \frac{T P}{T P + F P}

(7)

R e c a l l = \frac{T P}{T P + F N}

(8)

O v e r - s e g m e n t a t i o n R a t e = \frac{F P}{T P}

(9)

F 1 - S c o r e = 2 \times \frac{P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

(10)

where TP, FN, and FP denote the number of correct segmentations, missed segmentations and over-segmentations, respectively.

2.4. Tree Height Trait Extraction

In this study, tree height was derived based on the results of high-precision individual tree segmentation. The point clouds of individual trees were normalized to eliminate the influence of terrain fluctuations by converting the absolute elevations into relative heights above the ground. Subsequently, the highest point within each normalized point cloud was identified and extracted. The elevation value of this peak point was directly taken as the tree height.

This study employs a stratified accuracy assessment framework to systematically evaluate the performance of a tree height prediction model. The samples were first divided into a low-error group (comprising the lower two-thirds of the absolute error distribution) and a high-error group (the upper one-third) based on the tertiles of absolute error. Additionally, a high-quality subset was defined using a relative error threshold of ≤20%, establishing a three-tier quality stratification system. The evaluation incorporated R² and RMSE metrics, visualized through scatter plots showing the regression relationship between predicted and actual values (with 95% confidence intervals) against the y = x ideal line, along with boxplots comparing the distributions of actual values, predicted values, and absolute errors across the three groups. This multi-level assessment framework provides an overall accuracy measure and identifies the characteristics of high-quality predictions, offering targeted insights for model refinement and practical application under varying precision requirements.

To systematically evaluate the accuracy of the tree height extraction, field-measured tree height data that were spatially aligned with the LiDAR data were used as reference values. The root–mean–square error (RMSE) and mean absolute error (MAE) were employed to quantify the magnitude of the estimation errors, while the coefficient of determination (R²) was used to assess the linear agreement between the LiDAR-derived tree heights and the measured tree heights.

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})^{2}}

(11)

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|

(12)

R^{2} = 1 - \frac{\sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})^{2}}{\sum_{i = 1}^{n} (y_{i} - \bar{y})^{2}}

(13)

2.5. Diameter at Breast Height Prediction Model

In this study, an individual tree diameter at breast height (DBH) prediction model was developed using multiple regression, based on airborne LiDAR point cloud data and field-measured sample tree data. Tree height (H) and crown width (CW) were extracted as predictor variables from the point clouds of segmented individual trees. The DBH reference values were obtained from high-precision handheld LiDAR ground measurements. Tree height was calculated as the vertical difference between the highest point in the individual tree point cloud and the corresponding ground elevation, while crown width was derived from the major axis of the minimum bounding rectangle of the tree crown’s horizontal projection.

To systematically evaluate the effects of model structure and feature combinations on DBH prediction accuracy, three types of regression models were constructed: linear, quadratic polynomial, and power function models. Two variable sets were designed for the independent variables: the first used only tree height as the input variable (H model), and the second incorporated both tree height and crown width (H + CW model). This setup allowed for the additional contribution of crown width to be assessed to improve the DBH estimation.

y = β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots + β_{p} x_{p} + ϵ

(14)

y = β_{0} + β_{1} x_{1} + β_{2} x_{2} + β_{3} {x_{1}}^{2} + β_{4} {x_{2}}^{2} + β_{5} x_{1} \cdot x_{2} + ϵ

(15)

y = β_{0} \cdot x_{1}^{β} \cdot x_{2}^{β_{2}} + ϵ

(16)

where y is the dependent variable,

x_{1}

,

x_{2}

, ……

x_{2}

are different independent variables, and

β_{0}

,

β_{1} {\dots \dots β}_{p}

are the coefficients corresponding to different independent variables.

The model validation is divided into two stages: internal validation is based on the handheld LiDAR data used for modeling, which is used to assess the model’s goodness-of-fit and stability; external validation uses an independent ground-truth chest diameter dataset to test the model’s generalization ability on unknown samples. Two indicators, namely the coefficient of determination (R²) and the root–mean–square error (RMSE), were selected for the accuracy evaluation to measure the prediction performance of the model.

By comparing the accuracy differences between different model structures and variable combinations, we focused on analyzing the degree of the contribution of tree height and crown width in the estimation of diameter at breast height (DBH), and explored the consistency of model performance between the training sample area and the extrapolation conditions. In addition, for samples with large prediction errors, the structural features of the point cloud were analyzed retrospectively to explore the potential causes of underestimation or overestimation (e.g., missing point cloud, branch and leaf shading, crown overlap), which provided the basis for improving the algorithms of individual tree segmentation and trait extraction.

2.6. Prediction of Leaf Nitrogen Content Within Individual Tree Canopies

The accurate estimation of leaf biochemical traits, including nitrogen content, from canopy reflectance faces a fundamental constraint: the mixed-pixel problem. In this challenge, the spectral signal from a target tree crown is contaminated by contributions from surrounding vegetation, underlying soil, and adjacent canopy elements. To address this issue, we employed precise individual tree crown delineation, derived from our LiDAR-based segmentation workflow, to define spectrally pure regions of interest within the co-registered multispectral imagery. This method ensures that the reflectance data input into the N-PROSAIL inversion model primarily originate from the sunlit canopy of each target arborescent seabuckthorn individual. By isolating the spectral signal most directly associated with foliar properties, this approach minimizes contamination and enhances the physiological relevance of the retrieved trait estimates.

In this study, the N-PROSAIL model, a coupled model integrating the leaf optical model PROSPECT-PRO and the canopy radiative transfer model 4SAIL, was employed to perform forward simulations for constructing a synthetic dataset for leaf nitrogen content prediction in seabuckthorn forests. The N-PROSAIL model simulates directional canopy reflectance spectra based on input vegetation biochemical and biophysical traits, thereby establishing a physically meaningful mapping between traits and spectra. This process provides a large volume of physically interpretable training samples for machine learning models. The workflow for mapping leaf nitrogen content (LNC) at the individual plant level using the N-PROSAIL model combined with the Random Forest (RF) algorithm is illustrated in Figure 3.

The dataset was generated through the following steps. First, key input traits affecting canopy reflectance and their plausible value ranges were identified. These include leaf biochemical traits, such as nitrogen content (N), chlorophyll content (Cab, μg/cm²), carotenoid content (Ccar, μg/cm²), equivalent water thickness (Cw, cm), and dry matter content (Cm, g/cm²), as well as canopy structural traits, such as leaf area index (LAI), average leaf inclination angle (ALIA), soil brightness coefficient (psoil), hotspot trait (hspot), and viewing geometry (solar zenith angle, view zenith angle, and relative azimuth angle).

The ecological representativeness of the look-up table (LUT) was ensured by defining prior parameter ranges from three sources, including field measurements from this study, the species-specific literature, and established model defaults (see Table 1 for a complete summary). Parameter ranges were determined as follows.

Leaf nitrogen content was parameterized directly using the values obtained from the chemical analysis of 37 sampled trees (Section 2.1). The leaf area index range was estimated by integrating hemispherical photography from sample plots with canopy gap fractions derived from the LiDAR point cloud [61]. Ranges for key leaf biochemical constituents, specifically chlorophyll and carotenoid content, were synthesized from published studies on sea buckthorn leaf’s optical properties [59,60]. The soil brightness coefficient was bounded between 0 and 1 to reflect the observed spectral variation in bare soil within the study area. Finally, all observation geometry parameters were configured to match the specific solar zenith, view zenith, and relative azimuth angles recorded during the UAV multispectral data acquisition.

Subsequently, a global sensitivity analysis strategy based on Latin Hypercube Sampling (LHS) was adopted to efficiently and uniformly sample the multidimensional trait space. The LHS method ensures broad coverage of the trait space with a limited number of samples, avoiding the clustering effects common in simple random sampling. This approach ensures that the simulated dataset adequately captures spectral responses under diverse trait combinations. Each sampled trait set was input into the N-PROSAIL model to generate corresponding hyperspectral reflectance data covering the 400–2500 nm spectral range at approximately 1 nm resolution.

To align the simulated data with field-acquired multispectral imagery, the high-resolution reflectance spectra were convolved into four broad spectral bands matching the measured multispectral sensor. The spectral response function (SRF) of each band of the target sensor was obtained, which quantifies the relative sensitivity of the sensor to different wavelengths. The equivalent reflectance for band i was calculated from the hyperspectral reflectance curve R(λ) using the following equation:

R_{i} = \frac{\int_{λ_{m i n}}^{λ_{m a x}} R (λ) \cdot {S R F}_{i} (λ) d λ}{\int_{λ_{m i n}}^{λ_{m a x}} {S R F}_{i} (λ) d λ}

(17)

This convolution process yielded a simulated dataset with the same spectral characteristics as the field multispectral data. The final product is a large lookup table (LUT), where each row represents a sample consisting of a trait set and the corresponding simulated canopy reflectance. This dataset not only exceeds the sample size achievable through field surveys but also encompasses a wide range of trait combinations that are difficult to observe in practice.

The monitoring of vegetation traits via optical remote sensing inherently relies on modeling, whether through statistical or physical approaches. While empirical statistical models are commonly used for nitrogen content retrieval, and physical models often infer nitrogen indirectly via related traits, such as chlorophyll, the direct prediction of nitrogen content remains challenging. In response, this study employed random forest machine learning-based nonlinear regression methods to estimate leaf nitrogen content in seabuckthorn forests.

Random forest (RF) [62] is a kind of tree structure integrated learning method that can be used for regression and classification. It was developed by the University of California, Berkeley Department of Statistics, by Professor Breiman, in 2001 [62]; its essence is based on the CART (classification and regression tree) decision tree integrated learning algorithm [63]. Using the bagging method from the training set of the self-help sampling method to generate a number of different sub-training sample sets, each sample is used to build a decision tree model, constituting a multi-classification model system, and finally, all the decision trees show the most voted results as the final prediction. The random forest algorithm demonstrates high predictive accuracy, strong resistance to noise, a low tendency to overfit, and excellent adaptability to diverse datasets. Due to its simple structure and excellent performance compared to other machine learning methods, it has been widely applied in various fields. In using the random forest regression method, n_estimators and max_features traits are two important traits to adjust. n_estimators defines the number of random trees. Increasing the number of trees generally leads to more reliable results, but also increases the computational cost.

The random forest regression model was configured with three vegetation indices as input features: the near-infrared band, the normalized blue-edge difference index (nNDVIblue), and the MERIS terrestrial chlorophyll index (MTCI).

n N D V I b l u e = \frac{ρ_{N I R} - ρ_{B l u e}}{ρ_{N I R} + ρ_{B l u e}}

(18)

M T C I = \frac{ρ_{N I R} - ρ_{R E}}{ρ_{R E} + ρ_{R e d}}

(19)

These three indices were extracted from the multispectral imagery for each segmented individual tree crown. For the field-measured dataset, they were calculated directly from the UAV-acquired spectra. For the N-PROSAIL-simulated dataset, they were derived from the convolved broad-band reflectances to ensure spectral consistency. This common set of features allows for the direct comparison and integration of field and simulated data during model training.

3. Results

3.1. Results of Individual Tree Splitting and Tree Height Extraction

This study evaluated the effects of different preprocessing strategies combined with segmentation algorithms on individual tree segmentation outcomes. The sample plot overview and individual tree segmentation results are shown in Figure 4. As summarized in Table 2, the application of a separate tree-planting and irrigation preprocessing strategy markedly enhanced the overall performance of all algorithms, which was primarily measured by the F1-score. When using the direct segmentation strategy, the Marker-Controlled Watershed algorithm achieved the highest recall rate of 92.00%, with only two missed segmentations. However, it was accompanied by significant over-segmentation, evident in the 89% false positives and an over-segmentation rate of 78.07%, leading to the lowest precision of 20.18% and a consequent F1-score of 33.09. In contrast, the Hierarchical Clustering algorithm achieved a more favorable balance, attaining a higher F1-score of 68.00% compared to the DBSCAN algorithm, which had an F1-score of 57.78%. The introduction of a separate tree-planting and irrigation preprocessing step yielded substantial improvements across all methods. The over-segmentation issue associated with the Watershed algorithm was effectively mitigated, as the over-segmentation rate dropped to 51.22% and the F1-score rose to 65.57%. The DBSCAN algorithm also showed improved performance, with the F1-score increasing to 77.27%. Most notably, the Hierarchical Clustering algorithm delivered the most outstanding performance by eliminating over-segmentation with zero false positives. It maintained high precision and recall rates, both at 84.21%, thereby achieving the highest F1-score of 84.21%.

In summary, the results demonstrated that effective preprocessing is crucial for enhancing segmentation accuracy by controlling over-segmentation. Furthermore, under the conditions of this study, the hierarchical clustering algorithm demonstrated the best overall performance and stability.

Point cloud-based segmentation proves to be a more suitable method for individual seabuckthorn tree delineation. The tree heights derived from the segmented point clouds were subsequently validated against field-measured tree height data. Based on the comprehensive evaluation and data exploration, the tree height prediction model demonstrated a clear performance stratification that is closely associated with canopy structure characteristics. The tree height extraction results and their errors are shown in Figure 5.

The model achieved reliable prediction accuracy in the majority of cases, with the low-error group (accounting for 66% of the samples, n = 67) showing excellent performance (R² = 0.874, RMSE = 0.390 m). However, an important finding emerged from the prediction error analysis: high-error predictions predominantly occur in areas characterized by high canopy density and small crown size. This pattern indicates that the degradation in model performance was not random but systematically associated with specific forest structural conditions.

There are several reasons for canopy-related limitations. Dense canopies may lead to signal attenuation and limited penetration, while small crowns provide insufficient structural information for accurate height retrieval. These factors collectively contribute to the accuracy reduction observed in the high-error group (R² = 0.315, RMSE = 1.755 m). Nevertheless, the model maintains satisfactory performance in the high-quality subset (relative error ≤ 20%, n = 82), with R² = 0.814 and RMSE = 0.580, demonstrating its robustness under favorable conditions.

3.2. DBH Prediction Model

This study systematically compared the prediction performance of buckthorn breast diameter based on different model forms with different combinations of variables. The DBH modeling results are shown in Figure 6 and Table 3. The results showed that both the linear model and the polynomial regression model achieved a good fit when only tree height was used as the independent variable. The linear model can be expressed as Equation (20), with a coefficient of determination (R²) of 0.780 and a root–mean–square error (RMSE) of 1.718 cm, and the polynomial regression model can be expressed as Equation (21), with an R² of 0.765 and a slightly lower RMSE of 1.731 cm. In contrast, the power function model can be expressed as Equation (22), with an R² of 0.771, and the RMSE is 1.726 cm. Therefore, the overall predictive stability is less than the first two types of models.

D B H = 2.48 \times H + 0.35

(20)

D B H = 1.40 \times H + 0.12 \times H^{2}

(21)

D B H = 2.63 \times H^{0.97}

(22)

When using tree height and crown width jointly as input variables, the linear model can be expressed as shown in Equation (23), with a coefficient of determination of 0.781 and a root–mean–square error of 1.718 cm, which does not show a significant improvement compared to the use of tree height only. The polynomial regression model can be expressed as shown in Equation (24), which decreases the coefficient of determination to 0.785 and the root–mean–square error to 1.701 cm. The power function model can be expressed as shown in Equation (25), which has a coefficient of determination of 0.779 and an RMSE of 1.725 cm. Overall, the introduction of crown width did not significantly improve the model accuracy, indicating that tree height was still the most important factor in predicting buckthorn breast diameter in this study.

D B H = 2.43 \times H + 0.031 \times C W + 0.43

(23)

D B H = 0.37 \times H + 0.29 \times C W + 0.24 \times H^{2}

(24)

D B H = 2.68 \times H^{0.95} \times {C W}^{0.12}

(25)

In addition, a comparison of the model performance on the two validation sets, including LiDAR data and independently measured data, revealed only minor differences in accuracy metrics. This suggests that the proposed model exhibits a certain level of generalizability and stability across distinct data sources. The study area and structural shape results are shown in Figure 7.

3.3. Mapping LNC of Individual Seabuckthorn Trees

This study employed the N-PROSAIL model as a supplementary data source and the Random Forest algorithm for nitrogen content prediction modeling. To comprehensively evaluate model performance, three different modeling and validation scenarios were designed. The prediction accuracy of leaf nitrogen content (LNC) is shown in Figure 8.

In the measured–measured scenario (i.e., the model was trained and evaluated via leave-one-out cross-validation on the same set of measured data), the model demonstrated good fitting capability and stability, with an RMSE of 1.562 mg/g, R² of 0.818, and MAE of 1.173 mg/g, suggesting the model has strong explanatory power for the measured samples.

In the simulated scenario (i.e., the model was trained on N-PROSAIL-simulated data and evaluated via leave-one-out cross-validation on the same simulated dataset), the performance metrics were RMSE = 1.300 mg/g, R² = 0.870, and MAE = 1.014 mg/g. Although the simulated data helped increase the sample diversity, the structural differences from real-world conditions may have limited the model’s ability to accurately capture actual features.

In the more practically meaningful simulated–measured generalization test (i.e., the model was trained on simulated data and validated on an independent set of measured data), it achieved an RMSE of 2.074 mg/g, R² of 0.680, and MAE of 1.742 mg/g. As compared to the simulated-only scenario, the prediction accuracy showed a significant improvement, with some metrics even exceeding those of the model trained exclusively on measured data. These findings indicated that the N-PROSAIL-simulated data effectively enhanced the model’s capacity to capture nitrogen content variations, thereby improving its generalization performance in practical applications. The LNC prediction results are shown in Figure 9.

4. Discussion

The primary contribution of this study resides in the design and validation of an integrated analytical workflow for resolving a persistent challenge in remote sensing of natural forests, namely the extraction of individual tree-level traits in monodominant stands where conspecifics exhibit a growth-form continuum from shrubs to trees. The key methodological advancement is the introduction of a dedicated tree–shrub separation step, which directly addresses the structural interference that has fundamentally limited previous approaches. Our empirical results demonstrate that this step is not merely beneficial but essential. When applied to raw point clouds, conventional segmentation algorithms performed poorly, with F1-scores as low as 33.09 percent. Following the application of our separation module, all methods showed a marked improvement, culminating in an optimal F1-score of 84.21 percent and the complete elimination of over-segmentation for hierarchical clustering. This sequence confirms that the synergistic integration of separation and segmentation, not the performance of any single algorithm, enables reliable individual tree mapping in such complex environments. Furthermore, by structurally coupling LiDAR-derived architecture with multispectral-derived biochemistry via radiative transfer modeling, the workflow delivers a comprehensive phenotypic portrait at the individual tree scale. This capacity directly supports downstream applications such as precision phenotyping for germplasm selection. Ultimately, the primary contribution of this work lies in integrating established components into a coherent workflow that addresses a specific ecological challenge, one that has proven difficult to tackle with conventional methods.

This study proposes a hierarchical and logically rigorous technical workflow for separating trees and shrubs from mixed seabuckthorn point clouds. Its core strength lies in a multi-step, progressive processing approach that effectively discriminates between the two vegetation types. The efficacy of the method stems mainly from the following key designs. First, the parallel multi-scale extraction of candidate feature points represents a major innovation. By detecting tree crown candidate points simultaneously at three scales (3 m, 5 m, and 7 m), the method adapts effectively to structural heterogeneity among stands. Given the considerable variation in crown size among species, a single scale would be insufficient to capture all treetops completely. The multi-scale strategy enables the synchronous detection of both pioneer species with smaller crowns and dominant species with larger crowns, significantly improving the completeness of crown detection and thereby supplying richer feature information for accurate tree–shrub separation. Second, the OPTICS clustering algorithm, incorporating both height and density constraints, is crucial for achieving fine classification. As a density-based clustering method, OPTICS performs well in identifying arbitrarily shaped clusters and handling non-uniformly distributed data, making it highly suitable for natural scenarios where sparse shrubs coexist with dense tree crowns. However, under complex conditions, relying solely on density may be inadequate for clearly distinguishing shrubs and low trees with significant vertical overlap. Our method introduces a height constraint, integrating the vertical vegetation structure into the clustering process. This allows low-lying, dense point clouds to be more reliably identified as shrubs, and taller, dense point clusters as trees. The dual “density + height” criterion considerably enhances the robustness and accuracy of classification.

The structural complexity inherent in natural seabuckthorn forests necessitated the two-stage methodology presented in this study. Unlike monocultural or plantation stands, these ecosystems are characterized by a continuous gradient of growth forms, where trees and shrubs are not merely adjacent but structurally interwoven. This heterogeneity presents a fundamental challenge for standard individual tree detection algorithms, which typically assume a distinct separation between target trees and the understory. Our findings indicate that applying such algorithms directly to the raw point cloud leads to systematic errors: under-segmentation due to the absorption of shrub points into tree crowns and over-segmentation arising from the complex internal architecture of large crowns. Consequently, the initial separation of tree points emerges not as an optional preprocessing step, but as a critical prerequisite for accurate delineation in structurally complex stands.

The efficacy of our approach lies in the complementary functions of its two stages. The first stage, tree-shrub separation via multi-scale spatial analysis and adaptive clustering, serves as a contextual filter. It explicitly addresses the primary source of commission error by removing non-tree vegetation, thereby transforming a mixed point cloud into a purified set of candidate tree points. The second stage, performing fine-scale individual crown segmentation, then operates on this optimized dataset. This sequential design allows each stage to specialize: the separation step mitigates noise, and the segmentation step maximizes fidelity in boundary detection. The quantitative improvements reported in Table 2, notably the rise in F1-score for hierarchical clustering from 68.00% to 84.21% and the elimination of over-segmentation, directly validate this synergistic relationship. The separation stage resolves the coarse-scale confusion between growth forms, enabling the segmentation stage to achieve precision at the individual tree level.

This methodological framework carries significant implications for the remote sensing of structurally heterogeneous forests. It demonstrates that in ecosystems where biological form and spatial arrangement introduce significant noise, a single-algorithm solution is often insufficient. A hierarchical processing strategy, which decouples the problem of “what is a tree” from “where are its boundaries,” proves to be more robust. Future work could explore automating the adaptive thresholds used in the separation step or testing this two-stage paradigm on other forest types with similarly intermixed vegetation. Ultimately, this study underscores that advancing individual tree mapping in complex natural environments may depend less on refining a single algorithm and more on strategically orchestrating multiple analytical steps to mirror the ecological complexity on the ground.

A limitation of this study, however, is the relatively small amount of ground validation data. As a result, the evaluation relied primarily on visual interpretation and the manual labeling of sample area point clouds. Although this approach allows for a preliminary assessment of the reasonableness of the classification results, it has certain drawbacks. On the one hand, visual interpretation outcomes are susceptible to the interpreter’s experience and subjectivity, lacking a unified and objective quantitative evaluation standard. On the other hand, in the absence of high-precision ground truth data, such as precisely georeferenced individual tree survey data, it is difficult to perform a rigorous quantitative assessment of classification accuracy using metrics like overall accuracy, recall, and precision. In subsequent research, incorporating more sample plots and more detailed field survey data would improve the reliability and persuasiveness of the method validation, while also furnishing a more objective basis for parameter optimization.

The segmentation results demonstrate that the pre-processing step of separating trees and shrubs significantly enhances the accuracy of individual tree detection within seabuckthorn stands. This underscores the critical importance of distinguishing between growth forms prior to segmentation, as it effectively reduces commission errors and over-segmentation. However, it is important to note that the scope of this study is limited to the segmentation of trees; the shrub component, once separated, was not subjected to individual plant segmentation. Consequently, the performance of the proposed method for shrub identification remains uncertain. Future work should therefore include a dedicated evaluation of shrub segmentation to comprehensively assess its applicability across entire plant communities.

The comparative analysis confirms hierarchical clustering as the most effective segmentation method for our seabuckthorn forest data, achieving an optimal balance between detection accuracy and segmentation integrity (F1-score = 84.21%; over-segmentation rate = 0.00%). This performance advantage stems from a fundamental alignment between the algorithm’s design and the ecosystem’s structural complexity. Unlike DBSCAN, which is constrained by fixed density thresholds and thus prone to errors amid variable crown densities, or the marker-controlled watershed approach, whose over-segmentation (OR = 78.07%) reveals a critical sensitivity to local maxima in the CHM, hierarchical clustering operates through a flexible, proximity-based nested grouping. This method intrinsically accommodates key forest attributes: the growth-form continuum from shrub to tree, multi-axis crown architectures, variable within-crown point densities, and naturally clumped spatial distributions. Consequently, the selection of hierarchical clustering is justified not only by its quantitative superiority, but by its conceptual congruence with the ecological and structural reality of seabuckthorn stands, especially when deployed following the essential preprocessing step of tree–shrub separation, which removes understory interference and allows the algorithm to focus on genuine crown-scale topology.

To validate tree height accuracy, we stratified prediction errors using tertiles instead of the median or quartiles. This approach was chosen to create groups of markedly different sizes, which amplifies diagnostic contrast within an error distribution heavily skewed toward minor inaccuracies. The tertile-based stratification revealed a clear and interpretable pattern. The high-error group, which corresponds to the top tertile, consisted primarily of trees located in high-density canopy areas with small crowns. These conditions are known to challenge LiDAR-based height retrieval due to signal occlusion and limited crown definition. In contrast, the larger low-error group, comprising the bottom two tertiles, was dominated by open-grown trees with well-defined crowns, where the model performed reliably. Therefore, the tertile split served as a purposeful analytical tool rather than a mere statistical convention. It effectively separated the majority of accurately predicted trees from a distinct minority of problematic cases, with the latter directly linked to specific and ecologically meaningful forest structural conditions. Employing the median would have obscured this pattern by combining moderately accurate samples from dense areas with genuine algorithmic outliers. This method thereby enhances the diagnostic value of error analysis by cleanly isolating systematic failure modes for focused methodological investigation.

In constructing the DBH prediction model for seabuckthorn, tree height emerged as the most influential predictor, while the inclusion of crown width did not significantly enhance model accuracy. This could be attributed to the limited variability in crown size among seabuckthorn individuals or a weak inherent correlation between crown width and DBH. While the power function model yielded a lower root–mean–square error (RMSE) in certain cases, its coefficient of determination (R²) did not improve markedly, potentially due to sample distribution characteristics or the model’s sensitivity to extreme values. Future studies could incorporate additional structural attributes, such as the height to the first live branch or the crown volume, to further strengthen the robustness of DBH estimation. Moreover, the high consistency between validation results based on field-measured data and those derived from handheld device extraction confirms the feasibility of applying this method at a regional scale, though its stability across larger sample areas requires further verification.

The finding that incorporating crown width did not significantly improve DBH estimation accuracy can be attributed to three interrelated factors rooted in the ecology and the remote sensing system. First, arborescent seabuckthorn in the high-altitude semi-arid study area exhibits a limited phenotypic range in terms of crown architecture. A conservative growth strategy prioritizing vertical and belowground investment over lateral crown expansion under harsh conditions naturally constrains crown width variation, thereby reducing its statistical utility as a predictive variable. Second, the allometric relationship between DBH and crown width is highly asymmetric and variable. For individuals of comparable DBH, crown width can differ substantially due to localized competition, microtopographic effects, and inherent genetic variation. This variability weakens the bivariate correlation and diminishes the marginal explanatory power of crown width beyond what is already captured by tree height. Third, the remote measurement of crown width introduces inherent uncertainty. While LiDAR-derived widths were validated against field data, delineations of the irregular and often asymmetric crowns of natural stands remain approximations. Representing such complex three-dimensional structures with a single planar metric inevitably oversimplifies the true functional allometry. This oversimplification, stemming from deriving a single diameter value from the major axis of a minimum bounding rectangle, further attenuates the predictive contribution of the crown width variable. Collectively, these ecological and methodological factors explain why crown width did not enhance model performance in this context.

Our methodology reveals a synergistic interdependence between structural segmentation and biochemical estimation. Reliable crown delineation serves not merely as a preprocessing step but as a critical prerequisite for accurate leaf nitrogen content retrieval. By ensuring that each spectral sample corresponds to a correctly isolated, single-tree crown, we substantially mitigate spectral contamination from mixed pixels, shadowed canopy components, and adjacent vegetation. This refinement of the input signal suppresses noise within the spectral–biochemical relationship, producing a prediction model that is both more accurate and more physically interpretable. Consequently, structural segmentation precision directly enhances biochemical credibility, integrating the two analytical streams into a coherent phenotyping framework whose combined value exceeds that of either component in isolation.

It is important to clarify the specific role of the N-PROSAIL model within our workflow to preempt concerns regarding undue complexity. The model was employed exclusively during the development phase of this study. Its primary function was not to establish a full, operational inversion pipeline, but rather to serve as a computational bridge, generating a synthetic training dataset to overcome the severe constraint of limited ground truth measurements. The final, practical output of this process is a calibrated Random Forest model, denoted RF_DA, which performs inference using only a small set of common vegetation indices. This design strategically encapsulates the underlying complexity offline, ultimately delivering a straightforward, transparent, and directly applicable tool for potential end-users. In our research, we employed traditional vegetation indices for regression prediction of LNC, but the results were unsatisfactory (specific details are provided in Table A1). For future application in similar seabuckthorn stands, the requirement is simply to compute three standard spectral indices from the available UAV imagery, thereby eliminating any need for complex radiative transfer modeling or parameterization. Although the LNC prediction models, calibrated on field data and enriched with N-PROSAIL simulations, showed strong agreement with independent spectral measurements, the sampling approach presents a notable constraint. The collection of merely one sample per tree fails to represent potential intra-canopy nitrogen variation. Consequently, future work should prioritize stratified canopy samplings to account for vertical heterogeneity, thereby enhancing the granularity and reliability of nitrogen content estimates.

This study introduces a scalable technical workflow predicated on a critical preprocessing step: intraspecific growth-form separation, followed by individual tree segmentation and trait extraction. The framework is explicitly designed to overcome a persistent challenge in shrubland remote sensing, namely the difficulty in reliably delineating distinct woody individuals from a continuous and morphologically diverse vegetation matrix. Consequently, the approach shows considerable promise for application across other arid and semi-arid ecosystems dominated by monodominant woody species that exhibit a pronounced growth-form continuum, such as Tamarix spp., Caragana spp., or certain Artemisia species. The core methodology is inherently transferable, provided that the target species exhibits a polymorphic mixture of shrub-like and tree-like architectures within the same stand.

Successful operationalization in new contexts, however, will necessitate deliberate adaptation. First, the multi-scale spatial analysis and dynamic thresholding parameters central to the growth-form separation step must be recalibrated to reflect the characteristic crown dimensions and point cloud densities of the novel target species. Second, data acquisition parameters, particularly LiDAR point density and the selection of multispectral bands, should be optimized to capture the specific structural and physiological traits of interest. Third, while the N-PROSAIL model can be re-parameterized for different species, its efficacy as a data-augmentation tool hinges on the availability of reliable, species-specific leaf optical property libraries. In regions where such foundational biophysical data are scarce, its initial implementation might adopt a more pragmatic, empirical approach utilizing established vegetation indices as a precursor to full radiative transfer modeling.

In essence, the proposed workflow provides a generalizable conceptual architecture for individual-level mapping in complex woody communities. Its practical implementation, however, transforms it from a fixed protocol into an adaptable framework, whose underlying algorithms require careful tuning to the local species characteristics and data conditions. Future research should prioritize validating this framework across diverse biogeographic regions to develop robust, empirically grounded guidelines for parameter adaptation, thereby enhancing its utility for large-scale ecological monitoring.

5. Conclusions

This study demonstrates the effectiveness of a hierarchical processing approach for individual tree segmentation and parameter inversion in seabuckthorn forests. The results indicate that distinguishing trees from shrubs prior to segmentation significantly improves the accuracy of all evaluated methods, with hierarchical clustering showing the most robust performance (F1-score: 84.21%; over-segmentation rate: 0.00%). For DBH prediction, tree height was identified as the most critical factor, while the inclusion of crown width did not significantly enhance model accuracy. The power function model exhibited a lower RMSE in some cases, but no consistent improvement was observed in the coefficient of determination. The leaf nitrogen content prediction models, developed based on both measured and N-PROSAIL-simulated data, performed well. Moreover, a high consistency was found between data extracted by handheld sensors and ground measurements, indicating the potential of this method for regional-scale application. In summary, the proposed technical workflow provides reliable support for extracting the structural and biochemical parameters of sea buckthorn, with the separation of trees and shrubs being a crucial step. Future research could introduce more structural feature parameters and validate the approach under diverse site conditions to further enhance its applicability and robustness.

Author Contributions

Conceptualization, W.X. and L.C.; methodology, W.X. and K.Z.; software, W.X.; validation, W.X. and K.Z.; formal analysis, W.X.; resources, P.D. and L.C.; data curation, K.Z.; writing—original draft preparation, W.X.; writing—review and editing, W.X., K.Z., P.D., Z.X., Y.W., L.L., X.S. and L.C.; visualization, W.X.; supervision L.C.; funding acquisition, L.C. and K.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by Biological Breeding-National Science and Technology Major Project (2023ZD04056), Science and Technology Plan Project (Topic) of Xizang Autonomous Region (XZ202502ZY0044) and the Jiangsu Agriculture Science and Technology Innovation Fund (CX(23)1027).

Data Availability Statement

The data and the code of this study are available from the corresponding author upon request.

Acknowledgments

We gratefully acknowledge the graduate students from the Forest Management (Nanjing Forestry University) for helping with field data collection.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

UAV	Unmanned Aerial Vehicle
LiDAR	Light Detection and Ranging
DBH	diameter at breast height
LNC	leaf nitrogen content
CSF	Cloth Simulation Filter
DEM	Digital Elevation Model
DSM	Digital Surface Model
CHM	Canopy Height Model

Appendix A

Table A1. Performance comparison of traditional vegetation index-based regression models for LNC prediction.

Model Type	Input Features	R²	RMSE (mg/g)	MAE (mg/g)
Simple Linear Regression	NIR band	0.214	3.892	3.124
Simple Linear Regression	nNDVIblue	0.287	3.706	2.985
Simple Linear Regression	MTCI	0.326	3.603	2.876
Multiple Linear Regression	NIR + nNDVIblue + MTCI	0.358	3.517	2.768
Polynomial Regression (quadratic)	NIR + nNDVIblue + MTCI	0.382	3.448	2.691
N-PROSAIL + Random Forest (this study)	NIR + nNDVIblue + MTCI	0.680	2.074	1.742

Note: All traditional models were trained and validated using the same field-measured dataset (n = 37) with leave-one-out cross-validation. The N-PROSAIL + Random Forest model was trained on simulated data and validated on field-measured data.

References

Wang, Y.K.; Guo, S.X.; Wang, J.; Yuan, H.; Xu, B.L.; Wang, D.Y. Valuation of forest ecosystem services in Gansu Qilian Mountain National Nature Reserve. J. Desert Res. 2013, 33, 1905–1911. [Google Scholar]
Pan, Y.D.; Birdsey, R.A.; Fang, J.Y.; Houghton, R.; Kauppi, P.E.; Kurz, W.A.; Phillips, O.L.; Shvidenko, A.; Lewis, S.L.; Canadell, J.G.; et al. A Large and Persistent Carbon Sink in the World’s Forests. Science 2011, 333, 988–993. [Google Scholar] [CrossRef]
Zhao, Z.B.; Li, K.G.; Zeng, G.J.; Li, D.C. Evaluation of forest ecosystem service functions in Qinhuangdao City. J. Arid Land Resour. Environ. 2012, 26, 31–36. [Google Scholar]
Watson, J.E.M.; Evans, T.; Venter, O.; Williams, B.; Tulloch, A.; Stewart, C. The exceptional value of intact forest ecosystems. Nat. Ecol. Evol. 2018, 2, 599–610. [Google Scholar] [CrossRef]
Messier, C.; Bauhus, J.; Doyon, F.; Maure, F.; Sousa-Silva, R.; Nolet, P.; Mina, M.; Aquilué, N.; Fortin, M.-J.; Puettmann, K. The functional complex network approach to foster forest resilience to global changes. For. Ecosyst. 2019, 6, 194–209. [Google Scholar] [CrossRef]
Nakamura, K.; Kitching, R.L.; Cao, M.; Creedy, T.J.; Fayle, T.M.; Freiberg, M.; Hewitt, C.N.; Itioka, T.; Koh, L.P.; Ma, K.P.; et al. Forests and Their Canopies: Achievements and Horizons in Canopy Science. Trends Ecol. Evol. 2017, 32, 438–451. [Google Scholar] [CrossRef]
Chausson, A.; Turner, B.; Seddon, D.; Chabaneix, N.; Girardin, C.A.J.; Kapos, V.; Key, I.; Roe, D.; Smith, A.; Woroniecki, S.; et al. Mapping the effectiveness of nature-based solutions for climate change adaptation. Glob. Change Biol. 2020, 26, 6134–6155. [Google Scholar] [CrossRef]
Zhang, C.; Ren, Z.Y.; Gao, M.X.; Yan, W.H. Evaluation of forest ecological service function and value in Gansu Province. J. Arid Land Resour. Environ. 2007, 21, 147–151. [Google Scholar]
Fu, B.J.; Zhao, W.W.; Chen, L.D. Progress and prospects of geography-ecological process research. Acta Geogr. Sin. 2006, 61, 1123–1131. [Google Scholar]
Peng, J.; Wang, Y.L.; Zhang, Y.; Ye, M.T.; Wu, J.S. Research on the influence of land use classification on landscape pattern index. Acta Geogr. Sin. 2006, 61, 157–168. [Google Scholar]
Bastin, J.-F.; Finegold, Y.; Garcia, C.; Mollicone, D.; Rezende, M.; Routh, D.; Zohner, C.M.; Crowther, T.W. The global tree restoration potential. Science 2019, 365, 76–79. [Google Scholar] [CrossRef]
Showstack, R. Global Forest Watch Initiative Provides Opportunity for Worldwide Monitoring. Eos Trans. Am. Geophys. Union 2014, 95, 77–79. [Google Scholar] [CrossRef]
Farley, K.A.; Jobbágy, E.G.; Jackson, R.B. Effects of afforestation on water yield: A global synthesis with implications for policy. Glob. Change Biol. 2005, 11, 1565–1576. [Google Scholar] [CrossRef]
Turner, C.; Aggarwal, A.; Walls, H.; Herforth, A.; Drewnowski, A.; Coates, J.; Kalamatianou, S.; Kadiyala, S. Concepts and critical perspectives for food environment research: A global framework with implications for action in low- and middle-income countries. Glob. Food Secur. 2018, 18, 93–101. [Google Scholar] [CrossRef]
Moxnes, E. Discounting, climate and sustainability. Ecol. Econ. 2014, 102, 158–166. [Google Scholar] [CrossRef]
Kombiadou, K.; Matias, A.; Costas, S.; Carrasco, A.R.; Plomaritis, T.A.; Ferreira, Ó. Barrier island resilience assessment: Applying the ecological principles to geomorphological data. CATENA 2020, 194, 104755. [Google Scholar] [CrossRef]
Chen, C.; Park, T.; Wang, X.H.; Piao, S.L.; Xu, B.D.; Chaturvedi, R.K.; Fuchs, R.; Brovkin, V.; Ciais, P.; Fensholt, R.; et al. China and India lead in greening of the world through land-use management. Nat. Sustain. 2019, 2, 122–129. [Google Scholar] [CrossRef]
Yu, Y.N.; Peng, S.L. Research advances in ecosystem service value assessment. Ecol. Environ. Sci. 2010, 19, 2246–2252. [Google Scholar]
Liu, Y.L.; Ma, J.J.; Jin, X.L.; Wang, B.D.; Lin, J.Q.; Zhang, M. Review on valuation of ecosystem services. China Popul. Resour. Environ. 2005, 15, 91–95. [Google Scholar]
Zhang, J.; Qi, Y.; Li, Q.; Zhang, J.L.; Yang, R.; Wang, H.W.; Li, X.F. Combining UAV-Based Multispectral and Thermal Images to Diagnosing Dryness Under Different Crop Areas on the Loess Plateau. Agriculture 2025, 15, 126. [Google Scholar] [CrossRef]
Liu, H.X.; Cao, X.Y.; Zhang, C.C. Research progress in ecosystem service value assessment. J. Green Sci. Technol. 2024, 26, 273–280. [Google Scholar]
Seddon, N.; Chausson, A.; Berry, P.; Girardin, C.A.J.; Smith, A.; Turner, B. Understanding the value and limits of nature-based solutions to climate change and other global challenges. Philos. Trans. R. Soc. B Biol. Sci. 2020, 375, 20190120. [Google Scholar] [CrossRef]
Zhang, H.; Ying, B.B.; Hu, Y.J.; Wang, Y.X.; Yu, X.H.; Tang, C.X. Response of soil respiration to thinning is altered by thinning residue treatment in Cunninghamia lanceolata plantations. Agric. For. Meteorol. 2022, 324, 109089. [Google Scholar] [CrossRef]
Zhao, L.; Fang, Q.; Algeo, T.J.; Lu, A.; Yin, K.; Duan, Z.; Hong, H. Formation of plinthite mediated by redox fluctuations and chemical weathering intensity in a Quaternary red soil, southern China. Geoderma 2021, 386, 114924. [Google Scholar] [CrossRef]
Maclean, I.M.; Klinges, D.H. Microclimc: A mechanistic model of above, below and within-canopy microclimate. Ecol. Model. 2021, 451, 109567. [Google Scholar] [CrossRef]
Wongshaya, P.; Chayjarung, P.; Tothong, C.; Pilaisangsuree, V.; Somboon, T.; Kongbangkerd, A.; Limmongkon, A. Effect of light and mechanical stress in combination with chemical elicitors on the production of stilbene compounds and defensive responses in peanut hairy root culture. Plant Physiol. Biochem. 2020, 157, 93–104. [Google Scholar] [CrossRef]
Woźniak, G.; Dyderski, M.K.; Kompała-Bąba, A.; Jagodziński, A.M.; Pasierbiński, A.; Błońska, A.; Bierza, W.; Magurno, F.; Sierka, E. Use of remote sensing to track postindustrial vegetation development. Land Degrad. Dev. 2020, 32, 1426–1439. [Google Scholar] [CrossRef]
Han, G.; Yang, K.; Zeng, J. Distribution and fractionation of rare earth elements in suspended sediment of the Zhujiang River, Southwest China. J. Soils Sediments 2021, 21, 2981–2993. [Google Scholar] [CrossRef]
Giachino, A.; Focarelli, F.; Marles-Wright, J.; Waldron, K.J. Synthetic biology approaches to copper remediation: Bioleaching, accumulation and recycling. FEMS Microbiol. Ecol. 2021, 97, fiaa249. [Google Scholar] [CrossRef]
Yadav, A.; Thakur, U.; Saxena, R.; Pal, V.; Bhateja, V.; Lin, J.C.W. AFD-Net: Apple Foliar Disease multi classification using deep learning on plant pathology dataset. Plant Soil 2022, 477, 595–611. [Google Scholar] [CrossRef]
Ghosh, D.; Maiti, S.K. Biochar-assisted eco-restoration of coal mine degraded land to meet United Nation Sustainable Development Goals. Land Degrad. Dev. 2021, 32, 4494–4508. [Google Scholar] [CrossRef]
Yu, B.; Yang, C.; Yu, M. Experimental study on the critical condition of river blockage by a viscous debris flow. CATENA 2022, 213, 106198. [Google Scholar] [CrossRef]
Yao, T. Development Status of Sea Buckthorn Industry in Inner Mongolia and Its Ecological Response to Wind and Sand. China Fruits 2022, 104–108. [Google Scholar] [CrossRef]
Gan, T.J.; Zeng, Z.X.; Pei, W.H.; Jia, Q.; He, Y.X.; Chen, J. Plant responses and rhizosphere soil characteristics of sea-buckthorn from different sex combinations in an abandoned lead-zinc mine. Front. Plant Sci. 2025, 16, 1601834. [Google Scholar] [CrossRef]
Huang, Y.Y.; Zhang, Y.; Zhang, T.T.; Chen, X.Q. Fingerprint and difference analysis of flavonoids of Hippophae plants grown on the Tibetan plateau. J. Food Compos. Anal. 2024, 128, 106010. [Google Scholar] [CrossRef]
Du, Z.Y.; Bai, H.H.; Liu, M.L.; Liu, Y.; Zhu, G.D.; Chai, G.Q.; He, Y.M.; Shi, J.G.; Duan, Y.Z. Response of ecological stoichiometry and homeostasis characteristic to nitrogen addition in Hippophae rhamnoides L. Sci. Total Environ. 2024, 951, 175591. [Google Scholar] [CrossRef]
Lister, A.J.; Andersen, H.; Frescino, T.; Gatziolis, D.; Healey, S.; Heath, L.S.; Liknes, G.C.; McRoberts, R.; Moisen, G.G.; Nelson, M.; et al. Use of Remote Sensing Data to Improve the Efficiency of National Forest Inventories: A Case Study from the United States National Forest Inventory. Forests 2020, 11, 1364. [Google Scholar] [CrossRef]
Nhamo, L.; Magidi, J.; Nyamugama, A.; Clulow, A.D.; Sibanda, M.; Chimonyo, V.G.P.; Mabhaudhi, T. Prospects of Improving Agricultural and Water Productivity through Unmanned Aerial Vehicles. Agriculture 2020, 10, 256. [Google Scholar] [CrossRef]
Xie, C.; Yang, C. A review on plant high-throughput phenotyping traits using UAV-based sensors. Comput. Electron. Agric. 2020, 178, 105731. [Google Scholar] [CrossRef]
Bayomi, N.; Fernandez, J.E. Eyes in the Sky: Drones Applications in the Built Environment under Climate Change Challenges. Drones 2023, 7, 637. [Google Scholar] [CrossRef]
Yao, H.; Qin, R.; Chen, X. Unmanned Aerial Vehicle for Remote Sensing Applications—A Review. Remote Sens. 2019, 11, 1443. [Google Scholar] [CrossRef]
Zhang, Z.; Zhu, L. A Review on Unmanned Aerial Vehicle Remote Sensing: Platforms, Sensors, Data Processing Methods, and Applications. Drones 2023, 7, 398. [Google Scholar] [CrossRef]
Abdullah, M.M.; Al Ali, Z.M.; Blanton, A.; Charabi, Y.; Abulibdeh, A.; Al Awadhi, T.; Srinivasan, S.; Fadda, E.; Mohan, M. UAVs for improving seasonal vegetation assessment in arid environments. Front. Environ. Sci. 2024, 12, 1366712. [Google Scholar] [CrossRef]
Ullah, S.; Ilniyaz, O.; Eziz, A.; Ullah, S.; Fidelis, G.D.; Kiran, M.; Azadi, H.; Ahmed, T.; Elfleet, M.S.; Kurban, A. Multi-Temporal and Multi-Resolution RGB UAV Surveys for Cost-Efficient Tree Species Mapping in an Afforestation Project. Remote Sens. 2025, 17, 949. [Google Scholar] [CrossRef]
Ting, Y.; Jiang, K.; Li, G.C.; Eichhorn, M.P.; Fan, J.C.; Liu, F.Z.; Chen, B.Q.; An, F.; Cao, L. Individual tree crown segmentation from airborne LiDAR data using a novel Gaussian filter and energy function minimization-based approach. Remote Sens. Environ. 2021, 256, 112307. [Google Scholar]
Qin, H.M.; Zhou, W.Q.; Yao, Y.; Wang, W.M. Individual tree segmentation and tree species classification in subtropical broadleaf forests using UAV-based LiDAR, hyperspectral, and ultrahigh-resolution RGB data. Remote Sens. Environ. 2022, 280, 113143. [Google Scholar] [CrossRef]
Sun, L.; Wang, H.F.; Cai, Y.; Yang, Q.; Chen, C.J.; Lv, G.H. Disentangling the Interspecific and Intraspecific Variation in Functional Traits of Desert Plant Communities under Different Moisture Gradients. Forests 2022, 13, 1088. [Google Scholar] [CrossRef]
Chai, G.Q.; Zheng, Y.F.; Lei, L.T.; Yao, Z.Q.; Chen, M.Y.; Zhang, X.L. A novel solution for extracting individual tree crown parameters in high-density plantation considering inter-tree growth competition using terrestrial close-range scanning and photogrammetry technology. Comput. Electron. Agric. 2023, 209, 107849. [Google Scholar] [CrossRef]
Fiston, N.; Mathieu, V.; Jérôme, T. Mapping common and glossy buckthorns (Frangula alnus and Rhamnus cathartica) using multi-date satellite imagery WorldView-3, GeoEye-1 and SPOT-7. Int. J. Digit. Earth 2023, 16, 31–42. [Google Scholar]
Guo, Y.D.; Zhang, Q.L.; Zhang, R.; Chen, X.Y.; Mi, H.Z. Construction of remote sensing estimation model for biomass of artificial shrub forest in Kubuqi Desert. J. Northeast. For. Univ. 2022, 50, 56–60. [Google Scholar]
Bulluck, L.; Lin, B.; Schold, E. Fine Resolution Imagery and LIDAR-Derived Canopy Heights Accurately Classify Land Cover with a Focus on Shrub/Sapling Cover in a Mountainous Landscape. Remote Sens. 2022, 14, 1364. [Google Scholar] [CrossRef]
Jiao, Y.H. Estimation of Seabuckthorn Phenotypic Information and Leaf Area Index Based on UAV Imagery. Master’s Thesis, Xinjiang Agricultural University, Urumqi, China, 2021. [Google Scholar]
Zhang, W.G. Discussion on Kjeldahl nitrogen determination method. Chin. J. Anal. Chem. 1976, 4, 410. [Google Scholar]
Zhang, W.M.; Qi, J.B.; Wan, P.; Wang, H.T.; Xie, D.H.; Wang, X.Y.; Yan, G.J. An Easy-to-Use Airborne LiDAR Data Filtering Method Based on Cloth Simulation. Remote Sens. 2016, 8, 501. [Google Scholar] [CrossRef]
Kim, K.H.; Baek, J.G. A Prediction of Chip Quality using OPTICS (Ordering Points to Identify the Clustering Structure)-based Feature Extraction at the Cell Level. J. Korean Inst. Ind. Eng. 2014, 40, 257–266. [Google Scholar] [CrossRef]
Dai, Z.; He, R.; Wang, H.; Bai, W. Adaptive individual tree extraction method by integrating airborne LiDAR and vegetation index. Opt. Precis. Eng. 2023, 31, 3331–3344. [Google Scholar] [CrossRef]
Li, W.K.; Guo, Q.H.; Jakubowski, M.K.; Kelly, M. A New Method for Segmenting Individual Trees from the Lidar Point Cloud. Photogramm. Eng. Remote Sens. 2012, 78, 75–84. [Google Scholar] [CrossRef]
Ayrey, E.; Fraver, S.; Kershaw, J.A.; Kenefic, L.S.; Hayes, D.; Weiskittel, A.R.; Roth, B.E. Layer Stacking: A Novel Algorithm for Individual Forest Tree Segmentation from LiDAR Point Clouds. Can. J. Remote Sens. 2017, 43, 16–27. [Google Scholar] [CrossRef]
Huang, G.F. Effects of Drought Stress on the Growth of Seabuckthorn Seedlings. Agric. Eng. Technol. 2023, 43, 49–50. [Google Scholar]
Wang, Y.F.; Zhang, X.Y.; Liu, J.H.; Chang, R.X.; Wang, X.S. Research Progress and Development Prospects of Seabuckthorn Functions. China Fruit Veg. 2021, 41, 49–53. [Google Scholar]
Zhou, S. Study on Morphological and Anatomical Characteristics and Environmental Adaptability of Three Seabuckthorn Species in Tibet. Master’s Thesis, Tibet Agriculture and Animal Husbandry College, Nyingchi, China, 2023. [Google Scholar]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Knable, M.B.; Barci, B.M.; Bartko, J.J.; Webster, M.J.; Torrey, E.F. Molecular abnormalities in the major psychiatric illnesses: Classification and Regression Tree (CRT) analysis of post-mortem prefrontal markers. Mol. Psychiatry 2002, 7, 392–404. [Google Scholar] [CrossRef][Green Version]

Figure 1. The technical workflow of the study. (a) Step 1: Data acquisition and preprocessing, showing the deployed platforms and sensors for collecting UAV LiDAR, handheld LiDAR, and multispectral data. (b) Step 2: Tree–shrub separation, showing the processing chain from the raw point cloud to the classified result. Key steps include ground filtering with CSF and object classification with OPTICS. (c) Step 3: Individual tree segmentation, showing the methodology for delineating individual trees from the CHM and point cloud using a watershed algorithm and clustering. All LiDAR points in the image are color-coded by elevation, with blue representing lower areas and red indicating higher elevations. (d) Step 4: Tree trait estimation, showing the extraction and inversion of structural (height, crown, DBH) and biochemical (nitrogen) traits from the multi-source data.

Figure 2. Flowchart of the tree–shrub separation method. (a) Overall procedure, which consists of three main steps: filtering the normalized LiDAR data to obtain above-ground points; extracting candidate feature points to generate a tree canopy candidate point cloud; and performing clustering to separate tree and shrub point clouds. (b) Step 1: Filtering. The raw data first undergo dual-threshold filtering to remove noise, followed by ground point extraction using the Cloth Simulation Filter (CSF) algorithm. (c) Step 2: Candidate feature point extraction. A multi-scale parallel processing approach is applied to extract candidate tree points at neighborhood radii of 3 m, 5 m, and 7 m. (d) Step 3: Tree–shrub clustering. The OPTICS algorithm is used in combination with height and density constraints to cluster and distinguish between tree and shrub point clouds.

Figure 3. The workflow of mapping the leaf nitrogen content (LNC) of individual trees using the N-PROSAIL model combined with the Random Forest (RF) algorithm. The process begins by inputting canopy traits into the 4SAIL model and leaf traits into the N-PROSPECT model, respectively. The N-PROSAIL model then performs a forward simulation to generate simulated vegetation indices (VIs). The colored curves in the image represent simulated spectral curves, and the four colored borders correspond to the four bands of the sensor. These simulated VIs are then used to train a RF model whose performance is subsequently validated with field-measured remote sensing data to achieve the accurate estimation of LNC.

Figure 4. Sample plot overview and single tree segmentation results: (a-1–a-3) sample plot: (a-1) RGB image of the sample plot; (a-2) point cloud data of the sample plot; (a-3) CHM of the sample plot; (b-1–b-3) segmentation on original point clouds (each color represents a distinct tree); (b-1) DBSCAN clustering; (b-2) hierarchical clustering; (b-3) marker-controlled watershed segmentation; (c-1–c-3) individual trees, segmented and color-coded, from the canopy (gray point clouds represent shrub-form background vegetation; colored point clouds represent individual arborescent seabuckthorn trees, with each color indicating a distinct individual); (c-1) DBSCAN clustering; (c-2) hierarchical clustering; (c-3) marker-controlled watershed segmentation.

Figure 5. Tree height extraction results. (a) Scatter plot of prediction accuracy based on absolute error grouping; samples were divided into a low-error group (lower 2/3 of error distribution) and a high-error group (upper 1/3 of error distribution) based on the textiles of absolute error, shown in red and green, respectively. (b) Validation of prediction accuracy for high-quality sample subset; a high-quality prediction subset was formed by screening samples with relative error ≤ 20% from the entire sample set. (c) Comparison of distribution characteristics of tree height predictions across three groups, showing a comprehensive comparison of the distribution differences in actual tree height, predicted tree height, and absolute error among three groups: low-error group (lower 2/3 of absolute error), high-error group (upper 1/3 of absolute error), and high-quality group (relative error ≤ 20%).

Figure 6. DBH modeling results: (a) DBH predicted using tree height in a linear regression model, validated against DBH extracted from handheld LiDAR data; (b) DBH predicted using tree height in a polynomial regression model, validated against field-measured DBH; (c) DBH predicted using tree height and crown width in a power regression model, validated against DBH extracted from handheld LiDAR data; (d) DBH predicted using tree height and crown width in a polynomial regression model, validated against field-measured DBH. The red area in the figure represents the confidence interval.

Figure 7. Study area and structural trait results. (a-1–a-3) The RGB orthophoto of the study area. (b-1–b-3) The result of extracting the structural trait of the crown width of the study area. (c-1–c-3) The result of extracting the tree height trait of the study area. (d-1–d-3) The result of inverting the trait of the DBH of the study area.

Figure 8. The prediction accuracies of leaf nitrogen content (LNC) using the models built based on measured data with self-testing (a); the models established based on simulated data with self-testing (b); the models built based on simulated data and validation with measured data (c). The red area in the figure represents the confidence interval.

Figure 9. Mapping the leaf nitrogen content of individual trees within the seabuckthorn forests. (a) The LNC prediction results for the entire study area; (b) the LNC prediction results for a partial region; (c) the LNC prediction results at the individual tree scale.

Table 1. Basic traits and their ranges for the N-PROSAIL model.

Trait Category	Trait (Unit)	Symbol	Range/Value	Data Source
Leaf Biochemical	Nitrogen Content (μg/cm²)	N	20–50	Field measurement
	Chlorophyll Content (μg/cm²)	Cab	20–40	[59]
	Carotenoid Content (μg/cm²)	Ccar	4–12	[60]
	Equivalent Water Thickness (cm)	Cw	0.01–0.05	[59]
	Dry Matter Content (g/cm²)	Cm	0.003–0.009	[60]
Canopy Structural	Leaf Area Index	LAI	1.5–6.0	LiDAR estimate
	Average Leaf Inclination Angle (°)	ALA/ALIA	30–70	[61]
	Hotspot Trait	hspot	0.01–0.2	Model default range
	Leaf Structure Trait	N	1.2–2.0	Model default range
Soil	Soil Brightness Coefficient	psoil	0–1	Field observation
Observation Geometry	Solar Zenith Angle (°)	θs/SZA	20–50	Flight parameter
	View Zenith Angle (°)	θv/VZA	0–10	Flight parameter
	Relative Azimuth Angle (°)	φ/RAA	0–180	Flight parameter

Table 2. Effects of different preprocessing methods on segmentation results.

Preprocessing Strategy	Segmentation Method	TP	FN	FP	P	R	OR	F1-Score
Segmentation on Original Point Clouds	DBSCAN Clustering	13	6	7	50.00%	68.42%	26.92%	57.78%
	Hierarchical Clustering	17	7	2	65.38%	70.83%	7.69%	68.00%
	Marker-Controlled Watershed	23	2	89	20.18%	92.00%	78.07%	33.09%
Segmentation on Separated Trees	DBSCAN Clustering	17	3	4	70.83%	85.00%	16.67%	77.27%
	Hierarchical Clustering	16	3	0	84.21%	84.21%	0.00%	84.21%
	Marker-Controlled Watershed	20	0	21	48.78%	100.00%	51.22%	65.57%

Note: TP is correct segmentation, FN is missing segmentation, FP is over-segmentation, P is precision, R is recall, and OR is over-segmentation rate.

Table 3. DBH modeling results.

Input Parameters	Model Type	Validation Data	RMSE (cm)	R²
Tree height	Linear Regression	Modeled Data	1.719	0.780
	Polynomial Regression		1.731	0.765
	Power Function		1.726	0.771
	Linear Regression	Measured Data	3.384	0.524
	Polynomial Regression		3.287	0.551
	Power Function		3.453	0.504
Tree height & Crown width	Linear Regression	Modeled Data	1.718	0.781
	Polynomial Regression		1.701	0.785
	Power Function		1.725	0.779
	Linear Regression	Measured Data	3.407	0.517
	Polynomial Regression		3.254	0.560
	Power Function		3.462	0.502

Note: Modeled DBH is extracted from handheld LiDAR data, while measured DBH is obtained from field surveys.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xue, W.; Zhou, K.; Dunzhu, P.; Xing, Z.; Wu, Y.; Lin, L.; Shen, X.; Cao, L. Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model. Remote Sens. 2026, 18, 909. https://doi.org/10.3390/rs18060909

AMA Style

Xue W, Zhou K, Dunzhu P, Xing Z, Wu Y, Lin L, Shen X, Cao L. Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model. Remote Sensing. 2026; 18(6):909. https://doi.org/10.3390/rs18060909

Chicago/Turabian Style

Xue, Wenkai, Kai Zhou, Pubu Dunzhu, Zhen Xing, Yunhua Wu, Ling Lin, Xin Shen, and Lin Cao. 2026. "Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model" Remote Sensing 18, no. 6: 909. https://doi.org/10.3390/rs18060909

APA Style

Xue, W., Zhou, K., Dunzhu, P., Xing, Z., Wu, Y., Lin, L., Shen, X., & Cao, L. (2026). Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model. Remote Sensing, 18(6), 909. https://doi.org/10.3390/rs18060909

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation of Individual Tree-Level Structural and Biochemical Traits for Seabuckthorn Forests in Lhasa Valley Plain by Coupling UAV-Based LiDAR and Multispectral Images with N-PROSAIL Model

Highlights

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area and Technology Workflow

2.2. UAV Data Acquisition and Pre-Processing

2.3. Individual Tree Segmentation Methods

2.4. Tree Height Trait Extraction

2.5. Diameter at Breast Height Prediction Model

2.6. Prediction of Leaf Nitrogen Content Within Individual Tree Canopies

3. Results

3.1. Results of Individual Tree Splitting and Tree Height Extraction

3.2. DBH Prediction Model

3.3. Mapping LNC of Individual Seabuckthorn Trees

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI