Next Article in Journal
Genomic Footprints of Multiple Host Lineages in the Mitochondrial and Nuclear Genomes of the Holoparasite Prosopanche americana
Previous Article in Journal
Functional Analysis of CbbHLH35 Reveals Its Role in Drought and Cold Stress Tolerance in Caladium bicolor
Previous Article in Special Issue
Mulberry Drought Diagnosis: Integrating Proximal Sensing and Metabolomics for Remote Monitoring
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation of Nitrogen Status in Zanthoxylum armatum var. novemfolius Using Machine Learning Algorithms and UAV Hyperspectral and LiDAR Data Fusion

1
College of Resources and Environment, Southwest University, Chongqing 400716, China
2
Chongqing District Agricultural Technology Extension Center, Chongqing 401121, China
3
National Agricultural Science Beibei Observation and Experiment Station, Chongqing 400716, China
4
Chongqing Field Scientific Observatory of Soil Quality and Ecological Environment, Hechuan 401519, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2026, 15(7), 1119; https://doi.org/10.3390/plants15071119
Submission received: 22 January 2026 / Revised: 26 March 2026 / Accepted: 3 April 2026 / Published: 6 April 2026
(This article belongs to the Special Issue Remote Sensing for Diagnosis of Plant Health)

Abstract

Accurate monitoring of nitrogen (N) status is critical for precision N management and optimizing the yield and quality of Zanthoxylum armatum var. novemfolius (ZA). However, individual sensors often struggle to simultaneously capture the biochemical variations and complex canopy structural changes of ZA. Therefore, field experiments were conducted over two consecutive years, applying four N-application rates (0, 150, 300, and 450 kg N ha−1) to ZA. At each phenological stage, hyperspectral imagery and LiDAR point clouds were collected via three UAV flight altitudes (60 m, 80 m, and 100 m), and canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) were measured. This study developed a framework by synergistically fusing UAV-derived hyperspectral imaging (HSI) and LiDAR data for CNC and AGNA monitoring. Results showed that the response of nitrogen status indicators to fertilization was phenology-specific: CNC showed no significant difference (p > 0.05) among treatments during the vigorous vegetative growth stage (VGS) but differed significantly (p < 0.05) during the fruit expansion stage (FES); AGNA differed significantly among treatments at VGS and FES (p < 0.05). The two-step screening yielded NDSI (732, 879) and NDSI (560, 690) as the optimal CNC indicators at VGS and FES, respectively (r = 0.83 and 0.93), whereas the NDSI (711, 986) and NDSI (515, 736) were identified as the optimal AGNA indicators at VGS and FES, respectively (r = 0.91 and 0.71). Across all phenological stages, Random Forest Regression consistently delivered the highest accuracy for CNC (R2 = 0.93–0.98, RMSE = 0.87–1.02 g kg−1) and AGNA (R2 = 0.95–0.97, RMSE = 1.92–2.55 g plant−1), outperforming MLR, PLSR, and SVR. This synergistic framework provides a high-precision, non-destructive methodology for the precision N monitoring of woody crops.

1. Introduction

Zanthoxylum armatum var. novemfolius (ZA), a characteristic economic woody spice in China, is widely cultivated for its unique aroma and medicinal properties [1]. Nitrogen (N) is one of the essential nutrient elements required for the growth of ZA [2]. A previous study has demonstrated that nitrogen status is the crucial diagnostic indicator for fruit development of ZA [3]. Precise N-monitoring techniques for the woody crop have been hindered by its complex canopy architecture and distinct phenological characteristics [4]. Previous studies have demonstrated that the phenological stages significantly influence nitrogen dynamics [5]. For instance, apple trees maintain higher N levels during flowering than during the fruit expansion stage [6]. Nevertheless, research investigating the interactive effects of N application rates and phenological stages on the nutritional status of ZA remains scarce. Furthermore, traditional diagnostic methods—primarily laboratory-based chemical analyses—are destructive, labor-intensive, and costly [7], necessitating the development of more efficient, non-destructive monitoring frameworks [8]. Currently, UAV-based hyperspectral imaging (HSI), which captures continuous narrow-band reflectance, offers significant advantages in quantifying nitrogen-related biochemical traits [6]. However, its application is often hindered by the “saturation effect” in dense perennial canopies [9]. This phenomenon occurs because HSI primarily captures surface reflectance, where upper leaves physically obscure the lower layers, effectively “blinding” the sensor to the biochemical information within the internal canopy volume once the leaf area reaches a certain density. To overcome this two-dimensional limitation, Light Detection and Ranging (LiDAR) is introduced to provide a critical three-dimensional perspective [10]. By penetrating the canopy to quantify structural parameters such as volume and density, LiDAR complements spectral data by accounting for the biomass depth that HSI fails to perceive, thereby significantly enhancing the accuracy of canopy nitrogen status estimation [11,12]. Furthermore, the optimization of UAV flight parameters is critical for accurate data acquisition, although the influence of flight altitude on nitrogen estimation has not been fully consistent across studies [13,14]. For instance, studies in citrus orchards have shown that lower altitudes (e.g., 50 m) provide richer structural detail but may introduce motion blur and increase data processing complexity, while higher altitudes (e.g., 120 m) may lead to the loss of small-scale canopy gaps essential for nitrogen estimation [4]. In cotton crops, research has demonstrated that as UAV flight altitude increases (e.g., from 60 m to 100 m), spectral reflectance in the visible range (500–550 nm) increases, while near-infrared reflectance decreases due to the inclusion of more soil background information in the pixels [14]. Determining the optimal flight altitude is therefore essential for improving data quality.
In this study, a multi-rotor UAV platform equipped with hyperspectral and LiDAR sensors was employed to collect data from a ZA orchard over two consecutive years. The specific objectives were to: (1) Analyze the variation in canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) across the different phenological stages of ZA; (2) to develop a workflow for screening phenological sensitive UAV hyperspectral and LiDAR parameters for monitoring CNC and AGNA, including multiple flight altitudes (60, 80, and 100 m); (3) to develop highly accurate and robust models for CNC and AGNA monitoring by machine learning algorithms and UAV hyperspectral and LiDAR parameter fusion.

2. Results

2.1. Nitrogen Status Affected by Nitrogen Application Ratio and Different Stages

Both nitrogen application rates and growing stages significantly influenced the biomass, canopy nitrogen concentration, and aboveground nitrogen accumulation of Zanthoxylum armatum (ZA) (Table 1). During the vigorous vegetative growth stage (VGS), the nitrogen application ratio significantly influenced the biomass and aboveground nitrogen accumulation (AGNA) of ZA (p < 0.05), whereas the canopy nitrogen concentration (CNC) showed no significant differences among the nitrogen application treatments (p > 0.05), which is because of nitrogen dilution effect. Both biomass and AGNA exhibited a clear increasing trend with rising nitrogen application rates, reaching their peak values under the N3 treatment (2.73–4.04 kg plant−1; 37.12–49.15 g plant−1). However, the average CNC value among N1–N3 treatments was only 3.04% higher than that of the N0 treatment. During the fruit expansion stage (FES), the biomass showed no significant differences among the nitrogen application treatments (p > 0.05), whereas the CNC and AGNA showed significant differences among the nitrogen application treatments (p < 0.05). The CNC and AGNA exhibited a clear increasing trend with rising nitrogen application rates, reaching their peak values under the N2 treatment (35.41–39.76 g kg−1; 47.18–59.92 g plant−1). The yield of N2 was found to be significantly higher than the N0 and N1 treatments, with 5.24 kg plant−1 at Lishi town and 5.6 kg plant−1 at Ciyun town.

2.2. Correlations of Canopy Nitrogen Content and Above Ground Nitrogen Accumulation with UAV Hyperspectral and LiDAR Parameters Across Flight Altitudes and Growth Stages

The Spearman correlations of hyperspectral vegetation indices and LiDAR parameters with the canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) also exhibited varying trends across different phenological stages (Figure 1). The correlation between LiDAR parameters (PH + CD + CV) and CNC reached 0.79 at the fruit expansion stage (FES), which was markedly higher than the correlation during the vigorous vegetative growth stage (VGS, r = 0.35). During the VGS, the correlations of 11 spectral parameters with CNC ranged from −0.05 to −0.36 (negative) and 0.47 to 0.83 (positive), with the correlation of NDSI (732, 879) being the highest (0.83). At the FES, the correlations of 11 spectral parameters with CNC ranged from −0.18 to −0.90 (negative) and 0.58 to 0.93 (positive), among which the correlation of NDSI (560, 690) was the highest (0.93). The direction (positive or negative) of these correlations is primarily determined by the mathematical structure of the index formulas. The correlations of LiDAR parameters (PH + CD + CV) with AGNA remained at a consistently high level, ranging from 0.85 to 0.92. During the VGS, the correlations of 11 spectral parameters with AGNA ranged from −0.41 to −0.76 (negative) and 0.45 to 0.91 (positive), with the correlation of NDSI (711, 986) being the highest (0.91). At the FES, the correlations of 11 spectral parameters with CNC ranged from −0.01 to −0.46 (negative) and 0.04 to 0.78 (positive), among which the correlation of NDSI (515, 736) was the highest (0.78). Furthermore, multicollinearity diagnostics and feature contribution analysis were conducted to verify model robustness and interpretability (Table S3).

2.3. Nitrogen Status Retrieval Models by Machine Learning Methods and Data Fusion

The predictive performance for CNC was significantly influenced by the regression models, with the coefficient of determination (R2) of Random Forest Regression (RFR, R2 = 0.93–0.98, RMSE = 0.87–1.02 g kg−1) always outperforming that of Multiple Linear Regression (MLR), Partial Least Squares Regression (PLSR), and Support Vector Regression (SVR) (R2 = 0.71–0.93, RMSE = 0.10–0.18 g kg−1) (Table 2). Similarly, the predictive performance for AGNA was significantly influenced by the regression models, with the R2 of Random Forest Regression (R2 = 0.95–0.97, RMSE = 1.92–2.55 g plant−1) always outperforming that of MLR, PLSR, and SVR (R2 = 0.80–0.93, RMSE = 0.21–0.52 g plant−1). The fitting relationship between the measured values and the predicted values obtained via the leave-one-out cross-validation method for CNC and AGNA using the RFR regression models across distinct phenological stages is displayed in Figure 2. Furthermore, our analysis of configuration trade-offs revealed that an Optimized Altitude (60 m LiDAR + 100 m HSI) achieved a 4.40–8.99% improvement in predictive accuracy compared to a uniform altitude (Table S4). While a uniform altitude simplifies flight planning, the Optimized Altitude approach maximizes the synergy between high-resolution structural metrics and noise-mitigated biochemical signals.

3. Discussion

3.1. Divergent Dominance of Pigment Sensitivity and Structural Biomass in Nitrogen Status Diagnostics Across Phenological Stages

Nitrogen in fruit trees exhibits a phase-transition trend across different phenological stages. To improve the accuracy of nitrogen monitoring, we investigate how vigorous growth stage (VGS) and fruit expansion stage (FES) affect nitrogen monitoring in fruit trees [15]. During the VGS of Zanthoxylum armatum (ZA), the elongating canopy and rapidly expanding leaf area require a sustained, adequate N supply to drive chlorophyll synthesis; yet this supply must remain balanced—excess N triggers uneven partitioning between flowers and leaves, while the N that is taken up is continuously channeled into photosynthetic production that fuels new leaf growth. During VGS, our results show that canopy nitrogen concentration (CNC) remained virtually flat, whereas aboveground nitrogen accumulation (AGNA) responded sharply to incremental nitrogen supply. During the VGS, a study of ZA revealed an equivalent split-response: CNC in leaves and fruits shifted by only 21.11% and 11.22% across fertilizer levels, whereas nitrogen accumulation AGNA extended across 53.84% and 54.31% [2]. CNC prediction performed better during FES than VGS. This is due to the nitrogen dilution effect, where rapid structural dry matter accumulation dilutes nitrogen concentration as trees develop [16,17]. Furthermore, AGNA correlate more strongly with LiDAR structural parameters, because it integrates both nitrogen concentration and total biomass. LiDAR effectively captures three-dimensional canopy structures (e.g., volume and height), which directly reflect the biomass allocation dynamics [18]. Consequently, the pronounced phase-transition of Nitrogen status between VGS and FES generates corresponding, fine-scale variations in canopy reflectance that conventional fixed-band indices often miss. The superior performance of the reconstructed Normalized Difference Spectral Index (NDSI) across all flight altitudes and growth stages confirms that full-spectrum optimization (400–1000 nm) is critical for overcoming signal saturation in dense perennial canopies [19]. By dynamically identifying optimal band pairs, the NDSI framework captures subtle nitrogen-induced shifts that fixed-band indices often fail to detect [20]. Furthermore, our findings reveal a significant phenological link between the optimal spectral regions and the physiological priorities of the crop: during the VGS, the optimal NDSI band pairs (e.g., 711, 732, and 986 nm) shifted toward the 700–1000 nm near-infrared region, which is highly indicative of plant biomass and internal leaf cell structure [21]. As VGS is characterized by rapid dry matter accumulation and structural expansion, these NIR signals reflect the plant’s physical volume and density, aligning with the retrieval requirements for AGNA. Conversely, during the FES, the optimal band pairs (e.g., 515, 560, and 690 nm) were consistently located within the 400–700 nm visible light range, a region dominated by the absorption of photosynthetic pigments such as chlorophyll and carotenoids [22]. The massive translocation of nitrogen from leaves to fruits during this stage triggers sensitive fluctuations in pigment concentration, making the visible spectrum the most potent window for diagnosing CNC. To leverage these spectral sensitivities, this framework integrates hyperspectral-based chemical analysis with LiDAR-derived structural mapping, thereby enabling a more comprehensive assessment of nitrogen status by combining nitrogen concentration data with total biomass estimates. This approach ensures that both nutrient intensity and total nitrogen content are accurately quantified within the complex spatial architecture of the orchard [18].

3.2. Synergistic Effects of Multi-Source Data Fusion and Machine Learning

The integration of hyperspectral imagery and LiDAR point clouds demonstrated a superior capacity for capturing the complex nitrogen (N) dynamics of Zanthoxylum armatum (ZA) compared to single-sensor approaches. In this study, the Random Forest Regression (RFR) model consistently outperformed traditional methods like MLR and PLSR, achieving a peak R2 of 0.98. The superior performance of RFR over SVR (R2: 0.78–0.93) stems from its inherent advantage in managing the highly non-linear, high-dimensional data typical of dense canopies, whereas, SVR relies on a global optimal hyperplane that is often overly sensitive to hyperparameter tuning in heterogeneous datasets [23]. Similar robust performances of RFR in multi-source fusion have been documented in other woody crops, such as almond orchards, where the inclusion of structural variables significantly refined canopy-scale N estimations [18]. The synergy between the two sensors effectively addresses the “saturation effect” that often limits the precision of passive optical sensors in dense perennial canopies. While optimized hyperspectral indices (e.g., NDSI) are highly sensitive to N-induced chlorophyll shifts in the upper canopy layer, they frequently fail to reflect the N status of the sub-canopy. By incorporating LiDAR-derived variables—specifically the PH + CD + CV triplet—the model accounts for the “physical” context of the 3D architecture, which is a critical predictor for nitrogen accumulation (AGNA). This aligns with recent findings that integrating canopy height and volume data into spectral models provides a more complete picture of nitrogen status by combining LiDAR-derived structure and hyperspectral-derived chemistry [13].

3.3. Impact of UAV Flight Altitude on Retrieval Precision

A critical finding of this research is that the “optimal” UAV flight altitude is not universal but is highly dependent on both the target nitrogen trait and the specific growth stage. For canopy nitrogen concentration (CNC) retrieval, a higher altitude of 100 m generally yielded the best performance, with fusion data reaching peak R2 values during both growth stages. At increased heights, the sensor’s larger footprint effectively averages out “leaf-level” noise, such as localized specular reflection and shadow occlusion within the canopy [13,24]. By contrast, Aboveground nitrogen accumulation (AGNA) retrieval responded differently to flight height depending on phenological stage. During FES the 100 m altitude again delivered the highest fitting accuracy (R2 = 0.95), yet in VGS the 60 m flight level outperformed both the 80 m and 100 m, boosting fitting accuracy by 2.95–3.27%. We attribute this switch to the biomass-oriented band pair selected by the reconstructed NDSI at 60 m (711 and 986 nm); at higher altitudes the same index shifted toward shorter wavelengths that are less sensitive to canopy volume, thereby weakening the correlation with AGNA [1,20]. For the LiDAR component, 60 m was consistently superior for both growth stages and both nitrogen traits (Table S2). The 60 m flight altitude provides significantly higher point cloud density, enabling the precise quantification of canopy physical parameters like height and volume [25]. This physical precision improves AGNA retrieval by providing a more accurate biomass basis for mass-based estimation, while also helping CNC models account for shadowing and canopy heterogeneity [26]. Collectively, the results prescribe a flight plan that is sensor-specific rather than universal: 100 m for hyperspectral acquisition to maximize biochemical signal-to-noise, 60 m for LiDAR to preserve the structural fidelity required for biomass-linked nitrogen mapping.

3.4. Limitations and Future Perspectives

While the integration of UAV-derived HSI and LiDAR data demonstrated significant potential for nitrogen monitoring, several limitations should be acknowledged. First, the sample size in this study was relatively limited (n = 16 per stage), which is a common challenge in research involving perennial woody crops due to the high costs and the destructive sampling (over 500 yuan per tree). Thus, we ultimately used leave-one-out cross-validation to build the model. Although leave-one-out cross-validation was utilized to maximize the utility of the available data, but it may tend to overestimate performance. Second, due to the limited total number of samples (n = 32), this study did not implement a fully independent validation set from different years or varieties. In future work, the dataset should be expanded to include a larger sample size, which can then be divided into training, validation, and test sets to further verify the generalizability and robustness of the diagnostic framework. Third, regarding the modelling methods, this study primarily evaluated traditional machine learning algorithms. While Deep Learning (DL) architectures offer powerful capabilities for multi-modal data fusion, modern deep learning architectures (e.g., convolutional neural networks for spectral–spatial cubes, pointNet for point clouds or transformer-based spectral models) could potentially capture non-linear canopy–nitrogen relationships more effectively than ensemble trees. Although our current dataset is limited, future research should leverage these DL frameworks to enhance modeling performance as larger datasets become available. Finally, external environmental factors such as wind speed, complex terrain, and orchard spacing were not fully integrated into the current model. Future work should incorporate these practical constraints and environmental variables to enhance the operational feasibility of the framework in diverse and complex orchard environments.

4. Materials and Methods

4.1. Study Site and Experimental Design

The study was conducted from 2020 to 2021 at the cultivation bases of Lishi Town and Ciyun Town, Chongqing, China (106.1800° E, 29.0700° N, altitude 330 m) (Figure 3). These two orchards were selected as independent experimental sites because they represent the two most typical large-scale commercial cultivation models in the region. Specifically, the Ciyun site is a high-yield demonstration base characterized by intensive precision management, while the Lishi site follows traditional large-scale farming practices with moderate productivity levels. The study region is characterized by an annual average temperature of 18.2 °C and an annual average precipitation of 1034.7 mm, which belongs to a subtropical humid monsoon climate (Figure 1). The cultivars used in this experiment were the primary Zanthoxylum armatum (ZA) grown in Jiangjin, Chongqing. These 8-year-old trees have been in continuous fruit production for over 5 years, exhibit strong adaptability to the local climate, and are recognized for their stable performance These characteristics ensure that the selected ZA are both representative and suitable for evaluating canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) under different nitrogen concentration.
The fertilizer treatments and soil properties are summarized in Table 3. This experiment included four N application rates: N0 (0 kg N ha−1, control), N1 (150 kg N ha−1), N2 (300 kg N ha−1, the local recommended rate), and N3 (450 kg N ha−1, the conventional rate used by local farmers) [2]. Nitrogen was supplied as urea (CO(NH2)2, 46% N). Phosphorus and potassium were applied at total rates of 120 kg P2O5 ha−1 and 100 kg KCl ha−1, respectively. Phosphorus and potassium were supplied as calcium superphosphate, 12% P2O5 and muriate of potash, 62%. These treatments were arranged in a randomized complete block design with three replicates, with each replicate comprising 10 individual trees to ensure representative sampling. Fertilizers were applied in four phenological stages: shoot-promoting (June), autumn basal (late September), bud break (next February), and fruit expansion (next April). These nutrients were partitioned across the four application timings according to the following proportions: nitrogen (30%, 30%, 20%, and 20%), phosphorus (50%, 20%, 20%, and 10%), and potassium (30%, 10%, 20%, and 40%), respectively. The planting density was 1667 plants per hectare, with a row spacing of 2 m and a plant spacing of 3 m. The predominant soil type in the area is purple soil. The soil nutrient backgrounds (Table 3) provide different alkaline nitrogen. Adequate irrigation and pest management ensured minimal stress during crop growth.

4.2. The Framework of This Study

The study workflow (Figure 4) utilizes a multi-source UAV platform for simultaneously acquisition of hyperspectral and LiDAR data. Two-year experiments (2020–2021) provided data for constructing Zanthoxylum armatum (ZA) canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) estimation models at the vigorous vegetative and fruit-expansion stages. Machine learning algorithms (MLR, PLSR, SVM, and RFR) were applied to different flight altitude combinations (60 m, 80 m, and 100 m) to develop these models. Factors influencing model estimation precision were thoroughly examined, with particular attention paid to the growth stage, flight height, and algorithm performance impacts.

4.3. Ground Data Acquisition

Prior to remote-sensing data collection, two Zanthoxylum armatum (ZA) trees with uniform growth statuses were selected and marked. Immediately after the remote sensing data were collected, destructive aboveground sampling was conducted on the target ZA trees. Sixteen trees were sampled during each growth stage (vigorous vegetative growth and fruit expansion stages), resulting in a total of 32 sample trees. The specific sampling method was as follows: the trunk was cut off near the ground level, and the aboveground parts were separated into four sections: branches, leaves, fruits, and main stem. The fresh weights of each part were measured. For each section, approximately 200 g of samples were collected from the east, west, south, and north sides of the canopy respectively. Of these, 100 g was placed in a self-sealing bag and stored in a refrigerator for further measurement; the remaining 100 g was thoroughly washed with deionized water, air-dried, weighed for fresh weight, then subjected to enzyme inactivation at 105 °C for 30 min in an oven (Shanghai Yiheng Scientific Instrument Co., Ltd., Shanghai, China), and dried at 65 °C to constant weight before being weighed and recorded. The dried samples were ground, passed through a 60-mesh sieve, and used for nitrogen content determination. The samples were digested using the concentrated H2SO4-H2O2 (Sinopharm Chemical Reagent Co., Ltd., Shanghai, China) method, and the total nitrogen content was measured using a Kjeldahl apparatus (Hanon Advanced Technology Group Co., Ltd., Jinan, China) [27].
CNC = NCleaf
AGNA = NCleaf × Mleaf + NCshoot × Mshoot + NCfruit × Mfruit + NCtrunk × Mfruit
where CNC = canopy nitrogen concentration (g kg−1); AGNA = above-ground nitrogen accumulation (g plant−1); NCleaf, NCshoot, NCfruit, NCtrunk = nitrogen concentration in leaf, shoot, fruit, and trunk, respectively (g kg−1); Mleaf, Mshoot, Mfruit, Mtrunk = dry biomass of leaf, shoot, fruit, and trunk, respectively (g plant−1).

4.4. UAV Data Acquisition and Processing

4.4.1. UAV Platform

In this study, a DJI M600 Pro hexacopter UAV (DJI Company, Shenzhen, China) equipped with an IRIS integrated hyperspectral-LiDAR imaging system (LICA United Technology Limited, Beijing, China) was used to simultaneously collect hyperspectral imagery and LiDAR point cloud data of the Zanthoxylum armatum (ZA) canopy. Table 4 outlines the specifications of the IRIS integrated hyperspectral-LiDAR imaging system. To ensure the quality of the hyperspectral images and point cloud stitching, the UAV was programmed with an 80% forward overlap and a 60% side overlap, maintaining a flight speed of 1 m s−1 at altitudes of 60 m, 80 m, and 100 m. Each acquisition was conducted under clear and cloudless conditions between 11:00 a.m. and 2:00 p.m. Furthermore, to cover the growth characteristics of ZA during its key developmental stages, the above-mentioned remote sensing data were collected in the experimental areas of Lishi Town and Ciyun Town during late October (the end of the vigorous vegetative growth stage, VGS) and mid-April (the end of the fruit-expansion stage, FES) of both 2020 and 2021, respectively.

4.4.2. Hyperspectral Image and LiDAR Data Processing

Hyperspectral images were collected and initially stitched using Agisoft PhotoScan to generate digital ortho-reflectance images. To ensure the positional accuracy of the hyperspectral image, nine pieces of 60 cm × 60 cm white reference plates were evenly placed around and in the center of the sample orchard as control points, and their geographical coordinates were recorded for geometric correction. The corrected images were radiometrically calibrated using deployed PTFE standard gray panels to convert the image’s Digital Number (DN) values into the reflectance of the Zanthoxylum armatum (ZA) canopy. Regions of Interest (ROIs) for the target ZA canopy were constructed based on marked information (selecting 50 pixels in each cardinal direction within the canopy while avoiding soil, ground vegetation, and shaded areas), and the average reflectance was extracted as the spectral reflectance for each sample point. Finally, the obtained hyperspectral data were subjected to noise reduction using the Savitzky–Golay (SG) convolution smoothing algorithm, resulting in canopy reflectance curves for modeling analysis. Simultaneously, Raw point clouds were pre-processed in LiDAR360 (version 5.0, GreenValley International, Berkeley, CA, USA) for noise removal and ground classification via the Progressive Densification TIN filtering algorithm. A smoothed Canopy Maxima Model was then generated to facilitate individual tree segmentation using a marker-controlled watershed algorithm. Finally, structural parameters were extracted using PCM software (version 2.0, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China): Plant Height and Crown Diameter were derived from the maximum vertical height and the average of cardinal crown widths, respectively, while Canopy Volume was quantified using the 3D convex hull method to accurately characterize the ZA canopy architecture [23]. These features were then used for machine learning modeling; specifically, the Random Forest Regression model was constructed with 100 decision trees, a minimum samples split of 2, a minimum samples leaf of 2, and a maximum depth of 3 [28,29]. A leave-one-out cross-validation strategy was employed to evaluate model stability. Additionally, Variance Inflation Factor (VIF) analysis and SHAP-based feature attribution were conducted to evaluate multi-collinearity and variable importance, respectively.

4.5. Modeling Methods

4.5.1. Feature Selection

We implemented a comprehensive two-step sequential feature screening and optimization workflow (Figure 5) to identify the most robust predictors for canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) in Zanthoxylum armatum (ZA). Initially, the regions with the coefficient of determination (R2) values for NDSI, CNC, and AGNA are calculated from 400–1000 nm when the UAV flight altitude is 60–100 m (Figures S1 and S2). Secondly, the correlation coefficient between nitrogen status and LiDAR based characteristic parameters is calculated when the UAV flight altitude is 60–100 m (Table S2).

4.5.2. Machine Learning

To predict the CNC and AGNA contents in Zanthoxylum armatum (ZA), four machine learning algorithms were evaluated: Multiple Linear Regression (MLR), Partial Least Squares Regression (PLSR), Support Vector Regression (SVR), and Random Forest Regression (RFR). To enhance the spatiotemporal robustness of the models, data from different years (2020 and 2021) and locations (Lishi and Ciyun) were pooled into a single dataset for analysis. Conversely, separate models were constructed for each growth stage (vigorous growth stage and fruit expansion stage) and each UAV flight altitude (60 m, 80 m, and 100 m) to evaluate stage-specific physiological fluctuations and the impact of observation height on diagnostic accuracy.
Given the limited sample size (n = 16 per phenological stage), leave-one-out cross-validation was adopted to maximize data utility and ensure unbiased estimation. Furthermore, individual trees within the same treatment block or spatially adjacent trees could appear in both training and validation folds during LOOCV. This iterative procedure reserves each observation as a validation set once while training the model on the remaining n-1 samples; the resulting performance metrics (R2 and RMSE) thus represent the aggregate results across all cross-validation iterations. Specifically, MLR was used to model linear dependencies, while PLSR was selected for its ability to manage high-dimensional, collinear spectral data through latent variable projection [30]. For non-linear modeling, RFR was utilized for its ensemble-based robustness against noise, While SVR was employed to define a maximum-margin hyperplane using the loss function [31,32].

4.5.3. Model Accuracy Evaluation

Statistical analysis was performed using various metrics to evaluate the performance of the model. The coefficient of determination (R2), Root Mean Square Error (RMSE), and Mean Relative Error (MRE) were used to assess the model. An R2 value closer to 1 indicates a better estimation performance by the model. Lower RMSE and MRE values also indicate better estimation performance. The formulas for these metrics are as follows:
R 2 = i = 1 n ( y ^ i y ¯ ) 2 / i = 1 n ( y i y ¯ ) 2
R M S E = i = 1 n ( y ^ i y i ) 2 / n
M R E = 1 n i = 1 n y i y ^ i y i
where y i represents the observed value, y ¯ represents the mean of the observed values, y ^ i represents the predicted value, i represents the sample index, and n represents the number of samples.

4.5.4. Statistical Analysis

Spectral data were processed using ENVI 5.3 (L3Harris Geospatial, Boulder, CO, USA), point cloud data were processed using LiDAR360 and PCM software. Vegetation indices were constructed using Python 3.1.4. Statistical differences in biomass, CNC, and AGNA among the nitrogen treatments were evaluated using one-way analysis of variance (ANOVA) followed by Tukey’s post-hoc test at a significance level of p < 0.05. All models were implemented in Python (v3.8) using the scikit-learn library (v0.24.2). Graphs were plotted using Origin 2024 (OriginLab Corporation, Northampton, MA, USA).

5. Conclusions

Stage-specific fusion of UAV hyperspectral and LiDAR data, refined by a two-step feature selector and Random Forest regression, delivered CNC R2 = 0.93–0.98 (RMSE 0.87–1.02 g kg−1) and AGNA R2 = 0.95–0.97 (RMSE 1.92–2.55 g plant−1) across VGS and FES, providing a robust, non-destructive nitrogen monitoring tool for precision management of Zanthoxylum armatum and other woody crops.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants15071119/s1, Table S1: Vegetation Indices, LiDAR induces and their corresponding descriptions and references; Table S2: correlation coefficient between nitrogen status and LiDAR based characteristic parameters; Table S3: Multicollinearity diagnostics (VIF) and feature importance analysis (SHAP) for the optimal LiDAR–HSI fusion models across phenological stages; Table S4. Comparison of model performance (R2) under Uniform Altitude and Optimized Altitude for CNC and AGNA estimation; Figure S1: Equipotential diagram of the correlation between any two spectral indices at different heights and the CNC of Zanthoxylum armatum; Figure S2: Equipotential diagram of the correlation between any two spectral indices at different heights and the AGNA of Zanthoxylum armatum. References [33,34,35,36,37,38,39,40] are cited in Supplementary Materials.

Author Contributions

Conceptualization, Y.W. and J.W.; Methodology, Y.W.; Software, S.Z.; Validation, S.Z., J.Z., S.W. and X.Y.; Data curation, X.S.; Writing—original draft, S.Z.; Writing—review & editing, X.S. and J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Central Universities Basic Scientific Research Operating Funds (No. SWU-KQ22022).

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wang, M.; Tong, S.; Ma, T.; Xi, Z.; Liu, J. Chromosome-level genome assembly of Sichuan pepper provides insights into apomixis, drought tolerance, and alkaloid biosynthesis. Mol. Ecol. Resour. 2021, 21, 2533–2545. [Google Scholar]
  2. Zhao, Y.M.; Kong, F.M.; Zhao, J.K.; Wang, S.; Peng, Q.; Li, Z.Q.; Yang, L.; Bai, Z.L.; Jiang, H.L.; Shi, X.J.; et al. Organic substitution improves yield and quality of Zanthoxylum bungeanum by enhancing soil quality index and nutrient accumulation. Sci. Hortic. 2026, 356, 114603. [Google Scholar] [CrossRef]
  3. Lu, S.J.; Qiu, J.; Wang, J.Y.; Hui, W.K.; Zhai, Y.F.; Xv, J.; Yang, L.; Muhammad, F.D.; Ali, M.A.; Gong, W. Seasonal changes of mineral nutrients absorption and allocation in fruit of Zanthoxylum bungeanum ‘Hanyuan’ during the development. Sci. Hortic. 2024, 324, 112586. [Google Scholar] [CrossRef]
  4. Kwon, S.H.; Ku, K.B.; Le, A.T.; Han, G.D.; Park, Y.; Kim, J.; Tuan, T.T.; Chung, Y.S.; Mansoor, S. Enhancing citrus fruit yield investigations through flight height optimization with UAV imaging. Sci. Rep. 2024, 14, 322. [Google Scholar] [CrossRef] [PubMed]
  5. Zhang, Y.; Guo, J.; Ren, F.; Jiang, Q.; Zhou, X.; Zhao, J.; Liu, X. Integrated Physiological, Transcriptomic, and Metabolomic Analyses of the Response of Peach to Nitrogen Levels during Different Growth Stages. Int. J. Mol. Sci. 2022, 23, 10876. [Google Scholar] [CrossRef]
  6. Li, W.; Zhu, X.; Yu, X.; Li, M.; Tang, X.; Zhang, J.; Xue, Y.; Zhang, C.; Jiang, Y. Inversion of Nitrogen Concentration in Apple Canopy Based on UAV Hyperspectral Images. Sensors 2022, 22, 3503. [Google Scholar] [CrossRef]
  7. Mohd Asaari, M.S.; Mertens, S.; Verbraeken, L.; Dhondt, S.; Inzé, D.; Bikram, K.; Scheunders, P. Non-destructive analysis of plant physiological traits using hyperspectral imaging: A case study on drought stress. Comput. Electron. Agric. 2022, 195, 106806. [Google Scholar] [CrossRef]
  8. Chen, Z.C.; Miao, Y.X.; Lu, J.J.; Zhou, L.; Li, Y.; Zhang, H.Y.; Lou, W.D.; Zhang, Z.; Kusnierek, K.; Liu, C.H. In-Season Diagnosis of Winter Wheat Nitrogen Status in Smallholder Farmer Fields Across a Village Using Unmanned Aerial Vehicle-Based Remote Sensing. Agronomy 2019, 9, 619. [Google Scholar] [CrossRef]
  9. Mutanga, O.; Skidmore, A.K. Narrow band vegetation indices overcome the saturation problem in estimating grass biomass estimation. Int. J. Remote Sens. 2004, 25, 3999–4014. [Google Scholar] [CrossRef]
  10. Gao, L.; Chai, G.; Zhang, X. Above-Ground Biomass Estimation of Plantation with Different Tree Species Using Airborne LiDAR and Hyperspectral Data. Remote Sens. 2022, 14, 2568. [Google Scholar] [CrossRef]
  11. Almeida, D.R.A.; Broadbent, E.N.; Ferreira, M.P.; Meli, P.; Zambrano, A.M.A.; Gorgens, E.B.; Resende, A.F.; Silveira, C.A.; Romanelli, J.P.A.; Hummel, G.; et al. Monitoring restored tropical forest diversity and structure through UAV-borne hyperspectral and lidar fusion. Remote Sens. Environ. 2021, 264, 112582. [Google Scholar]
  12. Dilmurat, K.; Sagan, V.; Moose, S. AI-driven maize yield forecasting using unmanned aerial vehicle-based hyperspectral and LiDAR data fusion. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, 2022, 193–199. [Google Scholar] [CrossRef]
  13. Wang, J.; Zhang, S.; Lizaga, I.; Zhang, Y.; Ge, X.; Zhang, Z.; Zhang, W.; Huang, Q.; Hu, Z. UAS-based remote sensing for agricultural monitoring: Current status and perspectives. Comput. Electron. Agric. 2024, 227, 109501. [Google Scholar] [CrossRef]
  14. Yin, C.; Lv, X.; Zhang, L.; Ma, L.; Wang, H.; Zhang, L.; Zhang, Z. Hyperspectral UAV Images at Different Altitudes for Monitoring the Leaf Nitrogen Content in Cotton Crops. Remote Sens. 2022, 14, 2576. [Google Scholar] [CrossRef]
  15. Hueso, A.; Camacho, G.; Gomez-del-Campo, M. Spring deficit irrigation promotes significant reduction on vegetative growth, flowering, fruit growth and production in hedgerow olive orchards (cv. Arbequina). Agric. Water Manag. 2021, 248, 106695. [Google Scholar] [CrossRef]
  16. Iqbal, M.A.; Khan, M.A.; Azam, M.; Bhatti, M.I.N.; Feroze, M.A.; Zulfiqar, B.; Umar, H.; Asif, M. Seasonal fluctuations of N, P and K in leaves influenced nutrient requirement during fruit development stages in different olive genotypes. Braz. Arch. Biol. Technol. 2022, 65, 1–12. [Google Scholar] [CrossRef]
  17. Zhao, L.; Jia, Z.; Li, G.; Zhang, T.; Wei, M. N Utilization, Residual and Loss Characteristics of Spring-Topdressing (15N-urea) Pear Orchards in the Old Course of the Yellow River Area. Agronomy 2022, 12, 2682. [Google Scholar]
  18. Woldenberg, O.; Dubinin, V.M.; Baram, S.; Paz-Kagan, T. Bridging the Gap Between Leaf and Canopy Nitrogen Content in Almond Orchard with UAV-Based Sensor Fusion. Smart Agric. Technol. 2025, 12, 101355. [Google Scholar] [CrossRef]
  19. Inoue, Y.; Guérif, M.; Baret, F.; Skidmore, A.; Tangbhaiturntana, A.; Schlerf, M.; Darvishzadeh, R. Simple and robust methods for remote sensing of canopy chlorophyll content: A comparative analysis of hyperspectral data for different types of vegetation. Plant Cell Environ. 2016, 39, 2609–2623. [Google Scholar] [PubMed]
  20. Maimaitijiang, M.; Sagan, V.; Sidike, P.; Daloye, A.M.; Erkbol, H.; Fritschi, F.B. Soybean yield prediction using multimodal remote sensing data fusion and deep learning. Remote Sens. Environ. 2020, 248, 111961. [Google Scholar]
  21. Berger, K.; Verrelst, J.; Féret, J.B.; Wang, Z.; Ganeva, D.; Tansey, K. Crop nitrogen monitoring: Recent progress and principal developments in the optical remote sensing techniques. Remote Sens. 2020, 12, 3916. [Google Scholar]
  22. Thenkabail, P.S.; Smith, R.B.; De Pauw, E. Hyperspectral vegetation indices and their relationships with agricultural crop characteristics. Remote Sens. Environ. 2000, 71, 158–182. [Google Scholar] [CrossRef]
  23. Cao, J.; Liu, K.; Zhuo, L.; Liu, L.; Zhu, Y.; Peng, L. Combining UAV-based hyperspectral and LiDAR data for mangrove species classification using the rotation forest algorithm. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102414. [Google Scholar]
  24. Zhang, S.; Wang, X.; Lin, H.; Dong, Y.; Qiang, Z. A review of the application of UAV multispectral remote sensing technology in precision agriculture. Smart Agric. Technol. 2025, 12, 101406. [Google Scholar] [CrossRef]
  25. Walter, J.D.C.; Edwards, J.; McDonald, G.; Kuchel, H. Estimating biomass and canopy height with LiDAR for field crop breeding. Front. Plant Sci. 2019, 10, 1145. [Google Scholar] [CrossRef]
  26. Lin, L.; Yu, K.; Yao, X.; Deng, Y.; Hao, Z.; Chen, Y.; Wu, N.; Liu, J. UAV Based Estimation of Forest Leaf Area Index (LAI) through Oblique Photogrammetry. Remote Sens. 2021, 13, 803. [Google Scholar]
  27. Gao, H.J.; Chen, X.W.; Liang, A.Z.; Peng, C.; Zhu, P.; Zhang, X.Z. Combined Effects of Straw Returning and Nitrogen Fertilizer Application on Crop Yield and Nitrogen Utilization in the Chernozem of Northeast China. Appl. Ecol. Environ. Res. 2022, 20, 893–903. [Google Scholar] [CrossRef]
  28. Wang, L.J.; Ling, Q.H.; Liu, Z.; Dai, M.Z.; Zhou, Y.; Shi, X.J.; Wang, J. Precision Estimation of Rice Nitrogen Fertilizer Topdressing According to the Nitrogen Nutrition Index Using UAV Multi-Spectral Remote Sensing: A Case Study in Southwest China. Plants 2025, 14, 1195. [Google Scholar] [CrossRef]
  29. Li, H.; Huang, C.; Zhang, Y.; Li, S.; Liu, Y.; Yang, K.; Lu, J. Estimating Maize Leaf Area Index Using Multi-Source Features Derived from UAV Multispectral Imagery and Machine Learning Models. Plants 2025, 14, 3534. [Google Scholar] [CrossRef]
  30. Chen, B.; Lu, X.; Yu, S.; Gu, S.; Huang, G.; Guo, X.; Zhao, C. The Application of Machine Learning Models Based on Leaf Spectral Reflectance for Estimating the Nitrogen Nutrient Index in Maize. Agriculture 2022, 12, 1839. [Google Scholar] [CrossRef]
  31. Smola, A.J.; Schölkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef]
  32. Wang, L.; Zhou, X.; Zhu, X.; Dong, Z.; Guo, W. Estimation of biomass in wheat using random forest regression algorithm and remote sensing data. Crop J. 2016, 4, 212–219. [Google Scholar] [CrossRef]
  33. Yin, C.X.; Lin, J.; Ma, L.L.; Zhang, Z.; Hou, T.Y.; Zhang, L.F.; Lv, X. Study on the Quantitative Relationship Among Canopy Hyperspectral Reflectance, Vegetation Index and Cotton Leaf Nitrogen Content. J. Indian Soc. Remote Sens. 2021, 49, 1787–1799. [Google Scholar]
  34. Zhou, X.; Huang, W.; Zhang, J.; Kong, W.; Casa, R.; Huang, Y. A Novel Combined Spectral Index for Estimating the Ratio of Carotenoid to Chlorophyll Content to Monitor Crop Physiological and Phenological Status. Int. J. Appl. Earth Obs. Geoinf. 2019, 76, 128–142. [Google Scholar] [CrossRef]
  35. Daughtry, C.S.T.; Walthall, C.L.; Kim, M.S.; Colstoun, E.B.; McMurtrey, J.E., III. Estimating Corn Leaf Chlorophyll Concentration from Leaf and Canopy Reflectance. Remote Sens. Environ. 2000, 74, 229–239. [Google Scholar]
  36. Gamon, J.A.; Peñuelas, J.; Field, C.B. A Narrow-Waveband Spectral Index That Tracks Diurnal Changes in Photosynthetic Efficiency. Remote Sens. Environ. 1992, 41, 35–44. [Google Scholar] [CrossRef]
  37. Gitelson, A.; Viña, A.; Ciganda, V.; Rundquist, D.; Arkebauer, T. Remote Estimation of Canopy Chlorophyll in Crops. Geophys. Res. Lett. 2005, 32, L08403. [Google Scholar] [CrossRef]
  38. Haboudane, D.; Miller, J.R.; Tremblay, N.; Zarco-Tejada, P.J.; Dextraze, L. Integrated Narrow-Band Vegetation Indices for Prediction of Crop Chlorophyll Content for application to precision agriculture. Remote Sens. Environ. 2002, 81, 416–426. [Google Scholar] [CrossRef]
  39. Hernández-Clemente, R.; Navarro-Cerrillo, R.M.; Zarco-Tejada, P.J. Carotenoid Content Estimation in a Heterogeneous Conifer Forest Using Narrow-Band Indices and PROSPECT + DART Simulations. Remote Sens. Environ. 2012, 127, 298–315. [Google Scholar] [CrossRef]
  40. Wang, Z.J.; Wang, J.H.; Liu, L.Y.; Huang, W.J.; Zhao, C.J.; Wang, C.Z. Prediction of grain protein content in winter wheat (Triticum aestivum L.) using plant pigment ratio (PPR). Field Crops Res. 2004, 90, 311–321. [Google Scholar]
Figure 1. Pooled Spearman correlation analysis (2020–2021) of canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) with hyperspectral vegetation indices and LiDAR structural parameters. Panels (a) and (b) represent CNC during the vigorous vegetative growth stage (VGS) and fruit expansion stage (FES), respectively, with hyperspectral data collected at a flight altitude of 100 m. Panels (c,d) represent AGNA during VGS and FES, where hyperspectral data were collected at 60 m for panel (c) and at 100 m for panel (d). For all scenarios, the LiDAR structural parameters were consistently derived from data collected at a flight altitude of 60 m.
Figure 1. Pooled Spearman correlation analysis (2020–2021) of canopy nitrogen concentration (CNC) and aboveground nitrogen accumulation (AGNA) with hyperspectral vegetation indices and LiDAR structural parameters. Panels (a) and (b) represent CNC during the vigorous vegetative growth stage (VGS) and fruit expansion stage (FES), respectively, with hyperspectral data collected at a flight altitude of 100 m. Panels (c,d) represent AGNA during VGS and FES, where hyperspectral data were collected at 60 m for panel (c) and at 100 m for panel (d). For all scenarios, the LiDAR structural parameters were consistently derived from data collected at a flight altitude of 60 m.
Plants 15 01119 g001
Figure 2. Comparison of measured and LOOCV-predicted (a,b) CNC and (c,d) AGNA using UAV-RFR models across phenological stages. Panels (a,c) represent VGS and (b,d) represent FES. Flight altitudes are 100 m for (a,b,d) and 60 m for (c). Model predictions were evaluated via LOOCV based on stage-specific datasets. For each phenological stage (VGS and FES), a sample size of n = 16 was utilized for model development and validation.
Figure 2. Comparison of measured and LOOCV-predicted (a,b) CNC and (c,d) AGNA using UAV-RFR models across phenological stages. Panels (a,c) represent VGS and (b,d) represent FES. Flight altitudes are 100 m for (a,b,d) and 60 m for (c). Model predictions were evaluated via LOOCV based on stage-specific datasets. For each phenological stage (VGS and FES), a sample size of n = 16 was utilized for model development and validation.
Plants 15 01119 g002
Figure 3. Study area location and the variations of temperature and precipitation in 2020 and 2021. The red triangle indicates the location of the experimental site.
Figure 3. Study area location and the variations of temperature and precipitation in 2020 and 2021. The red triangle indicates the location of the experimental site.
Plants 15 01119 g003
Figure 4. The technical diagram of this study. (A) UAV hyperspectral data processing; (B) UAV LiDAR data processing; (C) field destructive data collection; (D) model construction and evaluation. The color scale from blue to red represents the transition from low to high point cloud density.
Figure 4. The technical diagram of this study. (A) UAV hyperspectral data processing; (B) UAV LiDAR data processing; (C) field destructive data collection; (D) model construction and evaluation. The color scale from blue to red represents the transition from low to high point cloud density.
Plants 15 01119 g004
Figure 5. Flowchart of the optimized feature extraction and fusion process for CNC and AGNA estimation.
Figure 5. Flowchart of the optimized feature extraction and fusion process for CNC and AGNA estimation.
Plants 15 01119 g005
Table 1. Statistical analysis (mean ± SD; ANOVA post hoc Tukey) of biomass, canopy nitrogen concentration (CNC), aboveground nitrogen accumulation (AGNA), and yield of Zanthoxylum armatum var. novemfolius under different nitrogen application rate treatments (N0, N1, N2, and N3) at the Vigorous vegetative growth stage (VGS) and the fruit expansion stage (FES). Different lowercase letters within a row indicate significant differences among treatments at p < 0.05. The significant variables were colored blue ranging from the lowest (light blue) to the highest (dark blue) values.
Table 1. Statistical analysis (mean ± SD; ANOVA post hoc Tukey) of biomass, canopy nitrogen concentration (CNC), aboveground nitrogen accumulation (AGNA), and yield of Zanthoxylum armatum var. novemfolius under different nitrogen application rate treatments (N0, N1, N2, and N3) at the Vigorous vegetative growth stage (VGS) and the fruit expansion stage (FES). Different lowercase letters within a row indicate significant differences among treatments at p < 0.05. The significant variables were colored blue ranging from the lowest (light blue) to the highest (dark blue) values.
LishiCiyun
N0N1N2N3p-ValueN0N1N2N3p-Value
VGSBiomass (kg plant−1)1.38 ± 0.05 c1.86 ± 0.15 b1.91 ± 0.18 b2.73 ± 0.16 a0.0042.81 ± 0.12 c3.18 ± 0.09 b3.89 ± 0.05 a4.08 ± 0.10 a<0.001
CNC (g kg−1)38.06 ± 0.73 a38.86 ± 1.25 a39.35 ± 1.06 a39.95 ± 1.38 a0.47732.19 ± 0.53 a32.48 ± 0.39 a33.02 ± 0.33 a33.58 ± 0.72 a0.172
AGNA (g plant−1)17.70 ± 2.58 c24.48 ± 0.21 b26.61 ± 1.10 b37.12 ± 0.76 a<0.00124.44 ± 1.52 d32.06 ± 1.00 c40.53 ± 1.75 b49.15 ± 4.10 a0.037
FESBiomass (kg plant−1)2.58 ± 0.12 a2.69 ± 0.16 a2.81 ± 0.12 a2.87 ± 0.18 a0.3303.68 ± 0.18 a3.74 ± 0.10 a3.98 ± 0.18 a4.12 ± 0.32 a0.277
CNC (g kg−1)32.58 ± 0.31 c35.62 ± 0.63 b39.76 ± 1.28 a38.54 ± 0.56 a0.00322.63 ± 1.23 d28.00 ± 1.24 c35.41 ± 1.26 a32.35 ± 1.11 b0.012
AGNA (g plant−1)32.65 ± 3.80 c38.24 ± 0.98 bc47.18 ± 2.30 a41.63 ± 1.75 ab0.13132.89 ± 1.15 d40.15 ± 3.24 c59.92 ± 3.09 a51.51 ± 3.22 b0.002
Yield (kg plant−1)3.08 ± 0.53 c3.77 ± 0.48 bc5.24 ± 0.45 a4.35 ± 0.54 ab0.0043.27 ± 0.58 c3.92 ± 0.30 c5.61 ± 0.26 a4.76 ± 0.55 b0.001
Note: Different lowercase letters in columns indicate significant differences between different nitrogen application treatments (p < 0.05). VGS represents the vigorous vegetative growth stage, which is primarily the structural development stage of the tree, involving the rapid expansion of leaves and the elongation of new shoots. FES represents the fruit expansion stage, which is characterized by the rapid growth and volume increase of the Zanthoxylum armatum fruit.
Table 2. Performance comparison of machine learning algorithms for canopy nitrogen concentration (CNC) and above-ground nitrogen accumulation (AGNA) estimation based on optimal feature fusion (LiDAR + Hyperspectral) across phenological stages and leave-one-out cross-validation (LOOCV).
Table 2. Performance comparison of machine learning algorithms for canopy nitrogen concentration (CNC) and above-ground nitrogen accumulation (AGNA) estimation based on optimal feature fusion (LiDAR + Hyperspectral) across phenological stages and leave-one-out cross-validation (LOOCV).
Phenological StageParametersPLSRMLRSVRRFR
PH + CD + CV *NDSIR2RMSER2RMSER2RMSER2RMSE
CNC (g kg−1)VGS732, 879 #0.71 ± 0.082.09 ± 0.180.72 ± 0.082.05 ± 0.170.78 ± 0.061.81 ± 0.150.93 ± 0.021.02 ± 0.09
FES560, 690 #0.86 ± 0.042.16 ± 0.110.87 ± 0.042.05 ± 0.100.93 ± 0.021.48 ± 0.060.98 ± 0.010.87 ± 0.05
AGNA (g plant−1)VGS711, 986 †0.89 ± 0.514.76 ± 0.290.91 ± 0.033.36 ± 0.210.93 ± 0.024.71 ± 0.310.97 ± 0.011.92 ± 0.18
FES515, 736 #0.87 ± 0.044.12 ± 0.250.88 ± 0.043.92 ± 0.240.80 ± 0.068.38 ± 0.520.95 ± 0.022.55 ± 0.22
Note: VGS and FES represent the vigorous vegetative growth stage and fruit expansion stage, respectively. Accuracy metrics (R2 and RMSE) were calculated via LOOCV based on stage-specific datasets. For each phenological stage (VGS and FES), a sample size of n = 16 was utilized for model development and LOOCV. The symbols in the table indicate the specific UAV flight altitudes at which the corresponding parameters were acquired: *: 60 m, #: 100 m, and †: 60 m. MLR, PLSR, SVR, and RFR represent the machine learning algorithms evaluated for nitrogen status estimation. Model performance is quantified by R2 (coefficient of determination) and RMSE (root mean square error; expressed in g kg−1 for CNC and g plant−1 for AGNA).
Table 3. Basic information regarding two field experiments.
Table 3. Basic information regarding two field experiments.
Experimental SiteYearTreatmentsSoil Characteristics
Lishi Town2020–2021N application (kg N ha−1): 0, 150, 300, 450pH: 4.4
Organic matter: 12.06 g kg−1
Alkaline-N: 174.20 mg kg−1
Olsen-P: 46.16 mg kg−1
Available-K: 232 mg kg−1
Ciyun Town2020–2021N application (kg N ha−1): 0, 150, 300, 450pH: 4.9
Organic matter: 14.38 g kg−1
Alkaline-N: 78.65 mg kg−1
Olsen-P: 21.49 mg kg−1
Available-K: 224 mg kg−1
Table 4. Specifications of the IRIS Integrated Hyperspectral LiDAR Imaging System.
Table 4. Specifications of the IRIS Integrated Hyperspectral LiDAR Imaging System.
Host PlatformDJI M600 Multi-Rotor Drone
Support systemMultiple GNSS systems including BeiDou, GPS, GLONASS, and RTK
HyperspectralSpectral range400–1000 nm
Spectral resolution2.1 nm
Sampling interval1.07 nm
Number of spectral channels561
Number of spatial channels900
Frames per second249
LiDARScan frequency320,000 points s−1
Number of scan lines16, 32
Range150 m
Horizontal Accuracy±2 cm (1σ)
Elevation accuracy±1 cm (1σ)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, S.; Wei, Y.; Zhao, J.; Wang, S.; Ye, X.; Shi, X.; Wang, J. Estimation of Nitrogen Status in Zanthoxylum armatum var. novemfolius Using Machine Learning Algorithms and UAV Hyperspectral and LiDAR Data Fusion. Plants 2026, 15, 1119. https://doi.org/10.3390/plants15071119

AMA Style

Zhao S, Wei Y, Zhao J, Wang S, Ye X, Shi X, Wang J. Estimation of Nitrogen Status in Zanthoxylum armatum var. novemfolius Using Machine Learning Algorithms and UAV Hyperspectral and LiDAR Data Fusion. Plants. 2026; 15(7):1119. https://doi.org/10.3390/plants15071119

Chicago/Turabian Style

Zhao, Shangyuan, Yong Wei, Jinkun Zhao, Shuai Wang, Xin Ye, Xiaojun Shi, and Jie Wang. 2026. "Estimation of Nitrogen Status in Zanthoxylum armatum var. novemfolius Using Machine Learning Algorithms and UAV Hyperspectral and LiDAR Data Fusion" Plants 15, no. 7: 1119. https://doi.org/10.3390/plants15071119

APA Style

Zhao, S., Wei, Y., Zhao, J., Wang, S., Ye, X., Shi, X., & Wang, J. (2026). Estimation of Nitrogen Status in Zanthoxylum armatum var. novemfolius Using Machine Learning Algorithms and UAV Hyperspectral and LiDAR Data Fusion. Plants, 15(7), 1119. https://doi.org/10.3390/plants15071119

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop