Next Article in Journal
Soil Organic Carbon Dynamics in Contrasting Soil Types Under Short-Rotation Woody Crop Production
Previous Article in Journal
Enhancing Broiler Weight Prediction via Preprocessed Kernel Density Estimation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Temporal Point Cloud Alignment for Accurate Height Estimation of Field-Grown Leafy Vegetables

1
College of Engineering, South China Agricultural University, Guangzhou 510642, China
2
Ministry of Education Key Technologies and Equipment Laboratory of Agricultural Machinery and Equipment in South China, South China Agricultural University, Guangzhou 510642, China
*
Author to whom correspondence should be addressed.
Agriculture 2026, 16(2), 280; https://doi.org/10.3390/agriculture16020280
Submission received: 16 November 2025 / Revised: 5 January 2026 / Accepted: 9 January 2026 / Published: 22 January 2026
(This article belongs to the Topic Digital Agriculture, Smart Farming and Crop Monitoring)

Abstract

Accurate measurement of plant height in leafy vegetables is challenging due to their short stature, high planting density, and severe canopy occlusion during later growth stages. These factors often limit the reliability of single-plant monitoring across the full growth cycle in open-field environments. To address this, we propose a multi-temporal point cloud alignment method for accurate plant height measurement, focusing on Choy Sum (Brassica rapa var. parachinensis). The method estimates plant height by calculating the vertical distance between the canopy and the ground. Multi-temporal point cloud maps are reconstructed using an enhanced Oriented FAST and Rotated BRIEF–Simultaneous Localization and Mapping (ORB-SLAM3) algorithm. A fixed checkerboard calibration board, leveled using a spirit level, ensures proper vertical alignment of the Z-axis and unifies coordinate systems across growth stages. Ground and plant points are separated using the Excess Green (ExG) index. During early growth stages, when the soil is minimally occluded, ground point clouds are extracted and used to construct a high-precision reference ground model through Cloth Simulation Filtering (CSF) and Kriging interpolation, compensating for canopy occlusion and noise. In later growth stages, plant point cloud data are spatially aligned with this reconstructed ground surface. Individual plants are identified using an improved Euclidean clustering algorithm, and consistent measurement regions are defined. Within each region, a ground plane is fitted using the Random Sample Consensus (RANSAC) algorithm to ensure alignment with the X–Y plane. Plant height is then determined by the elevation difference between the canopy and the interpolated ground surface. Experimental results show mean absolute errors (MAEs) of 7.19 mm and 18.45 mm for early and late growth stages, respectively, with coefficients of determination (R2) exceeding 0.85. These findings demonstrate that the proposed method provides reliable and continuous plant height monitoring across the full growth cycle, offering a robust solution for high-throughput phenotyping of leafy vegetables in field environments.

1. Introduction

Plant height is a key phenotypic trait for assessing plant physiological status and growth dynamics and is commonly defined as the perpendicular distance from the root collar to the highest photosynthetically active tissue within the canopy [1,2]. In agronomic research and crop phenotyping, plant height is typically estimated as the elevation difference between the canopy surface and the ground [3]. However, manual measurement is labor-intensive, invasive, and prone to sampling bias due to limited sample sizes, which constrains its applicability for large-scale and long-term field monitoring [4]. With the development of high-throughput, non-destructive sensing technologies, image- and point cloud–based methods have increasingly replaced manual measurements. Two-dimensional (2D) image-based approaches are attractive due to their low cost and simplicity but are inherently limited by the absence of three-dimensional (3D) structural information, resulting in reduced robustness under complex field conditions [5,6,7]. In contrast, three-dimensional sensing technologies—such as stereovision, depth cameras, and Light Detection and Ranging (LiDAR)—enable automated plant height estimation through point cloud analysis and provide additional spatial information related to canopy structure and geometry [8].
Stereovision-based methods reconstruct three-dimensional crop structures via multi-view image matching and have been widely applied in unmanned aerial vehicle (UAV) platforms and ground-based phenotyping systems due to their flexibility and relatively low cost [9,10,11]. Recent studies have further combined stereovision with Simultaneous Localization and Mapping (SLAM) techniques to support multi-temporal field reconstruction [12,13,14]. For example, Dong et al. [12] proposed a four-dimensional reconstruction framework that integrates multi-sensor SLAM with Real-Time Kinematic Global Positioning System (RTK-GPS), enabling the unified registration of point clouds acquired from a peanut field at different growth stages and the estimation of plant height over time. However, this method relies primarily on feature-based stereo vision, which remains sensitive to illumination variations and appearance degradation in agricultural environments characterized by dense canopies, near-ground regions, or highly repetitive textures. Moreover, the consistent alignment of multi-temporal point clouds depends heavily on high-precision RTK-GPS priors, while at the crop scale, pose estimation errors in visual–inertial SLAM may still accumulate over time, adversely affecting reconstruction accuracy in later stages. These limitations further reduce the applicability of such approaches to short-stature crops or scenarios with severe occlusion during advanced growth stages.
Laser-based sensing technologies, including terrestrial laser scanners and LiDAR systems, can acquire high-precision three-dimensional point cloud data and have been widely applied to plant height estimation on ground-based platforms and UAVs. High measurement accuracy has been reported under relatively sparse canopy conditions. For instance, Sun et al. [15] achieved a coefficient of determination (R2) of 0.934 and a root mean square error (RMSE) of 3.51 mm when measuring rapeseed height using a mobile LiDAR system, while Obanawa et al. [13] demonstrated that handheld LiDAR outperformed Structure-from-Motion (SfM) approaches in grassland height estimation. Despite these advantages, LiDAR-based methods are constrained by high equipment costs, substantial computational requirements, and reduced data reliability under dense canopies due to occlusion and multiple laser returns. In addition, the lack of color information in most LiDAR datasets limits their applicability for continuous, individual-plant height monitoring throughout the full growth cycle [13,16,17].
RGB–Depth (RGB-D) cameras, which simultaneously acquire color and depth information, have emerged as a promising alternative for plant height estimation in both controlled and open-field environments [4,18]. Owing to their limited sensing range, these systems are primarily deployed on ground-based platforms [19]. Previous studies have demonstrated good performance at early growth stages. For example, Zhao et al. [20] achieved millimeter-level accuracy when estimating lettuce height under controlled conditions, while Qiu et al. [18] reported strong correlations between RGB-D–derived and manual measurements for maize at the jointing stage. However, field applications using RGB-D sensors often suffer from performance degradation under uneven terrain and complex soil backgrounds [21,22]. More importantly, most existing RGB-D–based approaches rely on independent frame-level measurements and lack a unified spatial reference for multi-temporal alignment. Consequently, cumulative spatial drift and inconsistent ground references limit their suitability for long-term field monitoring. Furthermore, these methods are typically restricted to seedling or early growth stages, when ground surfaces remain visible and canopy occlusion is minimal [23,24], making them inadequate for dense, late-stage field-grown leafy vegetables.
Overall, existing plant height measurement methods remain inadequate for field-grown leafy vegetables, particularly for continuous multi-temporal monitoring across the full growth cycle. Near-ground sensing approaches typically rely on persistent ground visibility and clear plant separation, assumptions that fail under dense planting and severe interleaf occlusion at later growth stages. Aerial and multi-view methods, on the other hand, often depend on static terrain references or implicit global coordinate alignment, making them vulnerable to spatial drift and ground reference inconsistencies caused by transplantation and early-stage soil disturbance. Consequently, robust multi-temporal point cloud alignment and consistent canopy-to-ground height estimation remain challenging for leafy vegetables characterized by short stature, prostrate growth habits, dense canopies, and rapidly diminishing ground visibility.
To address these challenges, this study takes Choy Sum (Brassica rapa var. parachinensis) as a representative leafy vegetable and proposes a multi-temporal point cloud alignment framework for plant height monitoring throughout the entire growth cycle. The framework establishes a unified world coordinate system using fixed reference constraints, enabling reliable alignment of point clouds acquired at different time points without reliance on GPS. A persistent ground reference model is reconstructed from early-stage ground observations and subsequently integrated with later-stage plant point clouds, effectively decoupling ground modeling from canopy measurement when the ground surface becomes occluded. Furthermore, geometric constraints and planting structure priors are incorporated to enhance the robustness of individual plant height estimation under dense planting and uneven terrain.
The main contributions of this study are summarized as follows:
  • A multi-temporal point cloud alignment framework is proposed for short-stature, densely planted leafy vegetables under severe canopy occlusion, enabling reliable plant height monitoring throughout the entire growth cycle in open-field environments.
  • A persistent reference ground modeling strategy is introduced by reconstructing the ground surface from early-stage point clouds and integrating it with later-stage plant data, effectively decoupling ground modeling from canopy measurement when ground visibility is lost.
  • A unified and stable spatial reference is established using fixed calibration constraints in combination with local ground plane correction and planting structure priors, improving the robustness and accuracy of individual plant height estimation under uneven terrain and dense planting conditions.

2. Materials and Methods

2.1. Experimental Site and Data Acquisition

This study was conducted in November 2023 at the Qilin Farm of South China Agricultural University (113°22′38.95″ E, 23°10′9.31″ N). The experiment utilized Choy Sum (Brassica rapa var. parachinensis) plants, which were cultivated in deep furrows and high ridges with 15 cm spacing between plants. Data collection began on the 5th day after transplanting and was repeated every 48 h, spanning 11 time points from day 5 to day 25 to continuously monitor the growth of plant throughout its entire growth cycle. At each time point, RGB-D sequences were captured from approximately 200 plants, with each sequence comprising 1500–2000 frames of imagery along with corresponding Inertial Measurement Unit (IMU) data. The point cloud map covered an area of approximately 6 m × 1.5 m.
To ensure data consistency and minimize the influence of ambient light variations, all data were collected between 17:00 and 18:00, a time period with relatively low and stable light intensity, optimizing the quality of depth data. The data acquisition was performed using a custom-built mobile platform (Figure 1a,b), which was equipped with an Intel RealSense D435i depth camera (Intel Corporation, Santa Clara, CA, USA). Sensor control and data synchronization were managed within the Robot Operating System (ROS) framework on an Ubuntu 20.04 operating system.
Before data acquisition, the D435i camera underwent a detailed calibration process. The Kalibr toolkit was utilized for the intrinsic calibration of both the RGB camera and the infrared stereo module. Noise calibration of the built-in IMU was carried out using the imu_utils toolkit. A final joint calibration between the camera and IMU was conducted to align their coordinate systems. The accuracy of the depth data was verified using the official RealSense depth quality tool, which confirmed a Z-axis accuracy error of less than 0.6%. To enhance data quality, spatial filtering, temporal filtering, and hole-filling algorithms were employed to reduce noise and address missing data resulting from environmental conditions.
For establishing a consistent world coordinate system, a high-precision checkerboard calibration board was permanently fixed in the data collection area. The board was leveled with a spirit level to ensure vertical alignment with an error not exceeding 0.05°, serving as the spatial reference. The camera was mounted vertically downward on an adjustable stand, positioned 60–100 cm above the ground, and integrated with a mini pan-tilt unit to optimize its viewing angle. The platform traversed the target area at a constant speed of 10 cm/s, performing back-and-forth scans to generate comprehensive point clouds covering the entire ridge.
To validate the proposed method, the true height of each Choy Sum plant was manually measured using a digital caliper (accuracy: 0.02 mm), defined as the vertical distance from the root–stem junction (root collar) to the highest point of the canopy. Each measurement was repeated three times per plant, and the average value was taken as the reference height. Manual measurements were synchronized with point cloud acquisition to minimize errors caused by plant growth between sampling times. A total of 202 plants were monitored throughout the entire experimental period.

2.2. Overview of the Proposed Framework

The proposed framework processes color images, depth images, and IMU data streams from an RGB-D camera to continuously monitor the plant height of Choy Sum (Brassica rapa var. parachinensis) throughout its entire growth cycle. The workflow is illustrated in Figure 2 and consists of three primary stages:
  • Three-dimensional Point Cloud Reconstruction and Coordinate System Alignment
    Initially, a 3D point cloud map is reconstructed from the RGB-D data using an improved Oriented FAST and Rotated BRIEF–Simultaneous Localization and Mapping (ORB-SLAM3) algorithm (Figure 2b). A stable world coordinate system is then established based on a fixed checkerboard calibration board placed within the scene. The point cloud map is transformed into this unified coordinate system (Figure 2c). Subsequently, background points, including non-target point clouds, are filtered out (Figure 2d). This step ensures precise spatial alignment of multi-temporal point cloud datasets into a single, absolute reference frame through feature matching and coordinate transformation. Additionally, non-field objects, such as ridges and paths, are identified and removed based on their spatial relationship to the coordinate system and plot boundaries. Importantly, the Z-axis of the world coordinate system is orthogonal to the ground plane, providing a geometrically consistent foundation for subsequent Kriging interpolation and ground model reconstruction.
  • Reference Ground Model Reconstruction and Multi-Temporal Point Cloud Alignment
    Given the short growth cycle of leafy vegetables, minimal ground deformation after transplantation, and the completeness of early-stage point clouds, ground points are initially extracted using color-based segmentation (Figure 2e). The ground point cloud from the early growth stage is selected as the reference. However, due to seedling occlusion and denoising processes, some ground points are missing. To address this, a continuous surface is reconstructed using Kriging interpolation (Figure 2f). The vegetable point clouds from each subsequent growth stage are then fused with this reference ground model (Figure 2g), providing the data needed for calculating plant height by determining the canopy-to-ground elevation difference.
  • Plant Height Calculation Across the Growth Cycle
    Early Stage Measurement Area Delineation: For the early-growth stage, individual plants are segmented using a fast Euclidean clustering algorithm (Figure 2h). The measurement area for each plant is defined based on the extent of its canopy (Figure 2i).
    Late-Stage Measurement Area Delineation: During the late growth stage, when the leaves expand radially from the center, measurement areas are defined using the known planting spacing and the pre-determined center points from the early-stage areas.
    Coordinate Correction: The Random Sample Consensus (RANSAC) algorithm is applied to estimate the normal vector of the local ground plane within each measurement area. A rotational transformation is then applied to align this normal vector with the Z-axis of the world coordinate system, eliminating any systematic errors caused by pose misalignment (Figure 2i).
Height Computation: Finally, the plant height is calculated as the elevation difference between the highest canopy point and the reference ground model within the defined measurement area (Figure 2j).

2.3. Three-Dimensional Reconstruction and Point Cloud Preprocessing

2.3.1. Reconstruction of Open-Field Choy Sum Point Clouds Based on an Improved ORB-SLAM3 Algorithm

Previous studies have demonstrated the feasibility of using ORB-SLAM-based frameworks to measure crop height in open-field environments [14,25,26,27]. In this study, we employ the enhanced ORB-SLAM3 algorithm proposed by Li et al. [25] for 3D reconstruction of the agricultural field scene. This improved version incorporates a dense mapping module and a YOLOv8-based object detection module for dynamic object removal, effectively addressing the original framework’s limitations of generating only sparse maps and mitigating tracking errors caused by wind-induced plant movement. The specific workflow is as follows.
Prior to feeding images into the SLAM tracking thread, dynamic vegetable regions are detected and masked out via object detection, retaining only static background feature points for camera pose estimation. Subsequently, RGB images and depth maps are fused to generate colored 3D point clouds (containing X, Y, Z coordinates and RGB color information). IMU data is tightly coupled to refine the inter-frame transformation matrices during point cloud registration, enabling precise stitching of point clouds from multiple keyframes and reduction in redundant points in overlapping regions. Finally, loop closure detection and global bundle adjustment are applied to construct a large-scale, metrically consistent, dense point cloud map (Figure 2b). This approach maintains the high real-time performance of ORB-SLAM3 while significantly enhancing the geometric completeness, density, and robustness of the reconstructed scenes, providing a reliable data foundation for subsequent plant height measurement.

2.3.2. World Coordinate System Unification and Background Removal

In outdoor agricultural 3D modeling, coordinate registration methods based on ground control points (GCPs) are well-established [11,17,28]. For instance, Jamil et al. [11] utilized 10 GCPs placed 53 m apart to achieve centimeter-level Digital Terrain Model (DTM) modeling, a technique suited for large-scale scenarios (>50 m). In contrast, for the small-scale experimental field in this study (with both length and width < 7 m), we propose an alternative method for constructing a coordinate system based on a visual calibration target.
An 8 × 11 checkerboard calibration board was employed as the reference for the world coordinate system. The board was carefully leveled using a spirit level, ensuring a verticality error of less than 0.05°. Feature points were then detected with high precision using the Harris corner detection algorithm, resulting in a single-point pixel error of less than 1.2 px. Leveraging the camera calibration results from Section 2.1, we applied the EPnP algorithm [29] to solve the rigid transformation parameters from the world coordinate system to the camera coordinate system. This included the rotation matrix R ∈ SO(3) and translation vector TR3, which together define the transformation that maps any world coordinate point P  = [XW, YW, ZW] T to its corresponding camera coordinate p  = [XC, YC, ZC] T, as described by Equation (1).
In small-scale scenarios, this method can effectively satisfy the requirements for coordinate transformation. However, as the spatial scale increases, it becomes necessary to employ multiple calibration boards or GCPs to achieve large-area map calibration (Section 3.1 for the relevant error analysis).
p   = R   ·   P   + T
A point p in the camera coordinate system is transformed back to the world coordinate system as Q = [X, Y, Z] T, which corresponds to the actual 3D location of the physical calibration corner. The effect of this coordinate transformation is illustrated in Figure 2c, and the transformation formula is provided in Equation (2).
Q   = R   1   ·   P     T  
The initial point cloud contained non-field objects (e.g., ridges, pathways, and the calibration board) and exhibited boundary artifacts from multi-frame stitching (indicated by the red areas in Figure 2d). To remove these artifacts and background elements, a two-step filtering pipeline was implemented. First, spatial threshold filtering, based on the known physical dimensions of the experimental plot, was applied to remove the majority of non-target points. Second, anisotropic Gaussian filtering was used to smooth jagged edges, followed by connected component analysis to isolate and retain only the point cloud corresponding to the planting area. This workflow effectively achieves both coordinate system unification and background removal, providing a clean and consistent spatial reference for subsequent analysis.

2.4. Plant and Ground Segmentation with Reference Surface Reconstruction

The acquired 3D point cloud contained mixed data of plants and the ground, requiring effective segmentation to extract distinct plant and ground information. The short growth cycle of leafy vegetables, minimal ground deformation after transplantation, and the high completeness of the early-stage ground point cloud made this period ideal for establishing a persistent reference surface, providing reliable ground elevation data for subsequent continuous plant height measurement.

2.4.1. Plant-Ground Segmentation

The raw point cloud was initially denoised. Discrete noise was addressed using the Statistical Outlier Removal (SOR) algorithm [30], which analyzes the local point density around each point to identify and remove statistical outliers, resulting in a smoother point cloud. Given the standardized field management (weed-free) and the optimal data acquisition period (low ambient light interference), the pre-processed point cloud predominantly consisted of the ground surface and Choy Sum plants.
Plant-ground segmentation was performed using color-depth fusion data from the RGB-D camera, based on chromaticity analysis. After evaluating performance across multiple color spaces (RGB, HSV, and YCrCb), the Excess Green (ExG) index in the RGB space (Figure 2e) proved most effective for this application. The ExG index leverages the chlorophyll reflection properties, which cause plant points to exhibit a stronger green (G) channel response, while the soil surface typically shows a dominant red (R) channel response, allowing for reliable separation between plants and ground.
The segmentation result contained two primary types of noise:
  • High-frequency jagged noise at the leaf-ground boundary, caused by perspective projection and spectral aliasing, which risked misclassification as ground points.
  • Discrete outliers from green pixels projected onto the ground, which could be misidentified as plant points.
To address these, anisotropic filtering [31] was applied to smooth the edge artifacts, while radius filtering (search radius: 5 cm; minimum neighbors: 20; these parameters were finalized through iterative testing and visual evaluation; specific values should be adjusted according to individual application requirements) [32] was used to remove the discrete outliers. This resulted in high-fidelity plant and ground segmentation, ready for further analysis.

2.4.2. Surface Reconstruction

The reference ground surface was reconstructed in two stages: ground point cloud filtering and surface interpolation (Figure 3).
First, the Cloth Simulation Filter (CSF) algorithm [33] was applied. This algorithm simulates a virtual cloth settling under gravity onto the inverted point cloud, utilizing the geometric continuity of the ground to adaptively separate and remove residual vegetation and debris (Figure 3c).
Next, to address localized gaps in the filtered ground point cloud, Kriging interpolation [34] was employed to reconstruct a continuous terrain surface. This geostatistical method models the spatial autocorrelation of elevation data via a semivariogram. A Gaussian covariance function and least-squares criteria were used to optimize the spatial weighting for interpolation. Simultaneously, a standard deviation field was generated to quantify spatial uncertainty (Figure 3d). Compared to deterministic interpolation methods, Kriging interpolation not only effectively fills data gaps but also provides an error estimate for subsequent plant height measurements. The result is a high-precision, continuous DTM that serves as the ground reference.

2.5. Multi-Temporal Point Cloud Fusion

This stage focuses on fusing the reconstructed reference ground model (described in Section 2.4.2) with the plant point clouds collected across different time periods. The fusion process varies significantly between the early and late growth stages.
In the early growth stage, both the reference ground model and the plant point clouds come from data captured in the same acquisition sequence (Figure 4a). As a result, they are inherently aligned in the same coordinate system, introducing no additional spatial registration errors (Figure 4b). The fusion of these datasets enhances the integrity of the ground surface, ensuring high accuracy for plant height measurements.
In the late growth stage, the reference ground model, derived from early-stage data, is fused with plant point clouds from a later period. This requires cross-temporal registration, using the Harris corner detection and EPnP algorithms outlined in Section 2.3.2. This process introduces some spatial alignment error (see Section 4.1 for quantitative error analysis). As shown in Figure 4c, the ground in the original point cloud is heavily occluded by the dense canopy and contains substantial noise, making it unsuitable for direct measurement. In contrast, after multi-temporal fusion (Figure 4d), the ground reference is effectively recovered, providing a reliable and accurate reference surface for plant height calculation.

2.6. Plant Height Extraction

2.6.1. Measurement Region Delineation and Coordinate Correction

Accurate measurement of individual plant height requires the delineation of a measurement region for each plant and applying a coordinate correction to eliminate systematic errors induced by local ground slope. As outlined in Section 2.2, the strategy for defining this region differs between the early and late growth stages.
Necessity of geometric formulations. The introduction of Equations (1)–(7) is essential to ensure consistent and accurate plant height estimation across multiple growth stages. Specifically, Equations (1) and (2) transform all reconstructed point clouds into a unified world coordinate system defined by the fixed checkerboard, which is a prerequisite for multi-temporal alignment. Equations (3)–(7) further address local ground inclination and pose-induced bias by aligning the fitted local ground plane with the global vertical (Z-axis). Without these geometric transformations and corrections, canopy-to-ground height differences would be affected by coordinate drift and local terrain slope, leading to systematic errors, particularly during late growth stages when multi-temporal fusion is required.
In the early stage of vegetable growth, the distance between plants is relatively large, making it easy to distinguish individual plants. A fast Euclidean clustering algorithm is first employed to segment the point cloud, effectively isolating individual plants (Figure 5a), with distinct colors representing each plant. Next, based on the projection of a plant’s canopy points onto the XY plane of the world coordinate system, a circular measurement region, slightly larger than the canopy extent, is automatically delineated. This region contains both the plant’s points and the corresponding portion of the reconstructed reference ground model (Figure 5b).
To mitigate measurement bias due to local ground slope, a coordinate correction is applied. The RANSAC algorithm is used to fit a plane to the local ground points within the delineated region, defined by the equation:
A X   +   B Y   +   C Z   +   D   =   0
where n  = (A, B, C) is the normal vector of the fitted plane and D is the distance from the origin to the plane. The angle θ between this normal vector and the world coordinate system’s Z-axis vector k  = (0, 0, 1) is computed using the dot product formula:
θ   =   arccos n   ·   k / n     ×   k  
The axis of rotation c is determined by the cross product of vectors n and k:
c   =   n ×   k
Let u be the unit vector in the direction of c (i.e., u   =   c   / c   ). Using the angle θ obtained above, the rotation matrix Rrot that aligns the RANSAC-fitted plane with the world coordinate XY plane is computed using the Rodrigues’ rotation formula:
R rot   =   I   +   sin θ   · u ×   +   1     cos θ · u × 2
where I is the 3 × 3 identity matrix and u × is the skew-symmetric matrix form of vector u. Let Po ∈ R3 × N denote the original point cloud, where each column corresponds to a 3D point in homogeneous coordinates. The corrected point cloud Pc is then obtained by applying this rotation matrix:
P c   =   R rot   ·     P o
where t PcR3 × N represents the rotated point cloud aligned with the XY plane. The coplanarity between the local ground surface and the world coordinate system’s XY-plane is significantly enhanced in the corrected point cloud (as shown by comparing Figure 5b,c,e,f), thereby providing a more reliable reference benchmark for subsequent plant height measurement.
During the late growth stage, severe canopy closure and mutual leaf occlusion make it impossible to automatically delineate measurement regions through projection. To overcome these challenges, our approach leverages two key characteristics: the fixed planting spacing of Choy Sum (15 cm) and its radial leaf expansion pattern from the center.
The previously acquired center coordinates of individual plants from the early growth stage (Figure 5d) served as anchor points for the late-stage measurements. We manually delineated circular measurement regions centered on these coordinates (Figure 5e) and applied the identical coordinate correction methodology described previously (Figure 5f). This strategy ensures measurement consistency across growth stages and significantly improves the reliability of plant height extraction during the critical late growth phase.
Although Equations (1)–(7) are based on standard geometric formulations widely used in computer vision and point cloud processing, they are indispensable in the proposed framework. Their systematic integration explicitly enforces multi-temporal coordinate consistency, vertical height normalization, and local ground inclination correction. This ensures that plant height is always measured along a unified and physically meaningful vertical direction, even under uneven terrain and severe canopy occlusion in later growth stages.

2.6.2. Plant Height Calculation

Plant height is determined by calculating the maximum vertical distance between the canopy points and the reference ground surface within the coordinate-corrected measurement region for each plant. The calculation process begins by considering an individual canopy point with coordinates (x, y, z). To estimate the ground elevation directly beneath this point, the method first projects the point onto the XY-plane at coordinate (x, y). Since no ground point exists exactly at this projected location due to non-overlapping distributions of canopy and ground points after correction (Figure 6a), the algorithm identifies the four nearest neighboring ground points to (x, y) in the XY-plane. The average z-coordinate of these four points, denoted as zg, is computed to generate an interpolated ground point at (x, y, zg).
The vertical distance from the canopy point to the ground is then calculated as Δz (Figure 6b):
Δ z   =   z     z g  
This process is repeated for all canopy points within the measurement region, generating a set of vertical distance values. The final plant height H is determined as the maximum value from this set of calculated Δz values, representing the highest point of the plant relative to the reconstructed ground surface beneath it.
Plant height measurement methods are typically validated by comparing automated results with manual ground-truth values using MAE and R2. This study adopts the same evaluation metrics to assess method accuracy.

3. Results

3.1. Terrain Error Analysis

To address missing data in the original ground point cloud and to select an appropriate interpolation method, terrain reconstruction accuracy was compared between the Inverse Distance Weighting (IDW) method and Kriging, using the ground data described in Section 2.4.2. Due to gaps in the sampled ground points, interpolation was required to generate a continuous reference surface.
For a fair comparison, 45,086 points were uniformly sampled from the interpolated surfaces of both methods using nearest-neighbor search, and their elevation errors were computed against the corresponding original ground points. The error distributions are shown in Figure 7. Kriging produced scatter points that were more concentrated around the regression line (Figure 7b), with an RMSE of 2.364 mm, which was lower than that of IDW (3.295 mm). Therefore, Kriging was used for terrain reconstruction in subsequent analyses.
To further characterize the Kriging-based terrain reconstruction, an elevation distribution map and a prediction standard error map were generated (Figure 8a,b). The elevation map indicates that terrain height ranges from −16.30 cm to 42.06 cm, with a total relief of 58.36 cm across the experimental plot (Figure 8a). The prediction standard error map shows lower uncertainty in the interior regions (2.2–2.5 mm) and higher uncertainty near plot boundaries (2.5–3.9 mm), consistent with the spatial distribution of available ground points (Figure 8b).
Overall, Kriging interpolation achieved an R2 of 0.9993 with an RMSE of 2.364 mm. Most interior areas exhibited prediction standard errors between 2.2 mm and 2.5 mm, while higher uncertainty was observed near the plot edges.

3.2. Plant Height Measurement Analysis

Continuous data collection was conducted over a 25-day period following the transplantation of Choy Sum, covering key growth stages from day 5 to maturity. To represent early and late growth conditions, point cloud data acquired on day 9 and day 21 after transplantation were selected for plant height evaluation.
The experimental field contained 202 plants with complete data at both time points. Due to the requirement of manual delineation of measurement regions during the late growth stage, 60 plants were selected for quantitative evaluation using a sampling strategy. Specifically, 10 plants were selected at 1 m intervals from near to far relative to the checkerboard calibration target.
Plant height was estimated using a terrain-referenced point cloud–based algorithm, in which a Kriging-interpolated ground surface served as the reference terrain model. For each plant, height was calculated as the vertical difference between canopy points and the corresponding terrain elevation within the defined measurement region, as described in Section 2.6.
To further evaluate the contribution of key components in the proposed framework, a component analysis was conducted based on plant height estimation accuracy. A direct ablation that removes the reference ground model is not feasible for late growth stages because the ground surface is severely occluded in the raw RGB-D point clouds, making canopy-to-ground height computation ill-posed without a persistent ground reference.
Therefore, the analysis focuses on components that can be evaluated under the same experimental setting. Specifically, we compared the full proposed method with a variant in which the local ground plane correction was disabled. In the ablated variant, plant height was computed directly using the original world coordinate system without applying the RANSAC-based plane fitting and rotation alignment.
The comparison results are summarized in Table 1. Removing the local plane correction leads to increased estimation error, particularly under uneven terrain conditions, demonstrating that local plane normalization plays an important role in ensuring accurate and robust plant height estimation.
The comparison between estimated plant height and manual measurements is shown in Figure 9. Strong correlations were observed at both growth stages, with coefficients of determination (R2) of 0.902 for the early stage and 0.855 for the late stage. The mean absolute error (MAE) was 7.19 mm in the early growth stage and 18.45 mm in the late growth stage.

4. Discussion

4.1. Impact of Coordinate Transformation Errors on Plant Height Estimation

The accuracy of plant height estimation in the proposed framework is closely related to the quality of coordinate transformation during multi-temporal point cloud alignment. Although all point clouds are transformed into a unified world coordinate system (WCS) defined by a fixed checkerboard calibration board, residual transformation errors can still propagate into height measurements under specific conditions.
In the early stage of growth, both the reference ground surface and plant point clouds are acquired within the same data collection sequence and transformed into the WCS using identical transformation parameters. As a result, relative height measurements between the canopy and ground are preserved, and coordinate transformation errors do not significantly affect plant height estimation at this stage.
In contrast, during the later growth stages, the reference ground model reconstructed from early-stage data must be fused with plant point clouds acquired at later time points. Because these datasets undergo independent coordinate transformations, residual errors in camera pose estimation and transformation parameters directly influence the accuracy of canopy-to-ground height differences. This effect becomes more pronounced as the distance between the target plant and the calibration board increases, consistent with the error propagation characteristics of the pinhole camera model [29].
This distance-dependent error behavior explains the increasing residual magnitude and variability observed for plants located farther from the checkerboard in Figure 9b. Although the adopted coordinate transformation strategy provides sufficient precision for small-scale field plots, the accumulation of residual transformation errors represents a dominant source of uncertainty in late-stage plant height estimation when multi-temporal fusion is required.
Overall, the experimental results indicate that coordinate transformation errors have a negligible impact on early-stage measurements but become an important limiting factor for late-stage plant height estimation. By constraining the measurement area and using a fixed calibration reference, the proposed framework maintains measurement accuracy within acceptable limits for field-grown leafy vegetables. For larger measurement areas, additional spatial reference constraints would be required to suppress error accumulation.

4.2. Influence of Terrain Reconstruction Accuracy on Plant Height Estimation

Accurate terrain reconstruction is a prerequisite for reliable plant height estimation in open-field environments with uneven ground surfaces. In the proposed framework, plant height is defined as the vertical difference between canopy points and a reconstructed reference terrain surface. Consequently, any error in terrain elevation directly propagates into the estimated plant height.
As shown in Figure 8a, the experimental plot exhibits a vertical terrain relief of 58.36 cm, which is comparable to or exceeds the height of mature leafy vegetables. Under such conditions, assuming a flat or locally uniform ground surface would lead to systematic bias in canopy-to-ground height estimation. The Kriging-based terrain reconstruction provides a continuous reference surface that preserves local elevation variations and reduces this source of error.
The prediction standard error map in Figure 8b further indicates that terrain reconstruction uncertainty is spatially heterogeneous. Lower standard errors are observed in interior regions with dense ground point coverage, while higher uncertainty occurs near plot boundaries where ground data are sparse. This spatial pattern implies that plant height estimation is more sensitive to terrain reconstruction accuracy in edge regions, which should be considered when interpreting measurement results.
Overall, the terrain reconstruction accuracy achieved in this study (RMSE = 2.364 mm) provides a stable and sufficiently precise ground reference for plant height estimation at the centimeter scale. When combined with multi-temporal point cloud alignment, this approach enables reliable canopy-to-ground height measurement even under conditions of severe canopy occlusion during later growth stages.

4.3. Overall Performance and Comparison with Existing Methods

The results presented in this study demonstrate that reliable plant height estimation for field-grown leafy vegetables can be achieved across the full growth cycle, despite the challenges posed by dense canopy occlusion and uneven terrain. This capability is particularly important for short-stature crops with high planting density, where ground visibility rapidly decreases during later growth stages and conventional ground-referenced methods become unreliable.
Building upon previous studies by Obanawa, Dong, Zhang, and colleagues [12,13,22], this study adopts the canopy-to-ground height difference as the basis for plant height estimation, thereby avoiding the need for full-plant reconstruction. Compared with LiDAR-based or multi-view stereo approaches, which often require expensive hardware or are sensitive to canopy occlusion, the proposed framework leverages RGB-D sensing and multi-temporal point cloud alignment to balance measurement accuracy and system practicality under open-field conditions.
Several design choices contribute to the observed performance. The use of a fixed checkerboard calibration target provides a stable local spatial reference, enabling accurate coordinate unification in small field plots without relying on GPS. In addition, the reconstruction of a persistent reference ground surface from early-stage data allows plant height estimation to remain feasible even when the ground is fully occluded during later growth stages. This temporal decoupling of ground modeling and canopy measurement distinguishes the proposed method from frame-level RGB-D approaches that are typically limited to early growth stages.
Under consistent growth conditions, the proposed method achieved a mean absolute error of 7.19 mm in the early growth stage, which is lower than the 9.23 mm reported by Zhao et al. [22] for lettuce height measurement in a potted environment. Although direct numerical comparison across studies should be interpreted with caution due to differences in crop type and experimental setup, this result indicates that the proposed framework can achieve competitive accuracy in open-field environments characterized by uneven terrain and severe canopy occlusion. More importantly, the method uniquely supports continuous monitoring throughout the entire growth cycle, which remains a key limitation for many existing RGB-D–based plant height measurement approaches.
A quantitative comparison between the proposed method and representative state-of-the-art approaches is summarized in Table 2. Although direct numerical comparison is constrained by differences in crop type, sensor configuration, and experimental conditions, the proposed framework achieves competitive accuracy while uniquely supporting continuous plant height monitoring throughout the entire growth cycle under open-field conditions with dense canopy occlusion.
To further differentiate the proposed approach from existing plant height estimation methods, qualitative aspects related to sensing modality, platform characteristics, reference and alignment strategies, as well as cost and computational considerations, are summarized in Table 3.

4.4. Limitations and Future Research Directions

Despite the promising performance of the proposed framework, several limitations should be acknowledged. First, although the use of a persistent reference ground surface alleviates the impact of ground occlusion, extremely dense or overlapping canopies may still affect the accurate delineation of individual plant regions, particularly during late growth stages with severe inter-leaf occlusion. Second, the performance of RGB-D sensors is inherently sensitive to illumination conditions and surface reflectance. While data acquisition in this study was conducted under relatively stable lighting conditions, strong sunlight, shadows, or highly reflective leaf surfaces may introduce additional noise in depth measurements. Future work will investigate illumination-robust sensing strategies and adaptive filtering methods to improve measurement stability under varying field conditions. Third, the scalability of the proposed framework to larger field plots is constrained by coordinate transformation accuracy and computational requirements. Expanding the measurement area would require additional calibration targets or ground control points to suppress error accumulation, as well as algorithmic optimization to reduce processing time and memory consumption.
Beyond plant height estimation, the proposed multi-temporal framework provides a foundation for integrating structural information with other data sources. Future research may explore the fusion of time-series plant height data with multispectral indices, soil properties, or environmental data to model downstream agronomic variables, such as biomass accumulation, nitrogen uptake, plant health status, or variable-rate fertilization prescriptions. Such integration represents a natural and valuable extension of the proposed approach toward data-driven decision support in precision agriculture.

5. Conclusions

This study proposes a multi-temporal point cloud alignment framework for accurate plant height estimation of field-grown leafy vegetables. By integrating a unified world coordinate system with a persistent reference ground model reconstructed from early-stage data, the proposed method enables continuous and reliable height monitoring throughout the full growth cycle, even under severe canopy occlusion.
Field experiments on Choy Sum demonstrated that the proposed method achieved mean absolute errors of 7.19 mm and 18.45 mm in the early and late growth stages, respectively, with coefficients of determination exceeding 0.85. These results indicate that the framework is well suited for high-throughput phenotyping of short-stature, densely planted leafy vegetables in open-field environments, where conventional ground-based or frame-level methods often fail.

Author Contributions

Q.W.: conceptualization, methodology, software, data curation, writing—original draft. K.Y.: formal analysis, investigation, writing—review and editing. Y.L.: data curation, visualization. Y.S.: data curation, visualization. Z.Z.: validation, supervision, project administration, funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to acknowledge the support of this study by the State Key Research Program of China (Grant No. 2022YDF2001901-01).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

During the preparation of this work the authors used ChatGPT 5.0 in order to Check and optimize the English translation. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the published article.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Guan, H.; Liu, M.; Ma, X.; Yu, S. Three-dimensional reconstruction of soybean canopies using multisource imaging for phenotyping analysis. Remote Sens. 2018, 10, 1206. [Google Scholar] [CrossRef]
  2. Zhang, Y.; Liu, P.; Zhang, X.; Zheng, Q.; Chen, M.; Ge, F.; Li, Z.; Sun, W.; Guan, Z.; Liang, T.; et al. Multi-locus genome-wide association study reveals the genetic architecture of stalk lodging resistance-related traits in maize. Front. Plant Sci. 2018, 9, 611. [Google Scholar] [CrossRef] [PubMed]
  3. Grenzdörffer, G.J. Crop height determination with uas point clouds, The International Archives of the Photogrammetry. Remote Sens. Spat. Inf. Sci. 2014, 40, 135–140. [Google Scholar] [CrossRef]
  4. Qiu, R.; Miao, Y.; Ji, Y.; Zhang, M.; Li, H.; Liu, G. Measurement of individual maize height based on rgb-d camera. Trans. Chin. Soc. Agric. 2017, 48, 211. [Google Scholar] [CrossRef]
  5. Galkovskyi, T.; Mileyko, Y.; Bucksch, A.; Moore, B.; Symonova, O.; Price, C.A.; Topp, C.N.; Iyer-Pascuzzi, A.S.; Zurek, P.R.; Fang, S.; et al. Gia roots: Software for the high throughput analysis of plant root system architecture. BMC Plant Biol. 2012, 12, 116. [Google Scholar] [CrossRef]
  6. Paproki, A.; Sirault, X.; Berry, S.; Furbank, R.; Fripp, J. A novel mesh processing based technique for 3d plant analysis. BMC Plant Biol. 2012, 12, 63. [Google Scholar] [CrossRef]
  7. Gibbs, J.A.; Pound, M.; French, A.P.; Wells, D.M.; Murchie, E.; Pridmore, T. Plant phenotyping: An active vision cell for three-dimensional plant shoot reconstruction. Plant Physiol. 2018, 178, 524–534. [Google Scholar] [CrossRef]
  8. Smith, L.N.; Zhang, W.; Hansen, M.F.; Hales, I.J.; Smith, M.L. Innovative 3d and 2d machine vision methods for analysis of plants and crops in the field. Comput. Ind. 2018, 97, 122–131. [Google Scholar] [CrossRef]
  9. Gilliot, J.M.; Michelin, J.; Hadjard, D.; Houot, S. An accurate method for predicting spatial variability of maize yield from uav-based plant height estimation: A tool for monitoring agronomic field experiments. Precis. Agric. 2021, 22, 897–921. [Google Scholar] [CrossRef]
  10. Oehme, L.H.; Reineke, A.; Weiß, T.M.; Würschum, T.; He, X.; Müller, J. Remote sensing of maize plant height at different growth stages using uav-based digital surface models (dsm). Agronomy 2022, 12, 958. [Google Scholar] [CrossRef]
  11. Jamil, N.; Kootstra, G.; Kooistra, L. Evaluation of individual plant growth estimation in an intercropping field with uav imagery. Agriculture 2022, 12, 102. [Google Scholar] [CrossRef]
  12. Dong, J.; Burnham, J.G.; Boots, B.; Rains, G.C.; Dellaert, F. 4D crop monitoring: Spatio-temporal reconstruction for agriculture. In Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore, 29 May–3 June 2017; pp. 3878–3885. [Google Scholar] [CrossRef]
  13. Obanawa, H.; Yoshitoshi, R.; Watanabe, N.; Sakanoue, S. Portable lidar-based method for improvement of grass height measurement accuracy: Comparison with sfm methods. Sensors 2020, 20, 4809. [Google Scholar] [CrossRef] [PubMed]
  14. Shu, F.; Lesur, P.; Xie, Y.; Pagani, A.; Stricker, D. SLAM in the Field: An evaluation of monocular mapping and localization on challenging dynamic agricultural environment. arXiv 2020, arXiv:2011.01122. [Google Scholar] [CrossRef]
  15. Sun, B.; Zain, M.; Zhang, L.; Han, D.; Sun, C. Stem-leaf segmentation and morphological traits extraction in rapeseed seedlings using a three-dimensional point cloud. Agronomy 2025, 15, 276. [Google Scholar] [CrossRef]
  16. Banerjee, B.P.; Spangenberg, G.; Kant, S. Cbm: An iot enabled lidar sensor for in-field crop height and biomass measurements. Biosensors 2022, 12, 16. [Google Scholar] [CrossRef]
  17. Harkel, J.T.; Bartholomeus, H.; Kooistra, L. Biomass and crop height estimation of different crops using uav-based lidar. Remote Sens. 2020, 12, 17. [Google Scholar] [CrossRef]
  18. Qiu, R.; Zhang, M.; He, Y. Field estimation of maize plant height at jointing stage using an rgb-d camera. Crop J. 2022, 10, 1274–1283. [Google Scholar] [CrossRef]
  19. González-Barbosa, J.; Ramírez-Pedraza, A.; Ornelas-Rodríguez, F.; Cordova-Esparza, D.; González-Barbosa, E. Dynamic measurement of portos tomato seedling growth using the kinect 2.0 sensor. Agriculture 2022, 12, 449. [Google Scholar] [CrossRef]
  20. Zhao, Y.; Zhang, X.; Sun, J.; Yu, T.; Cai, Z.; Zhang, Z.; Mao, H. Low-cost lettuce height measurement based on depth vision and lightweight instance segmentation model. Agriculture 2024, 14, 1596. [Google Scholar] [CrossRef]
  21. Zhang, J.; Xing, W.; Song, X.; Cui, Y.; Li, W.; Zheng, D. Buckwheat plant height estimation based on stereo vision and a regression convolutional neural network under field conditions. Agronomy 2023, 13, 2312. [Google Scholar] [CrossRef]
  22. Zhang, J.; Xing, W.; Ye, S.; Zhang, C.; Zheng, D. Oat plant height estimation based on a dual output regression convolutional neural network. Sci. Agric. Sin. 2024, 57, 3974–3985. [Google Scholar] [CrossRef]
  23. Jiang, Y.; Li, C.; Paterson, A.H.; Sun, S.; Xu, R.; Robertson, J. Quantitative analysis of cotton canopy size in field conditions using a consumer-grade rgb-d camera. Front. Plant Sci. 2018, 8, 2233. [Google Scholar] [CrossRef] [PubMed]
  24. Yang, S.; Gao, W.; Mi, J.; Wu, M.; Wang, M.; Zheng, L. Method for measurement of vegetable seedlings height based on rgb-d camera. Trans. Chin. Soc. Agric. Mach. 2019, 50, 128–135. [Google Scholar] [CrossRef]
  25. Li, Y.; Shen, H.; Fu, Y.; Wang, K. A method of dense point cloud slam based on improved yolov8 and fused with orb-slam3 to cope with dynamic environments. Expert Syst. Appl. 2024, 255, 124918. [Google Scholar] [CrossRef]
  26. Huo, J.; Zhou, C.; Yuan, B.; Yang, Q.; Wang, L. Real-time dense reconstruction with binocular endoscopy based on stereonet and orb-slam. Sensors 2023, 23, 2074. [Google Scholar] [CrossRef]
  27. Sewtz, M.; Fanger, Y.; Luo, X.; Bodenmüller, T.; Triebel, R. Indoormcd: A benchmark for low-cost multi-camera slam in indoor environments. IEEE Robot. Autom. Lett. 2023, 8, 1707–1714. [Google Scholar] [CrossRef]
  28. Moeckel, T.; Dayananda, S.; Nidamanuri, R.R.; Nautiyal, S.; Hanumaiah, N.; Buerkert, A.; Wachendorf, M. Estimation of vegetable crop parameter by multi-temporal uav-borne images. Remote Sens. 2018, 10, 805. [Google Scholar] [CrossRef]
  29. Lepetit, V.; Moreno-Noguer, F.; Fua, P. Epnp: An accurate o(n) solution to the pnp problem. Int. J. Comput. Vis. 2009, 81, 155–166. [Google Scholar] [CrossRef]
  30. Rusu, R.B.; Marton, Z.C.; Blodow, N.; Dolha, M.; Beetz, M. Towards 3d point cloud based object maps for household environments. Robot. Auton. Syst. 2008, 56, 927–941. [Google Scholar] [CrossRef]
  31. Moorfield, B.; Haeusler, R.; Klette, R. Bilateral filtering of 3d point clouds for refined 3d roadside reconstructions. In Proceedings of the International Conference on Computer Analysis of Images and Patterns, Helsinki, Finland, 2–4 September 2015; pp. 394–402. [Google Scholar] [CrossRef]
  32. Balta, H.; Velagić, J.; Bosschaerts, W.; Cubber, G.D.; Siciliano, B. Fast statistical outlier removal based method for large 3d point clouds of outdoor environments. IFAC-PapersOnLine 2018, 51, 348–353. [Google Scholar] [CrossRef]
  33. Zhang, W.; Qi, J.; Wan, P.; Wang, H.; Xie, D.; Wang, X.; Yan, G. An easy-to-use airborne lidar data filtering method based on cloth simulation. Remote Sens. 2016, 8, 501. [Google Scholar] [CrossRef]
  34. Coburn, T.C. Geostatistics for Natural Resources Evaluation. Technometrics 2000, 42, 437–438. [Google Scholar] [CrossRef]
Figure 1. The mobile field phenotyping platform. (a) Schematic diagram of the platform: 1—telescopic data collection frame; 2—spirit level; 3—checkerboard calibration board; 4—mini pan-tilt unit; 5—RealSense D435i camera (Intel Corporation, Santa Clara, CA, USA); 6—computing unit (laptop). (b) Photograph of the platform operating in the field.
Figure 1. The mobile field phenotyping platform. (a) Schematic diagram of the platform: 1—telescopic data collection frame; 2—spirit level; 3—checkerboard calibration board; 4—mini pan-tilt unit; 5—RealSense D435i camera (Intel Corporation, Santa Clara, CA, USA); 6—computing unit (laptop). (b) Photograph of the platform operating in the field.
Agriculture 16 00280 g001
Figure 2. Workflow of the proposed plant height measurement framework. (a) Data Input: RGB image, Depth image, and IMU data. (b) Point Cloud Map Reconstruction. (c) World Coordinate System Transformation. (d) Background Removal (Red: removed background). (e) Ground and Plant Segmentation (Deep green: plant point clouds). (f) Reference Ground Modeling (Kriging Interpolation). (g) Multi-Temporal Point Cloud Fusion. (h) Individual Plant Segmentation (Euclidean Clustering). (i) Measurement Area Delineation and Pose Correction (red: the measured plant points; light green: measurement area). (j) Plant Height Calculation.
Figure 2. Workflow of the proposed plant height measurement framework. (a) Data Input: RGB image, Depth image, and IMU data. (b) Point Cloud Map Reconstruction. (c) World Coordinate System Transformation. (d) Background Removal (Red: removed background). (e) Ground and Plant Segmentation (Deep green: plant point clouds). (f) Reference Ground Modeling (Kriging Interpolation). (g) Multi-Temporal Point Cloud Fusion. (h) Individual Plant Segmentation (Euclidean Clustering). (i) Measurement Area Delineation and Pose Correction (red: the measured plant points; light green: measurement area). (j) Plant Height Calculation.
Agriculture 16 00280 g002
Figure 3. Workflow for reconstructing the reference ground surface. (a) Original ground point cloud. (b) Cloth Simulation Filtering (CSF). (c) Filtered ground point cloud. (d) Reference Surface Reconstruction (Kriging Interpolation). (Red: removed points; blue: reconstructed reference surface).
Figure 3. Workflow for reconstructing the reference ground surface. (a) Original ground point cloud. (b) Cloth Simulation Filtering (CSF). (c) Filtered ground point cloud. (d) Reference Surface Reconstruction (Kriging Interpolation). (Red: removed points; blue: reconstructed reference surface).
Agriculture 16 00280 g003
Figure 4. Results of multi-temporal point cloud fusion for early and late growth stages. (a) Original point cloud (early stage). (b) Fused point cloud (early stage). (c) Original point cloud (late stage). (d) Fused point cloud (late stage). (Deep green: plant point clouds across different growth stages; blue: reconstructed reference surface).
Figure 4. Results of multi-temporal point cloud fusion for early and late growth stages. (a) Original point cloud (early stage). (b) Fused point cloud (early stage). (c) Original point cloud (late stage). (d) Fused point cloud (late stage). (Deep green: plant point clouds across different growth stages; blue: reconstructed reference surface).
Agriculture 16 00280 g004
Figure 5. Delineation and correction of the measurement region for individual plant height extraction. (a) Individual plant segmentation (early stage). (b) Initial circular region delineation (early stage). (c) Pose-corrected region (early stage). (d) Plant center locations (early stage). (e) Manual region delineation (late stage). (f) Pose-corrected region (late stage). (Light green: reference ground model; red: the measured plant points; purple box: illustrates the ground plane pose before and after correction.).
Figure 5. Delineation and correction of the measurement region for individual plant height extraction. (a) Individual plant segmentation (early stage). (b) Initial circular region delineation (early stage). (c) Pose-corrected region (early stage). (d) Plant center locations (early stage). (e) Manual region delineation (late stage). (f) Pose-corrected region (late stage). (Light green: reference ground model; red: the measured plant points; purple box: illustrates the ground plane pose before and after correction.).
Agriculture 16 00280 g005
Figure 6. Schematic of the plant height calculation method. (a) Top-down view (XY-plane) of the measurement region. (b) Vertical distance calculation between a canopy point and the interpolated ground surface. (Light green: reference ground model; red: the measured plant points).
Figure 6. Schematic of the plant height calculation method. (a) Top-down view (XY-plane) of the measurement region. (b) Vertical distance calculation between a canopy point and the interpolated ground surface. (Light green: reference ground model; red: the measured plant points).
Agriculture 16 00280 g006
Figure 7. Scatter plots of terrain interpolation errors: (a) Inverse Distance Weighting (IDW) method; (b) Kriging method.
Figure 7. Scatter plots of terrain interpolation errors: (a) Inverse Distance Weighting (IDW) method; (b) Kriging method.
Agriculture 16 00280 g007
Figure 8. Accuracy evaluation of terrain reconstruction: (a) Elevation Map. (b) Prediction Standard Error Map. (Warm colors indicate higher elevation (or larger error), while cool colors indicate lower values).
Figure 8. Accuracy evaluation of terrain reconstruction: (a) Elevation Map. (b) Prediction Standard Error Map. (Warm colors indicate higher elevation (or larger error), while cool colors indicate lower values).
Agriculture 16 00280 g008
Figure 9. Comparison between manually measured plant height and estimated plant height derived using the terrain-referenced point cloud–based plant height estimation algorithm: (a) early growth stage; (b) late growth stage.
Figure 9. Comparison between manually measured plant height and estimated plant height derived using the terrain-referenced point cloud–based plant height estimation algorithm: (a) early growth stage; (b) late growth stage.
Agriculture 16 00280 g009
Table 1. Component analysis of plant height estimation accuracy.
Table 1. Component analysis of plant height estimation accuracy.
MethodEarly-Stage MAE (mm)Late-Stage MAE (mm)R2 (Late Stage)
Full proposed method7.1918.450.855
Without local plane correction8.1324.630.692
Table 2. Performance comparison of plant height estimation methods using different sensing modalities and experimental setups.
Table 2. Performance comparison of plant height estimation methods using different sensing modalities and experimental setups.
StudySensor TypeExperimental EnvironmentCrop TypeGrowth StageMAE (mm)R2
Obanawa et al. [13]LiDARGrasslandItalian ryegrassSingle
Dong et al. [12]RGB-DFieldRow cropsMulti-temporal
Qiu et al. [18]RGB-DFieldMaizeJointing>0.95
Zhao et al. [20]RGB-DPotted (Indoor)LettuceEarly9.230.89
Zhang et al. [22]Multi-view RGB Open FieldOatSingle23.0 avg/22.4 max
Jiang et al. [23]RGB-DFieldCotton Canopy development 0.35–0.71
This studyRGB-DOpen FieldChoy SumFull Growth Cycle7.19/18.450.902/0.855
Table 3. Comparison of representative plant height measurement methods in terms of methodology and applicability.
Table 3. Comparison of representative plant height measurement methods in terms of methodology and applicability.
StudyMethodPlatformReference and AlignmentEfficiency and CostApplicability
Obanawa et al. [13]LiDAR/SfM geometry-basedUAVCanopy surface differencingHigh hardware costGrassland; not designed for dense leafy vegetables
Dong et al. [12]Spatio-temporal SfM/MVS (4D reconstructionUAVMulti-temporal 3D model alignmentLow sensor cost, high computationRow crops; sensitive to occlusion
Qiu et al. [18]RGB-D frame-level height extractionGroundVisible ground reference per frameLow cost, simple pipelineEarly growth stage only
Zhao et al. [20]Depth vision + instance segmentationGroundDepth-based bottom plane referenceLow cost, moderate computationLettuce; dense leafy vegetables (single stage)
Zhang et al. [22]CNN-based regression on depth imagesGroundLearned depth–height mappingLow sensor cost, high computationCereal crops; limited for leafy vegetables
Jiang et al. [23]RGB-D canopy profilingGroundCanopy surface relative to groundLow cost, moderate computationField crops; not optimized for dense canopies
This studyMulti-temporal RGB-D point cloud alignmentGroundPersistent reference ground + unified coordinate systemLow-cost sensor, moderate computationDense leafy vegetables; full growth cycle
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Q.; Yuan, K.; Zhao, Z.; Luo, Y.; Shui, Y. Multi-Temporal Point Cloud Alignment for Accurate Height Estimation of Field-Grown Leafy Vegetables. Agriculture 2026, 16, 280. https://doi.org/10.3390/agriculture16020280

AMA Style

Wang Q, Yuan K, Zhao Z, Luo Y, Shui Y. Multi-Temporal Point Cloud Alignment for Accurate Height Estimation of Field-Grown Leafy Vegetables. Agriculture. 2026; 16(2):280. https://doi.org/10.3390/agriculture16020280

Chicago/Turabian Style

Wang, Qian, Kai Yuan, Zuoxi Zhao, Yangfan Luo, and Yuanqing Shui. 2026. "Multi-Temporal Point Cloud Alignment for Accurate Height Estimation of Field-Grown Leafy Vegetables" Agriculture 16, no. 2: 280. https://doi.org/10.3390/agriculture16020280

APA Style

Wang, Q., Yuan, K., Zhao, Z., Luo, Y., & Shui, Y. (2026). Multi-Temporal Point Cloud Alignment for Accurate Height Estimation of Field-Grown Leafy Vegetables. Agriculture, 16(2), 280. https://doi.org/10.3390/agriculture16020280

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop