1. Introduction
Sugarcane, an important economic crop, is widely cultivated in regions such as Guangxi and Yunnan in China [
1]. However, these areas are mostly hilly, and sugarcane planting is scattered and uneven, which makes sugarcane harvesters prone to blockages in the conveyor system when the feeding amount exceeds its maximum capacity during operation [
2]. The conveying capacity of the system is related to the rotational speed of the rollers in the conveying system [
3]. When the roller speed is too slow, a large amount of sugarcane remains in the conveying channel and cannot be discharged in time, which is likely to cause blockage [
4]; on the other hand, when the roller speed is too fast, it leads to an increase in energy consumption costs. However, the roller speeds are generally fixed, making it difficult to adjust them promptly. Therefore, there is an urgent need to explore efficient control methods for sugarcane harvesters, which is crucial for improving both harvesting quality and efficiency.
The selection of roller speeds is closely linked to changes in the feeding amount. Currently, research on feeding amount detection methods can be categorized into those based on torque, pressure, and power [
5,
6,
7,
8]. Jiang et al. [
9] performed decomposition, denoising, and reconstruction of the torque signal through wavelet transform analysis of the noise frequency range and established the fitting relationship between torque and feeding amount. Lu et al. [
10] and Jie et al. [
11] established a linear relationship between pressure and feeding volume by measuring the pressure on the closed hydraulic oil system or the inclined conveyor at the outlet using sensors. However, due to the complex terrain in field operations, the inclination of the combine harvester causes significant measurement errors in the pressure sensor. Sun et al. [
12] developed a mathematical model relating the power of the harvester’s drive shaft to the feeding amount by detecting the power of the harvester’s drive shaft, thereby enabling rapid detection of the feeding amount. However, these feeding amount detection techniques can only indirectly reflect changes in the feeding amount and require the establishment of a strict correlation with the feeding quantity.
In recent years, the rapid development of machine vision technologies has provided new solutions for feeding amount detection. Sadeghi-Tehran et al. [
13], Wang et al. [
14], and Xiong et al. [
15] demonstrated the feasibility of segmenting and counting crops. Dandrifosse et al. [
16], based on the YOLOv5 model and the DeepMAC segmentation method, used two RGB cameras to convert the number of wheat spikes in the images into spike density, further estimating the wheat feeding amount. Chen et al. [
17] predicted the feeding amount by extracting the pixel values of the rice in front. However, research on these feeding amount detection technologies has mainly focused on small-grain crops such as corn, wheat, and rice [
18,
19,
20,
21]. This approach has certain limitations when applied to sugarcane, where the top leaves are messy and easily obscure the main body of the crop, so it is difficult to accurately obtain the sugarcane density from above.
In contrast, calculating the sugarcane volume from 3D point cloud data and using it as the feeding amount standard seems feasible. Yan et al. [
22] used LiDAR to perform 3D scanning of canopy features and created a 3D point cloud. They proposed a slicing convex hull method for volume estimation, summing the volumes of each slice to obtain the total canopy volume. Experimental results showed that this method’s estimates were close to the real values. Mirbod et al. [
23] used two models (Faster R-CNN and Mask R-CNN) to segment the shape of apples and calculated their surface area and diameter using depth information obtained from a stereo camera. Qi et al. [
24] used YOLOv8-seg to obtain masks and bounding boxes for the buds and stems and combined RGB-D data to estimate the diameter of tomato stems. Previous studies have significantly contributed to the advancement of feeding amount detection technology. Unfortunately, there has been no research on feeding amount detection for sugarcane harvesters. Therefore, to address the blockage problem caused by the mismatch between the conveying capacity and feeding amount in sugarcane harvesters, this study adopts sugarcane volume as a measure of the feeding amount. The YOLOv8 instance segmentation model is used to extract the sugarcane mask, which is then combined with depth data obtained from a binocular camera to extract the 3D point cloud of sugarcane, thereby enabling the detection of the feeding volume. Additionally, by exploring the relationship between feeding volume and roller speeds, a PLC control system based on volume detection was designed. Through volume detection experiments and control performance tests, it was evaluated whether the system could accurately detect the sugarcane feeding volume and adjust the roller speed based on the feed rate to improve blockage conditions.
4. Discussion
The research results indicate that the system alleviates the conveyor blockage issue of the sugarcane harvester to some extent but still has potential for further optimization. The core of this system lies in the accurate detection of sugarcane volume. During the experiment, it was found that the stacking degree of the sugarcane and the obstruction caused by cane leaves severely affected the collection of point cloud data, leading to disordered or missing point cloud information in certain areas. To alleviate this situation, Gao et al. [
30] used two depth cameras instead of a binocular camera as point cloud acquisition devices to obtain RGB-D data. They performed a rough registration between the measured point cloud and the template point cloud to align the point clouds, which improved some defects in the point cloud data. Alternatively, Sarmad et al. [
31] applied reinforcement learning to construct a correspondence between the global feature vector of partial point clouds and the input random noise vector of GAN to complement point clouds. Therefore, multiple sensors can be considered to capture the point cloud information of sugarcane from different angles, or generative adversarial networks (GANs) can be used to supplement the missing point cloud data, thereby improving the loss of point cloud information caused by the occlusion or stacking of sugarcane leaves.
This system’s ability to significantly reduce the probability of blockage lies in the accurate relationship established between volume and speed. However, it also limits the roller speed to three fixed levels, which restricts the flexibility in adjusting the system’s operation. In addition, another key factor contributing to the significant reduction in the blockage rate is the selection of an appropriate initial speed. Since the system cannot be adjusted before the first feeding, it is necessary to collect volume data during the first feeding process. However, when the feeding amount varies significantly, the first feeding process may lead to blockage. The main cause of these issues lies in the discrete threshold-based control strategy designed in this study. The system only responds when the sugarcane feeding amount exceeds a preset threshold, and the response speed of this control strategy is relatively slow. In future research, consideration can be given to combining fuzzy logic control with PID control, dynamically adjusting based on the feeding amount and roller speeds and improving the system’s response speed.
These two aspects severely limit the application of machine vision in sugarcane harvesters and should be given high attention in future research.
5. Conclusions
This study proposes a machine vision-based conveying control approach for sugarcane harvesters to reduce blockage occurrences in the conveying channel. Firstly, an improved I-YOLOv8-Seg instance segmentation model is fused with depth data to rapidly extract 3D sugarcane point clouds from complex backgrounds. Subsequently, the slicing method was established for calculating the volume of sugarcane point clouds. Finally, the optimal roller speeds for different feeding volumes were determined through experiments, leading to the development of a control strategy based on feeding volume detection. Experimental results show a detection accuracy of 91.28% even under maximum feeding rates, and the maximum blockage rate is reduced from 17.76% to 4.32%. This approach ensures continuous harvester operation, improves harvesting efficiency and quality, and results in substantial time and labor cost savings.