1. Introduction
In the past, robots were mainly for increasing productivity and expanding production capacity under fast growing necessaries. But, in recent years, we are facing the necessity of developing a new concept for the introduction of robots into harvesting swiftlet nests. We understand that one of the new goals of the robots from social issues is to promote automated robots instead of labor increase. On the other hand, humans make mistakes when harvesting swiftlet nests. In this case, the decision to select a particular operation to be automated is made only by considering the technological possibility and economical desirability. Today, manual harvesting of swiftlet’s nests in the house causes many obstacles to the survival and reproduction of swiftlets. Moreover, manual harvesting completely depends on the experience of the harvester. The environment in the swiftlet farming area is quite toxic affecting human health, so the trends of automation in monitoring and harvesting of nests in the house is being researched and implemented. The edible bird nests are harvested from natural caves, and they are taken increasingly from purpose-built brick houses, which are designed and structured to attract swiftlets.
There are many species of avian that use its saliva grande as a component in nest buildings, but the swiftlet might be the extreme case where its nest is made entirely of the saliva. Out of numerous swiftlet species, namely the edible-nest swiftlet (
Aerodramus fuciphagus), the black-nest swiftlet (
Aerodramus maximus), the Atiu swiftlet (
Aerodramus sawtelli), and others, only some of them whose nests are edible belong to the family
Aerodramus fuciphagus. They frequently breed in the ceiling of caves or the edge of cliffs where they are not exposed to direct sunlight, but the majority of edible birds’ nests are now collected in artificial bird’s nest houses for higher productivity and lower hazard. The white translucent house’s nest is made of glutinous saliva solution that securely clings to vertical walls and is hardened over time. Nutritional studies [
1,
2] have shown EBN is mainly made of protein and carbohydrates with very low input of fat. Economically, EBN can fetch more than USD 2500 per kilogram, mostly produced in Southeast Asia, and the global bird’s nest industry is estimated to be worth USD 5 billion, mainly consumed in countries like China, Hong Kong, and Taiwan. The garnering of EBN in bird’s nest houses has yielded elevated income due to the nest’s expensive tag and reduced life-threatening risks related to cliff-climbing for farmers recently. However, manually harvesting the EBN, like most of the agricultural field, is repetitive, time-dependent, and labor-intensive, and can even have negative effects in maintaining the house bird’s population owing to the swiftlet’s dislikes of unfamiliar human odors. These difficulties can be resolved by an EBN harvesting robot since as proved by many real-world examples, utilizing a farming robot is faster, more accurate, and more efficient because of its inexhaustive and controllable characteristics.
Currently, in Southeast Asia, domestic swiftlet farming is being developed. Houses or factories for raising swiftlets are designed and built depending on investment costs and farming area. Like Indonesia [
3], most of the swiftlet houses built in Vietnam are brick houses built with wooden trusses for birds to nest (
Figure 1). The swiftlet house is built with a diverse scale with different number of floors from one to four floors. The minimum size of the nesting rooms is usually 8.0 × 16 m or 40 m. The height of each floor is usually 2 m, sometimes up to 2.5 to 3.0 m high. The formation of the nest begins with nesting and egg laying. The mother bird builds the nest by dripping saliva. When the nest is formed, the mother begins to lay eggs and then incubate. The chicks are incubated and hatched in the nest and cared for by their parents until they are ready to fly.
Since each agricultural product is harvested in a different environment, researchers have been developing solutions aimed at specific commodities. This is reasonable because each species has particular strength, size, and shape, which can make the method for one kind damaging for another. Although for many crops and fruits, agricultural robots are extensively studied and catered for, there is sparse research about harvesting robots that aim at EBN. Hence, it is of profound importance to develop a robotic system designed to automate the bird’s nest harvesting process of in-house environments [
4].
A robot consists of different sub-systems such as the mobile platform, manipulator, and end-effectors which are designed to match the task requirement and environment constraints. Each subsystem is reviewed from various literature, and this study’s automated system is presented thereafter. Smart agriculture is the use of smart technology applied to agriculture with the aim of saving labor and improving product quality and productivity. The caring for and raising of swiftlets for nesting is an agricultural profession developed in Vietnam. Therefore, the care, monitoring and harvesting should use AI with Machine learning to ensure sustainable development in agriculture.
A study [
5] makes a review of the design of manipulators used in harvesting robots, which shows that electric servo and stepper motors are mostly utilized when the weight load is light, whereas the custom electric actuator or hydraulic motor is used in the opposite situation. Additionally, [
5] also lists three requirements to consider when designing the manipulator, which are adaptable enough for the working environment, vigorous enough for the required forces, and sustainable enough against the environment’s condition. To correctly control the manipulator, kinematics and inverse kinematics models need to be calculated, so [
6] utilizes the Denavit–Hartenberg model for the path planning of cucumber harvesting. The end effector is the interaction part between the product and the robot, so its design needs to be delicate enough to not impair the product but powerful enough to extract the product from the main body. According to [
7], the fruit harvesting method commonly utilizes one made out of product grasping, stem grasping, rotation mechanism, cutting devices, and pressure suction as its main development. Grasping, suction, and rotation alone cannot be applied to EBN harvesting since the approach will easily break the nest due to the nest’s strong adhesion to the wooden cell of EBN house. The cutting mechanism, however, is harder to implement in fruit harvesting because good stem detection is required. Refs. [
8,
9] develop a thermal blades solution by creating a voltage between electrodes in which both uphold the gripper structures to stabilize the product before the cutting process. Meanwhile, reference [
10] chooses a high-speed mechanical rolling blade to cut off the pumpkin’s stem. Four-wheel vehicles are dominant in developing a harvesting robot [
11] because of their light weight and flexibility for dampish conditions. Likewise, track vehicles are less sensitive to the ground environment, so it can work well in swampy and wet ground quality [
12]. Due to more pressure on the ground, the crawler-type wheel has an advantage when working on rough terrain compared to that of the four-wheel [
13]. Six-wheel steering is also used when high maneuverability is demanded on unpredictable terrain. Additionally, ensuring that all wheels sustain contact with the ground in any circumstances is a crucial design requirement. Ref. [
14] designs a flexible platform frame that is slightly bendable when contacting with inclined ground. Many systems for fruit harvesting in a greenhouse use a rail mobile platform to travel due to the organized rows of crops, such as in [
12]; so the path is always determined and slipping is minimized compared to the wheel-type system. If the wheel-type or crawl-type wheel is used, path planning incorporated with landmark detection needs to be executed. As such, [
13] with crawl-type wheel uses the GNSS and LiDAR sensors to determine the location of ground turn and orchards row-turn respectively, and the robot’s path is tracked continuously. Path planning for a robot system working on farmland is developed in [
14]. Many systems in greenhouses use rail mobile platforms to travel due to the organized rows of crops, such as in [
15]. Therefore, the path is always determined and slipping is minimized compared to wheel-type system. If the wheel-type or crawl-type wheel is used, path planning incorporated with landmark detection needs to be executed. As such, [
16] with crawl-type wheels uses the GNSS and LiDAR sensors to determine the location of ground turn and orchards row-turn respectively, and the robot’s path is tracked continuously. Due to the flourishing of machine vision, a large number of harvesting solutions use vision-based methods. When combining color and geometric features of the target, fruits like tomato, apple, citrus, and lichi get correctly detected with a precision over 90% as mentioned in [
17] which provide a review of object recognition approach. Nevertheless, camera-based still faces difficulties in ambient light and various illumination conditions, so LiDAR-based is frequently selected as an alternative. Ref. [
18] applies cartographer algorithm in ROS for local SLAM and loop-back detection to LiDAR’s point cloud, and Publish Point function in ROS combined with Dijkstra algorithm is applied for global and local path planning. Path planning constrained by different limits for a robot system working on farmland is developed in [
19].
The swiftlet bird builds its nest from saliva strands, but only some of the swiftlet species’ nests are proved to be edible and harvested, which majorly comprises glycoprotein to which galactose, galactosamine, and other carbohydrates attach [
20]. Moreover, the nest cement has enough adhesive properties when dried up to hold against its weight and the bird’s egg. Specifically, the nest’s adhesive strength overall can load a vertical force of around 0.465
N considering the heaviest case scenario of two birds’, two eggs’, and the nest’s weights. Paper [
21] calculates the maximum stress in this scenario to be 0.56 MPa at the rim where the birds stand. However, there is a limited source for the adhesiveness of the nest to the wooden frame partly because the sample is normally analyzed in the laboratory. Therefore, the actual force to separate the nest in the harvesting site is not regenerated. When each component is brought together, a control unit fuses the data and decides the reaction of those peripherals. Ref. [
22] provides the general control system for apple harvesting robot, but the graph can be applied to most harvesting robots.
2. Swiftlet-Nest Harvesting Robot
This study introduces the design and planning of movement paths based on reinforcement deep learning to address the significant challenge of harvesting swiftlet nests in a combined environment of structured and unstructured spaces, ensuring that the robot can harvest targeted bird nests without colliding or interacting with neighboring nests. The recurrent neural network is applied to recollect and find the past state observed by the computer vision system and determine depth according to the gradient algorithm to yield the appropriate movement. The task of this study is to plan a collision-free path, which consists of a sequence of regression reference points, into the mixed environment after the position of the interested object is determined.
Figure 2 depicts the structure of the designed bird’s nest harvesting robot. It consists of a moving platform, vision system, and manipulator, which are the main parts of swiftlet-nest harvesting robotic system. The vision system and manipulator are both integrated and firmly fixed on the moving platform. The mobile platform is made of an aluminum platform with three wheels including two active wheels and a caster wheel moving according to differential drive. The moving platform, manipulator, and vision system are controlled by a central controller unit (CPU) that communicates with the computer via a universal asynchronous receiver–transceiver (UART). The moving platform carries the robot arm and vision system to the desired position under the control of the CPU. The robotic structure is designed to be suitable for the size of the birdhouse space as well as the workspace of the manipulator. The nests are mostly located on the walls of the swiftlet-house, so harvesting nests is a sequence of the tool’s movements to the position beneath the nest, followed by a lateral movement along the wall to perform the harvest. Thus, the required manipulator should have four degrees of freedom: three translational movements in cartesian space and one rotation about the vertical axis to be able to apply the tool’s force to the wall. To perform the harvesting function, a device, similar to a scraper, is used to separate the nest from the wall, which is considered as the end-effector of the manipulator. The bird’s nest separating tool resembles a shovel, while the cup mimics a human bird’s nest harvester. To expand the workspace of the manipulator, the tool is placed on a moving platform, effectively extending the operational space reach within the swiftlet-house. To perform the bird’s nest recognition, positioning, and navigating the robot, the camera system can translate in the vertical axis and is also mounted on the moving platform. Therefore, the field of view of the robot can be greatly expanded. Swiftlet’s nest has long been a culinary delight in many countries and tagged with a high price due to its supposed health benefits. Traditionally, they are harvested from natural cliffs, which is extremely dangerous and requires safety gears to collect high-built nests up in the cave’s ceiling. Therefore, man-made houses are constructed to duplicate the natural living condition of the swiftlet so that the bird is attracted to nest in this guarded space. The swiftlet first builds an anchor from its saliva gland that is hardened and adhered to the high wooden house’s frame when exposed to the air. Thereafter, it weaves the nest around the spot to form a semi-cup shape that firmly sticks to the frame along the nest’s boundary [
23]. To harvest bird nests, the collectors need to have a specialized knowledge of the suitable time and nest’s maturity level. Because of the nest’s persistent adhesiveness, the collectors usually use a thin scraping tool to gently detach the nest from the frame by delicately pushing and picking between the nest and its gluing surface. As such, the same method with a non-abrasive and fine tool is applied as the robot’s end effector to undisturbed applied force from the bottom of the nest. When there is sufficient force, the nest-frame adhesive bond is breakable, and the nest can be collected.
Due to the asynchronous formation of nests, not all of them can be harvested, which constitutes a significant challenge in this study. The selection of the candidate bird’s nest to harvest is based on its size and formation time to ensure quality and precise operation. The weight of the bird’s nest is very light, and it is very easy to break due to strong impact force. Therefore, careful attention, flexibility, and meticulousness must be given to ensure precise and accurate harvesting operations when employing a robot for this task. To harvest the bird’s nest, the end-effector, or a tool as simple as a blade, can separate the bird’s nest from the wooden frame of the house, safely assumed to be flat surfaces.
The separation process of harvested swiftlet nests from the surface of frames is completed by using the tool attaching manipulator. In order to analyze the process in detail, its two-dimensional modeling is presented in
Figure 3. In this idealized modeling the separating tool moves bottom to top along the swiftlet nest at a constant velocity V. The nest is removed from the frame by a tool based on the shearing of the nest’s adhesive layer continuously along the shear plane. The forces and power involved in separating operations is significantly important since power requirements must be known to enable the calculation of the manipulator’s dynamics with adequate power. Equally crucial, the separating forces are required without excessive distortion so that the nest’s shape is intact. The forces acting on the tool are shown in
Figure 3. The cutting force F
c acts in the direction of the removing speed V and supplies the energy required for the harvesting process. The thrust force
Ft acts in the direction normal to the separating velocity, that is, perpendicular to the frame. These two forces produce the resultant force R. The resultant force can be resolved into two components on the tool face: a friction force F along the tool-nest interface and a normal force N perpendicular to it. Another way is to resolve into the shear force
Fs along the shear plane and a normal shear force
Fs, n perpendicular to it. This paper assumes that the width of the adhesive layer before and after the shearing process is almost identical. Therefore, if the rake angle,
α, is between the tool’s surface and the perpendicular direction of the frame, then the shear angle,
β, is calculated by (1), and the shear force is determined by (2).
During the harvesting operation of the manipulator, the separating tool attaching to the manipulator can reach the swiftlet nests in its workspace. In this study, a manipulator named modified SCARA with four degrees of freedom was used as a harvesting manipulator. A modified SCARA manipulator is generally represented in the form of an open loop kinematics linkage. It includes four degrees of freedom presenting a multivariable mechanical system. The manipulator should copy the movements of the human arm harvesting the edible swiftlet’s bird in the house. The number of degrees of freedom of the arm is in the range of 6 ≥ n ≥ 3. The movement of two human arms holding the swiftlet separating tool for harvesting is quite flexible and has high DoFs. The number of DoFs is modeled, and each joint must have a reference system of axes that includes the z and x axes. There are several concepts, but the modified SCARA is selected with three rotating joints, and a translation one. We can model the coordinate system of the manipulator (
Figure 4) according to the rotation using the anatomy and representation of the coordinate system and the rotation joints in the human body. Each joint is given a set of coordinate axes.
Before a dynamic system is constructed, it is essential to carry out the analysis and synthesis on the mathematical model of the system. Like the concept presented in the previous section, it described an end-effector moving along a vertical wooden wall. The task space coordinate system is given in the
u and
v directions. In this case, the tool is moved along the surface in the
v direction and normal force is applied to the surface along the
u direction. The vector of task space is
x, the correlation defined the forward kinematics is defined as follows:
where
.
In this study, the coordinate systems for the joints and links of the 4-degrees-of-freedom manipulator follow the Denavit–Hartenberg (D–H) rule. Since the four axes of joints are parallel, the axes are assigned as shown in
Figure 4. During the bird’s nest harvesting operation of the robotic manipulator, the separating tool of manipulator can reach the nest locations so that it can be harvested in the workspace and perform harvesting operation in-sequence. With the configuration of the modified SCARA manipulator of 4 degrees of freedom, it can perform translational movements along the
x,
y,
z axes and rotation around the
z axis. The manipulator is generally presented in the form of an open loop kinematics. Theoretically, SCARA would be required to separate an object attaching the other one with a desired orientation to find the position of the end-effector based on the rotation angles of the manipulator’s joints and is a forward kinematics problem. The parameters of the D–H table are based on the poses of the coordinate attached to the joints. The coordinate transformation between two consecutive frames
i and frame
i + 1 is obtained as follows.
According to the parameters of the manipulator based on the D-H method, the forward kinematics equation of the manipulator is formed by multiplying the matrix of transformation matrices. The result of the forward kinematics equation is as below.
When the robot arm is in operation, the separating tool must be correctly controlled with the required speed of displacement and vertical upward movement. The impact forces and velocity of the tool are carefully calculated to ensure the quality of the harvested bird’s nest. After multiplying all the matrices together and simplifying the equation, the positions of the tool can be determined as follows:
For the task of the nest’s separation from wooden frames by the manipulator, the position controller gives a responding performance because the task only requires the robot to follow a predetermined required trajectory. However, the manipulator contacts with the environment, so there are the interaction forces between the separating tool mounted on the end-effector and the environment, so the interaction force and position of the separation tool need to be controlled. To separate the nests for harvesting, the position of the tool should be controlled. That is, insufficient press can result in adhesion that is not easily separable, while excessive pressure can lead to the nest’s breakage. Where
ϕ(
q) is forward kinematics, the derivative of x is defined.
The relationship between the speed of the manipulator’s end-effector and velocities of angular and linear joints can expressed with the following equation by Jacobian matrix.
In the study, the manipulator which uses the angular and linear joints can be shown.
where
Ji is the
ith column of Jacobian matrix,
p is the position vector of the origin in the coordinate system of the manipulator relative to the coordinate system {
i} in the base coordinate system. The
zi denotes the unit vector of the
z-axis of the coordinate system {
i} in the base coordinate system. The dynamic model of the manipulator with 4 degrees of freedom, rigid, with no friction at the joints is shown below.
The state vector
is modified as follows:
The Jacobian matrix in
n ×
n task space is defined as follows:
where,
I is the identity matrix and
T is transformation matrix.
In addition, we can determine the Jacobian matrix with four variables as joints. From the positions and orientation obtained, the relation between the velocities is shown in below.
And the Jacobian matrix is determined as follows:
where
is called the analytical Jacobian matrix, and the relation of acceleration also called the differential kinematics is as follows:
The inverse kinematics model of the harvesting manipulator allows for obtaining the joint positions
q in terms of the positions and orientation of tool attaching on the end-effector in the reference frame. The inverse kinematics is below:
where
and
.
The derivation of the inverse kinematics model is very complex and contrasts with the forward kinematics. The angular and linear accelerations and velocity of joints are determined according to recursive relationships. For each joint, inertial forces and the moments are calculated, and equations of kinematics are established. The configuration of the separating tool as the end-effector is described by a position and an orientation.
In order to reduce the impact force on the manipulator during operation as well as improve movement’s stability, it is required that the trajectory of the end-effector and active joints of the manipulator have to be continuous and smooth. Therefore it should not have any sudden change in velocity and acceleration. When the separating tool harvests the nests, the path of the end-effector is optimally designed based on the information provided by the LiDAR system. The LiDAR collects the point clouds and then calibrates the position and orientation to give a suitable map. Information about the nesting parameters is located in the three-dimensional coordinates of swiftlet nests in workspace. After completing harvesting for each nest, the robot system returns to its initial position. The operation of harvesting the swiftlet nest is divided into three stages as follows. First, the separating tool is moved to the target (below the nest). Secondly, it is moved along the vertical axis from bottom to top. Finally, the tool is moved back to its original position. At each stage in the harvesting process, path planning is performed based on quintic polynomial interpolation. To ensure that harvesting does not affect the quality of the bird’s nest, the speed of the tool must be controlled. The transport processes play an essential role at automated swiftlet nest harvesting, especially in moving the manipulator to suitable position determined by LiDAR’s data, when the mobile platform is to be transported after finishing one operation to the next to be performed. The mobile platform moves along the stationary wall in the house facilities’ handling of individual harvesting based on mapping from LiDAR. Works provided with a permanent internal transportation system are consequently forced to produce their products in long series and are so unable to adapt flexibly, without a cost increase, to the requirements of customers. The mobile robot has a virtual tracking system which conveys information as robot deviation from the direction and position given by the map to the control system.
In this study, a wheel mobile robot on the ground is used, its wheels move by only rotation and no slip in driving and lateral directions. There are two concepts for designing the mobile platform as differential and tracking drive. The differential drive is simpler than the other one. The differential drive is always in contact with the ground by three contact points. The drive has the advantages over the tracking drive due to larger contact surface between its wheels and ground, assuming no wheel slipping when changing direction. Therefore, a tracking drive is often used for rough ground with big obstacles which the robot cannot overcome. We need to integrate the robot’s movements over time to determine its position accurately. Additionally, this research section considers the dynamics of the mobile robot system to address the challenges of controlling the robot’s motion in an environment that involves impact forces. Even though wheel slip is a factor that reduces the accuracy of evaluating and estimating the robot’s motion, when traveling at low speed, the effect of wheel slip relative to the road surface can be ignored. Wheeled robots move along the
x-axis and the robot cannot move along the
y-axis, and the position of the robot can be determined as the midpoint between the two active wheels. When the robot is moving, we will assume rolling without slipping on flat ground at low speed (5 km/h); therefore, the force of inertia in all directions is less than the force of friction. Under the differential drive system, the motion of the vehicle is determined by the control of wheel velocities. In order for both drive wheels to roll properly, the robot must rotate around a point situated on the common axis of the two wheels. By adjusting the relative velocity between the two wheels, the location of this rotation point can be changed, allowing for different trajectories of the vehicle. From the forward kinematics, it is possible to calculate the pose of the robot at any time
t based on the control of the velocity of the left wheel and the right wheel. In general, the differential drive robot’s ability for moving in a specific direction
θ(
t) at a certain velocity is governed by the following equations.
Therefore, the angular and linear velocity is calculated in the equation below:
where linear and angular velocity are
V,
ω of the robot.
The dynamics of the robot are determined by the non-holonomic constraint. In this study, a differential drive mobile robot is the approached object. The robot’s center of gravity is placed in the midpoint between two wheels. Each wheel is controlled independently by the motor and the castor wheel has the ability to keep the robot’s balance while it is moving on the flat floor. Our robot is shown in (
Figure 5) with 2
b as the distance between two wheels,
R is the radius of each wheel,
θ is the angular between x-axis and robot orientation. Assume that the robot does not slide on any axis. From this assumption, the robot is described below.
The equation of dynamics is written by the following:
where
M(
q) is the inertial matrix,
AT is the Jacobi matrix,
B(
q) is the transpose matrix, τ is the moment, λ is the force vector,
is the matrix containing radial and rotational components,
G(
q) is the gravity matrix, τ
d is the disturbance elements, and
q is the status vector.
Scanning System Based on LiDAR
Dissimilar to the camera, LiDAR is not susceptible to dim or dark light conditions in the bird’s nest house, so it is employed in this study. A 3D map shown in
Figure 6 is created from mounding distinct height levels of 2D layers that are generated from LiDAR’s point cloud for the vision capabilities of the robot. Since the data can be obstructed by both the environment’s condition and its calibration, which introduce noise, outlier points are removed by counting the number of the surrounding points and comparing it to a limit. As each 2D layer of the wooden frame is obtained, the line segment algorithm is applied to detect the frame portion between two nests, and the nest intersection is the complement of the frame line. Consequently, a 3D segment of the bird’s nest is acquired, and the nest’s position can be interpolated for harvesting process.
Line segmenting from the point cloud algorithm has been researched for many years with different approaches, so utilizing which method is a matter of choice regarding complexity and accuracy. Reference [
22] reviews multiple research studies about line extraction from 2D sets of points and compares those algorithms on the same scanned LiDAR data. The incremental algorithm idea, as mentioned in [
23], is used in this study to extract points that are classified as belonging to a line. The pseudo code for the algorithm is as follows:
- -
An initial
1…n number of points is fit to a line, added to current set, and the RMSE is calculated by (8) and compared whether it is less than a user-specified threshold. In this study, the threshold is chosen based on the standard deviation of the statistical error of the LiDAR points.
- -
If 1. is true, the next point is added to the current set, and the change in RMSE is recalculated to test whether it belongs to the current set.
- -
If next point cannot be added, either the length of current set needs to be larger than a threshold to be considered a line segment, or the current set is discarded.
- -
If 1. is false, points 2…n + 1 are used instead of 1…n and step 1. is repeated.