You are currently viewing a new version of our website. To view the old version click .
Robotics
  • Article
  • Open Access

27 April 2021

Prototype Development of Small Mobile Robots for Mallard Navigation in Paddy Fields: Toward Realizing Remote Farming †

,
,
,
,
and
1
Faculty of Software and Information Science, Iwate Prefectural University, Takizawa City 020-0693, Japan
2
Faculty of Systems Science and Technology, Akita Prefectural University, Yurihonjo City 015–0055, Japan
3
Faculty of Bioresource Sciences, Akita Prefectural University, Akita City 010-0195, Japan
4
Institute of Engineering Innovation, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan
This article belongs to the Special Issue Advances in Agriculture and Forest Robotics

Abstract

This study was conducted to develop robot prototypes of three models that navigate mallards to achieve high-efficiency rice-duck farming. We examined two robotics navigation approaches based on imprinting and feeding. As the first approach, we used imprinting applied to baby mallards. They exhibited follow behavior to our first prototype after imprinting. Experimentally obtained observation results revealed the importance of providing imprinting immediately up to one week after hatching. As another approach, we used feed placed on the top of our second prototype. Experimentally obtained results showed that adult mallards exhibited wariness not only against the robot, but also against the feeder. After relieving wariness with provision of more than one week time to become accustomed, adult mallards ate feed in the box on the robot. However, they ran away immediately at a slight movement. Based on this confirmation, we developed the third prototype as an autonomous mobile robot aimed for mallard navigation in a paddy field. The body width is less than the length between rice stalks. After checking the waterproof capability of a body waterproof box, we conducted an indoor driving test for manual operation. Moreover, we conducted outdoor evaluation tests to assess running on an actual paddy field. We developed indoor and outdoor image datasets using an onboard monocular camera. For the outdoor image datasets, our segmentation method based on SegNet achieved semantic segmentation for three semantic categories. For the indoor image datasets, our prediction method based on CNN and LSTM achieved visual prediction for three motion categories.

1. Introduction

With the evolutionary development of machines and information technologies represented by robots, artificial intelligence (AI), and the internet of things (IoT), precision agriculture and smart farming have been attracting attention in recent years [1,2,3,4,5,6,7]. Herein, we propose a novel agricultural concept as remote farming for fundamental technologies to actualize smart farming. Remote farming specifically emphasizes not only the use of information and communications technology (ICT) to improve production efficiency and reduce costs but also remote robot operation and remote field monitoring. For this study, we intend to develop a small robot that accommodates functions to move autonomously along a path between rice paddies and that automatically monitors crop growth, disease infection, and natural enemy attacks.
Existing large agricultural machines demand great effort to ensure safety during locomotion [8]. By contrast, small farming robots accomplish proximity crop monitoring and various tasks in complex and narrow areas that are difficult for large robots to negotiate. Moreover, damage from collision with grown crops or animals as livestock or work animals is reduced by a safe robot system [9]. Nevertheless, smaller robots impose constraints on the mounting of devices and sensors. For this study, we used a monocular camera to recognize surroundings for our robot to achieve autonomous locomotion as a part of a monocular vision-based navigation system [10].
Laser imaging detection and ranging (LiDAR) sensors offer great potential in agriculture to measure crop volumes and structural parameters [11]. Nevertheless, our robot uses no LiDAR sensor because of the reduction of parts to be mounted on the body, the total cost, and system complexity. Moreover, the field of view (FOV) of LiDAR is narrower than that of a normal camera. The rapid development of image processing methods enhanced with deep-learning (DL) techniques [12] has expanded to use open-source code libraries that have been released on numerous websites [13]. Therefore, various technologies and approaches to elucidate objects, scenes, and their surroundings for autonomous locomotion can be applicable for numerous applications to images obtained using an ordinarily available monocular camera [14].
We consider that time-series FOV images obtained from a frontal monocular camera while moving between rice paddies provide consistency in image features to line up on both sides of a route. Moreover, we consider that machine learning (ML) algorithms present benefits for processing image features for accomplishing necessary functions for autonomous locomotion. However, the paddy field soil states undergo various changes concomitantly with growth of the rice plants. In the case of a typical Japanese rice crop, a paddy field is flooded to approximately 300 mm depth after rice planting in spring. Then the water depth decreases along with the progress of rice growth in summer. Water is completely drained from the field for harvesting in autumn.
In a paddy field, soil conditional changes vary concomitantly with the growth of rice plants, seasons, regional climate characteristics, wind direction and speed, and solar radiation. One important challenge for an autonomous mobile robot in a paddy field is to maintain robustness and adaptability to environmental changes. We assume that incremental learning for field locomotion images based on ML algorithms has an effective potential to improve generalization capability. This study was conducted to develop three robot prototypes as depicted in Figure 1 that move among rice plants. Moreover, this study was done to implement and evaluate our proposed method based on time series learning algorithms for vision-based autonomous locomotion.
Figure 1. Exterior photos of respective robot prototypes that moves among rice plants for mallard navigation.
The contribution of this paper is as follows:
  • investigation of the possibility of small robots to actualize highly efficient rice-duck farming,
  • verification of useful methods to navigate mallards using robot prototypes based on imprinting and feeding,
  • construction of image datasets among rice paddies obtained using an onboard monocular camera,
  • and demonstration of segmentation and prediction based on deep-learning-based methods.
This paper is structured as follows. In Section 2, we briefly review related studies of existing rice-ducking robots and navigation methods. Section 3 presents the first prototype as depicted in Figure 1a used for imprinting-based navigation and its basic experimental results applied for baby mallards. Subsequently, Section 4 presents the second prototype as depicted in Figure 1b used for feeding-based navigation and its basic experimental results applied for adult mallards. As a fully customized mallard navigation robot, Section 5 and Section 6 respectively present the design of the third prototype as depicted in Figure 1c and the basic results of locomotion experiments in an actual paddy field based on visual navigation. Finally, Section 7 concludes and highlights future work. Herein, we had proposed this basic method with originally developed sensors of two proceedings [15,16]. For this paper, we have described detail results and discussion as a whole system.

3. Imprinting-Based Navigation

3.1. Experiment Setup

For this experiment, we attempted imprinting of baby mallards after a hatch. We obtained eggs from a mallard farm as depicted in Figure 3. Particularly, the middle panel of Figure 3 depicts the inside of an aviary for laying.
Figure 3. Mallard farm (left), inside of aviary (middle), and mallard eggs (right). We obtained eggs from this mallard farm.
The left panel of Figure 4 depicts the mallard eggs and an incubator (MX-20; Autoelex Co. Ltd., Kimhae, Korea) that we used for a hatch. We divided 18 eggs into four groups because we shifted hatching dates in two-day intervals. We set the temperature and humidity in the incubator respectively to 37.5 °C and 45.0%. The bottom incubator plate includes a slider to roll the eggs periodically. We set the rolling frequency to one-hour intervals. We stopped the rolling approximately one week before the predicted hatching dates.
Figure 4. Mallard eggs in incubator (left), baby mallards after hatching (middle), and after drying of feathers in breeding cage (right).
The rolling was stopped approximately one week before the predicted hatching dates. Simultaneously, the humidity was increased to 65%. The middle panel of Figure 4 depicts a baby mallard immediately after hatching. We kept the baby mallards in the incubator until their feathers were dry. After drying, we moved them to a breeding cage, as depicted in the right panel of Figure 4. We kept them warm in the cage using an electric heater.
The upper panel of Figure 1a depicts a 3D mallard model used for imprinting. After creating the model using a 3D printer, we put it on the base of the first prototype as depicted in the bottom panel of Figure 1a. The model was made of acrylonitrile butadiene styrene.

3.2. Experiment Results

We applied imprinting sequentially to three mallards in each. The robot was moved manually in the cage for 3 h in front of the baby mallard. While observing the baby mallards, we moved the robot to lead them with random behavior patterns. After imprinting, the baby mallard attempted to follow the robot. Subsequently, we observed that the baby mallard changed responses of calling patterns according to robot movements. The baby mallard moved near the electric heater when the robot stopped.
We applied imprinting to the second baby mallard, which hatched two days later, with similar stimulation patterns and procedures. Although the second baby mallard followed the robot, both were interested in each other. Our observations suggest that the interest between them was higher than that for the robot. Therefore, we put a partition in the cage to separate them. First, they attempted to cross the partition. We showed the robot to each baby mallard in their separated zone for enhanced imprinting. They continued the follow behavior against the robot. Subsequently, we added the third baby mallard to the cage after installing a partition. The imprinting procedure for the third baby mallard was similar to that used for the former two baby mallards.
For observing the follow behavior, we moved the three baby mallards and the robot from the cage to the test file with the size of 2.0 m2. Figure 5 depicts time-series images of this experiment. The four sides of the test field were surrounded by wooden boards. As an objective after imprinting, the baby mallards followed a robot that moved slowly. They continued following the robot after increase of the velocity.
Figure 5. Follow behavior displayed after imprinting. Baby mallards followed the robot that moved slowly.
Imprinting among baby mallards after hatching occurred in a cage because they moved frequently. Based on the experiment for baby hybrid ducks conducted by Yamada et al. [25,26], we considered that imprinting was enhanced for separating baby mallards because they spent time longer with the robot together. Moreover, imprinting of individuals was effective because the calling sound volume of separated baby mallards was greater than that of the gathered baby mallards.
We attempted imprinting to baby mallards at one week after hatching using a similar procedure with the robot. We confirmed that baby mallards falsely recognized the robot as an enemy. Moreover, we observed that they avoided the robot with a big call. Actually, mallards reach a peak sensitive period at 15 hr after hatching. Subsequently, the imprinting effect is reduced considerably. For the robot, the interest for baby mallards arose at more than one week after hatching because the so-called critical period is weaker than that of baby mallards several hours after hatching. Experimentally obtained results reveal that navigation is possible for baby mallards before the critical period of imprinting.

4. Feeding-Based Navigation

4.1. Experiment Setup

For this experiment, feed navigation tests were conducted for adult mallards using our second prototype. We used an off-the-shelf snow vehicle model (Blizzard FR1/12EP Belt Vehicle; Kyosho Corp., Tokyo, Japan) as depicted in Figure 1b. The body is 488 mm long, 320 mm wide, and 220 mm high. At the robot rear body, we installed a feeder and a water-protected monocular camera (CS-QR300; Planex Communications Inc., Tokyo, Japan). A 1/3 inch CMOS sensor (IMX225; Sony Corporation) is used for the imaging device. The video image resolution is 1280 × 720 pixels with 15 fps.
For this experiment, we used nine adult mallards (six months old) without imprinting. The aviary used for this experiment is depicted in Figure 6. We conducted feed learning experiments for them not only getting used to the robot while eating feed, but also following the robot with a desire to get feed. To relieve wariness, we changed the 10 experiment conditions in terms of positional combinations between the feeder and the robot, feed types, and sound stimulation. Mainly, we used compound feed that comprises corn, soybean, milo, bran, press cake, and fish flour. Table 1 denotes the experimental conditions including dates and hours. The positions of the robot and the feeder are depicted in Figure 7.
Figure 6. Exterior and inside of aviary.
Table 1. Experiment conditions for feed-based mallard navigation. The positions of the robot and the feeder are depicted in Figure 7.
Figure 7. Layout and assignment of robot and feeder.

4.2. Experiment Results

For the first experiment, we placed the robot and the feeder at P1. We played a sound clip of calling recorded in advance using a microphone. Figure 8 depicts images obtained using the camera on the robot. The mallards gathered at a corner in the aviary. They did not attempt to approach the robot because they might have alerted. We observed that they seemed to watch the robot while keeping their distance. As described herein, we gave no feed to the mallards from the morning of the experiment day. Although they were hungry, they did not approach the feed because of their own wariness. Compared with curious baby mallards, adult mallards were more vigilant against something that they saw first. We considered that the fear of natural enemies was attained by adult mallards for their work in paddy fields.
Figure 8. Behavior and response patterns for the first experiment. The mallards gathered at a corner in the aviary. They did not attempt to approach the robot because they might have alerted.
One week later, we conducted the second experiment. We gave no feed to the mallards during the two days prior. The robot and the feeder were placed in a similar position with playing of a similar sound clip. As additional feed, we gave them cabbage and broccoli, which are vegetables preferred by mallards. The mallards gathered to a corner in the aviary. They showed no approach to the feeder on the robot. After a few minutes, we observed that their wariness had disappeared. Two mallards watched the robot from the pond. However, they showed no approach to the robot.
For the third experiment, the robot position was changed to P2 near the entrance whereas the feeder position remained at P1. We expected that mallards would eat the feed irrespective of wariness of the robot because the robot and feeder were located separately. However, they did not approach the feeder.
For the fourth experiment, the feeder position was changed to P3, although the robot position remained at P2. Moreover, we changed the feed to rice with small granularity. Mallards like to eat rice more than compound feed. However, they showed no approach to the rice.
For the fifth experiment, we removed the robot, although the feeder position remained at P3. Although the robot did not exist in their view range, they exhibited no approach to the feeder.
For the sixth experiment, we placed the robot at P4 whereas the feeder position remained at P3. During the previous five experiments, mallards frequently gathered at P4 and its surrounding area. For this experiment, mallards gathered at a position between P3 and P4. Particularly, they moved near P3 when the robot moved to P4. However, they showed no feed eating.
The seventh experiment was conducted eight days after the sixth experiment. We set this period for mallards that became accustomed to the feeder. The robot and the feeder were placed at P3. The one-hour observation result demonstrated that mallards showed no feed eating.
Before the eighth experiment, we left the robot for two days in the aviary. The robot and the feeder were placed at P3, which was a similar position to that used in the previous experiment. As an experimentally obtained result, Figure 9 depicts images obtained from the camera on the robot. Mallards were swimming in the pond after relieving wariness, as depicted in the upper two images. Twenty minutes later, one mallard approached the robot with feed, as depicted in the middle and right images of Figure 9. Other mallards gathered around the robot to eat the feed. We moved the robot slightly while they were eating. They ran away immediately when the robot moved. As described herein, they approached the robot from rear. We considered that this behavior derived from a rapid escape response to a dangerous situation.
Figure 9. Behavior and response patterns for the eighth experiment. Mallards were swimming in the pond after relieving wariness, as depicted in the upper two images. Twenty minutes later, one mallard approached the robot with feed.
After stopping the robot, we attempted to approach the mallards. Fifteen minutes later, another mallard approached. The mallard gradually shortened the distance to the feeder. Other mallards were eating feed. We forwarded the robot slightly again. Although the mallards fled all at once, the escape distance was shorter than that of the robot movement. Apparently, the wariness was relieved for mallards. Although mallards approached from a large place behind, they gathered for eating feed from a narrow direction behind.

4.3. Discussion

Results obtained from eight experiments indicate that adult mallards had strong wariness. We demonstrated that at least two days are required to get used to the robot and the feeder. Moreover, they do not always eat feed. Maintaining navigation using feed alone is expected to be difficult because mallards eat feed only several times a day. As described herein, a breeder gave feed with similar behavior from a similar place every day. We infer that mallards recognize the input combined with breeder’s behavior patterns. For this experiment, the feeder was placed in the aviary with the feed placed in advance. We infer that the similar behavior to enter feed to a vacant feeder on the robot contributes to wariness relaxation for mallards. Moreover, we infer that the wariness for the robot movements is relieved if they recognize it as a harmless object.

5. Mallard Navigation Robot

Based on the above two experiments, we developed our third prototype as a fully customized mobile robot for mallard navigation. To confirm the mobility performance for a paddy field, we conducted buoyancy evaluation and locomotion tests in indoor and outdoor environments. For the present experiment, we did not attach the motor waterproof boxes.

5.1. Overall Design and Specifications

For inter-rice locomotion, the restriction of the body size is strict because it is necessary to avoid contact between the robot and rice stems. Figure 10 depicts the initial design of our third prototype, which satisfies this specification. The body size is 270 mm long, 210 mm wide, and 250 mm high. We identified one point of difficulty for the strength of the driving unit. Therefore, we searched for a consumer product for an alternative driving unit with a size that required similar specifications with consideration of the whole body strength.
Figure 10. Initial design of the third prototype.
We selected an off-the-shelf tank-truck suspension system (TS100-MT) as our alternative platform. Figure 11 depicts the exterior of this platform, which is made of aluminum.
Figure 11. Exterior of driving platform. We selected an off-the-shelf tank-truck suspension system.

5.2. Waterproof Design

For actualizing a remote control as a part of autonomous locomotion, the robot has a single-board computer, motors, sensors, and a battery. We designed waterproof boxes of two types for protecting motors inside the crawlers and for protecting other electrical devices from water.
Figure 10c depicts the design of a body waterproof box, which contains four pillars as a vertical sliding mechanism to float in water to prevent immersion. The box contains four pillars with a vertical sliding mechanism to float in water to prevent immersion. We used acrylonitrile butadiene styrene (ABS) resin for molding the boxes using a 3D printer. The body waterproof box is 210 mm long, 163 mm wide, and 120 mm high. For the 50 mm water draft, the buoyancy obtained from the box size is 1.68 kg, which is sufficient after loading a single-board computer and batteries.

5.3. Assembly

We assembled TS100-MT with reference to the manufacturer’s instruction manual with photographs. Although attached parts that make up the drive wheels were a different shape from the parts described in the manual, we considered that they were functionally equivalent for assembly. The assembly time for one person was approximately four hours. After molding a body waterproof box in each plain using a 3D printer, we mounted it to the top plate of TS100-MT using bolts and nuts.
Figure 1c depicts the robot after installing all devices. The input voltages of the single-board computer board and the motors are, respectively, 5 V and 9 V. Although the best solution to supply electric power is to use a converter that can provide these two voltages from the 12 V battery, we used a 5 V battery for the single-board computer with the motor driver and a 9 V battery for the motors.

5.4. System Configuration

Figure 12a depicts a single-board computer (Raspberry Pi 3 Model B; Raspberry Pi Foundation) used for our system. For two reasons, we selected Raspberry Pi Model B. Sufficient peripherals are present for this single-board computer in terms of a camera interface, multiple USB ports, and pins used for sensor interface. For actualizing autonomous locomotion, the robot is controlled by sensors attached from these ports. Second, this single-board computer accommodates a Wi-Fi module for wireless communication. We used a Wi-Fi router (WXR-25533DHP; Buffalo Inc., Nagoya City, Japan) for actualizing a remote control and robot status recognition.
Figure 12. Electrical devices.
Figure 12b depicts a motor driver (BTS 7960; Infineon Technologies AG, Neubiberg, Germany) for the control of two DC motors. For three reasons, we selected this motor driver. First, this motor driver accommodates one set of input ports and two sets of output ports that provide motor control easily and simplicity. We used two motors for driving respective crawlers. Both motors are controlled by a single motor driver. Second, this motor driver copes with widely various input voltages from 4.8 V to 35.0 V. Regarding actual applications, it is suitable to cover widely various input voltage for a battery. Third, this motor driver includes a heat control function combined with a heat sink. For long-term operation to run a paddy field for many hours, a failure risk of electric devices from heating is unavoidable.
Figure 12c depicts pins of the motor driver board. For motor control using pulse width modulation (PWM), the single-board computer transmits a duty ratio that determines the rotation speed and control signals to change the rotation direction. The power pins are wired to the 5.0 V power supply and ground pins. Other pins are used for general-purpose input and output (GPIO) pins. Particularly, P2, P4, P35/P38, and P37/P40 are mapped, respectively, to 5 V power supply, the ground, rotation directional control for the right/left motor, and rotation speed control for the right/left motor. The PWM duty ratio is set as 1.0 for motor rotation and 0.0 for a stop. The rotational direction is switched for the positive or negative rotation.

5.5. Buoyancy Evaluation

We checked the friction of the sliding rails used in the body waterproof box because the friction had come to be large without holding it horizontally. Moreover, we checked the buoyancy of the body waterproof box on water for repeating vertical sliding with pushing and releasing by hands. Subsequently, we checked synchronized vertical movements according to waves. For measuring the water draft, we attached a ruler to the side of the box. We fixed a gap on each corner of the body waterproof box using rubber tape. We changed the amount of the water in the plastic bottle used as ballast. This experiment was conducted under condition of water from 0.2 L to 1 L step by 0.2 L poured into the plastic bottle.
First, we confirmed changes in water draft to weight. The experimentally obtained result revealed that the waterproof box floated up without difficulty at any loading capacity, pushing the body waterproof box with both hands. Moreover, the sliding mechanism moved vertically and worked without difficulty. We infer that the friction of the sliding mechanism was reduced because water served as a lubricant.
Subsequently, we made waves by manually stirring the water in the tank. The waterproof box moved vertically along with the waves. A small amount of the wave water entered the box inside because the top of the body waterproof box had no cover. The top of the body waterproof box was closed to the water surface when the loading capacity was increased with the riding water draft surface. Water immersion was prevented after a top cover was used. This preliminary experiment revealed that the slide mechanism functioned with no difficulty.

5.6. Test Locomotion

From the indoor locomotion experiment, we confirmed the three points: accessing the website for operation, running the robot according to instructions, and measuring locomotion velocity. We set a fixed Internet Protocol (IP) address for a single-board computer. Although we confirmed that the same IP address provides the access to the website smoothly, the single-board computer sometimes denied access with warning messages that denote long time to open the website. This warning disappeared after rebooting the server with sufficient passage of time.
To operate the robot remotely, we developed a web-based tool with a touch panel interface used for a tablet computer. Assigned instructions, which comprise forward, right, left, and stop, are executed with clicking a button on a display. The single-board computer is specified from this tool.
As a result of executing respective instructions, the robot turned without moving forward or backward. Herein, the motor rotational direction was opposite. We connected a reversed motor driver output terminal for one motor. Finally, the robot moved forward and backward normally. We measured the locomotion velocity moved a distance of 100 m in 5 min 33 s that correspond to 0.30 m/s.
From an outdoor locomotion experiment, we confirmed the two points. The first point is performance evaluation to run on rough roads with unevenness. The second point is distance measurement of an effective wireless communication range to send instructions using a mobile router.
For this experiment, the effective range of wireless instruction transmission was up to 41 m. As specifications of the mobile router, the interference distance limit was set to 40 m, which is insufficient for moving in a paddy field. We infer the necessity for an alternative high-power router or a multi-router system as a remote control method. Remote operations are unnecessary for a use case of autonomous locomotion. However, long-distance communication is fundamentally important because monitoring and transmission of emergency stop commands are indispensable. We infer that one solution is to use cellular communication networks as an alternative method of Wi-Fi communication. An important difficulty of stepping on the rice plants occurred for the late response against instructions. We infer that the delay of instruction execution can eliminate the implementation of autonomous locomotion functions.
Figure 13 depicts time-series images of outdoor locomotion tests on a snow road. For a step larger than 50 mm, as depicted in the bottom panels, the robot used much time to run over because of insufficient motor torque. Overall, we confirmed that the robot ran without problems in this environment. In contrast, a time difference was found between the operation on the tablet computer and the execution of instructions. Although we sent an instruction to turn right, the robot continued to move forward. We confirmed that a rotational difference between the left and right motors occurred after the locomotion several times. The robot was moving in a zigzag slightly even with forward instructions.
Figure 13. Time-series images of outdoor locomotion on a snow road.
We confirmed that the third prototype has sufficient locomotion capability not only for a gravel road, but also for a snowy road controlled by remote operation through wireless communication. Although the robot spent much time surmounting a step approximately 50 mm high, no shortage of the motor torque was apparent because of continued rotation of wheels and crawler belts. In contrast, a difficulty existed by which the motor revolutions differed between the left and right was apparent. This motor driver includes a feedback function of motor revolutions. We infer that the rotational difference can be eliminated with respect to the actual numbers of rotations for correction using this function calculated using a single-board computer.

6. Autonomous Locomotion Experiment

6.1. Outdoor Experiment

This experiment was conducted outdoors in an actual paddy field. The experimentally obtained results verify the basic locomotion capabilities of our robot and evaluation of the semantic segmentation accuracy for FOV time-series images obtained by the onboard camera.

6.1.1. Locomotion among the Rice Plants

Figure 14 depicts images of locomotion of our robot at two dates. The time-series images in Figure 14a depict the paddy condition two months after rice planting. The rice plants had grown to approximately 300 mm. As a result of the low rice plant density, the robot moved easily among the rice plants.
Figure 14. Robot locomotion on two days.
Figure 14b depicts the paddy condition three months after rice planting. Rice plants grew to approximately 500 mm. The locomotion can be seen slightly through a small gap in the rice plants. The radius of each bunch of rice plants is approximately 30 mm. The mean margin was 140 mm wide because the robot was 200 mm wide and the passage was 300 mm wide. This margin width gave rise to the robot being unable to be allowed to meander slightly. As a result of this experiment, the most common locomotion patterns were for forward movement. If the robot meandered to the left or right, then the operation to switch move backward was activated rather than steering the opposite direction to the proper path.
Despite this situation, we obtained time-series FOV images, as depicted in Figure 15, using the onboard monocular camera (Raspberry Pi Hight Quality Camera; Raspberry Pi Foundation) with a CMOS imaging sensor (IMX477R; Sony Corp., Tokyo, Japan). The maximum image resolution is 4056 × 3040 pixels. As a result of the processing costs and memory size, we obtained images of 1016 × 760-pixel resolution.
Figure 15. Time-series FOV images obtained using an onboard monocular camera.

6.1.2. Segmentation Results

Kirillov et al. [56] divided image segmentation tasks into two types: semantic segmentation and instance segmentation. Semantic segmentation treats thing classes as stuff, which is amorphous and uncountable. Instance segmentation detects each object and its delineation with a bounding box or segmentation mask. For this study, our segmentation targets comprise ground, paddy, and other categories that belong to stuff. Numerous methods have been proposed for semantic segmentation in terms of PSPNet, ICNet, and PSANet by Zhao et al. [57,58,59], ESPNet by Mehta et al. [60], MaskLab and DeepLabv3+ by Chen et al. [61,62], AdaptSegNet by Tsai et al. [63], Auto-DeepLab by Liu et al. [64], Gated-SCNN by Takikawa et al. [65], RandLA-Net by Hu et al. [66], and PolarNet by Zhang et al. [67]. For this study, we used SegNet [68] for the segmentation of time-series FOV images because this classical method is well established for the segmentation of road scene images such as the CamVid Database [69].
Table 2 presents details of the number of images assigned to training, validation, and testing. We annotated all the pixels of 1160 images. The annotation labels comprise three types: ground pixels, paddy pixels, and other pixels.
Table 2. Numbers of images assigned to training, validation, and testing [images].
Figure 16 depicts loss and accuracy curves for the learning processes. Both training and validation losses converged along with the progression of generations. Similarly, both training and validation accuracies converged along with the progression of generations. Compared with both tendencies, the verification accuracy is higher than the training accuracy. We consider that this reversal resulted from the ratio of training images to verification images as denoted in Table 2.
Figure 16. Loss and accuracy curves in the learning processes.
Figure 17 depicts two examples of segmentation results with SegNet. The left, center, and right panels respectively represent the original images, ground truth (GT) images, and segmentation images. As a local tendency, the images include several false positive and false negative pixels, especially near the category boundaries. As an overall tendency, segmentation images are consistent with the GT images.
Figure 17. Segmentation results with SegNet.
Table 3 presents detailed segmentation results for each category. The accuracies for the ground pixels, the paddy pixels, and the other pixels are, respectively, 99.1%, 92.4%, and 44.8%. The mean accuracy is 97.0%. Effects of other pixels on the overall accuracy were slight because the occupancy of other pixels was merely 1.5% (=110,838/7,257,600) of the total pixels.
Table 3. Segmentation results [pixels].

6.2. Indoor Experiment

This experiment was conducted on an imitation paddy field indoors. The experimentally obtained results provide evaluation of behavior prediction accuracy in navigation based on visual processing from FOV time-series images obtained using the onboard camera.

6.2.1. Experiment Setup

Figure 18 depicts the exterior of the field built in imitation of a paddy field. The field size is 2.0 m wide and 4.4 m long. White objects are imitations of rice plants created using plastic bottles covered with white paper. The object intervals in the longitudinal and lateral are, respectively, 0.6 m and 0.4 m. These lengths are twice as large as those of actual intervals.
Figure 18. Experiment field resembling a paddy field for obtaining FOV time-series images from our robot.
Unlike an actual muddy field, this experimental field provided steady locomotion because of sufficiently high friction between the floor surface and the robot crawlers. Therefore, the proportion of images annotated as moving forward was increasing. To ensure the effective number of images annotated at right and left turns, the initial angles were arranged from the moving direction; then the initial positions were shifted from the centerline between rice plants.
Regarding the robot speed and the burden for annotation, image sampling was changed from 60 fps to 10 fps. We manually annotated all images. Table 4 denotes the number of obtained images and the detailed information of three behavior categories: move forward (MF), turn right (TR), and turn left (TL).
Table 4. Numbers of obtained images and their detailed information of three behavior categories.
We obtained 4626 images in total. The data amounts of MF, TR, and TL images are, respectively, 2419, 1347, and 860 images. The data ratios of MF and TL are, respectively, 52.3% for the maximum and 18.6% for the minimum. At the experimental field terminals, the robot turned right to change its path to the next path between rice plants. This locomotion increased the number of TR images compared to that of TL images.
Table 5 shows the fundamental network parameters of our implementation model combined with a CNN and a long short-term memory (LSTM) [70] network. Figure 19 depicts a network structure of LSTM. Repeating modules are connected in a series of cascades.
Table 5. Set values of fundamental network parameters (E1).
Figure 19. Network structure of LSTM.
The filter sizes of the first convolutional layer and the second through fifth convolutional layers are, respectively, 5 × 5 and 3 × 3. The pooling widths for the respective layers are 2 × 2. Herein, the number of hidden layers is a dominant LSTM parameter.
Letting α and N b s respectively represent the learning coefficient and the batch size, and letting n l b be the number of the look-back parameter, then the dimension of the LSTM as the length of the long-term memory controlled by n l b . For generalization, we employed the Adam algorithm [71] as a Stochastic Gradient Descent (SGD) method that incorporates low-order moments. The default values of α , n b s , and n l b were set, respectively, to 0.01, 2 [72], and 50 [73]. The data ratio between training and validation is 80:20.
The parameter values which maximize behavior prediction accuracy vary in datasets. For this experiment, we attempted the following nine experiments, labeled as E1–E9 below, with different combinations of parameters and setting values. Regarding the computational burden and time, we set 100 training epochs as a common value.
E1: 
Default parameter values are set to the initial network model.
E2: 
The sampling frame rate is changed from 10 fps to 5 fps.
E3: 
n b s is changed from 2 to 5.
E4: 
n l b is changed from 50 to 10.
E5: 
The input-layer depth is changed from 3 dimensions to 6 dimensions to accommodate segmentation images with SegNet in addition to the original images.
E6: 
The additional dropout layer between the first and second convolutional layers invalidates 25% of the connection between the two layers.
E7: 
A new dropout layer that disables 25% of the connections is added to each layer after the second convolutional layer.
E8: 
A new pathway is appended to bypass the third and fourth convolution layers.
E9: 
The pooling width of the pooling layers is changed to 4 × 4.

6.2.2. Training Results

Figure 20 depicts transitions of the training and validation accuracies. For the training datasets, the global accuracy improved constantly and steadily. Particularly, the trends of E1, E2, E4, E5, and E6 show convergence around the 20th epoch. Trends of E3, E7, and E9 all indicate convergence at around the 40th epoch. In contrast, the trend of E8 shows a slow convergence tendency with a marked drop at around the 60th epoch. The convergence properties for validation datasets are insufficient compared with those obtained for training datasets. The characteristic curves for the validation datasets are saturated at around 0.70.
Figure 20. Transitions of the degrees of training and validation accuracies.

6.2.3. Locomotion Prediction Results

Figure 21 presents the prediction results of the respective experiments. The accuracy of E6 achieved 62.4%, which was the highest. In contrast, the accuracy of E8 was 56.1%, which was the lowest. The accuracy difference between them was 6.3 percentage points. The accuracies of E1, E4, E6, and E9 were all greater than the mean accuracy of 59.7%.
Figure 21. Locomotion prediction results for nine experimental conditions.
Table 6 presents the prediction accuracy obtained for each locomotion pattern. The accuracy of MF achieved 81.6%, which was the maximum. In contrast, the accuracy of TL exhibited 25.8%, which was the minimum.
Table 6. Locomotion prediction accuracy.

6.2.4. Analysis and Discussion

The uneven number of images is attributable to a limitation in the data collection method related to the locomotion mechanism of the robot. Figure 22 portrays an example of a TR path. Although the robot is turning left globally, MF is selected locally and shortly because the robot must recover to the correct path. The upper-middle panel depicts an FOV image of the robot moving towards rice plants, annotated as MF instead of TL. The accumulation of this situation increased the number of images annotated as MF.
Figure 22. Inconsistency between the actual purpose of robot motion and annotated labels.
To predict global locomotion patterns, we consider the necessity of understanding the scene context [74] as autonomous agents [75]. Moreover, we consider that we can assign detailed annotation labels in terms of “forward locomotion concomitantly with the center of a route between rice plants” and “adjusting locomotion to the center of a route”. However, it is unrealistic to assign such complex labels manually. Therefore, we infer the implementation of an automatic annotation function that recognizes the scene context.

7. Conclusions

This paper provided a small robot for realization of remote farming. To actualize highly efficient rice-duck farming, this study presented experimentally obtained results to verify a useful method to navigate mallards using robot prototypes based on imprinting and feeding. Experimentally obtained results revealed that baby mallards with imprinting followed our first prototype. We considered that adult mallards require more than two days to become accustomed to the robot and the feeder because they have strong wariness against unknown objects. Although we did not actualize mallard navigation in an actual field, we found that mallards ate feed in the feeder on our second prototype. Based on this preliminary result, we developed our third prototype as a fully customized mobile robot for mallard navigation. We conducted an indoor driving test to assess manual operation and outdoor evaluation tests to run on an actual paddy field. We developed indoor and outdoor image datasets using an onboard monocular camera. For the outdoor image datasets, our segmentation method based on SegNet achieved 97.0% mean segmentation accuracy for three semantic categories. For the indoor image datasets, our prediction method based on CNN and LSTM achieved 59.7% mean prediction accuracy for three motion categories.
As future work, we expect to quantify the imprinting effects of shared time between the robot and mallards in a breeding process. Moreover, we expect to conduct a navigation experiment using feed of several types in a paddy field for baby mallards after imprinting. For autonomous locomotion, we will achieve motor control incorporating a feedback mechanism. We would like to combine autonomous locomotion and navigation based on imprinting and feeding. We must statistically analyze the trajectories of baby mallards. To avoid overfitting caused by overtraining, we can reconsider the loss function to converge training and validation processes. We intend to improve the segmentation accuracy and the prediction accuracy for fully autonomous locomotion in actual paddy fields of various types. Finally, we would like to conduct statistical analyses for experimentally obtained results based on the concept of ARI.

Author Contributions

Conceptualization, S.Y.; methodology, H.W.; software, H.M.; validation, S.N.; formal analysis, K.S.; investigation, S.Y.; resources, Y.N.; data curation, H.W.; writing—original draft preparation, H.M.; writing—review and editing, H.M.; visualization, S.N.; supervision, K.S.; project administration, Y.N.; funding acquisition, H.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Japan Society for the Promotion of Science (JSPS) KAKENHI Grant Numbers 17K00384 and 21H02321.

Institutional Review Board Statement

The study was conducted according to the guidelines approved by the Institutional Review Board of Akita Prefectural University (Kendaiken–91; 27 April 2018).

Data Availability Statement

The datasets presented in this study are available on request to the corresponding author.

Acknowledgments

We would like to show our appreciation to Takuma Watanabe and Mizuho Kurosawa, who are graduates of Akita Prefectural University, for their great cooperation in the experiments.

Conflicts of Interest

The authors declare that they have no conflicts of interest. The funders had no role in the design of the study, in the collection, analyses, or interpretation of data, in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
ABSAcrylonitrile butadiene styrene
ACVsAir-cushion vehicles
AIArtificial intelligence
ARIAnimal–robot interaction
BDDVBerkeley DeepDrive video dataset
CNNConvolutional neural network
DLDeep learning
GPIOGeneral-purpose input and output
GTground truth
FOVField of view
FPVFirst-person view
HRIHuman–robot interaction
ICTInformation and communications technology
IoTInternet of things
IPInternet protocol
IRInfRared
LiDARLaser imaging detection and ranging
LRFLaser range finder
LSTMLong short-term memory
MFMove forward
PWMPulse width modulation
RCNNRecurrent convolutional neural network
SGDStochastic gradient descent
TLTurn left
TRTurn right
UAVUnmanned aerial vehicles

References

  1. Wolfert, S.; Ge, L.; Verdouw, C.; Bogaardt, M.J. Big Data in Smart Farming: A Review. Agric. Syst. 2017, 153, 69–80. [Google Scholar] [CrossRef]
  2. Farooq, M.S.; Riaz, S.; Abid, A.; Abid, K.; Naeem, M.A. Survey on the Role of IoT in Agriculture for the Implementation of Smart Farming. IEEE Access 2019, 7, 156237–156271. [Google Scholar] [CrossRef]
  3. Abbasi, M.; Yaghmaee, M.H.; Rahnama, F. Internet of Things in agriculture: A survey. In Proceedings of the Third International Conference on Internet of Things and Applications (IoT), Isfahan, Iran, 17–18 April 2019; pp. 1–12. [Google Scholar]
  4. Gia, T.N.; Qingqing, L.; Queralta, J.P.; Zou, Z.; Tenhunen, H.; Westerlund, T. Edge AI in Smart Farming IoT: CNNs at the Edge and Fog Computing with LoRa. In Proceedings of the IEEE AFRICON 2019, Accra, Ghana, 25–27 September 2019; pp. 1–6. [Google Scholar]
  5. Alreshidi, E. Smart Sustainable Agriculture (SSA) Solution Underpinned by Internet of Things (IoT) and Artificial Intelligence (AI). Int. J. Adv. Comput. Sci. Appl. 2019, 10, 93–102. [Google Scholar] [CrossRef]
  6. Dahane, A.; Benameur, R.; Kechar, B.; Benyamina, A. An IoT Based Smart Farming System Using Machine Learning. In Proceeding of the International Symposium on Networks, Computers and Communications, Montreal, QC, Canada, 20–22 October 2020; pp. 1–6. [Google Scholar]
  7. Balafoutis, A.T.; Evert, F.K.V.; Fountas, S. Smart Farming Technology Trends: Economic and Environmental Effects, Labor Impact, and Adoption Readiness. Agronomy 2020, 10, 743. [Google Scholar] [CrossRef]
  8. Yaghoubi, S.; Akbarzadeh, N.A.; Bazargani, S.S.; Bamizan, M. Autonomous Robots for Agricultural Tasks and Farm Assignment and Future Trends in Agro Robots. Int. J. Mech. Mechatron. Eng. 2017, 13, 1–6. [Google Scholar]
  9. Noguchi, N.; Barawid, O.C. Robot Farming System Using Multiple Robot Tractors in Japan Agriculture. In Proceedings of the 18th World Congress of the International Federation of Automatic Control, Milano, Italy, 28 August–2 September 2011; pp. 633–637. [Google Scholar]
  10. Chang, C.K.; Siagian, C.; Itti, L. Mobile Robot Monocular Vision Navigation Based on Road Region and Boundary Estimation. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Algarve, Portugal, 7–12 October 2012; pp. 1043–1050. [Google Scholar]
  11. Yandun, F.; Narvaez, F.Y.; Reina, G.; Torres-Torriti, M.; Kantor, G.; Cheein, F.A. A Survey of Ranging and Imaging Techniques for Precision Agriculture Phenotyping. IEEE/ASME Trans. Mechatron. 2017, 22, 2428–2439. [Google Scholar] [CrossRef]
  12. Niitani, Y.; Ogawa, T.; Saito, S.; Saito, M. ChainerCV: A Library for Deep Learning in Computer Vision. In Proceedings of the 25th ACM International Conference on Multimedia, Mountain View, CA, USA, 23–27 October 2017; pp. 1217–1220. [Google Scholar]
  13. Bradski, G.; Kaehler, A.; Pisarevsky, V. Learning-Based Computer Vision with Intel’s Open Source Computer Vision Library. Intel Technol. J. 2005, 9, 119–130. [Google Scholar]
  14. Madokoro, H.; Woo, H.; Nix, S.; Sato, K. Benchmark Dataset Based on Category Maps with Indoor–Outdoor Mixed Features for Positional Scene Recognition by a Mobile Robot. Robotics 2020, 9, 40. [Google Scholar] [CrossRef]
  15. Madokoro, H.; Yamamoto, S.; Woo, H.; Sato, K. Mallard Navigation Using Unmanned Ground Vehicles, Imprinting, and Feeding. In Proceedings of the International Joint Conference on JSAM and SASJ and 13th CIGR VI Technical Symposium Joining FWFNWG and FSWG Workshops, Sapporo, Japan, 3–6 September 2019; pp. 1–8. [Google Scholar]
  16. Watanabe, T.; Madokoro, H.; Yamamoto, S.; Woo, H.; Sato, K. Prototype Development of a Mallard Guided Robot. In Proceedings of the 19th International Conference on Control, Automation and Systems, Jeju, Korea, 15–18 October 2019; pp. 1566–1571. [Google Scholar]
  17. Pernollet, C.A.; Simpson, D.; Gauthier-Clerc, M.; Guillemain, M. Rice and Duck, A Good Combination? Identifying the Incentives and Triggers for Joint Rice Farming and Wild Duck Conservation. Agric. Ecosyst. Environ. 2015, 214, 118–132. [Google Scholar] [CrossRef]
  18. Ball, D.; Ross, P.; English, A.; Patten, T.; Upcroft, B.; Fitch, R.; Sukkarieh, S.; Wyeth, G.; Corke, P. Robotics for Sustainable Broad-Acre Agriculture. In Field and Service Robotics; Springer: Cham, Switzerland, 2015; Volume 105, pp. 439–453. [Google Scholar]
  19. Velasquez, A.E.B.; Higuti, V.A.H.; Guerrero, H.B.; Becker, M. HELVIS: A Small-scale Agricultural Mobile Robot Prototype for Precision Agriculture. In Proceedings of the 13th International Conference on Precision Agriculture, St. Louis, MO, USA, 31 July–3 August 2016; Volume 1, pp. 1–18. [Google Scholar]
  20. Yasuda, K.; Takakai, F.; Kaneta, Y.; Imai, A. Evaluation of Weeding Ability of Brush-Roller Type Paddy Weeding Robot and Its Influence on the Rice Growth. J. Weed Sci. Technol. 2017, 62, 139–148. [Google Scholar] [CrossRef][Green Version]
  21. Mitsui, T.; Kagiya, T.; Ooba, S.; Hirose, T.; Kobayashi, T.; Inaba, A. Development of a Small Rover(AIGAMO ROBOT) to Assist Organic Culture in Paddy Fields: Field Experiment Using Robot for Weeding in 2007. In Proceedings of the JSME annual Conference on Robotics and Mechatronics, Nagoya, Japan, 5–7 June 2008; pp. 1–3. [Google Scholar]
  22. Fujii, K.; Tabata, K.; Yokoyama, T.; Kutomi, S.; Endo, Y. Development of a Small Weeding Robot “AIGAMO ROBOT” for Paddy Fields. Tech. Rep. Gifu Prefect. Res. Inst. Inf. Technol. 2015, 17, 48–51. [Google Scholar]
  23. Sori, H.; Inoue, H.; Hatta, H.; Ando, Y. Effect for a Paddy Weeding Robot in Wet Rice Culture. J. Robot. Mechatron. 2018, 30, 198–205. [Google Scholar] [CrossRef]
  24. Nakai, S.; Yamada, Y. Development of a Weed Suppression Robot for Rice Cultivation: Weed Suppression and Posture Control. Int. J. Electr. Comput. Energetic, Electron. Commun. Eng. 2014, 8, 1879–1883. [Google Scholar]
  25. Yamada, Y.; Yamauchi, S. Study on Imprinting and Guidance of a Duck Flock by an Autonomous Mobile Robot (Effectiveness of Visual and Auditory Stimuli). J. Jpn. Soc. Des. Eng. 2018, 53, 691–704. [Google Scholar]
  26. Yamada, Y.; Yamauchi, S.; Tsuchida, T. Study on Imprinting and Guidance of a Duck Flock by an Autonomous Mobile Robot (Effectiveness of Imprinting after Critical Period and Effectiveness of Feeding and Auditory Stimulus). J. Jpn. Soc. Des. Eng. 2018, 53, 855–868. [Google Scholar]
  27. Miklósi, Á.; Korondi, P.; Matellán, V.; Gácsi, M. Ethorobotics: A New Approach to Human-Robot Relationship. Front. Psychol. 2017, 8, 958. [Google Scholar] [CrossRef]
  28. Korondi, P.; Korcsok, B.; Kovács, S.; Niitsuma, M. Etho-Robotics: What Kind of Behaviour Can We Learn from the Animals? In Proceedings of the 11th IFAC Symposium on Robot Control, Salvador, Brazil, 26–28 August 2015; pp. 244–255. [Google Scholar]
  29. Kim, J.; Choi, S.; Kim, D.; Kim, J.; Cho, M. Animal-Robot Interaction for Pet Caring. In Proceedings of the IEEE International Symposium on Computational Intelligence in Robotics and Automation, Daejeon, Korea, 15–18 December 2009; pp. 159–164. [Google Scholar]
  30. Romano, D.; Donati, E.; Benelli, G.; Stefanini, C. A Review on Animal—Robot Interaction: From Bio-Hybrid Organisms to Mixed Societies. Biol. Cybern 2019, 113, 201–225. [Google Scholar] [CrossRef] [PubMed]
  31. Romano, D.; Stefanini, C. Individual Neon Tetras (Paracheirodon Innesi, Myers) Optimise Their Position in the Group Depending on External Selective Contexts: Lesson Learned from a Fish-Robot Hybrid School. Biosyst. Eng. 2021, 204, 170–180. [Google Scholar] [CrossRef]
  32. Macri, S.; Karakaya, M.; Spinello, C.; Porfiri, M. Zebrafish Exhibit Associative Learning for an Aversive Robotic Stimulus. Lab Anim. 2020, 49, 259–264. [Google Scholar] [CrossRef]
  33. Karakaya, M.; Macri, S.; Porfiri, M. Behavioral Teleporting of Individual Ethograms onto Inanimate Robots: Experiments on Social Interactions in Live Zebrafish. Iscience 2020, 23, 101418. [Google Scholar] [CrossRef]
  34. Clément, R.J.; Macri, S.; Porfiri, M. Design and Development of a Robotic Predator as a Stimulus in Conditioned Place Aversion for the Study of the Effect of Ethanol and Citalopram in Zebrafish. Behav. Brain Res. 2020, 378, 112256. [Google Scholar] [CrossRef] [PubMed]
  35. Polverino, G.; Karakaya, M.; Spinello, C.; Soman, V.R.; Porfiri, M. Behavioural and Life-History Responses of Mosquitofish to Biologically Inspired and Interactive Robotic Predators. J. R. Soc. Interface 2019, 16, 158. [Google Scholar] [CrossRef] [PubMed]
  36. Butail, S.; Ladu, F.; Spinello, D.; Porfiri, M. Information Flow in Animal-Robot Interactions. Entropy 2014, 16, 1315–1330. [Google Scholar] [CrossRef]
  37. Spinello, C.; Yang, Y.; Macri, S.; Porfiri, M. Zebrafish Adjust Their Behavior in Response to an Interactive Robotic Predator. Front. Robot. AI 2019, 6, 38. [Google Scholar] [CrossRef] [PubMed]
  38. Romano, D.; Benelli, G.; Stefanini, C. Opposite Valence Social Information Provided by Bio-Robotic Demonstrators Shapes Selection Processes in the Green Bottle Fly. J. R. Soc. Interface 2021, 18, 176. [Google Scholar] [CrossRef]
  39. Datteri, E. The Logic of Interactive Biorobotics. Front. Bioeng. Biotechnol. 2020, 8, 637. [Google Scholar] [CrossRef]
  40. Vaughan, R.; Sumpter, N.; Henderson, J.; Frost, A.; Cameron, S. Experiments in Automatic Flock Control. Robot. Auton. Syst. 2000, 31, 109–117. [Google Scholar] [CrossRef]
  41. Henderson, J.V.; Nicol, C.J.; Lines, J.A.; White, R.P.; Wathes, C.M. Behaviour of Domestic Ducks Exposed to Mobile Predator Stimuli. 1. Flock Responses. Br. Poult. Sci. 2001, 42, 433–438. [Google Scholar] [CrossRef]
  42. Chen, Z.; Wang, S.; Wang, J.; Xu, K.; Lei, T.; Zhang, H.; Wang, X.; Liu, D.; Si, J. Control Strategy of Stable Walking for a Hexapod Wheel-Legged Robot. ISA Trans. 2021, 108, 367–380. [Google Scholar] [CrossRef]
  43. Li, J.; Wang, J.; Peng, H.; Hu, Y.; Su, H. Fuzzy-Torque Approximation-Enhanced Sliding Mode Control for Lateral Stability of Mobile Robot. IEEE Trans. Syst. Man Cybern. Syst. 2021, 1–10. [Google Scholar] [CrossRef]
  44. Chen, J.; Du, C.; Zhang, Y.; Han, P.; Wei, W. A Clustering-Based Coverage Path Planning Method for Autonomous Heterogeneous UAVs. IEEE Trans. Intell. Transp. Syst. 2021, 1–11. [Google Scholar] [CrossRef]
  45. Hess, E.H. Imprinting. Science 1959, 130, 133–141. [Google Scholar] [CrossRef] [PubMed]
  46. Karlson, P.; Butenandt, A. Pheromones (Ectohormones) in Insects. Annu. Rev. Entomol. 1959, 4, 39–58. [Google Scholar] [CrossRef]
  47. Yurtsever, E.; Lambert, J.; Carballo, A.; Takeda, K. A Survey of Autonomous Driving: Common Practices and Emerging Technologies. IEEE Access 2020, 8, 58443–58469. [Google Scholar] [CrossRef]
  48. Wang, Z.; Wu, Y.; Niu, Q. Multi-Sensor Fusion in Automated Driving: A Survey. IEEE Access 2020, 8, 2847–2868. [Google Scholar] [CrossRef]
  49. Murase, T.; Hirakawa, T.; Yamashita, T.; Fujiyoshi, H. Self-State-Aware Convolutional Neural Network for Autonomous Driving. In Proceedings of the IEICE Technical Report of Pattern Recognition and Media Understanding, Kumamoto, Japan, 12–13 October 2017; pp. 85–90.
  50. Kamiya, R.; Kawaguchi, T.; Fukui, H.; Ishii, Y.; Otsuka, K.; Hagawa, R.; Tsukizawa, S.; Yamashita, K.; Yamauchi, T.; Fujiyoshi, H. Self-Motion Identification Using Convolutional Recurrent Neural Network. In Proceedings of the 22nd Symposium on Sensing via Image Information, Yokohama, Japan, 8–10 June 2016. [Google Scholar]
  51. Kanade, T.; Hebert, M. First-Person Vision. Proc. IEEE 2012, 100, 2442–2453. [Google Scholar] [CrossRef]
  52. Lai, S.; Xu, L.; Liu, K.; Zhao, J. Recurrent Convolutional Neural Networks for Text Classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015; pp. 2267–2273. [Google Scholar]
  53. Xu, H.; Gao, Y.; Yu, F.; Darrell, T. End-to-end Learning of Driving Models from Large-scale Video Datasets. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2174–2182. [Google Scholar]
  54. Yu, F.; Chen, H.; Wang, X.; Xian, W.; Chen, Y.; Liu, F.; Madhavan, V.; Darrell, T. BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 16–18 June 2020; pp. 2633–2642. [Google Scholar]
  55. Kamilaris, A.; Prenafeta-Boldú, F.A. Deep Learning in Agriculture: A Survey. Comput. Electron. Agric. 2018, 147, 70–90. [Google Scholar] [CrossRef]
  56. Kirillov, A.; He, K.; Girshick, R.; Rother, C.; Dollár, P. Panoptic Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 16–20 June 2019; pp. 9396–9405. [Google Scholar]
  57. Zhang, P.; Liu, W.; Lei, Y.; Wang, H.; Lu, H. RAPNet: Residual Atrous Pyramid Network for Importance-Aware Street Scene Parsing. IEEE Trans. Image Process. 2020, 29, 5010–5021. [Google Scholar] [CrossRef] [PubMed]
  58. Zhao, H.; Qi, X.; Shen, X.; Shi, J.; Jia, J. ICNet for Real-Time Semantic Segmentation on High-Resolution Images. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8–14 September 2018; pp. 405–420. [Google Scholar]
  59. Zhao, H.; Zhang, Y.; Liu, S.; Shi, J.; Loy, C.; Lin, D.; Jia, J. PSANet: Point-wise Spatial Attention Network for Scene Parsing. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8–14 September 2018; pp. 267–283. [Google Scholar]
  60. Mehta, S.; Rastegari, M.; Caspi, A.; Shapiro, L.; Hajishirzi, H. ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8–14 September 2018; pp. 552–568. [Google Scholar]
  61. Chen, L.; Hermans, A.; Papandreou, G.; Schroff, F.; Wang, P.; Adam, H. MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 4013–4022. [Google Scholar]
  62. Chen, L.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 2018, 40, 834–848. [Google Scholar] [CrossRef] [PubMed]
  63. Tsai, Y.; Hung, W.; Schulter, S.; Sohn, K.; Yang, M.; Chandraker, M. Learning to Adapt Structured Output Space for Semantic Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 7472–7481. [Google Scholar]
  64. Liu, C.; Chen, L.-C.; Schroff, F.; Adam, H.; Hua, W.; Yuille, A.L.; Fei-Fei, L. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 82–92. [Google Scholar]
  65. Takikawa, T.; Acuna, D.; Jampani, V.; Fidler, S. Gated-SCNN: Gated Shape CNNs for Semantic Segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Korea, 27 October–2 November 2019; pp. 5228–5237. [Google Scholar]
  66. Hu, Q.; Yang, B.; Xie, L.; Rosa, S.; Guo, Y.; Wang, Z.; Trigoni, N.; Markham, A. RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 16–18 June 2020; pp. 11108–11117. [Google Scholar]
  67. Zhang, Y.; Zhou, Z.; David, P.; Yue, X.; Xi, Z.; Gong, B.; Foroosh, H. PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Washington, DC, USA, 16–18 June 2020; pp. 9601–9610. [Google Scholar]
  68. Badrinarayanan, V.; Kendall, A.; Cipolla, R. SegNet: A Deep Convolutional Encoder–Decoder Architecture for Image Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef]
  69. Brostow, G.J.; Fauqueur, J.; Cipolla, R. Semantic Object Classes in Video: A High-Definition Ground Truth Database. Pattern Recognit. Lett. 2009, 30, 88–97. [Google Scholar] [CrossRef]
  70. Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  71. Kingma, D.P.; Adam, B.J. A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  72. Saud, A.S.; Shakya, S. Analysis of Look Back Period for Stock Price Prediction with RNN Variants: A Case Study of Banking Sector of NEPSE. Procedia Comput. Sci. 2020, 167, 788–798. [Google Scholar] [CrossRef]
  73. Byeon, W.; Breuel, T.M.; Raue, F.; Liwicki, M. Scene Labeling With LSTM Recurrent Neural Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 3547–3555. [Google Scholar]
  74. Joubert, O.R.; Rousselet, G.A.; Fize, D.; Fabre-Thorpe, M. Processing Scene Context: Fast Categorization and Object Interference. Vis. Res. 2007, 47, 3286–3297. [Google Scholar] [CrossRef] [PubMed]
  75. Naseer, M.; Khan, S.; Porikli, F. Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey. IEEE Access 2019, 7, 1859–1887. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.