Mobile Robot Navigation Based on Embedded Computer Vision

Marroquín, Alberto; Garcia, Gonzalo; Fabregas, Ernesto; Aranda-Escolástico, Ernesto; Farias, Gonzalo

doi:10.3390/math11112561

Open AccessArticle

Mobile Robot Navigation Based on Embedded Computer Vision

by

Alberto Marroquín

¹

,

Gonzalo Garcia

¹

,

Ernesto Fabregas

²

,

Ernesto Aranda-Escolástico

³

and

Gonzalo Farias

^1,*

¹

Escuela de Ingeniería Eléctrica, Pontificia Universidad Católica de Valparaíso, Av. Brasil 2147, Valparaíso 2362804, Chile

²

Departamento de Informática y Automática, Universidad Nacional de Educación a Distancia, Juan del Rosal 16, 28040 Madrid, Spain

³

Departamento de Ingeniería de Software y Sistemas Informáticos, Universidad Nacional de Educación a Distancia, Juan del Rosal 16, 28040 Madrid, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(11), 2561; https://doi.org/10.3390/math11112561

Submission received: 29 April 2023 / Revised: 30 May 2023 / Accepted: 31 May 2023 / Published: 2 June 2023

Download

Browse Figures

Versions Notes

Abstract

:

The current computational advance allows the development of technological solutions using tools, such as mobile robots and programmable electronic systems. We present a design that integrates the Khepera IV mobile robot with an NVIDIA Jetson Xavier NX board. This system executes an algorithm for navigation control based on computer vision and the use of a model for object detection. Among the functionalities that this integration adds to the Khepera IV in generating guided driving are trajectory tracking for safe navigation and the detection of traffic signs for decision-making. We built a robotic platform to test the system in real time. We also compared it with a digital model of the Khepera IV in the CoppeliaSim simulator. The navigation control results show significant improvements over previous works. This is evident in both the maximum navigation speed and the hit rate of the traffic sign detection system. We also analyzed the navigation control, which achieved an average success rate of

93 %

. The architecture allows testing new control techniques or algorithms based on Python, facilitating future improvements.

Keywords:

mobile robot; computer vision; object detector; traffic signs; Khepera; Jetson Xavier NX

MSC:

65D19; 68T40

1. Introduction

Process automation is made possible by technological tools, such as robots and embedded systems. Robots are automated machines capable of performing tasks with a certain degree of autonomy or under human supervision. Embedded systems, on the other hand, are electronic systems programmed to perform one or more specific tasks, adding functionality to other systems. Both tools contribute to the creation of technological solutions in important sectors, such as agriculture, manufacturing, and domestic assistance [1,2,3,4]. In this sense, they seek to replace human intervention in repetitive or heavy tasks, increasing the productivity (tasks solved in less time) and quality (less error in the expected results) of the processes. The same occurs in bio-inspired mobile robots, where the use of artificial neural networks and centralized patterns is applicable [5,6].

There are different devices for the locomotion of a mobile robot; among these are the wheels, the tracks, and the articulated legs [7,8,9]. The differential drive mobile robot (DDMR) is the easiest to implement and control; it includes two motorized wheels (one on each side of its base) and a freewheel as a support point (in the front part of its base). This configuration causes the robot’s movement and orientation to change due to the contribution of the speeds of both wheels, allowing it to move on smooth surfaces with good energy efficiency [10,11]. Its applications extend from academia, for the development and testing of navigation algorithms, to industrial environments for raw materials transport. The above requires that the internal controller of the DDMR has computational resources for the following functions: perception (sensor reading), planning (use of algorithms), and execution (actuators control) [12,13].

Navigation control is necessary for planning [14]. It is an algorithm executed by the controller in charge of manipulating the actuators (in real time) for the safe movement of the mobile robot. It can use intelligent control techniques to balance the response speed and computational cost [15]. The navigation control intelligence determines the mobile robot autonomy to explore scenarios, either previously established (with walls or lines on the ground) or unknown (in open field). In addition, for the application of the DDMR control, functions such as following trajectories, avoiding obstacles, or moving objects may be required [16]. Therefore, we study these topics: (a) configurations that enable the controller to interpret its environment from sensor information, (b) algorithms that enhance controller intelligence in decision making, and (c) technologies that add functionality and reduce controller workload [17,18,19].

Several controllers used for autonomous navigation are mentioned in the literature, among which PID control, model predictive control, and fuzzy logic control stand out. The selection of the controller will depend on the type of robot, the environment where it is operated, and the task to be carried out [20,21,22]. Due to computational advances, it is also possible to integrate Machine Learning (ML) models into navigation controls. These models allow a computational system to learn to solve a task without the need to receive explicit programming on how to do it, for example, classification (separate objects in two or more classes), clustering (collecting objects with similar characteristics), and regression (establishing the relationship between two or more variables). An ML-based controller can be trained to learn to navigate in specific conditions or to adapt to dynamic environments from training. This consists of feeding a large dataset into a model and indicating the expected results (case study) to find characteristic patterns that allow it to make predictions when new data are involved, with a probability of being right (inference) [23,24,25].

Electronic development boards and microcomputers can be used to prototype technological solutions and carry out proofs of concept, that is, to verify the technical feasibility of a proposal by experiments [26,27]. Among these, Arduino UNO and Raspberry Pi stand out because they started the concept of open-source hardware (open-source hardware). This refers to all technological projects whose design documentation is publicly accessible so that they can be studied, freely modified, and redistributed without the need to purchase a license or subscription [28]. This has led manufacturers of electronic devices to create electronic development boards such as ESP32-DevKit and microcomputers such as Jetson Nano, which have functionalities to implement embedded systems with applications from the Internet of Things (IoT) to computer vision. ML models usually require a lot of computational resources and memory, but with advances in techniques to optimize them, an embedded system might use these ML models to make inferences [29].

At the Pontificia Universidad Católica de Valparaíso (PUCV), Chile, work has been carried out to provide greater intelligence to the Khepera IV mobile robot from the company K-TEAM. For example, [30] shows the development of a platform that obtains the absolute position of one or more Khepera within a given workspace. This solution requires a PC that captures images of the base of the platform using a camera and Swistrack v4 (OpenCV-based software) that detects and locates the robots present in the images. Finally, for all the robots to communicate with the PC, Easy Java Simulations (EJS) software is used, which allows each robot to consult, through TCP Socket, its absolute position and the error so that each one can correct its speeds and orientation. In [31], the methodology to calibrate the internal sensors of the Khepera using neural networks is presented, and in [32], the Q-learning technique is used, where the model learns by exploring an unknown environment to determine the actions to execute by trial and error. Both works require repeating a series of steps many times to adjust their models. In order not to wear out the useful life of the Khepera, they use the CoppeliaSim simulator, and from a digital model of the Khepera, developed at PUCV [33,34,35], they train their models in a controlled environment, as required by each application. This opens up two possibilities: performing repetitive tests, depending on the application, using the Khepera digital model, and obtaining reliable results before testing on a real Khepera, extending its useful life.

The Khepera is a DDRM with two spherical support wheels, with a diameter of 14 cm and a height of 5.8 cm. It has a remarkable performance in both simulation and real life, based on a sufficiently powerful device to read its sensors and control its actuators and navigate in real time. However, it has limited computational power to carry out the computational vision (not fitted with a GPU). To take advantage of the potential mentioned above of the Khepera, an embedded system can be integrated that adds the functionality to detect objects using ML models or even Deep Learning (DL), for example [36].

After analyzing the state of the art of embedded systems, research was carried out on NVIDIA development cards to carry out AI solutions. Due to the workload that image processing and inference using a model can carry, the Jetson Xavier NX was selected. It is compatible with model creation environments, such as TensorFlow [37], PyTorch [38], and Caffe [39]. In addition, it has several interfaces, because it also features a CPUr, including HDMI, GPIO, USB Ports, and Ethernet, which opens possibilities for other types of projects. In addition, its dimensions are 7 cm × 4.5 cm, making it suitable for placing on top of the Khepera.

The motivation of this work is to integrate both systems to create an embedded system that allows the following trajectories using computer vision techniques and object detection in real time. To achieve this, a TCP Socket-type connection is first established between both systems through WiFi, and then an algorithm is developed that has the functionality to segment and follow trajectories and the functionality to make decisions when detecting objects using ML or DL models. This navigation algorithm was tested in three experiments, both in real time and in simulation.

The main contributions of this work are the following:

The development of an embedded system that integrates the Khepera and Jetson Xavier NX, both systems exchanging information by TCP Socket. To the best of our knowledge, our work is the first solution that links the Khepera and the Jetson Xavier NX to implement a navigation DDRM based on trajectory tracking and object detection.
The construction of a robotic platform for testing algorithms based on image processing for trajectory tracking and real-time object detection models.
The verification of the technical feasibility of using the YOLOv5 architecture and its potential to generate models using transfer learning so that they can be used by an embedded system with GPU, managing to create robust intelligent systems in a shorter amount of time.
The proposed system presents an improvement with respect to previous works, as our approach increases the speed of the robot by improving the success rate of object recognition.

Carrying out this solution becomes a challenge due to the complexity involved in establishing communication between two systems so that they interact as a single system, the developing of navigation algorithms, and the generation of object detection models and integrating them into navigation algorithms. The experiments carried out both in simulation and in real time allowed us to verify the potential of the solution.

This article is organized as follows: Section 2 defines the problem and the solution approach. Section 3 shows the final implementation of the system and the experiments carried out to verify the scope of the system. Section 4 presents conclusions and possible future improvements.

2. Problem Statement and Proposed Solution

An autonomous DDRM requires sensors that deliver essential information about its environment so that it can navigate safely. One of the most popular in the industry is the LiDAR sensor [40] which, based on lasers, can measure the distance to various points in the environment and create a map with objects and surfaces, making it very accurate but expensive; in addition, the controller requires more computational resources to read the map. The next option that requires less expensive sensors, but still uses high computational resources, is the use of computer vision for guided path tracking and real-time object detection for decision making, all of which are based on the use of a camera, segmentation algorithms, and a detection model.

The work presented in [41] uses the platform of [30] to test navigation control based on trajectory tracking and traffic sign detection, where the Khepera sends an image of the road to the server, which has the algorithm to process it. This included both trajectory tracking using OpenCV and signal detection, which uses Haar Cascade, and a basic ML technique used to detect whether or not something in an image belongs to a specific class. After processing these images, the server uses EJS to send the new linear and angular velocities commands to the Khepera. This algorithm obtained an average success rate of

88.8 %

for the traffic sign detection model and a maximum speed of

2.7

cm/s for the trajectory tracking, probably due to the use of Haar Cascade models to perform the entire object detection task. Moreover, due to the number of separate components and programs it comprises, it becomes less flexible to modify and add more functionalities.

The trajectory tracking and traffic sign detection functionalities, using a single model, can be integrated into an embedded system, reducing the complexity of the whole system significantly and allowing the solution to scale to future improvements as required by the programmer, tasks that can be executed with the Jetson Xavier NX.

2.1. Kinematic Model

The kinematic model of a differential robot is the mathematical modeling of its movement without considering the external forces involved. It can estimate with high precision the position and orientation of the mobile robot, considering the following:

The robot moves on a flat surface.
The wheel slip must be negligible.
The robot is rigid with no flexible parts.
There are internal control loops to achieve the commanded speeds.

These conditions are met by the Khepera IV, which makes it possible to use the model to describe its motion through space, seeking to control its linear velocity (V) and angular velocity (

ω

). The equations that define the differential kinematic model are

V = \frac{(ω_{r} + ω_{l}) R}{2},

(1)

ω = \frac{(ω_{r} - ω_{l}) R}{2 L},

(2)

where

ω_{r}

and

ω_{l}

are the angular velocities of each robot motor, R is the radius of the wheels, and L is the distance between the wheels. Solving for

ω_{r}

and

ω_{l}

, we obtain

ω_{r} = \frac{(2 V + ω L)}{2 R},

(3)

ω_{l} = \frac{(2 V - ω L)}{2 R} .

(4)

The proposed navigation control will use the kinematic model to make estimates and send

ω_{r}

and

ω_{l}

to the Khepera IV so that it adjusts them in real time. For the Khepera IV to move from a starting point to an endpoint, the controller must synchronously vary the angular velocities of its wheels to form an efficient trajectory, as shown in Figure 1.

2.2. Path Tracking

With the Jetson Xavier NX, it is possible to create scripts with Python, which is an open-source programming language that is well-supported in libraries and functionalities. Among the available libraries is OpenCV, which is dedicated to image processing and computer vision. A camera is installed for path tracking, pointing forward at the front of the Khepera IV. It has a viewing angle of

60^{\circ}

, causing it to capture more than just the sector of the path to follow so that, before performing the algorithm, the image is cropped to display the way in front of the robot and only a part of it (sample the path). The controller then performs the following steps using OpenCV:

Filter the areas that are not the color of the road because they will not be analyzed.
Select the area of the road color with the largest area.
Identify the center of that region, and its value is taken in x (it is where the variation can be given by error).
Calculate the error between the location of the center of the road and the center of the camera (e); this is normalized to take values between 0 and 1.
Estimate the new angular speed ( $ω$ ) needed to adjust the direction of the Khepera, as well as the linear speed (V), needed for its advance, using the following equations:

$ω_{n} = ω \cdot e \cdot k_{p}$

(5)

$v_{n} = v \cdot (1 - | e |)$

(6)

where

ω_{n}

and

v_{n}

are the updated values of the angular and linear velocities, respectively. Equation (5) implements a P controller responsible for adjusting the angular speed of the Khepera according to the e value. The gain

k_{p}

is adjusted to

0.9

after several experiments. Equation (6) is a way of relating the linear velocity with the angular one but in the following way: if the calculated error e is very large, the linear velocity will tend to be 0; otherwise, it will tend to its nominal value. This ensures that the orientation adjustment has priority over the forward motion, which is important in path tracking. Figure 2 illustrates the route tracking conceptual model and the results obtained by implementing it in the controller.

2.3. Object Detection

When using ML or DL models, there are techniques for creating models from scratch; however, certain architectures (neural network layer structure) are usually used that achieve good results with a reduced use of resources and these are continuously studied and improved, creating new versions. Among these architectures are MobileNet SSD, Faster R-CNN, and YOLO. The YOLOv5 ([42,43]) was selected for its scope in object detection in real environments and its functionality in generating models using transfer learning, which helps reduce the model training time because the architecture of the model is not trained from scratch. This architecture is based on PyTorch, which is a framework compatible with Python and the Jetson Xavier NX.

Transfer learning (TL) is used to create new models, which consists of taking a previously trained model (with a large dataset) and re-training it with a custom (and possibly smaller) dataset. This is conducted in such a way that the ability to recognize patterns or features is obtained from the first model but is now performed exclusively with a custom dataset to be more efficient than training the model from scratch. In the case of YOLOv5, its models are pre-trained with the COCO dataset, which has more than 330,000 images labeled with 80 different categories. Another advantage of the architecture is that it has different versions to select according to the available computational resources. For microcomputers, YOLOv5s is recommended.

The prototype of the robotic platform was created using CoppeliaSim. This allowed repetitive tests to verify the trajectory tracking and adjust the controller before testing it in the Khepera IV. Later, within the same environment, the traffic signal models to be detected were created. They are shown in Figure 3. Table 1 shows the list of traffic signs to be detected. The first column has the assigned name, the second column describes the task that the navigation system will perform when detected, and the last column shows its corresponding view.

Then, the robotic platform was used to generate the dataset of the traffic signs to be detected (custom dataset). To do this, groups of 4 transit signals were placed on it at strategic points, which varied randomly in the following experiment (3 were carried out). Finally, the Khepera, with the navigation control for path tracking, traveled the platform taking captures as the one shown in Figure 4. A total of 3 videos were obtained at 6 fps with a duration of 30 s each. When breaking them down into images, 540 were reached. From these, the best 135 were manually selected and labeled (indicating where the traffic signals were located and what type they were) in the YOLOv5 format, using the RoboFlow online tool.

By having the personalized dataset, we proceeded to perform TF with the YOLOv5 model, having been trained with the COCO dataset. This model has in-depth knowledge of pattern recognition and feature detection in images. To apply TF, part of the architecture of the model was taken. Specifically, the convolutional layers, which is where said knowledge is encoded, and the weights of the output layers were restarted to train them with the dataset of traffic signs. This generates a model whose task is the detection of 6 kinds of traffic signs. This process is illustrated in Figure 5.

For this training, it was necessary to use a server with the following characteristics:

CPU: Intel Core i9-10900X.
lGPU: RTX2080, comes with 8 GB of GDDR6 memory.
RAM: 64 GB.
Storage: 1 TB.

At the end of the training and validation, an object detection model was obtained with a confusion matrix where each class had

100 %

success. Figure 6 shows examples of how it works in real time with the traffic sign detection model.

3. Final Implementation of Navigation Control and Experimentation

First, the conceptual model of the components that make up the solution was established. This is shown in Figure 7a while Figure 7b shows its physical implementation.

Then, all the algorithms proposed in the solution were integrated into a single navigation control algorithm. The algorithm shown in Figure 8a was generated in such a way that the Jetson Xavier NX establishes the connection via TCP Socket with the Khepera IV and then the navigation control remains working, which is based on the following trajectories until detecting traffic signs to perform actions. These traffic signals must be at a distance of 20 cm (or less) and must be located on the right side of the trajectory; the estimation of the distance between the mobile robot and a traffic sign is made from the area that uses the bounding box of its detection.

In parallel, the main Khepera program is executed. This is shown in Figure 8b, which, after establishing the connection via TCP Socket, keeps asking if there are data available to update the angular velocities of its wheels.

A robotic platform was built whose workspace is

1.5 \times 1.2

m

^{2}

in order to be able to place traffic signs to create different scenarios. Figure 9a shows a scenario within the CoppeliaSim simulator while Figure 9b shows its implementation on the real platform.

3.1. Scene 1

Scene 1 is shown in Figure 10. It consists of the robot following the trajectory, and every time it detects a traffic signal 20 cm away, the system stops for 2 s, but it does not execute the sign action. This is carried out to see the number of hits of the robot navigation control. For the 10 attempts that were made both in simulation and in real life, 100% success was obtained. A light blue line is drawn to indicate the trajectory that the robot must follow, and the

x y

axes are also placed to see the reference of the robot within its workspace. The above applies also to the next test scenes.

Figure 11 shows the behavior of the speeds according to the detected signal, with Figure 11a showing the behavior of the navigation control in simulation and Figure 11b for the real test. Both cases are very similar and demonstrate that the vehicle navigates at a maximum linear velocity of 10 cm/s. Figure 12a shows the trajectory generated until the “pucv” signal is detected, which indicates the end of the route. Figure 12b shows the distance between the mobile robot and the point of arrival, the traffic signal “pucv”, while navigating the route for 60 s.

3.2. Scene 2

Scene 2 is shown in Figure 13. It consists of the robot taking the action corresponding to the traffic signal detected at 20 cm while it is traversing the trajectory. For the 10 attempts in this scenario, a

100 %

success rate was obtained.

Figure 14a shows the speed changes according to the traffic signal detected, at 20 cm, within the CoppeliaSim simulation. Figure 14b shows the same but for the real implementation. Figure 15a shows the trajectory generated until reaching the end of the route when detecting the “pucv” signal while Figure 15b shows the distance between the mobile robot and the point of arrival, the traffic signal “pucv”, while navigating the route for 30 s.

3.3. Scene 3

Scenario 3 is shown in Figure 16. Now, the robot follows the trajectory, and when a traffic signal is found, it carries out the corresponding action. For the 10 attempts of this experiment,

80 %

success was obtained, because on 2 occasions certain turns left the robot out of the center of the road, and while it adjusted to the center, it did not adequately detect one of the traffic signs.

Figure 17a shows the speed changes according to the traffic signal detected at 20 cm within the simulation in CoppeliaSim, and Figure 17b shows the same but for the real implementation. Figure 18a shows the trajectory generated until reaching the end of the route when detecting the “pucv” signal while Figure 18b shows the distance between the mobile robot and the point of arrival, the traffic signal “pucv”, while navigating the route for 45 s.

Looking at the results of the experiments, it can be observed that the maximum linear velocity was maintained at 10 cm/s, the average hit rate for object detection was

100 %

, and the average hit rate for navigation control was

93 %

. This is illustrated in Table 2. These results show an improvement to those obtained in [41].

Table 3 includes the overall performance of both the simulation and real tests in comparison with an ideal motion. This optimal motion consists of following the same trajectories in terms of paths and traffic signs but considering perfect detection, identification, and control. Each scene includes a column with the accumulated absolute error (AAE) with respect to the ideal motion and a column with the total time counted from the start to the reaching of the target. The absolute AAE is calculated as follows:

A A E = \sqrt{\sum_{k = 1}^{N} | x (k) - x_{T} (k) | + | y (k) - y_{T} (k) |}

(7)

where

(x_{T}, y_{T})

is the ideal trajectory,

(x, y)

is the trajectory to be studied, and N is the number of sample points.

On the one hand, ideal behavior in all scenarios takes less time, which is to be expected. On the other hand, the comparison of the AAE between the simulated and real tests demonstrates the similarity of both test outcomes, which is also predictable because of the good quality of the simulator engine.

To complement the previous results, a more complex scenario is included. This is tested in simulation, where more traffic signs and more paths are taken. No real test is performed as the previous figures and results, summarized in Table 3, indicate a close similarity between the simulation and real outcomes.

Figure 19 shows the arena where several different traffic signs and paths are used for this case. This scenario was designed based on [41], replicating the one shown in Figure 14. The results of this test are shown in Figure 20.

Figure 20a shows the behavior of the linear and angular speeds for the robot navigating throughout the scenario. It reaches a maximum displacement speed of 10 cm/s and a low error in the trajectory tracking which is illustrated in Figure 20b, taking 120 s to complete the journey.

4. Conclusions

In this work, the development of a system that integrates a mobile robot (Khepera IV) and an embedded system (Jetson Xavier NX) was presented. The latter adds functionalities to the Khepera IV in such a way that its navigation potential is fully used and improved with the computational resources of the Jetson Xavier NX board. This solution benefits from the use of the Python programming language, which is supported by computer vision libraries and frameworks for the use of object detection models.

To verify the operation of the system, a robotic platform was built, and the CoppeliaSim simulator was used as a complement. Both were used to carry out experiments and analyze the response of the navigation control using the Khepera IV, in terms of the maximum displacement speed, hit rate in object detection, and hit rate in the navigation control response. The obtained results surpass previous works, giving a contribution to the state of the art in the use of embedded systems to provide functionalities to mobile robots.

The comparison between the operation of the navigation control within the simulator concerning the real environment showed very similar results that allowed to establish a high confidence for the use of the simulator to carry out tests and modifications to any algorithm before testing it in the real mobile robot, thus extending its useful life. The differences observed in the comparison are due to the fact that within the real environment, there are imperfections both in the base of the platform and in the structure of the Khepera IV, causing friction when navigating and not captured by the simulator.

The experimental tests allowed us to observe the strengths and limitations of the algorithms of the system, making it necessary to improve what refers to navigation control, specifically in some turns where the robot moved away from the center of the road, ignoring traffic signs. For future work, the use of advanced control techniques is recommended, allowing the Khepera IV to move at higher speeds while following the trajectory with lower overall error, minimizing the error in detecting the traffic signals.

Author Contributions

Conceptualization, A.M. and G.F.; methodology, G.G., A.M., G.F. and E.F.; software and hardware, A.M.; validation, E.F. and E.A.-E.; formal analysis, A.M., G.G. and G.F.; investigation, G.G., A.M. and G.F.; resources, G.F.; data curation, G.G. and A.M.; writing—original draft, A.M., G.G., G.F., E.A.-E. and E.F.; writing—review and editing, G.G., A.M., G.F., E.A.-E. and E.F.; visualization, G.G. and A.M.; supervision, G.F. and E.F.; project administration, G.F. and E.F.; funding acquisition, G.F., E.A.-E. and E.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the Chilean Research and Development Agency (ANID) under Project FONDECYT 1191188, the National University of Distance Education under Projects 2021V/-TAJOV/00 and OPTIVAC 096-034091 2021V/PUNED/008, and the Ministry of Science and Innovation of Spain under Projects PID2019-108377RB-C32 and PID2020-112658RB-I00.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

Thanks to Julio Rivera from SPOT, Chile, for his technical support to make the prototype of this work.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lee, I. Service Robots: A Systematic Literature Review. Electronics 2021, 10, 2658. [Google Scholar] [CrossRef]
Montoya-Cavero, L.E.; Díaz de León Torres, R.; Gómez-Espinosa, A.; Escobedo Cabello, J.A. Vision systems for harvesting robots: Produce detection and localization. Comput. Electron. Agric. 2022, 192, 106562. [Google Scholar] [CrossRef]
Kim, J.; Mishra, A.K.; Limosani, R.; Scafuro, M.; Cauli, N.; Santos-Victor, J.; Mazzolai, B.; Cavallo, F. Control strategies for cleaning robots in domestic applications: A comprehensive review. Int. J. Adv. Robot. Syst. 2019, 16, 1729881419857432. [Google Scholar] [CrossRef] [Green Version]
Sun, Z.; Xie, H.; Zheng, J.; Man, Z.; He, D. Path-following control of Mecanum-wheels omnidirectional mobile robots using nonsingular terminal sliding mode. Mech. Syst. Signal Process. 2021, 147, 107128. [Google Scholar] [CrossRef]
Chen, G.; Yang, X.; Xu, Y.; Lu, Y.; Hu, H. Neural network-based motion modeling and control of water-actuated soft robotic fish. Smart Mater. Struct. 2022, 32, 015004. [Google Scholar] [CrossRef]
Chen, G.; Xu, Y.; Yang, C.; Yang, X.; Hu, H.; Chai, X.; Wang, D. Design and control of a novel bionic mantis shrimp robot. IEEE/ASME Trans. Mechatron. 2023. early access. [Google Scholar] [CrossRef]
Rubio, F.; Valero, F.; Llopis-Albert, C. A review of mobile robots: Concepts, methods, theoretical framework, and applications. Int. J. Adv. Robot. Syst. 2019, 16, 1729881419839596. [Google Scholar] [CrossRef] [Green Version]
Montenegro, G.; Chacón, R.; Fabregas, E.; Garcia, G.; Schröder, K.; Marroquín, A.; Dormido-Canto, S.; Farias, G. Modeling and Control of a Spherical Robot in the CoppeliaSim Simulator. Sensors 2022, 22, 6020. [Google Scholar] [CrossRef]
Schröder, K.; Garcia, G.; Chacón, R.; Montenegro, G.; Marroquín, A.; Farias, G.; Dormido-Canto, S.; Fabregas, E. Development and Control of a Real Spherical Robot. Sensors 2023, 23, 3895. [Google Scholar] [CrossRef]
Stefek, A.; Pham, T.V.; Krivanek, V.; Pham, K.L. Energy Comparison of Controllers Used for a Differential Drive Wheeled Mobile Robot. IEEE Access 2020, 8, 170915–170927. [Google Scholar] [CrossRef]
Fabregas, E.; Farias, G.; Aranda-Escolástico, E.; Garcia, G.; Chaos, D.; Dormido-Canto, S.; Bencomo, S.D. Simulation and experimental results of a new control strategy for point stabilization of nonholonomic mobile robots. IEEE Trans. Ind. Electron. 2019, 67, 6679–6687. [Google Scholar] [CrossRef]
Moysiadis, V.; Tsolakis, N.; Katikaridis, D.; Sørensen, C.G.; Pearson, S.; Bochtis, D. Mobile Robotics in Agricultural Operations: A Narrative Review on Planning Aspects. Appl. Sci. 2020, 10, 3453. [Google Scholar] [CrossRef]
Gravina, R.; Alinia, P.; Ghasemzadeh, H.; Fortino, G. Multi-sensor fusion in body sensor networks: State-of-the-art and research challenges. Inf. Fusion 2017, 35, 68–80. [Google Scholar] [CrossRef]
Sun, J.; Zhao, J.; Hu, X.; Gao, H.; Yu, J. Autonomous Navigation System of Indoor Mobile Robots Using 2D Lidar. Mathematics 2023, 11, 1455. [Google Scholar] [CrossRef]
Mújica-Vargas, D.; Vela-Rincón, V.; Luna-Álvarez, A.; Rendón-Castro, A.; Matuz-Cruz, M.; Rubio, J. Navigation of a Differential Wheeled Robot Based on a Type-2 Fuzzy Inference Tree. Machines 2022, 10, 660. [Google Scholar] [CrossRef]
Kim, J.; Kim, B.K. Cornering Trajectory Planning Avoiding Slip for Differential-Wheeled Mobile Robots. IEEE Trans. Ind. Electron. 2020, 67, 6698–6708. [Google Scholar] [CrossRef]
Dell’Agnola, F.; Jao, P.K.; Arza, A.; Chavarriaga, R.; Millán, J.d.R.; Floreano, D.; Atienza, D. Machine-Learning Based Monitoring of Cognitive Workload in Rescue Missions With Drones. IEEE J. Biomed. Health Inform. 2022, 26, 4751–4762. [Google Scholar] [CrossRef] [PubMed]
Abdelwahab, M.; Parque, V.; Fath Elbab, A.M.R.; Abouelsoud, A.A.; Sugano, S. Trajectory Tracking of Wheeled Mobile Robots Using Z-Number Based Fuzzy Logic. IEEE Access 2020, 8, 18426–18441. [Google Scholar] [CrossRef]
Štefek, A.; Pham, V.T.; Krivanek, V.; Pham, K.L. Optimization of Fuzzy Logic Controller Used for a Differential Drive Wheeled Mobile Robot. Appl. Sci. 2021, 11, 6023. [Google Scholar] [CrossRef]
Babunski, D.; Berisha, J.; Zaev, E.; Bajrami, X. Application of Fuzzy Logic and PID Controller for Mobile Robot Navigation. In Proceedings of the 2020 9th Mediterranean Conference on Embedded Computing (MECO), Budva, Montenegro, 8–11 June 2020. [Google Scholar] [CrossRef]
Luis, C.E.; Vukosavljev, M.; Schoellig, A.P. Online Trajectory Generation With Distributed Model Predictive Control for Multi-Robot Motion Planning. IEEE Robot. Autom. Lett. 2020, 5, 604–611. [Google Scholar] [CrossRef] [Green Version]
Yudha, H.M.; Dewi, T.; Hasana, N.; Risma, P.; Oktarini, Y.; Kartini, S. Performance Comparison of Fuzzy Logic and Neural Network Design for Mobile Robot Navigation. In Proceedings of the 2019 International Conference on Electrical Engineering and Computer Science (ICECOS), Batam, Indonesia, 2–3 October 2019. [Google Scholar] [CrossRef]
Wang, T.; Chen, B.; Zhang, Z.; Li, H.; Zhang, M. Applications of machine vision in agricultural robot navigation: A review. Comput. Electron. Agric. 2022, 198, 107085. [Google Scholar] [CrossRef]
Jiang, H.; Wang, H.; Yau, W.Y.; Wan, K.W. A Brief Survey: Deep Reinforcement Learning in Mobile Robot Navigation. In Proceedings of the 2020 15th IEEE Conference on Industrial Electronics and Applications (ICIEA), Kristiansand, Norway, 9–13 November 2020. [Google Scholar] [CrossRef]
Pokle, A.; Martín-Martín, R.; Goebel, P.; Chow, V.; Ewald, H.M.; Yang, J.; Wang, Z.; Sadeghian, A.; Sadigh, D.; Savarese, S.; et al. Deep Local Trajectory Replanning and Control for Robot Navigation. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019. [Google Scholar] [CrossRef] [Green Version]
De Micco, L.; Vargas, F.L.; Fierens, P.I. A Literature Review on Embedded Systems. IEEE Lat. Am. Trans. 2020, 18, 188–205. [Google Scholar] [CrossRef]
Branco, S.; Ferreira, A.G.; Cabral, J. Machine Learning in Resource-Scarce Embedded Systems, FPGAs, and End-Devices: A Survey. Electronics 2019, 8, 1289. [Google Scholar] [CrossRef] [Green Version]
Karalekas, G.; Vologiannidis, S.; Kalomiros, J. EUROPA: A Case Study for Teaching Sensors, Data Acquisition and Robotics via a ROS-Based Educational Robot. Sensors 2020, 20, 2469. [Google Scholar] [CrossRef] [PubMed]
Ajani, T.S.; Imoize, A.L.; Atayero, A.A. An Overview of Machine Learning within Embedded and Mobile Devices–Optimizations and Applications. Sensors 2021, 21, 4412. [Google Scholar] [CrossRef]
Farias, G.; Fabregas, E.; Peralta, E.; Vargas, H.; Dormido-Canto, S.; Dormido, S. Development of an Easy-to-Use Multi-Agent Platform for Teaching Mobile Robotics. IEEE Access 2019, 7, 55885–55897. [Google Scholar] [CrossRef]
Farias, G.; Fabregas, E.; Peralta, E.; Vargas, H.; Hermosilla, G.; Garcia, G.; Dormido, S. A Neural Network Approach for Building An Obstacle Detection Model by Fusion of Proximity Sensors Data. Sensors 2018, 18, 683. [Google Scholar] [CrossRef] [Green Version]
Farias, G.; Garcia, G.; Montenegro, G.; Fabregas, E.; Dormido-Canto, S.; Dormido, S. Reinforcement Learning for Position Control Problem of a Mobile Robot. IEEE Access 2020, 8, 152941–152951. [Google Scholar] [CrossRef]
Fabregas, E.; Farias, G.; Peralta, E.; Vargas, H.; Dormido, S. Teaching control in mobile robotics with V-REP and a Khepera IV library. In Proceedings of the 2016 IEEE conference on Control Applications (CCA), Buenos Aires, Argentina, 19–22 September 2016; pp. 821–826. [Google Scholar]
Peralta, E.; Fabregas, E.; Farias, G.; Vargas, H.; Dormido, S. Development of a Khepera IV Library for the V-REP Simulator. IFAC-PapersOnLine 2016, 49, 81–86. [Google Scholar] [CrossRef]
Farias, G.; Fabregas, E.; Peralta, E.; Torres, E.; Dormido, S. A Khepera IV library for robotic control education using V-REP. IFAC-PapersOnLine 2017, 50, 9150–9155. [Google Scholar] [CrossRef]
Farias, G.; Garcia, G.; Montenegro, G.; Fabregas, E.; Dormido-Canto, S.; Dormido, S. Position control of a mobile robot using reinforcement learning. IFAC-PapersOnLine 2020, 53, 17393–17398. [Google Scholar] [CrossRef]
Shukla, N.; Fricklas, K. Machine Learning with TensorFlow; Manning: Greenwich, UK, 2018. [Google Scholar]
Paszke, A.; Gross, S.; Massa, F.; Lerer, A.; Bradbury, J.; Chanan, G.; Killeen, T.; Lin, Z.; Gimelshein, N.; Antiga, L.; et al. Pytorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 2019, 32, 1–12. [Google Scholar]
Jia, Y.; Shelhamer, E.; Donahue, J.; Karayev, S.; Long, J.; Girshick, R.; Guadarrama, S.; Darrell, T. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, USA, 3–7 November 2014; pp. 675–678. [Google Scholar]
Tee, Y.K.; Han, Y.C. Lidar-Based 2D SLAM for Mobile Robot in an Indoor Environment: A Review. In Proceedings of the 2021 International Conference on Green Energy, Computing and Sustainable Technology (GECOST), Miri, Malaysia, 7–9 July 2021; pp. 1–7. [Google Scholar] [CrossRef]
Farias, G.; Fabregas, E.; Torres, E.; Bricas, G.; Dormido-Canto, S.; Dormido, S. A Distributed Vision-Based Navigation System for Khepera IV Mobile Robots. Sensors 2020, 20, 5409. [Google Scholar] [CrossRef] [PubMed]
Yang, G.; Feng, W.; Jin, J.; Lei, Q.; Li, X.; Gui, G.; Wang, W. Face Mask Recognition System with YOLOV5 Based on Image Recognition. In Proceedings of the 2020 IEEE 6th International Conference on Computer and Communications (ICCC), Chengdu, China, 11–14 December 2020; pp. 1398–1404. [Google Scholar] [CrossRef]
Güney, E.; Bayilmiş, C.; Çakan, B. An Implementation of Real-Time Traffic Signs and Road Objects Detection Based on Mobile GPU Platforms. IEEE Access 2022, 10, 86191–86203. [Google Scholar] [CrossRef]

Figure 1. Displacement of the differential robot.

Figure 2. Path tracking algorithm: (a) Conceptual model of the algorithm. (b) Algorithm working in Khepera navigation controller.

Figure 3. Traffic signs for object detection.

Figure 4. Using CoppeliaSim to generate traffic signal dataset.

Figure 5. Transfer learning process: (A) Pre-trained model generation. (B) Using pre-trained model knowledge to generate a custom model.

Figure 6. Controller view when using real-time traffic sign detection model.

Figure 7. Proposed solution: (a) Conceptual model of the system. (b) The real implementation of the system.

Figure 8. System algorithm: (a) Jetson Xavier NX algorithm. (b) Khepera algorithm.

Figure 9. Robotic platform for testing computer vision algorithm: (a) CoppeliaSim simulation view. (b) Real implementation view.

Figure 10. First test scenario.

Figure 11. Angular and linear velocities of the Khepera: (a) CoppeliaSim simulation. (b) Real platform.

Figure 12. Results comparison: (a) Khepera position. (b) Distance between Khepera and target.

Figure 13. Second test scenario.

Figure 14. Angular and linear velocities of the Khepera: (a) CoppeliaSim simulation. (b) Real platform.

Figure 15. Results comparison: (a) Khepera position. (b) Distance between Khepera and target.

Figure 16. Third test scenario.

Figure 17. Angular and linear velocities of the Khepera: (a) CoppeliaSim simulation. (b) Real platform.

Figure 18. Results comparison: (a) Khepera Position. (b) Distance between Khepera and target.

Figure 19. Comparison scenario.

Figure 20. Simulation results for comparison: (a) Angular and linear velocities of the Khepera. (b) Khepera position.

Table 1. Traffic Signs Descriptions.

ID	Action	Traffic Signal
v120	maximum speed
left	turn left
v60	half maximum speed
stop	stop tour for 3 s
right	turn right
pucv	arrival point

Table 2. Experiment Results.

	Maximum Linear Velocity (cm/s)	Object Detection Hit Rate	Navigation Control Hit Rate
Scene 1	10	$100 %$	$100 %$
Scene 2	10	$100 %$	$100 %$
Scene 3	10	$100 %$	$80 %$
Average	10	$100 %$	$93 %$
Results from [41]	$2.7$	$88.8 %$	no data

Table 3. Path tracking error and total time.

	Scene 1		Scene 2		Scene 3
	Accumulated Error ( $m^{1 / 2}$ )	Time (s)	Accumulated Error ( $m^{1 / 2}$ )	Time (s)	Accumulated Error ( $m^{1 / 2}$ )	Time (s)
Ideal	0	53	0	25	0	41
Simulation	3.7	55.5	10.6	27	12.3	44.4
Real	4.0	54.7	10.4	26.6	12.3	42.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Marroquín, A.; Garcia, G.; Fabregas, E.; Aranda-Escolástico, E.; Farias, G. Mobile Robot Navigation Based on Embedded Computer Vision. Mathematics 2023, 11, 2561. https://doi.org/10.3390/math11112561

AMA Style

Marroquín A, Garcia G, Fabregas E, Aranda-Escolástico E, Farias G. Mobile Robot Navigation Based on Embedded Computer Vision. Mathematics. 2023; 11(11):2561. https://doi.org/10.3390/math11112561

Chicago/Turabian Style

Marroquín, Alberto, Gonzalo Garcia, Ernesto Fabregas, Ernesto Aranda-Escolástico, and Gonzalo Farias. 2023. "Mobile Robot Navigation Based on Embedded Computer Vision" Mathematics 11, no. 11: 2561. https://doi.org/10.3390/math11112561

APA Style

Marroquín, A., Garcia, G., Fabregas, E., Aranda-Escolástico, E., & Farias, G. (2023). Mobile Robot Navigation Based on Embedded Computer Vision. Mathematics, 11(11), 2561. https://doi.org/10.3390/math11112561

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mobile Robot Navigation Based on Embedded Computer Vision

Abstract

1. Introduction

2. Problem Statement and Proposed Solution

2.1. Kinematic Model

2.2. Path Tracking

2.3. Object Detection

3. Final Implementation of Navigation Control and Experimentation

3.1. Scene 1

3.2. Scene 2

3.3. Scene 3

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI