Autonomous Navigation System Test for Service Robots in Semi-Structured Environments

Rodriguez-Raygoza, Luis E.; Gonzalez-Aguirre, Juan A.; Felix-Herran, Luis C.; Ramirez-Moreno, Mauricio A.; Lozoya-Santos, Jorge de J.; Tudon-Martinez, Juan C.

doi:10.3390/app15169056

Open AccessArticle

Autonomous Navigation System Test for Service Robots in Semi-Structured Environments

by

Luis E. Rodriguez-Raygoza

,

Juan A. Gonzalez-Aguirre

,

Luis C. Felix-Herran

,

Mauricio A. Ramirez-Moreno

,

Jorge de J. Lozoya-Santos

and

Juan C. Tudon-Martinez

^*

School of Engineering and Sciences, Tecnológico de Monterrey, Monterrey 64700, Mexico

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(16), 9056; https://doi.org/10.3390/app15169056

Submission received: 11 June 2025 / Revised: 30 July 2025 / Accepted: 31 July 2025 / Published: 17 August 2025

(This article belongs to the Special Issue Advanced Mobile Robots: Researches and Applications)

Download

Browse Figures

Versions Notes

Abstract

This paper presents an autonomous mobile service robot designed by the Conscious Technologies research group at Tecnologico de Monterrey, integrating advanced navigation and real-time surveillance capabilities. The primary objective was to evaluate the robot’s performance in both navigation and vision tasks within dynamic environments. The research compared two navigation algorithms: the Navstack package, and a State Feedback Controller (SFC). While the Navstack algorithm excelled in obstacle avoidance, it showed variability in location precision and trajectory repeatability. Conversely, the SFC demonstrated superior precision and repeatability, but lacked obstacle-detection capabilities. Furthermore, the vision system’s efficacy in face mask detection and social distancing compliance was assessed under varying robot and human speeds. An ANOVA analysis revealed the significant impact of these dynamic variables on the performance of the vision model. The integration of robust navigation algorithms with intelligent surveillance methods highlights the potential for real-world applications in protocol enforcement and autonomous navigation in semi-structured environments.

Keywords:

navigation algorithms; autonomous mobile robot; object detection; vision system

1. Introduction

Automated Guided Vehicles (AGVs) are self-navigating systems that operate through a combination of onboard sensors and software algorithms [1]. They can follow paths using a variety of approaches, including laser triangulation, magnetic tapes, grids, wires, and optical guidance. These vehicles do not require a driver or operator, making them ideal for automating repetitive transport tasks in structured environments [1].

The first credited commercially available AGV dates back to 1954 when Arthur Barrett modified a tow truck so that it followed a wire on the floor. He named his invention “Guide-O-Matic” and it was offered by his company Barrett Electronics [1]. Back then, AGV applications were limited by the available sensors and computers, but nowadays, AGVs are widely used in manufacturing applications, due to their practicality and autonomous operation in dynamic environments [2]. Examples of current AGV applications include transport systems in warehouses and container terminals [3], and crop height measurement in the agricultural field [4].

In contrast, Autonomous Mobile Robots (AMRs) can perceive and interpret their environment, enabling them to navigate more flexibly than AGVs [5]. Unlike AGVs, which rely on fixed guidance infrastructure, AMRs build a virtual model of their surroundings to localize and plan navigation paths dynamically. Under this approach, the system creates a path plan to follow, instead of relying on external guidance like AGVs [6]. Due to their level of autonomy, these systems can transport material (similarly to AGVs), and perform more complex tasks such as surveillance and human–robot collaboration (e.g., with operators) [2].

In the 1990’s, many control and navigation methods were implemented in AMR systems. For instance, a navigation method based on estimating a path using a sensed environment was proposed in [7]. Static and moving obstacle avoidance has also been influenced by Fuzzy control [8], which has excelled by facilitating the implementation of heuristic control, being stable, and providing easy implementation [9]. Other improvements have focused on controlling various AMR systems at the same time, which are hierarchically organized using a traffic control [10]. Advanced approaches in the AGV field have allowed significant improvements in precision in applications that previously were impossible, e.g., Shijian et al. [11] proposed the use of magnetic tracks to increase AGV position accuracy, reducing the error from ±5 mm to ±1.69 mm. This improvement has allowed the implementation of AGV systems in some industrial applications that require almost millimeter precision, such as navigating narrow aisles in a warehouse.

In recent years, the development of complex AMR systems has increased. Gatesichapakorn et al. [12] developed a system running a Robot Operating System (ROS), a 2D LiDAR, and an RGB-D camera. In this work, two robot configurations were used to evaluate the performance in avoiding obstacles in an indoor environment with public space and a narrow way, one running on a Raspberry Pi 3 and a 2D LiDAR, and the other consisting of an Intel NUC, a 2D LiDAR, and an RGB-D camera. Iqbal et al. [4] presented a mobile field robot running ROS and a 2D LiDAR to navigate an occluded crop; the proposed navigation strategy based on a nodding LiDAR configuration achieved similar results to commonly used LiDAR configurations. Recent works have integrated reinforcement learning to enhance adaptive navigation and velocity control in AMRs [13]. Others have employed deep Q-learning with transfer learning for robust path planning in dynamic settings [14].

In 2020, due to the SARS COVID-19 pandemic, the idea of using AGVs for hospital applications began to spread widely. The vehicle’s tasks could include monitoring of patients’ health parameters such as oxygenation level, temperature, and heart rate; transportation of medicine from nurses to patients; and the disposal of waste material. One AGV implementation in this context was based on an infrared sensor to follow a wire on the floor and an ultrasonic sensor to avoid collisions [15]. Such implementations demonstrated how functional an AGV system could be when physical contact between people must be avoided, such as in hospital settings. When the first SARS COVID-19 cases started to appear on the American continent in early 2020 [16], several universities around the world took early actions to minimize the risk of propagation among students and collaborators of the university. The first step was to cancel onsite classes and migrate to an online model for academic continuity in early 2020; furthermore, some universities such as Tecnologico de Monterrey started to develop service robots as surveillance systems for social distancing [17].

Other universities around the world implemented similar AMR systems focused on surveillance of aspects such as social distancing, the use of hand sanitizer, and the correct use of face masks. For example, the University of Maryland surveyed social distance maintenance with the help of a 2D LiDAR, an RGB-D camera, and closed-circuit television (CCTV) [18]. The Ricardo Palma University in Peru designed a 3-wheeled robot with two anthropomorphic hands to detect face masks with AI trained cameras, dispense hand sanitizer, and monitor vital signs called “ROHNI-1” [19]. The Handong Global University in Korea designed a contactless office-assistant service robot capable of using image recognition, user tracking, and voice recognition to offer automated assistive service to students and staff at the university’s campus [20]. While these academic institutions developed successful designs, there are certain limitations that constrain their practical feasibility. Many of these solutions remain at the conceptual stage, lack autonomy capability, or offer simple applications for autonomous mobile platforms.

Outside the context of the pandemic, numerous researchers have proposed AMR systems presenting solutions to problems using commonly used algorithms. These solutions employed navigation techniques and advanced algorithms, highlighted relevant problems with common components, or offered a comparison of common SLAM packages. For instance, Köseoğlu et al. [21] designed an AMR running ROS and LiDAR systems; in their work, they highlighted a problem with the relation between the LiDAR’s sample rate and the robot’s speed, as it nonlinearly influenced the mapping quality when running the package including ROS. Shavanas et al.[6] implemented a surveillance system on an AMR running ROS, and a LiDAR system capable of mapping interiors, recording suspicious activities, and interacting with humans through an in-built display. Cheng et al. [22] verified the performance of two Simultaneous Localization And Mapping (SLAM) algorithms in a robotic system running ROS and LiDAR. Other authors have developed AMR systems running ROS with LiDARs that use the move_base package with the Adaptive Monte Carlo Localization (AMCL) approach [23,24,25]. The aforementioned works showed innovative navigation solutions and solved complex problems in obstacle avoidance; however, they did not show any advanced applications with the integration of vision algorithms.

Computer vision has transformed industrial and robotic systems, evolving from basic image processing to advanced real-time surveillance applications [26,27]. In robotics, vision enables object detection, obstacle avoidance, and decision-making based on dynamic environmental understanding [28]. A particularly innovative application in this field is the utilization of neural networks for specialized tasks, such as the detection of face masks—a critical concern in the wake of global health crises [29,30,31]. While these proposals have achieved great results, they remain limited to static and selected images.

This paper builds upon an AMR system of our own design and manufacture, preliminary introduced in [17], to explore the efficacy of two navigation controllers for real-time health protocol enforcement, such as face mask detection and social distancing. While individual navigation methods and vision systems have been extensively studied, their integration in real-world service robotics presents unique challenges that remain understudied. This work offers a system-level assessment of two widely used navigation methods (Navstack and SFC) in service robots, emphasizing how they integrate with a vision-based surveillance components in semi-structured environments. Our contribution lies in detailing how established methods perform and sometimes struggle under real deployment conditions, highlighting the practical trade-offs and sensor constraints that emerge when navigation and on-board vision must function together. These insights are particularly valuable for the growing field of service robotics, where the gap between laboratory performance and real-world deployment often determines a system’s practical utility. Through an ANOVA analysis, this work assessed the impact of the velocities of both people and the robot on surveillance accuracy, providing indirect insights into key performance metrics. The findings can contribute to a deeper understanding of how navigation strategies and environmental dynamics interact with vision-based surveillance in AMRs, providing insight for future optimizations in real-world applications.

This manuscript is organized as follows: Section 2 introduces the system’s design, manufacture, and integration, which includes a mechanical, electrical, and software overview. The design of experiments for navigation and smart surveillance purposes is presented in Section 3; whereas the obtained results are shown in Section 4. An in-depth discussion of the results is presented in Section 5. Finally, Section 6 concludes the paper and details future work.

2. Design, Manufacture, and Integration

Building upon preliminary results showcased in [17], this paper extends the implementation of an AMR system. PiBot, a differential drive mobile robot detailed in Figure 1, supports advanced navigation, smart surveillance, and research into control strategies and human–robot biometrics (temperature, distance, etc.). It demonstrates capabilities such as people counting, face mask validation, temperature measurement, and environmental monitoring in both closed and open scenarios. Notably, the MaskNet model previously tested on PiBot achieved a peak accuracy of 70% for face mask detection in the static environment of the Tecnologico de Monterrey’s library, involving scenarios with pedestrian traffic [32]. This work furthers the exploration of PiBot’s utility in dynamic interaction scenarios, highlighting its potential for real-world surveillance and interaction applications.

Figure 2 illustrates the mechatronic structure, showing the hierarchical organization and interactions among components. This architecture enables the robot to effectively process sensor data and execute both navigation and surveillance tasks.

In the following subsections, various aspects of the system are presented. The Section 2.1 details the system’s physical properties, such as its dimensions, weight, and kinematic model. The Section 2.2 discusses the hardware, connections, and components’ main tasks. Lastly, the Section 2.3 examines the autonomous navigation, control, and vision system algorithms.

2.1. Mechanical Specifications

In this subsection, the mechanical specification of the proposed AMR is presented, focusing on its dimensions, system weight, spatial configuration in Cartesian coordinates, the robot’s kinematic model, odometry characterization, and the system’s actuators, providing a comprehensive understanding of the mechanical aspects of the system exhibited herein.

The robot has a rectangular aluminum chassis with length of 55 cm, width of 45 cm, a total height of 95 cm, and wheel diameter of 15 cm. The unloaded vehicle weighs approximately 25 kg. The configuration space of the differential drive mobile robot is that obtained by two Cartesian coordinates,

[x, y]

being defined over a real set of numbers and another coordinate,

θ

, that is defined only over the numbers belonging to the angle of the unitary circle. This can be represented by

R^{2} \times S

, which is a mathematical abstraction and a formalism of the physical states that the robot can achieve at any given time. A schematic of the robot’s plan is shown in Figure 3, where the three variables of the robot’s position are shown.

2.1.1. Odometry Characterization

The transducers used in the odometry system are four Hall effect encoders, one located at each wheel that use an incremental tick for position estimation. Position estimation is carried out using the count of digital incremental encoders [33]. Given the total number of encoder ticks per N revolutions of the encoder, the distance traveled by the robot in the Cartesian space can be calculated using the following set of equations:

\begin{matrix} D_{l} = 2 π R \frac{Δ_{t i c k_{l}}}{N} \\ D_{r} = 2 π R \frac{Δ_{t i c k_{r}}}{N} \\ D_{c} = \frac{D_{l} + D_{r}}{2} \\ x_{i + 1} = x_{i} + D_{c} cos (θ) \\ y_{i + 1} = y_{i} + D_{c} sin (θ) \\ θ_{i + 1} = θ_{i} + \frac{D_{r} - D_{l}}{L} \end{matrix}

(1)

where

D_{c}

,

D_{l}

, and

D_{r}

are the distances traveled by the center point of the robot, the left wheel, and the right wheel, respectively, during a given time interval or movement, L is the distance between the wheels of the robot on the same axis, R is the radius of the wheels and,

Δ_{t i c k_{l}}

and

Δ_{t i c k_{r}}

are the increments in the count for the left and right encoders, respectively.

2.1.2. Kinematic Model

The model used for the differential drive mobile robot is presented in state-space form as follows:

[\begin{matrix} \dot{x} \\ \dot{y} \\ \dot{θ} \end{matrix}] = [\begin{matrix} \frac{R}{2} (ω_{r} + ω_{l}) cos θ \\ \frac{R}{2} (ω_{r} + ω_{l}) sin θ \\ \frac{R}{L} (ω_{r} - ω_{l}) \end{matrix}]

(2)

where

\dot{x}

is the linear velocity of the robot in the x direction,

\dot{y}

is the linear velocity in the y direction, and

\dot{θ}

is the angular speed of the model referenced with the inertial frame. The model states are dependent on

ω_{r}

and

ω_{l}

, which are the angular speeds of the right and left wheels, respectively, and they can be computed as

\begin{matrix} ω_{r} = \frac{2 v + ω L}{2 R} \\ ω_{l} = \frac{2 v - ω L}{2 R} \end{matrix}

(3)

where v and

ω

are the linear and angular velocities of the robot’s center point, respectively. These variables depend of the system’s current Cartesian pose. In this paper, the control strategy for the four-wheeled differential drive mobile robot capitalizes on the system’s intrinsic symmetry. Mobile robots with differential steering locomotion for classic wheel configurations can use the same equations as employed for two- and four-wheel drive platforms. This approach is effective due to the similar angular velocities exhibited by both wheels on each side of the robot [34].

2.1.3. Actuators

The physical control actuator is a permanent magnet DC motor with a maximum rotational speed of 200 RPM and a nominal voltage of 12 V. The actuator’s speed is controlled internally with a motor control board, which enables speed control of the actuator as a linear system. The actuator model dynamics can be modeled as a first-order system, where

ω_{x}

(RPMs of the permanent magnet DC motor) is the output of the system, and the input is the electric current signal delivered by the motor control board. The actuator control diagram is shown in Figure 4.

2.2. Electrical and Electronic Specifications

In this subsection, the details of the hardware components used, and their interconnections, applications, interfaces, and tasks will be thoroughly explored, providing a comprehensive understanding of the electrical and electronic aspects of the system under investigation.

Hardware and Connections

The system’s components consist of a Jetson TX2 (Nvidia) computer, which plays the role of the main computer running ROS, receives the messages from the different sensors and actuators, and performs computations on the acquired data for decision-making. This computer is connected to a USB Hub, which allows it to establish a serial connection with an Arduino Mega (Arduino), a LiDAR (Slamtec), and a 3D Camera (Intel). Additionally, the system includes a touch screen (ViewSonic) to interact with people, display messages, and function as a Human–Machine Interface (HMI).

The Arduino Mega executes a set of codes that process the information received from the Jetson TX2 to control the motors with a set of Roboclaw motor controllers (Basicmicro). The LiDAR is also connected to the USB Hub and sends data such as about the local environment and its estimated localization and orientation for the navigation algorithm (this process is explained in Section 2.3). The 3D camera, which is also connected to the USB Hub, is capable of detecting people and the correct usage of face masks, and of sending messages to the Jetson TX2 to readjust the path plan. The component connection diagram is shown in Figure 5. The main components required for the navigation system are the encoders on each motor, the LiDAR system, the 3D camera, and the Jetson TX2. These components and their location in the robot are shown in Figure 6. The whole robot is powered by two LiPo batteries; the first is a LiPo 4s 14.8 V battery and is the main power source of the digital and control electronics; the second is a LiPo 6s 22.4 V battery and energizes the power electronics, as well as the motors.

The main hardware component of the vision system is the camera [35]. In this robot, an Intel RealSense Depth Camera D435i was chosen, which has an IMU (Inertial Measurement Unit) capable of detecting depth under any situation when the camera moves. Due to this vision system working with artificial intelligence, a “digital brain” is required to carry out all the operations and processes necessary to perform the different tasks with very demanding calculations, offering maximum performance levels and relatively low power consumption [36]. The robot makes use of an NVIDIA-branded Jetson TX2 for this purpose; which has robust hardware, hence its great performance capacity, with components integrated into the board including a Gigabit Ethernet port, WiFi and Bluetooth connectivity, 8 GB of RAM memory, and 32 GB in eMMC format. Other capabilities include working with two 4k video streams at 60 frames per second and simultaneously managing 6 cameras, and it also has a 64-bit processor and a 256-core Pascal GPU [36].

2.3. Software Specifications

Lastly, in this subsection, the navigation algorithms, Navstack and Feedback controller, are described and compared, with Navstack being the primary choice for this robot. Additionally, the processing of the vision system will be detailed.

2.3.1. NavStack

The Navigation Stack package is an open-source, cross-platform algorithm capable of running very simple robots up to really complex systems, offering the capability of autonomous navigation. The navigation process diagram is showcased in Figure 7. The algorithm utilizes data from the odometry systems, built-in sensors, and a goal position, expressed as the vector

{\vec{x}}_{d}

. This goal position is represented by a point position (x,y,z) and a quaternion orientation (x,y,z,w). It is important to note that the z value of the desired pose is always set to 0 because the navigation algorithm only considers the x and y components, disregarding the height (z) value of maps. Consequently, the velocity commands generated by the algorithm are coherent and are dispatched to the mobile base of the system. This mobile base is incorporated into the move_base package, executing actions such as moving towards and reaching a goal. The move_base package employs both a local cost map (derived from the current LiDAR readings) and a global cost map (a previously created map of the environment), represented as an occupancy grid M, to achieve its navigation tasks [37].

For a system to run the Navigation Stack package, a set of requirements must be met:

A planar laser must be mounted on the system for map building.
The robot shape must be circular or square, as the package was not designed for arbitrary shapes.
The system must be running ROS.
A transform tree that defines the relative location of each component.
The system must publish sensor data using ROS message types.

The algorithm works as follows: The desired goal is sent to the goal manager listener, which is a set of coordinates and an orientation. The system automatically generates a path using the stored map of the environment saved on the global_costmap variable; then, Dijkstra’s algorithm calculates the shortest path available, using the robot’s current position as a global plan [38]. Then, with the global plan to follow, the local planner generates velocity commands to send to a mobile base using the Dynamic Window Approach (DWA) [39].

The local planner calculates the required speed v and

ω

of the system to reach the desired goal following a structured methodology. First, the system discretely samples the robot’s control space movement (x, y, and

θ

), for each velocity, and performs a short-time simulation and predicts the outcome. These simulations are then evaluated with a score that considers aspects such as proximity to goal, proximity to global planner (based on map and not current sensor readings), and proximity to obstacles. The highest-scored trajectory is selected and implemented. This algorithm is then repeated. The velocity commands v and

ω

are re-adjusted using the local cost map generated continuously from the LiDAR measurements, and the system’s current position and orientation generated by the LiDAR with the Adaptive Monte Carlo localization (AMCL) algorithm [40]. Once the velocity commands have been sent to the base controller, which requires data such as the wheelbase, track, and the wheels’ radius, it controls the wheels at the required speed according to the planned trajectory. The resulting change in position and orientation is estimated by the odometry system, to be considered by the AMCL algorithm and to readjust, if required, the trajectory following the DWA approach [41].

2.3.2. Feedback Controller

The controller proposed is a variation of the control scheme implemented in [42], as well as the one proposed in [7,43], where exponential stabilization is achieved via real State Feedback Control (SFC) using a geometric transformation of Cartesian coordinates to polar coordinates. To implement this control scheme, the kinematic model in Equation (2) can be transformed into a point-in-plane model:

[\begin{matrix} \dot{x} \\ \dot{y} \\ \dot{θ} \end{matrix}] = [\begin{matrix} v cos θ \\ v sin θ \\ ω \end{matrix}]

(4)

The position error is defined as the difference between the goal position

x_{d}

and the actual estimated position x, the same operation applies for the desired y coordinate. Let the output of the position error

u_{1} = x_{d} - x

and

u_{2} = y_{d} - y

, the model input for the speed is calculated as in Equation (5) and for the angular speed as in Equation (6), where

K_{ω}

and

K_{v}

refer to the gains of the controller and

θ_{d}

is the desired orientation angle in the control loop, computed as a function of the goal coordinates

x_{d}

and

y_{d}

. A geometric interpretation of this control scheme can be seen in Figure 8, the velocity vector

v

, pointing towards the goal is represented as a function of the positional error vector.

v = K_{v} v = K_{v} \sqrt{u_{1}^{2} + u_{2}^{2}}

(5)

ω = K_{ω} (θ_{d} - θ) = K_{ω} (arctan (\frac{u_{2}}{u_{1}}) - θ)

(6)

Parameter Choices and Model Scope: In our setup, the wheel radius R is 0.075 m, matching the physical wheels on PiBot and providing stable traction on high-friction indoor floors. The wheelbase L is set at 0.30 m, corresponding to the actual distance between the left and right wheels on the same axis. We assume negligible wheel slip under these lab-controlled conditions; more complex slip modeling would be required for uneven or low-friction terrains. Additionally, our sensor suite (encoders, LiDAR) is assumed to operate within nominal specifications, free of major signal dropouts or occlusions.

Standard Use of Cartesian-to-Polar Transformation: The conversion from Cartesian

(x, y)

coordinates to polar coordinates is a well-recognized practice in non-holonomic mobile robot control, helping to simplify the derivation of feedback control laws. We adopt it here for convenience and consistency with established techniques, rather than claiming a novel transformation. The overall implementation of this type of controller can be represented using the block diagram in Figure 9A. The robot model to control can be seen as a block called PiBot (name of the physical robot), from which the three states (position and orientation) can be obtained as a function of the odometry position estimation. On top of the go-to-goal low-level control, there coexists a finite state machine that regulates an arbitrary number of coordinates to reach the control system goal. The finite state machine can be seen in Figure 9B, which can be seen as a basic Go-to-goal scheme where the logic state of Goal reached conditions the next dynamic state using the tolerance of the norm of the error

T O L

concerning the goal point, according to

\sqrt{{(x - x_{d})}^{2} + {(y - y_{d})}^{2}} \leq T O L

.

2.3.3. Vision System

The vision system of PiBot, detailed extensively in [32], operates through a streamlined process involving person detection, image cropping, and face mask recognition. Utilizing YOLOv4 for efficient person detection through bounding boxes trained on the COCO database, the system employs the D435i camera’s sensors to assess social distancing between individuals. The synergy between these technologies underpins PiBot’s advanced vision capabilities, which are explored in greater detail in the preceding vision work [32].

3. Design of Experiments

This study comprised two independent experiments. Experiment 1 evaluated navigation accuracy by benchmarking two odometry approaches: Navstack [44] and SFC for a four-wheeled mobile robot [7]. Experiment 2 evaluated perception robustness by testing the MaskNet-based vision system under four robot–human speed combinations (static/moving robot; walking/running people).

3.1. Experimental Setup Description

The study consisted of two clearly separated experiments:

Experiment 1—Navigation Accuracy. Research question: How do the Navstack planner and a state-feedback controller (SFC) compare in navigation accuracy (MSE) and control-effort indices (IAE, ITAE, IAU) when executing four point-to-point moves indoors?
Experiment 2—Vision Robustness. Research question: How do robot motion (static vs. moving) and human speed (walking vs. running) influence the vision model’s performance?

3.1.1. Experiment 1: Navigation Tests

The experimental setup for navigation purposes consisted of 12 runs dedicated to comparing the odometry systems. These runs were performed on a carpeted floor in an enclosed space, as shown in Figure 10A. The first three pairs of runs for Experiment 1 were dedicated to the system reaching a goal G₁ located 2 m in front of the home position, as shown in Figure 10B. Throughout each run, the system’s position was continuously estimated through the AMCL algorithm with Navstack navigation, and through encoders with State Feedback navigation to compare with the desired goal position. Each odometry method was executed three times per goal, to have enough data to effectively conclude which one had the best performance.

In this sense, the second part of the experiment consisted of a second set of three runs dedicated to the system reaching a goal G₂ located 2 m to the left of the home position. This goal was intended to test the odometry methods with non-linear paths. The third set of three runs were focused on testing the odometry methods in non-linear movements, but located 2 m behind the home position, as shown in G₃. This goal put the system to the test when a 180° turn was required. The fourth and last set of three runs represented the fourth section of the experiment, which tested the odometry system when reaching the desired position, located 2 m to the right of the home position, labeled G₄. The four goals and the robot’s initial position are shown in Figure 10B.

In the navigation algorithm comparison task of this research, the criteria for reaching the goal were defined as the deviation in the x and y coordinates of the robot’s position being equal to or less than 10 cm from the desired goal location. This specific threshold ensured an acceptable level of precision for a location without a required orientation, reflecting the performance desired for practical applications of the navigation algorithms under investigation.

The experimental setup for navigation was conducted in an empty room with abundant distinct features, providing a favorable environment for the Navstack algorithm to perform effective localization using the AMCL algorithm. The level floor with high friction allowed for optimal traction and minimal wheel slippage, ensuring reliable performance of the evaluated navigation algorithms in this controlled setting.

3.1.2. Experiment 2: Vision Tests

The vision experiment, referred to as Experiment 2, analyzed the detection performance under various conditions. The study was divided based on two factors of interest: PiBot’s movement and the speed of human participants. PiBot was initially positioned facing an empty room, 4 m in front of two people at their starting points, with one person consistently wearing a face mask and the other not. The experiment was set up in four distinct conditions, as shown in Figure 11. Scenario in Figure 11A involved PiBot remaining stationary, while the participants walked toward each other, crossing paths within the camera’s view. In Figure 11B, the participants increased their speed to a run, while PiBot’s position remained unchanged. Scenario in Figure 11C introduced motion for PiBot, with the robot moving at a predetermined speed and with the participants walking. The final scenario, Figure 11D, had both PiBot and the participants in motion, with the latter running. Each condition was repeated four times in a random order, to avoid any sequence bias, providing a solid dataset for the following factorial analysis focusing on the relationship between PiBot’s movement and human speed. Video data for all scenarios were comprehensively recorded using rosbags and later analyzed on a separate computer to maintain uniformity in the test conditions. This process produced a series of .jpg images for each video frame, highlighting detected individuals and indicating whether they were wearing masks according to the algorithm. These images were then carefully reviewed to determine the accuracy of detection across the different test conditions.

The performance metrics were calculated from multiple experimental trials. Each experimental condition had four randomized repetitions, capturing 5–10 s video segments at 15 frames per second. These recordings were processed through our mask- and people-detection algorithms. For each analyzed frame, the detection results were evaluated against ground truth observations to determine true/false positives and negatives for both person presence and mask status. The frame-level metrics were then averaged across each trial to generate the performance indicators used in our statistical analysis (see supplementary data).

Figure 11 provides a detailed visual reference of the experimental conditions, crucial for comprehending the situations under which PiBot’s detection algorithms were scrutinized and assessed. The random order in which the vision experiments were performed is presented in Table 1.

3.2. Evaluation Metric Definition

For navigation, the quantitative element used to measure the performance of the robotic system was the error signal between the desired and current position, as in [45,46]. Its definition, as a function of time, corresponds to the Euclidean norm of the difference between the reference and each of the trajectory states:

For the navigation experiments, the robotic system’s performance was quantified, following [45,46], using a position-error signal defined as the Euclidean distance between the desired pose and the measured pose:

e (t) = | | x_{d} - x (t) | |

(7)

Measures of how fast the controller was, compared to other controllers or a baseline, were obtained as the Integral of the Absolute value of the Error (IAE) signal, which is described in Equation (8), and as the integral of the absolute value of the error multiplied by the time described in Equation (9). These performance indicators compute the signal energy of the error, which in general terms means that the lower the energy of the signal, the more precise and fast the system controller.

I A E = \int_{0}^{\infty} |e (t)| d t

(8)

I T A E = \int_{0}^{\infty} t |e (t)| d t

(9)

The energy consumption and autonomy of a system can also be a concern, especially in this kind of mobile robotics solution. Therefore, a performance indicator was computed as the integral of the absolute value of the manipulation signal described in Equation (10), where the magnitude of the manipulation vector

u = [v, ω]

is used to compute the IAU index as

I A U = \int_{0}^{\infty} |u (t)| d t

(10)

Another critical performance indicator that can be evaluated is the Mean-Squared Error (MSE) of the steady-state value of a controlled system. The closed form of the MSE can be seen in Equation (12), with the vector of error definition being as in Equation (11), where

G_{i j}

is the desired i goal at the j repetition of the experiment, and

{\hat{G}}_{i j}

is the final state reached by the robot at the end of the experiment.

e_{g} = [\begin{matrix} | | G_{11} - {\hat{G}}_{11} | | \\ | | G_{12} - {\hat{G}}_{12} | | \\ | | G_{13} - {\hat{G}}_{13} | | \\ \dots \\ | | G_{43} - {\hat{G}}_{43} | | \end{matrix}]

(11)

M S E = \frac{1}{N} e_{g}^{T} e_{g}

(12)

where N is the total number of tests executed, in this case,

N = 12

.

To assess the statistical significance and practical importance of differences between the algorithms, we calculated the following:

(a): 95% confidence intervals using the t-distribution for small samples:

$CI = \bar{x} \pm (t_{α, d f} \cdot \frac{s}{\sqrt{n}})$

(13)

where $t_{α, d f}$ is the t-critical value (4.303 for $d f = 2$ , $α$ = 0.05), s is standard deviation, and $n = 3$ samples per goal.
(b): Effect sizes using Cohen’s d with small-sample correction:

$d = (\frac{| {\bar{x}}_{1} - {\bar{x}}_{2} |}{\sqrt{(s_{1}^{2} + s_{2}^{2}) / 2}}) \cdot (\frac{N - 3}{N - 2.25}) \cdot \sqrt{\frac{N - 2}{N}}$

(14)

where ${\bar{x}}_{1}$ , ${\bar{x}}_{2}$ are means, $s_{1}$ , $s_{2}$ are standard deviations, and $N = 6$ total samples. Resulting effect sizes of $d = 0.2$ , $0.5$ , and $0.8$ indicate small, medium, and large effects, respectively.

In the evaluation of the vision experiment, the assessment focused on four main metrics, selected to capture the reliability and precision of both people and face mask detection capabilities under different conditions, as depicted in Figure 11. For each video, each frame was reviewed by hand, paying special attention to the MaskNet model’s True Positive (

T P

), True Negative (

T N

), False Positive (

F P

), and False Negative (

F N

) counts to calculate the vision model metrics related to its performance, using the metrics accuracy, precision, recall, and

F 1

Score [47], whose definitions are described as follows:

(a): $A c c u r a c y$ : Percentage of correct inferences out of all inferences.

$A c c u r a c y = \frac{T P + T N}{T P + T N + F N + F P}$

(15)
(b): $P r e c i s i o n$ : Portion of true positives out of all positives.

$P r e c i s i o n = \frac{T P}{T P + F P}$

(16)
(c): $R e c a l l$ : True positive rate, portion of correctly identified positives.

$R e c a l l = \frac{T P}{T P + F N}$

(17)
(d): $F 1$ : The $F 1$ score considers both precision and recall and its trade-off.

$F 1 = 2 * \frac{P r e c i s i o n * R e c a l l}{P r e c i s i o n + R e c a l l}$

(18)

Average values per metric were calculated for each run, providing data for the subsequent 2K factorial analysis. These quantified measures provided an understanding of the detection algorithms’ behavior, particularly their strengths and limitations under dynamic scenarios. This methodical approach aimed to ensure the validation of our system’s real-world applicability and helped us understand under which circumstances we can expect the best results.

4. Experimental Results

4.1. Experiment 1: Navigation Tests

One outlier run toward G₄ produced a small loop near the goal (Figure 12A, which inflated the overall Navstack mean to 0.26 ± 0.24 m. Removing that single run yielded a more representative error of 0.19 ± 0.10 m.

On the other hand, the trajectories during the 12 executions with the State Feedback Control algorithm are shown in Figure 12B. This algorithm offered a better precision in reaching the desired position, following consistent and repeatable trajectories. In this case, the average error was 0.0412 ± 0.0014 m. Expressed as MSE, Navstack yielded

0.12 \pm 0.14 m^{2}

for the full set of 12 runs. These results, as mentioned above, were highly influenced by the last run dedicated to reaching G₄. The SFC controller produced

0.0017 \pm 0.0014 m^{2}

, two orders of magnitude lower. The top-left graph in Figure 13 shows both algorithms’ average MSE and standard deviation per goal.

The average IAE with the Navstack algorithm was 80.70495 ± 36.30217 m

\cdot

s throughout all runs. The State Feedback Control algorithm achieved an average IAE of 38.8357 ± 0.6929 m

\cdot

s. The top-right graph in Figure 13 shows the average IAE and standard deviation for both algorithms per goal. Regarding the ITAE, the Navstack algorithm accumulated an average of 2840.400532 ± 4610.664885 m·s² across all of its runs. Meanwhile, the SFC algorithm gathered an average of 445.4025 m ± 14.1986 m·s². This metric, just like the MSE metric, was highly affected by the error recorded in the last run reaching the G₄, shown as an erratic path forming loops when reaching the destination in Figure 12A. The bottom-left graph in Figure 13 displays the average ITAE and standard deviation per goal of both algorithms.

Finally, the average IAU index of the runs with the Navstack algorithm was 11.3139 ± 6.2265. In contrast, the SFC algorithm achieved an average index of 5.1986± 0.0657. The bottom-right graph in Figure 13 shows the average IAU and standard deviation per goal for both algorithms. The SFC algorithm generated manipulation variables with lower magnitudes, i.e., a softer actuation. Statistical analysis revealed substantial practical differences between the algorithms across all metrics.

Table 2 lists the means, standard deviations, 95% confidence intervals, and bias-corrected Cohen’s d for each goal. SFC outperformed Navstack by one to almost three orders of magnitude in MSE (large effects for G1–G3, moderate for G4) and lowered the overall manipulation effort (IAU) by about half on average. SFC satisfied the

\pm 0.10 m

accuracy criterion in all 12 runs, whereas Navstack did so in 7, confirming a superior point-to-point accuracy, trajectory repeatability, and actuation efficiency in this semi-structured indoor test.

4.2. Experiment 2: Vision Tests

Based on Experiment 2, the vision system performance was evaluated to examine the interaction between robots and humans under various conditions, specifically focusing on four metrics: accuracy, precision, recall, and

F 1

score. The analysis involved factorial regression models and ANOVA to determine the significant effects of robot speed (static or moving) and person speed (walking or running) factors. First, confusion matrices were produced for all configurations and the overall runs, presenting the true positives, true negatives, false positives, and false negative values counts for each condition. Figure 14 displays an example frame from the vision system, depicting how people were detected and classified based on mask wearing, with confidence scores shown for both masked and unmasked individuals. These real-time classifications were compared against manual ground truth annotations to generate the confusion matrices.

Table 3 presents the confusion matrix for all possible configurations according to Table 1, case A considered the static platform with walking people configuration, case B corresponded to the static robot and people running mode, case C was when the robot was moving and the people were walking, and finally case D was related to the robot moving and people running configuration.

Lastly, Table 4 presents the confusion matrix considering all the runs together, obtaining the overall count for each

T P

,

T N

,

F P

, and

F N

.

The findings of this 2K Factorial Design were separated into the four metrics employed: the Accuracy, Precision, Recall, and

F 1

score, presenting the findings for each metric when combined with all the terms and their interactions.

A c c u r a c y

. With a p-value of

6 \times 10^{- 3}

, the “People Speed” factor proved significant with a 95% confidence level. By analyzing the main effect factorial plots, a broader difference between levels for the “People Speed” factor can be seen, with a difference of 8.21% between levels, with the “Walking level” around 73% accurate, while the “Running level” decreased to around 65%. Both ANOVA assumptions were met, with the Normality in the residuals test achieving a 0.723 p-value, while the Homoscedasticity test obtained a p-value of 0.879.

P r e c i s i o n

. With a 95% confidence level, no factor or term proved significant for the Precision metric. By analyzing the main effect factorial plots, a slight difference between levels in both factors is appreciable, but no significant effect was detected. Both ANOVA assumptions were met, with the Normality in residuals test achieving a 0.085 p-value, while the Homoscedasticity test obtained a p-value of 0.476.

R e c a l l

. With a p-value of

7 \times 10^{- 6}

, the “People Speed” factor proved significant, with a 95% confidence level. By analyzing the main effect factorial plots, a broader difference between levels for the “People Speed” factor can be seen, with a difference of 25.13% between levels, with the “Walking level” around 70% Recall, while the “Running level” decreased to around 44%. Both ANOVA assumptions were met, with the Normality in the residual test achieving a 0.359 p-value, while the Homoscedasticity test obtained a p-value of 0.486.

F 1

Score. With a p-value of

6 \times 10^{- 5}

, the “People Speed” factor proved significant with a 95% confidence level. By analyzing the main effect factorial plots, a broader difference between levels can be seen for the “People Speed” factor, with a difference of 16.58% between levels, with the “Walking level” having around 72% F1 score, while the “Running level” decreased to around 55%. Both ANOVA assumptions were met, with the Normality in the residual test achieving a 0.087 p-value, while the Homoscedasticity test obtained a p-value of 0.868.

5. Discussion

Of the two navigation algorithms evaluated, State Feedback Control (SFC) consistently outperformed Navstack in positional accuracy. This finding impacts service robotics in environments demanding precise navigation and reliable vision. This performance gap (Table 2) reveals a trade-off: Navstack avoids obstacles but sacrifices precision, while SFC excels in precision but ignores obstacles.

For each individual goal, SFC produced significantly lower mean position errors. On average, Navstack’s mean error was roughly 6.3 times larger than that of SFC. The consistently large effect sizes (most d > 0.8) across metrics quantify the substantial performance gap between approaches. While both algorithms achieved their goals, SFC demonstrated a remarkably consistent performance, evidenced by tight confidence intervals across all metrics. In contrast, Navstack showed high variability, particularly for G4, where the wide confidence intervals reflect an inconsistent performance.

Additionally, the SFC navigation algorithm generated more consistent trajectories since, in the same conditions, the trajectories followed using this algorithm were nearly identical, except for one run dedicated to reaching G3 located behind the home position, as shown in Figure 12b). However, this exception was not critical, as both sides generated mirrored trajectories. On the other hand, the trajectories followed throughout the runs with the Navstack algorithm lacked consistency, as they tended to vary between different runs for the same desired goal, as shown in Figure 12a). It should be noted that a critical factor for these navigation algorithms is the sensors from which they obtain the data used to estimate their position and orientation. The Navstack algorithm used both the motor encoders and the LiDAR global and local cost maps to estimate its position, while the SFC navigation algorithm relied only on the encoders. The encoders used in the system are highly accurate, and the tires exhibited little, if any, slippage, which would explain the excellent precision achieved with the SFC algorithm. Otherwise, oscillation of the system’s structure when moving or poor extrinsic calibration of the LiDAR system shifting the relative position and orientation could explain the significant errors obtained with the Navstack algorithm [48]. Regarding the amount of information used to perform the navigation, it is safe to assume that the SFC could drive the system to the desired goal using less information than the Navstack local controller.

Unlike Navstack, the SFC controller cannot autonomously detect obstacles and re-plan to avoid them. This gives rise to the idea of creating a hybrid algorithm that runs on the SFC algorithm until it faces an obstacle; then, the Navstack algorithm would override the system to evade the obstacle, and then relinquish control to the SFC algorithm. The resulting fusion of algorithms would combine the best of both navigation algorithms by achieving the best possible trajectory in terms of precision and consistency, while being able to detect and evade obstacles. Additionally, the performance of the SFC algorithm can be explained by the appropriate selection of gains. On the other hand, the Navstack control was configured following the ROS Navigation Tuning Guide [49] for indoor navigation and cornering behavior, with the DWA planner parameters optimized for our robot platform. However, not every parameter was optimized, which might explain the imperfect behavior observed. Further fine-tuning of additional parameters could improve performance.

The navigation experiment carried out was performed to display the capabilities for pure point-to-point motion, and one inherent limitation was that the obstacle avoidance techniques and metrics remained unevaluated. In that sense, a similar setup could be mirrored to replicate the experiment supposing that the navigation system detects one or more obstacles or enters obstacle-avoidance mode. In that case, the expected results would differ from the obstacle field and the method defined to avoid those obstacles.

The vision experiment extended the investigation into the performance of the vision system under various conditions of mobility, specifically comparing scenarios where the subject was either walking or running while the robot was either static or navigating forward, focusing on the impact of different speeds on the performance of the MaskNet model. The findings indicate a significant dependency of the vision system’s accuracy on the speed of the observed individuals. The mask detection accuracy and the rate of false negatives were notably better when subjects were walking as opposed to running. When comparing walking to running, there was a noticeable drop in accuracy, recall, and F1 score, with drops in performance of 8.21%, 25.13%, and 16.58%, respectively. The data suggest that the model’s ability to correctly identify and classify face masks decreased as the environment became more dynamic with people running.

Interestingly, while the robot’s speed (static vs. moving) did not significantly impact the overall performance, there was a slight, though not statistically significant, improvement in all metrics when the robot was in motion. This could be attributed to the varying angles and distances when capturing images while moving, aiding the vision system in more accurate detection, yet the improvement was minimal. The significant drop in accuracy during high-speed scenarios suggests the need for two key enhancements: first, implementing sequential analysis that considers consecutive frame inferences to reduce false positives, allowing the system to leverage temporal information from previous detections to validate current predictions; and second, expanding the training dataset to include more examples of subjects in motion at different speeds, which would help the model better recognize features under dynamic conditions.

6. Conclusions

PiBot showcases the integration of diverse sensors and components tailored for tasks such as environmental monitoring and autonomous navigation. The inclusion of AI kits within PiBot aids in processing vision models, reducing the computational load on its central Jetson TX2 computer. In its current form, PiBot demonstrates capabilities for sensor integration and data processing through mechanical and software integration. While the experimental validation focused on controlled indoor environments with optimal floor conditions (hospitals, universities, manufacturing warehouses, etc.), future research should expand testing to variable terrain conditions, for deployment in demanding outdoor or dynamic settings.

Experiments showed that PiBot achieved an average navigation error of 0.041 m with the SFC controller (versus 0.26 m for Navstack) and 71% mask-detection accuracy in static--walking scenarios, confirming the platform’s potential for service robot tasks. The State Feedback Control (SFC) algorithm showed a high precision and repeatability in navigation but lacks obstacle sensing and path adjusting when obstacles appear, making it only suited for obstacle-free environments. Further tuning of the navigation stack and calibrating the LiDAR sensor could enhance performance. To address SFC’s obstacle detection limitations, future work could use a hybrid approach: integrating SFC’s precision in open spaces, while allowing Navstack to override for obstacle evasion when required.

This potential arises from PiBot’s hardware and software structure being similar to the systems presented in [29], which have achieved accuracy rates greater than 90%. The experiment in dynamic environments revealed that the robot’s speed did not significantly impact the vision system’s effectiveness when navigating toward walking people. However, the speed of the individuals did influence the results, particularly affecting the detection of face masks and leading to an increased rate of false negatives, with a confidence level of 95%. These findings highlight the challenges the MaskNet model faces in environments with varying degrees of human activity. To address these limitations, future work will pursue several parallel improvements: first, exploring lightweight, optimized architectures that can maintain high accuracy, while working within the current hardware constraints; second, fine-tuning the existing model with a broader dataset that includes varied lighting conditions, movement speeds, and capture angles. The model is sensitive to people’s speed, something service robots in real-world dynamic scenarios constantly face.

Author Contributions

Conceptualization, L.E.R.-R., J.A.G.-A., J.d.J.L.-S., J.C.T.-M., L.C.F.-H. and M.A.R.-M.; methodology, L.E.R.-R., J.A.G.-A., M.A.R.-M., L.C.F.-H. and J.d.J.L.-S.; software, L.E.R.-R. and J.A.G.-A.; validation, L.E.R.-R., J.A.G.-A. and J.C.T.-M.; formal analysis, L.E.R.-R., J.d.J.L.-S. and J.C.T.-M., investigation, L.E.R.-R.; resources, J.d.J.L.-S. and M.A.R.-M.; data curation, J.A.G.-A., L.E.R.-R., J.C.T.-M. and L.C.F.-H.; writing—original draft preparation, L.E.R.-R.; writing—review and editing, J.C.T.-M.; visualization, L.E.R.-R. and J.A.G.-A.; supervision, J.d.J.L.-S., J.C.T.-M., M.A.R.-M. and L.C.F.-H.; project administration, J.d.J.L.-S. and J.C.T.-M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

This work was conducted with the kind support of the Tecnológico de Monterrey, campus Monterrey, who graciously provided access to their public areas, including laboratories and library. We are also grateful to Machine Care Industry, a valued industry partner, for their ongoing support of the PiBot project through funding, resources, and exposure. Finally, we sincerely thank Juan Carlos Martínez Napky and Juan Jesús Ortiz Vazquez for their invaluable assistance during the experiments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Tzafestas, S.G. Introduction to Mobile Robot Control, 1st ed.; Elsevier B.V.: Athens, Greece, 2014; pp. 1–691. [Google Scholar] [CrossRef]
Fragapane, G.I.; de Koster, R.M.B.M.; Sgarbossa, F.; Strandhagen, J.O. Planning and control of autonomous mobile robots for intralogistics: Literature review and research agenda. Eur. J. Oper. Res. 2021, 294, 405–426. [Google Scholar] [CrossRef]
Vis, I.F. Survey of research in the design and control of automated guided vehicle systems. Eur. J. Oper. Res. 2006, 170, 677–709. [Google Scholar] [CrossRef]
Iqbal, J.; Xu, R.; Sun, S.; Li, C. Simulation of an autonomous mobile robot for LiDAR-based in-field phenotyping and navigation. Robotics 2020, 9, 46. [Google Scholar] [CrossRef]
Intel. Autonomous Mobile Robot Technology and Use Cases. Available online: https://www.intel.com/content/www/us/en/learn/artificial-intelligence-robotics.html (accessed on 6 January 2025).
Shanavas, H.; Ahmed, S.A.; Safwat Hussain, M.H. Design of an Autonomous Surveillance Robot Using Simultaneous Localization and Mapping. In Proceedings of the 2018 International Conference on Design Innovations for 3Cs Compute Communicate Control (ICDI3C), Bangalore, India, 25–28 April 2018; pp. 64–68. [Google Scholar] [CrossRef]
Samson, C.; Ait-Abderrahim, K. Feedback control of a nonholonomic wheeled cart in Cartesian space. IEEE Int. Conf. Robot. Autom. 1991, 2, 1136–1141. [Google Scholar] [CrossRef]
Ishikawa, S. A method of indoor mobile robot navigation by using fuzzy control. In Proceedings of the Proceedings IROS ’91: IEEE/RSJ International Workshop on Intelligent Robots and Systems ’91, Osaka, Japan, 3–5 November 1991; Volume 2, pp. 1013–1018. [Google Scholar] [CrossRef]
Kodagoda, K.R.; Wijesoma, W.S.; Teoh, E.K. Fuzzy speed and steering control of an AGV. IEEE Trans. Control. Syst. Technol. 2002, 10, 112–120. [Google Scholar] [CrossRef]
Evers, J.J.; Koppers, S.A. Automated guided vehicle traffic control at a container terminal. Transp. Res. Part A Policy Pract. 1996, 30, 21–34. [Google Scholar] [CrossRef]
Su, S.; Zeng, X.; Song, S.; Lin, M.; Dai, H.; Yang, W.; Hu, C. Positioning Accuracy Improvement of Automated Guided Vehicles Based on a Novel Magnetic Tracking Approach. IEEE Intell. Transp. Syst. Mag. 2009, 12, 138–148. [Google Scholar] [CrossRef]
Gatesichapakorn, S.; Takamatsu, J.; Ruchanurucks, M. ROS based Autonomous Mobile Robot Navigation using 2D LiDAR and RGB-D Camera. In Proceedings of the 2019 1st International Symposium on Instrumentation, Control, Artificial Intelligence, and Robotics, ICA-SYMP, Bangkok, Thailand, 16–18 January 2019; pp. 151–154. [Google Scholar] [CrossRef]
Cao, Y.; Ni, K.; Kawaguchi, T.; Hashimoto, S. Path Following for Autonomous Mobile Robots with Deep Reinforcement Learning. Sensors 2024, 24, 561. [Google Scholar] [CrossRef] [PubMed]
Kumaar, A.A.N.; Kochuvila, S. Mobile Service Robot Path Planning Using Deep Reinforcement Learning. IEEE Access 2023, 11, 100083–100096. [Google Scholar] [CrossRef]
Antony, M.; Parameswaran, M.; Mathew, N.; Sajithkumar, V.S.; Joseph, J.; Jacob, C.M. Design and implementation of automatic guided vehicle for hospital application. In Proceedings of the 5th International Conference on Communication and Electronics Systems, ICCES 2020, Coimbatore, India, 10–12 June 2020; pp. 1031–1036. [Google Scholar] [CrossRef]
Wu, Y.C.; Chen, C.S.; Chan, Y.J. The outbreak of COVID-19. J. Chin. Med. Assoc. 2020, 83, 217–220. [Google Scholar] [CrossRef]
Osorio-Oliveros, R.; Tijerina-Berzosa, A.; Gonzalez-Aguirre, J.A.; Heredia Marin, I.B.; Ramírez-Moreno, M.A.; de Jesús Lozoya-Santos, J. PiBOT: Design and Development of a Mobile Robotic Platform for COVID-19 Response. Lect. Notes Netw. Syst. 2022, 347, 252–260. [Google Scholar] [CrossRef]
Sathyamoorthy, A.; Patel, U.; Paul, M.; Savle, Y.; Manocha, D. COVID surveillance robot: Monitoring social distancing constraints in indoor scenarios. PLoS ONE 2021, 16, e0259713. [Google Scholar] [CrossRef] [PubMed]
Nope-Giraldo, R.; Illapuma-Ccallo, L.; Cornejo, J.; Palacios, P.; Napan, J.; Cruz, F.; Palomares, R.; Cornejo-Aguilar, J.; Vargas, M. Mechatronic Systems Design of ROHNI-1: Hybrid Cyber-Human Medical Robot for COVID-19 Health Surveillance at Wholesale-Supermarket Entrances. In Proceedings of the 2021 Global Medical Engineering Physics Exchanges/Pan American Health Care Exchanges, GMEPE/PAHCE 2021, Sevilla, Spain, 15–20 March 2021. [Google Scholar] [CrossRef]
Park, Y.; Bae, H.; Jang, H.; Lee, W. Designing a Contactless Office-Assistant Robot Using User Differentiation. In Proceedings of the Robot Intelligence Technology and Applications 6; Kim, J., Englot, B., Park, H.W., Choi, H.L., Myung, H., Kim, J., Kim, J.H., Eds.; Springer: Cham, Switzerland, 2022; pp. 191–200. [Google Scholar]
Köseoğlu, M.; Çelik, O.M.; Pektaş, Ö. Design of an autonomous mobile robot based on ROS. In Proceedings of the 2017 International Artificial Intelligence and Data Processing Symposium (IDAP), Malatya, Turkey, 16–17 September 2017; pp. 1–5. [Google Scholar] [CrossRef]
Cheng, Y.; Wang, G.Y. Mobile robot navigation based on lidar. In Proceedings of the 2018 Chinese Control and Decision Conference (CCDC), Shenyang, China, 9–11 June 2018; pp. 1243–1246. [Google Scholar] [CrossRef]
Zhi, L.; Xuesong, M. Navigation and Control System of Mobile Robot Based on ROS. In Proceedings of the 2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, China, 12–14 October 2018; pp. 368–372. [Google Scholar] [CrossRef]
Yilmaz, A.; Temeltas, H. Self-adaptive Monte Carlo method for indoor localization of smart AGVs using LIDAR data. Robot. Auton. Syst. 2019, 122, 103285. [Google Scholar] [CrossRef]
Du, Y.; Ai, C.; Feng, Z. Research on navigation system of AMR based on ROS. In Proceedings of the 2020 IEEE International Conference on Real-time Computing and Robotics (RCAR), Asahikawa, Japan, 28–29 September 2020; pp. 117–121. [Google Scholar] [CrossRef]
Huang, T. Computer Vision: Evolution and Promise; CERN: Geneve, Switzerland, 1996. [Google Scholar]
Deng, L.; Yu, D. Deep Learning: Methods and Applications. Assoc. Comput. Mach. 2014, 7, 193–387. [Google Scholar]
Szeliski, R. Algorithms and Applications. In Computer Vision; Springer: Washington, DC, USA, 2011; Chapters 1–3; pp. 1–95. [Google Scholar]
Nagrath, P.; Jain, R.; Madan, A.; Arora, R.; Kataria, P.; Hemanth, J. SSDMNV2: A real time DNN-based face mask detection system using single shot multibox detector and MobileNetV2. Sustain. Cities Soc. 2021, 66, 102692. [Google Scholar] [CrossRef]
Razavi, M.; Alikhani, H.; Janfaza, V.; Sadeghi, B.; Alikhani, E. An Automatic System to Monitor the Physical Distance and Face Mask Wearing of Construction Workers in COVID-19 Pandemic. SN Comput. Sci. 2021, 3, 27. [Google Scholar] [CrossRef] [PubMed]
Kumar, A.; Kalia, A.; Sharma, A.; Kaushal, M. A hybrid tiny YOLO v4-SPP module based improved face mask detection vision system. J. Ambient. Intell. Humaniz. Comput. 2021, 14, 6783–6796. [Google Scholar] [CrossRef] [PubMed]
Mendoza-Ramirez, C.E.; Gonzalez-Aguirre, J.A.; Tudon-Martinez, J.C.; Felix-Herran, L.C.; Ramirez-Mendoza, M.A.; Lozoya-Santos, J.d.J.; Sename, O. Vision System Based on Artificial Intelligence for Service Robots. In Proceedings of the 2th International Conference on Mechatronics and Control Engineering (ICMCE), Budapest, Hungary, 25–27 January 2024. [Google Scholar]
Maulana, E.; Muslim, M.A.; Zainuri, A. Inverse kinematics of a two-wheeled differential drive an autonomous mobile robot. In Proceedings of the 2014 Electrical Power, Electronics, Communications, Control and Informatics Seminar, EECCIS 2014, Malang, Indonesia, 27–28 August 2014; pp. 93–98. [Google Scholar] [CrossRef]
Oftadeh, R.; Aref, M. Mechatronic Design of a Four Wheel Steering Mobile Robot with Fault-Tolerant Odometry Feedback. IFAC Proc. Vol. 2013, 46, 663–669. [Google Scholar] [CrossRef]
Gonzalez-Aguirre, J.A.; Osorio-Oliveros, R.; Rodríguez-Hernández, K.L.; Lizárraga-Iturralde, J.; Menendez, R.M.; Ramírez-Mendoza, R.A.; Ramírez-Moreno, M.A.; Lozoya-Santos, J.d.J. Service Robots: Trends and Technology. Appl. Sci. 2021, 11, 10702. [Google Scholar] [CrossRef]
Bordignon, L.P.; von Wangenheim, A. Benchmarking Deep Learning Models on Jetson TX2; Technical Report; Brazilian Institute for Digital Convergence: Florianopolis, Brazil, 2019. [Google Scholar]
Quang, H.D.; Manh, T.N.; Manh, C.N.; Tien, D.P.; Van, M.T.; Tien, K.N.; Duc, D.N. An Approach to Design Navigation System for Omnidirectional Mobile Robot Based on ROS. Int. J. Mech. Eng. Robot. 2020, 9, 1502–1508. [Google Scholar] [CrossRef]
Lu, D. global_planner, 2021. ROS.org. Available online: https://wiki.ros.org/global_planner (accessed on 6 January 2025).
Marder-Eppstein, E. dwa_local_planner, 2020. ROS.org. Available online: https://wiki.ros.org/dwa_local_planner?distro=noetic (accessed on 6 January 2025).
Gerkey, B.P. amcl, 2020. ROS.org. Available online: https://wiki.ros.org/amcl (accessed on 6 January 2025).
Megalingam, R.K.; Rajendraprasad, A.; Manoharan, S.K. Comparison of planned path and travelled path using ROS navigation stack. In Proceedings of the 2020 International Conference for Emerging Technology, INCET, Belgaum, India, 5–7 June 2020. [Google Scholar] [CrossRef]
Egerstedt, M.; Hu, X.; Stotsky, A. Control of mobile platforms using a virtual vehicle approach. IEEE Trans. Autom. Control 2001, 46, 1777–1782. [Google Scholar] [CrossRef]
Astolfi, A. Exponential stabilization of a wheeled mobile robot via discontinuous control. J. Dyn. Syst. Meas. Control ASME 1999, 121, 121–126. [Google Scholar] [CrossRef]
Larribe, F. Navigation, 2020. Available online: http://wiki.ros.org/navigation (accessed on 11 May 2022).
Goodwin, G.C.; Graebe, S.F.; Salgado, M.E. Control System Design, 1st ed.; Pearson: London, UK, 2000; Volume 1, p. 944. [Google Scholar] [CrossRef]
Ben-Ari, M.; Mondada, F. Elements of Robotics; Springer: Cham, Switzerland, 2017; pp. 1–308. [Google Scholar] [CrossRef]
Tigerschiold, T. What is accuracy, precision, recall and F1 score? 2022. Labelf Blog. Available online: https://www.labelf.ai/blog/what-is-accuracy-precision-recall-and-f1-score (accessed on 6 January 2025).
Pusztai, Z.; Eichhardt, I.; Hajder, L. Accurate calibration of multi-lidar-multi-camera systems. Sensors 2018, 18, 2139. [Google Scholar] [CrossRef] [PubMed]
Zheng, K. ROS Navigation Tuning Guide. arXiv 2017, arXiv:1706.09068. [Google Scholar]

Figure 1. Conceptual design of the proposed AMR, called PiBot.

Figure 2. Mechatronic system structure.

Figure 3. Differential drive robot schematic. The triangle indicates the front track of the vehicle.

Figure 4. DC motor control diagram.

Figure 5. System architecture showing the interconnections and data flow between PiBot’s main components, including processing units, sensors, and interfaces.

Figure 6. Navigation system components of PiBot: Jetson TX2, encoders, LiDAR, and 3D Camera.

Figure 7. Navstack block diagram, which represents the estimation of motor control parameters based on the local and global cost maps, and the goal manager. The diagram illustrates the data flow from odometry systems, sensors, and the goal position vector

{\vec{x}}_{d}

. This process leverages both local (current LiDAR readings) and global (previously stored map) cost maps, culminating in coherent velocity commands being dispatched to the mobile base.

Figure 7. Navstack block diagram, which represents the estimation of motor control parameters based on the local and global cost maps, and the goal manager. The diagram illustrates the data flow from odometry systems, sensors, and the goal position vector

{\vec{x}}_{d}

. This process leverages both local (current LiDAR readings) and global (previously stored map) cost maps, culminating in coherent velocity commands being dispatched to the mobile base.

Figure 8. Geometric interpretation of vehicle control approach.

Figure 9. (A) Block diagram of the vehicle control approach, (B) state diagram of the goal manager.

Figure 10. (A) PiBot’s initial position in the experiment, (B) Experiment 1 initial position and positional goals to achieve.

Figure 11. Illustrations of the four test scenarios in the vision experiment. (A) PiBot static, individuals walking. (B) PiBot static, individuals running. (C) PiBot in motion, individuals walking. (D) PiBot in motion, individuals running. Each subfigure demonstrates the distinct experimental conditions and the dynamic interaction between PiBot and the subjects.

Figure 12. Representation of the paths followed by PiBot (12 runs, four goals color-coded for visualization) using the (A) Navstack algorithm and (B) State Feedback Control algorithm with the origin at the robot’s starting point.

Figure 13. Performance comparison between the Navstack algorithm and the State Feedback Control (SFC) algorithm for four distinct goals. Each subfigure represents a performance metric: (A) MSE Performance Index in m², (B) IAE Performance Index in

m s

, (C) ITAE Performance Index in ms², and (D) IAU Performance Index. The bars represent the metric values for each goal, with black lines indicating the standard deviation per goal.

Figure 13. Performance comparison between the Navstack algorithm and the State Feedback Control (SFC) algorithm for four distinct goals. Each subfigure represents a performance metric: (A) MSE Performance Index in m², (B) IAE Performance Index in

m s

, (C) ITAE Performance Index in ms², and (D) IAU Performance Index. The bars represent the metric values for each goal, with black lines indicating the standard deviation per goal.

Figure 14. Example of a processed frame from the PiBot vision system. Green bounding boxes indicate the MaskNet model’s prediction of a face mask being worn, while red bounding boxes indicate no face mask detected.

Table 1. Experimental sequence of the vision test using random combinations of robot motion and human speed.

Order	Robot’s Speed	People’s Speed
1	Moving	Running
2	Moving	Walking
3	Moving	Walking
4	Moving	Running
5	Static	Running
6	Moving	Running
7	Moving	Walking
8	Static	Running
9	Static	Running
10	Static	Walking
11	Static	Walking
12	Static	Walking
13	Static	Walking
14	Moving	Walking
15	Moving	Running
16	Static	Running

Table 2. Statistical analysis of navigation algorithm performance metrics by goal.

Mean Squared Error
Goal	Algorithm	Mean + SD	95% CI	Cohen’s d
G1	Navstack	0.066 ± 0.019	[0.020, 0.112]	3.174
G1	SFC	0.002 ± 0.001	[0.000, 0.004]	3.174
G2	Navstack	0.038 ± 0.031	[0.000, 0.114]	1.092
G2	SFC	0.002 ± 0.002	[0.000, 0.007]	1.092
G3	Navstack	0.015 ± 0.008	[0.000, 0.036]	1.566
G3	SFC	0.002 ± 0.000	[0.001, 0.002]	1.566
G4	Navstack	0.357 ± 0.493	[0.000, 1.581]	0.666
G4	SFC	0.002 ± 0.002	[0.000, 0.008]	0.666
Integral of Absolute Error
Goal	Algorithm	Mean + SD	95% CI	Cohen’s d
G1	Navstack	18.591 ± 1.655	[14.481, 22.701]	3.296
G1	SFC	24.585 ± 0.291	[23.861, 25.308]	3.296
G2	Navstack	33.977 ± 12.756	[2.289, 65.664]	0.135
G2	SFC	32.109 ± 0.795	[30.133, 34.085]	0.135
G3	Navstack	49.184 ± 0.532	[47.861, 50.506]	10.566
G3	SFC	66.070 ± 1.377	[62.650, 69.491]	10.566
G4	Navstack	69.597 ± 61.503	[0.000, 222.378]	0.556
G4	SFC	32.579 ± 0.308	[31.813, 33.345]	0.556
Integral of Time-Weighted Absolute Error
Goal	Algorithm	Mean + SD	95% CI	Cohen’s d
G1	Navstack	111.885 ± 27.171	[44.387, 179.382]	2.670
G1	SFC	191.059 ± 3.463	[182.456, 199.661]	2.670
G2	Navstack	373.568 ± 233.583	[0.000, 953.820]	0.301
G2	SFC	297.275 ± 12.191	[266.990, 327.560]	0.301
G3	Navstack	636.940 ± 15.827	[597.624, 676.256]	8.207
G3	SFC	988.806 ± 36.307	[898.614, 1078.997]	8.207
G4	Navstack	4922.401 ± 7621.378	[0.000, 23854.954]	0.560
G4	SFC	304.471 ± 4.833	[292.464, 316.478]	0.560
Integral of the Absolute Value of the Manipulation Signal
Goal	Algorithm	Mean + SD	95% CI	Cohen’s d
G1	Navstack	2.305 ± 0.353	[1.428, 3.182]	0.971
G1	SFC	1.934 ± 0.005	[1.923, 1.945]	0.971
G2	Navstack	6.787 ± 0.811	[4.772, 8.801]	2.057
G2	SFC	4.980 ± 0.019	[4.932, 5.028]	2.057
G3	Navstack	9.866 ± 0.091	[9.641, 10.092]	7.145
G3	SFC	8.831 ± 0.098	[8.587, 9.075]	7.145
G4	Navstack	26.298 ± 23.651	[0.000, 85.051]	0.830
G4	SFC	5.050 ± 0.141	[4.699, 5.400]	0.830

Table 3. Confusion matrices for different robot and human movement configurations: (A) Static robot/Walking people, (B) Static robot/Running people, (C) Moving robot/Walking people, and (D) Moving robot/Running people.

Case A	Positive	Negative
Positive	74	22
Negative	46	97
Case B	Positive	Negative
Positive	70	27
Negative	47	90
Case C	Positive	Negative
Positive	77	28
Negative	58	111
Case D	Positive	Negative
Positive	85	29
Negative	45	103

Table 4. Overall confusion matrix aggregation results from all experimental conditions.

All Cases	Positive	Negative
Positive	306	106
Negative	196	401

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rodriguez-Raygoza, L.E.; Gonzalez-Aguirre, J.A.; Felix-Herran, L.C.; Ramirez-Moreno, M.A.; Lozoya-Santos, J.d.J.; Tudon-Martinez, J.C. Autonomous Navigation System Test for Service Robots in Semi-Structured Environments. Appl. Sci. 2025, 15, 9056. https://doi.org/10.3390/app15169056

AMA Style

Rodriguez-Raygoza LE, Gonzalez-Aguirre JA, Felix-Herran LC, Ramirez-Moreno MA, Lozoya-Santos JdJ, Tudon-Martinez JC. Autonomous Navigation System Test for Service Robots in Semi-Structured Environments. Applied Sciences. 2025; 15(16):9056. https://doi.org/10.3390/app15169056

Chicago/Turabian Style

Rodriguez-Raygoza, Luis E., Juan A. Gonzalez-Aguirre, Luis C. Felix-Herran, Mauricio A. Ramirez-Moreno, Jorge de J. Lozoya-Santos, and Juan C. Tudon-Martinez. 2025. "Autonomous Navigation System Test for Service Robots in Semi-Structured Environments" Applied Sciences 15, no. 16: 9056. https://doi.org/10.3390/app15169056

APA Style

Rodriguez-Raygoza, L. E., Gonzalez-Aguirre, J. A., Felix-Herran, L. C., Ramirez-Moreno, M. A., Lozoya-Santos, J. d. J., & Tudon-Martinez, J. C. (2025). Autonomous Navigation System Test for Service Robots in Semi-Structured Environments. Applied Sciences, 15(16), 9056. https://doi.org/10.3390/app15169056

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Autonomous Navigation System Test for Service Robots in Semi-Structured Environments

Abstract

1. Introduction

2. Design, Manufacture, and Integration

2.1. Mechanical Specifications

2.1.1. Odometry Characterization

2.1.2. Kinematic Model

2.1.3. Actuators

2.2. Electrical and Electronic Specifications

Hardware and Connections

2.3. Software Specifications

2.3.1. NavStack

2.3.2. Feedback Controller

2.3.3. Vision System

3. Design of Experiments

3.1. Experimental Setup Description

3.1.1. Experiment 1: Navigation Tests

3.1.2. Experiment 2: Vision Tests

3.2. Evaluation Metric Definition

4. Experimental Results

4.1. Experiment 1: Navigation Tests

4.2. Experiment 2: Vision Tests

5. Discussion

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI