Next Article in Journal
MURM: Utilization of Multi-Views for Goal-Conditioned Reinforcement Learning in Robotic Manipulation
Next Article in Special Issue
Dual-Quaternion-Based SLERP MPC Local Controller for Safe Self-Driving of Robotic Wheelchairs
Previous Article in Journal
Tethered Unmanned Aerial Vehicles—A Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Online Motion Planning for Safe Human–Robot Cooperation Using B-Splines and Hidden Markov Models

1
Engineering Department Enzo Ferrari, University of Modena and Reggio Emilia, 41125 Modena, Italy
2
Ideativa, 41125 Modena, Italy
*
Author to whom correspondence should be addressed.
Robotics 2023, 12(4), 118; https://doi.org/10.3390/robotics12040118
Submission received: 28 June 2023 / Revised: 8 August 2023 / Accepted: 16 August 2023 / Published: 18 August 2023
(This article belongs to the Special Issue Motion Trajectory Prediction for Mobile Robots)

Abstract

:
When humans and robots work together, ensuring safe cooperation must be a priority. This research aims to develop a novel real-time planning algorithm that can handle unpredictable human movements by both slowing down task execution and modifying the robot’s path based on the proximity of the human operator. To achieve this, an efficient method for updating the robot’s motion is developed using a two-fold control approach that combines B-splines and hidden Markov models. This allows the algorithm to adapt to a changing environment and avoid collisions. The proposed framework is thus validated using the Franka Emika Panda robot in a simple start–goal task. Our algorithm successfully avoids collision with the moving hand of an operator monitored by a fixed camera.

1. Introduction

In modern industries, the demand for collaborative robots is constantly increasing because of their versatility and precision in a wide variety of applications. In a collaborative cell, humans and robots work together, sharing the same workspace. Therefore, the robot must adapt to external events while complying with safety regulations [1]. Safe human–robot cooperation can be achieved if the control system can gather information about the state of the robot as well as the state of the worker inside the collaborative workspace. Obstacle avoidance algorithms are employed whenever the presence of a target, such as an object or an operator, should be accounted for in the proper and safe operation of the robot. The latter must be able to adapt its behavior in real time with respect to dynamically changing environments [1,2].
In general obstacle avoidance applications [3], the robot uses the position of a target object to modify its path and avoid colliding with it. Although safety distance margins can be imposed on the target object, unexpected movements can cause the robot to collide if the algorithm is not reactive enough to prevent the impact [4]. On the other hand, the planning algorithm should consider restoring the original path once the operator is outside the workspace so the robot can run its task in nominal conditions [5].
In this work, the moving obstacle is represented by the hand of a human operator who shares the same workspace as a cobot and whose position is monitored by a fixed camera. The robot must repeat a preplanned task, but according to the proposed framework, when the risk of a collision occurs, it can either modify the geometry of the path or reduce its velocity according to the requirements of the specific application. Then, when the obstacle is no longer within the robot’s reach, the programmed motion is restored.
The literature is full of methods that guarantee obstacle avoidance and the safe coexistence of humans and robots. To highlight the contribution of our proposal, the main state-of-the-art techniques are reviewed and briefly discussed in the following.

1.1. Related Work

Collision avoidance methods for robotic systems operating in dynamic environments with obstacles moving at high velocity typically rely on a combination of real-time trajectory planning and reactive control. These methods may vary in their level of integration, ranging from pure re-planning to the use of repulsive forces that act directly on the robot.
Collision avoidance motion planning problems can be, first of all, treated as an optimal control problem [6]. These approaches are widely used because they allow for the minimization of cost functions subject to disparate constraints, such as danger fields, temporal aspects, or even metrics for evaluating safety in collaborative manufacturing [4,7]. Optimization methods can be applied to both motion planning and control problems. A noteworthy optimization-based algorithm is CHOMP [8], i.e., Covariant Hamiltonian Optimization for Motion Planning, which guarantees collision-free and locally optimal trajectory generation by means of a covariant gradient update rule. Another example is presented in [9], where a parallel optimization scheme is successfully used to manipulate a cable-towed load with multiple collaborative quadrupeds.
Probabilistic inference methods [10,11,12] optimize trajectories subject to task constraints, goals, and motion priors, replacing classical cost functions with joint distributions formulated as conditional dependencies. In [13], the author introduces a Learning by Demonstration (LbD) framework that exploits Gaussian Mixture Models (GMM) to encode multiple tasks and retrieve the associated skill by means of regression techniques.
A different class of methods for motion planning and collision avoidance is composed of so-called sample-based planning algorithms [14]; according to these approaches, the environment is randomly sampled using techniques such as probabilistic road maps or exploring random trees. An example is given in [15], where sampled via-points are transformed into a set of candidate collision-free trajectories subject to predefined kinodynamic limits. In [16], if a moving obstacle collides with the manipulator, the motion is locally re-planned by using a Bi-RRT-based algorithm connecting the current robot position with a target on the unperturbed trajectory.
While the above-mentioned algorithms mainly work at the planning levels, an approach for reactive collision avoidance that acts directly on the robot control relies on potential field methods, originally introduced in [17]. Virtual repulsive and attractive fields are utilized to move the robot towards a target while avoiding obstacles. Specifically, repulsive fields are associated with obstacles, while attractive fields are associated with the target. The original concept has been further refined and applied to properly modify the shape of motion trajectories, which are seen as elastic bands subject to virtual forces [18], and is now a standard technique for real-time trajectory planning [3,19].
If all the techniques cited above modify the geometry of the robot motion, state-of-the-art methods in the industrial practice for guaranteeing safety in human–robot applications are based on a completely different philosophy [20]. They basically consist of stopping or slowing down robot motion in the presence of humans. For instance, according to ISO/TS 15066, the velocity and acceleration of the robot must be set to safe values based on the minimum distance from the operator [21]. This approach is commonly used in collaborative applications, where the position of human workers is continuously monitored with different types of cameras and the timing along the curve is properly scaled. See, for example, Refs. [7,22,23], among many other publications on this topic. Note that, in the case of dynamic obstacles, this technique based on the velocity modulation of the robot can also be a way to avoid obstacles that are currently on the desired geometric path but could move in the following instants.

1.2. Methodology and Contributions

In this work, we propose a novel framework for collision avoidance that merges the modification of the geometric path of the robot with a speed adaptation mechanism. The basic idea, derived from observations reported in [24], is to split the task representation into two fundamental categories: the trajectory and the symbolic level. The former comprises continuous signals that change over time, such as the position or orientation of the end effector (EE); the latter uses sequential or hierarchical information to establish a discrete set of movements with predefined rules. The goal of this work is to provide evidence, through an obstacle avoidance application, that task performance can benefit from the merging of these two domains.
The two basic tools used to implement this framework are B-splines, which encode spatial data and modify the path in Cartesian space [25], and hidden Markov models (HMMs), which encode temporal and sequential information by scaling down task velocity [26].
The main goals and contributions of this research work are to:
  • Design an online controller that can fit generic tasks and smoothly avoid collisions with dynamic obstacles.
  • Include the possibility to restore the original task whenever the robot is not prone to any collision.
  • Exploit a probabilistic framework to gather information about the obstacle and modify the robot’s velocity accordingly.
  • Combine trajectory and symbolic domains in a unified framework.
The paper is structured as follows: Section 2 details the theoretical background of the proposed solution. Section 3 describes the online control algorithm conceived in this work. Section 4 describes the experimental setup and validates the results of our framework. Finally, Section 5 concludes this work by discussing the results and possible future steps.

2. Task Encoding

The simplest way to define a robotic task is by specifying the trajectory, which is a mapping between time and space, that the end effector or joints must track. However, a task can also be interpreted as a sequence of action units or elements that follow loops or rules, as is the case with a symbolic representation as seen in [27,28].
This work attempts to bridge and combine these two domains for an obstacle avoidance application. This is achieved through a control system that modifies the nominal robot path defined by B-spline functions [25] while also varying the phase of the task, and thus its velocity, using an HMM [26]. The following paragraphs provide a brief review of the theoretical background of these techniques.

2.1. Spatial Encoding Based on B-Splines

Spline functions are extensively used for generating smooth and optimal time trajectories in robotic applications [15,29,30]. B-splines are a particular representation of generic spline curves based on a linear combination of N basis functions [29], i.e.,
p ( s ) = i = 1 N p i β i d ( s ) , s m i n s s m a x
where β i d ( s ) are basis functions of degree d, which only depends on the phase variable s (that in many applications coincides with time t), while p i are called control points and determine the geometric shape of the curve. As shown in Figure 1, a B-spline is basically a smooth approximation of the control polygon defined by the control points. Usually, they are defined by imposing some interpolation condition p ( s i ) = p i , i = 1 , , N , depending on the desired task.
A noteworthy property of the basis functions β i d ( s ) is that their value is zero everywhere except in the interval defined by knots [ s i , s i + d + 1 ] . As a consequence, a change in the i-th control point p i only influences the B-spline in this interval, allowing for local modifications in the trajectory described. Another useful property of B-splines is time-scaling, i.e., the possibility of changing the velocity along the curve by simply applying the transformation s ^ = α s to the knot vector s, where α is a constant. For more details, see [25].

2.2. Temporal and Sequential Encoding Based on Hidden Markov Models

Hidden Markov models (HMMs) are based on a Markov chain, which describes the probabilities of sequences of random variables, called states, that take values from a defined set [31]. In many applications, such as gesture and speech recognition [26,28], HMMs are used to model human behavior, with the aim of identifying gestures or predicting the most likely pattern of subsequent control states. Specifically, given a set of hidden states H = h 1 , h 2 , , h M and a sequence of T observations O = o 1 , o 2 , , o T , the purpose of HMMs is to analyze sequences of events in terms of a reduced set of parameters λ = [ Π , A , B ] , where:
  • Π is called the prior distribution, which tells us about the probability of starting a sequence in state h i ;
  • A is the transition probability matrix, where each element a i , j encodes the probability of moving from state i to state j;
  • B are the observation likelihoods, also called emission probabilities, where each element b i ( o t ) expresses the probability of an observation o t being generated from state i.
Note that, according to the usual definition of HMMs, λ describes a model in terms of discrete observation likelihoods [32]. However, since the proposed framework deals with a continuous observation space, i.e., trajectories in Cartesian space, the HMM can be represented as
λ = [ Π , A , μ i , Σ i ] i = 1 , , M
where M is the number of components characterized by mean and covariance values ( μ i , Σ i ) of the multivariate Gaussian Mixture Model (GMM). Thus, the λ parameters can be learned to encode the desired robot task, such that the regression of the model resembles the desired end effector path [33].
The proposed methodology can be explained as follows: using control points p i as hidden states h i , a forward-chained left–right HMM is adopted (see Figure 2) to account for the evolution of the trajectory [34]. With this model, in nominal conditions (i.e., no obstacles in the workspace), π 1 = 1 and transition probabilities a i , i + 1 1 , meaning that the robot evolves from one control point to the other with a probability close to one. In particular, let us analyze the transition matrix A for the HMM in Figure 2:
A = a 11 a 12 0 0 0 0 a 22 a 23 0 0 0 0 0 a M 1 M 1 a M 1 M 0 0 0 0 a M M
Here, the elements of each row of A sum up to 1. Parameter a i , i is the probability of staying in state h i , while a i , i + 1 is the probability of moving from state h i to state h i + 1 . Thus, the transition probabilities a i , i + 1 can be used to slow down or eventually stop the robot every time their values warn the system about a potential collision, as is explained in the next section.

3. Proposed Method

We consider a typical industrial scenario in which a robot is required to perform a specific task within a monitored workspace, using a fixed camera. When an obstacle enters the workspace, such as the hand of a human operator, its actual position is tracked, filtered through a Kalman filter, and sent to the controller that implements the proposed framework in order to prevent possible collisions. The functional blocks of the entire algorithm are displayed in the scheme shown in Figure 3, while the working principles are detailed below.

3.1. Spatial Modulation with Dynamical Control Points

Given the presence of moving obstacles, we assume that the task is modified only if they interfere with the nominal trajectory. However, if the obstacles move far from the robot, the task should be restored to its original trajectory. To achieve this, we use B-spline functions to induce a reversible deformation on the original trajectory. Specifically, a dynamical system is associated with each control point p i [5]:
M i , p ˜ ¨ i + B i , p ˜ ˙ i + K i , p ˜ i = F r e p , i
where, p ˜ i = p ( i ) p i represents the displacement of the actual position p ( i ) of the i-th control point with respect to its original location p i . The matrices M i = diag { m i } , B i = diag { b i } , and K i = diag { k i } are 3-by-3 diagonal matrices, where m i , b i , and k i are positive constants such that the system in (4) is critically damped. This can be performed by choosing, for example, m i = m ^ i and k i = k ^ i , then computing b i as b ^ i = 2 m ^ i · k ^ i . Finally, F r e p , i is a virtual repulsive force applied in the direction from the obstacle to the i-th control point.
This idea is explained in Figure 4. The module of the repulsive force F r e p , i acting on the i-th control point does not depend on its Cartesian distance from the obstacle but varies in intensity according to the Mahalanobis distance:
D M ( x , μ , Σ ) = ( x μ ) T Σ 1 ( x μ )
that accounts for the occupation probability of the obstacle p ^ o b s N ( μ o b s , Σ o b s ) , caused by uncertainties in the estimation of its location [6,35]. The module of the force can therefore be computed as follows [6]
| F r e p , i | = χ i D M ( p ( i ) ; μ o b s , Σ o b s )
where χ i is a free parameter that enables the repulsive force field intensity to be varied. Finally, the path is encoded with a third-order B-spline such that its derivatives are continuous until the acceleration, with limited jerk. This is of fundamental importance and should be designed carefully as, from a practical point of view, the internal inverse kinematics algorithm could lead to a joint acceleration discontinuity error if the trajectory overcomes the robot’s limitations [29].

3.2. Temporal Modulation with Varying Transition Probabilities in HMM

In our framework, the path of the task is encoded by p ( s ) in (1), while velocity is varied through the HMM’s parameters λ , accounting for the temporal modulation.
The connection between B-splines and the HMM domains is established through the continuous observation likelihoods N ( μ i , Σ i ) = b i ( o t ) , i = 1 , , M , as shown in (2). In particular, ( μ i , Σ i ) are the components of a GMM that encodes the likelihood of a point r R 3 being described by the HMM’s model λ :
B P ( r ) = i = 1 M W i ( r ) N ( r ; μ i , Σ i )
where the weighting factor W i with i = 1 M W i = 1 is called the mixing coefficient and represents the influence of the i-th component [13]. Gaussian distributions are encoded such that their regression corresponds to the nominal trajectory p ( s ) [36]. Furthermore, in (7), B accounts for the variability introduced by the movement of control points, thus giving a stochastic topological representation of the task. Figure 5 illustrates a graphical representation of this approach. It is worth noting that the number of states M of the HMM is generally different from the number of control points N that define the B-spline. For instance, it is possible to encode with the same state a segment of the trajectory curve, corresponding to a subtask, composed of several control points. However, for the sake of simplicity, it is convenient to assume M = N .
As explained in Section 2, the evolution of the B-spline trajectory can be modified by applying the transformation s ^ = α s to the knots vector of p ( s ) . In this way, the desired velocity s ˙ is multiplied by a factor α [ 0 , 1 ] and reduced accordingly. To implement a mechanism that is able to slow down the robot and eventually stop it in the case of a collision risk, the transition probabilities of the HMM are directly mapped into proper values of α , as explained below. Assume that the robot’s trajectory is in state h i (associated with the control point p i ), and consider the transition probability a i , i + 1 to the next state h i + 1 . For the purposes of this project, it is required that:
  • If a i , i + 1 = 1.0 , the robot moves at nominal velocity.
  • If 0.0 < a i , i + 1 < 1.0 , the robot slows down.
  • If a i , i + 1 = 0.0 , the robot stops.
By computing the transition coefficient as a Sigmoid function ϕ based on the Mahalanobis distance D M ( p ^ o b s ; μ i , Σ i ) between the obstacle position and the observation space of the HMM, i.e.,
a i , i + 1 = ϕ ( D M ) = 1 1 + e ( D M + γ )
its value changes according to the probability of colliding with the obstacle. Note that the free parameter γ can be used to tune the sensitivity of the robot with respect to the presence of an obstacle in the workspace. For example, using positive values of γ , the Sigmoid function ϕ ( D M ) is shifted right; thus, by tuning this parameter, we can decide for which values of D M the transition coefficient decays to zero. Finally, if the scaling factor α is computed as
α = a i , i + 1
the nominal velocity along the B-spline trajectory is modulated as required by the specifications reported above.
In this way, the coefficients a i , i + 1 acquire a new meaning, which is the probability of passing from control point p i to p i + 1 without collision. The symbolic control can therefore be explained as follows. The path is encoded as a B-spline, which means using a superposition of basis functions weighted by the so-called control points. Every basis function is responsible for a local encoding of the trajectory, together with the corresponding control point; therefore, we set the control points themselves as the hidden state of the HMM. The only way to access information regarding a hidden state is by the parameter B, explained in Section 2.2 and initialized as a GMM in (7). In our case, B basically gives a probability mapping that tells us, being the hand in a position p ^ o b s , which is the control point p i responsible for the piece of trajectory closest to the hand. Once this is known, we can use (8) to vary the transition probability between p i and p i + 1 , and thus modulate the nominal velocity with α calculated in (9). It follows that when α 0 , the robot stops, and then when the obstacle moves again out of the robot task space α 1 , the nominal velocity is restored. Figure 6 reports an example of 2D encoding merging the two techniques introduced in the previous paragraphs.

3.3. Switching between Trajectory and Symbolic Domains

The proposed motion planner combines two different control domains. However, there could be situations where operating at the trajectory level is preferred over the symbolic one, or vice versa. The algorithm is provided with a knob parameter, ξ [ 0 , 1 ] , so the user can choose the influence of each controller. Specifically, we expect the following behaviors:
  • ξ = 0.0 only the time-scaling mechanism based on HMM is active;
  • 0.0 < ξ < 1.0 B-spline and HMM cooperate with different proportions;
  • ξ = 1.0 only the B-spline modification algorithm is applied.
To achieve this, the parameters ( m i , b i , k i ) in (4) for trajectory control, and α in (9) for symbolic control, are modified according to the following laws:
( m i , b i , k i ) = ( m ^ i , b ^ i , k ^ i ) 2 ξ if 0 ξ < 0.5 ( m ^ i , b ^ i , k ^ i )     if 0.5 ξ 1
α = α ^ if 0 ξ 0.5 ( 2 α ^ 1 ) ( 1 ξ ) + ξ if 0.5 < ξ 1
where ( m ^ i , b ^ i , k ^ i ) are the default values assigned in (4), and α ^ is the result of (9). In this way, as ξ approaches 0, the values of the parameters ( m i , b i , k i ) increase rapidly (in practical experiments an upper bound on the three parameters is imposed), making the system in (4) extremely rigid and thus insensitive to external (virtual) forces. As a consequence, no spatial modifications of the trajectory are observed. On the other hand, as ξ approaches 1, the spatial modification mechanism is restored, but in this case, α 1 and therefore no changes in the velocity profile occur. Finally, when ξ 0.5 , both the trajectory and symbolic level controllers work together, producing spatial and temporal modulation of the trajectory.

4. Experimental Validation

In this section, the experimental setup is described and the results of the proposed framework are shown.

4.1. Experimental Setup and Methodology

For the experimental tests, we used a Franka Emika Panda collaborative robot controlled through a Cartesian pose controller developed in C++. The controller ran on a standard PC equipped with an Intel i7 8-core CPU and 8 GB RAM. The trajectory/symbolic motion control was implemented in Matlab/Simulink on the same PC. The C++ controller updated the robot states at a frequency of 1 kHz. To handle sharp trajectory deviations and potential discontinuities during inverse kinematics computation [37], we used a second-order filter to calculate the acceleration profile. The filtered acceleration profile was then double-integrated to update the end effector (EE) position. For the target object, a human’s hand represented the moving obstacle, sharing the same workspace with the robot. The hand’s Cartesian position was continuously tracked by a fixed camera, specifically an Intel RealSense D435. The camera’s output was processed on a second PC with similar characteristics to the first one. All software components were connected using ROS to facilitate communication among them [38]. In particular, to simplify the computation of the hand’s pose from the camera data, we used an ArUco marker directly placed on the operator’s hand [39]. The position coordinates derived from the camera images were sent through a multistep Kalman filter to predict the future positions of the hand [35,40]. This step proved valuable when the camera’s view of the scene was obstructed. The Kalman filter output, denoted as x ^ , was associated with a uniformly distributed random variable [41], i.e., x ^ N ( μ x , Σ x ) , which justified the use of the Mahalanobis distance in modifying the task (as shown in (6)) [6].
As we focused on collaborative tasks where the manipulator’s speed is limited by law, the encoded trajectory was based on positions and velocities, not explicitly on manipulator dynamics. Motor torques/acceleration were not used in the proposed planning/control approach. However, parameters such as spline control point dynamics and the GMM covariance matrix were chosen to make collision avoidance feasible by considering the typical motion velocity of a human being [4,6,27]. For the tests, a cubic B-spline with N = 7 control points in the x y z plane was used at the trajectory level. We selected ( m i , b i , k i ) in (4) to ensure high stiffness at the first and last control points to maintain their positions constantly. Specifically, we empirically chose the parameters ( m i , b i , k i ) to be
  • m i = 20 kg , b i = 310 N s m , k i = 1200 N m for i { 1 ; N } ;
  • m i = 5 kg , b i = 60 N s m , k i = 180 N m for i 2 , 3 , , N 1 ;
At the symbolic level, the task was encoded into a Gaussian Mixture Model (GMM) with 7 components (i.e., M = N ) in 3D space. These components served as observation likelihood parameters in (2) to construct a left–right hidden Markov model (HMM). We adjusted (8) with γ = 10 to increase the sensitivity of the symbolic controller even for higher Mahalanobis distance values computed in (5).
During the experiments, the robot followed a desired trajectory while an operator moved his hand close to it to simulate collisions. We varied the parameter ξ in (10) to individually analyze the B-spline control, the HMM, and the combination of both controllers. This allowed us to demonstrate the improvements achieved through the integration of the two techniques and validate our novel planning methodology for obstacle avoidance. We evaluated the performance using four parameters:
  • The average computation time for a single iteration T c o m p .
  • The normalized average path deviation Δ L , calculated as the mean deviation of the traveled distance normalized over the nominal path.
  • The average stop time, T s t o p , calculated as the average time taken for parameter α in (9) to reach zero (i.e., stop the robot) once the hand collision was perceived.
  • The success rate, ρ , defined as the percentage of cases where the distance did not go below a given safety threshold (equal to 10 cm in our experiments) compared to the total number of situations where an incipient collision was detected.
Figure 7 displays snapshots of the real experiments conducted on the robot to test the B-spline–HMM-based controller.

4.2. Results

In Figure 8 and Table 1, we report the results of the experiments that are discussed further in the following paragraphs, while in Figure 9, we present a comparison of the trajectory of the hand recorded and virtually reproduced in three different tests using B-spline, HMM, and B-spline–HMM-based controllers. The XY projections on the right show path modifications, while temporal modulation along the time axis is observed. To assess the performance of the controllers under different scenarios, several experiments were conducted where the robot executed the same task repeatedly. These experiments involved varying parameters such as ξ (balance between trajectory and symbolic control), task execution speed, and acceleration.

4.2.1. Controller at Trajectory Level Only ( ξ = 1.0 )

In this test, the controller operated solely at the trajectory level, with ξ = 1.0 . The average computation time for a single iteration was T c o m p = 3.515 ms. Across six different experiments, the robot encountered a total of 59 possible collision situations. The success rate ( ρ ) achieved was 77.97%, and the normalized average path deviation ( Δ L ) was 0.590 . Since the controller only focused on spatial control without temporal modulation, no value of T s t o p was reported as the robot never stops.

4.2.2. Controller at Symbolic Level Only ( ξ = 0.0 )

In the second test, the controller operated exclusively at the symbolic level, with ξ = 0.0 . It employed an HMM with an observation space described by a GMM with seven components. The average computation time for a single iteration was T c o m p = 1.480 ms. Across five different experiments, the robot faced 58 possible collision situations. The success rate ( ρ ) achieved was 89.66%, with an average stop time ( T s t o p ) of 0.431 s and Δ L = 0.0 since no spatial controller was implemented.

4.2.3. Controller at Both Trajectory and Symbolic Levels ( ξ = 0.5 )

In the final validation test, the task was encoded using a control at both the trajectory and symbolic levels, with ξ = 0.5 . Similar configuration parameters as the previous tests were maintained. The average computation time for a single iteration was T c o m p = 4.649 ms, which was slightly higher than the previous cases but still met real-time requirements within 16 ms [3,4]. Across 10 different experiments, the robot encountered 143 possible collision situations. The success rate ( ρ ) achieved was 94.41%, with a normalized average path deviation ( Δ L ) of 0.304 m and an average stop time ( T s t o p ) of 0.514 s.

4.2.4. Performance Analysis

In the first scenario with only B-spline modification as the control ( ξ = 1.0 ), the performance was relatively poor ( ρ = 77.97 % ). The controller lacked velocity control, leading to difficulties in rapidly modifying the robot’s path to avoid collisions. Additionally, tuning ( m i , b i , k i ) parameters could tighten or relax Δ L , but the controller’s overall performance was unsatisfactory due to the absence of temporal modulation. Similarly, in the controller employing only HMM control ( ξ = 0.0 ), performance was improved compared to B-spline control ( ρ = 89.66 % ). However, this controller’s behavior was inadequate when the operator’s hand moved towards the robot. By integrating the B-spline and HMM controllers, the combined B-spline–HMM controller achieved a higher success rate ( ρ = 94.41 % ). Although the average stop time ( T s t o p ) was higher than the HMM controller alone, real-time requirements were still satisfied [3,4] and the B-spline–HMM controller consistently slowed down and modified the robot’s path until the safety distance was maintained before stopping completely. The normalized average path deviation ( Δ L ) was lower compared to the trajectory level only controller, as the robot now reduced its velocity towards the hand and stopped before any collisions. This feature was critical in avoiding potential singular configurations that could occur with the B-spline controller alone.

5. Conclusions and Future Work

In this paper, a novel framework for obstacle avoidance applications is introduced. After several real-world experiments, we believe that our methodology is suitable for human–robot shared environments, meeting real-time requirements for safe cooperation. The proposed work introduces a motion planning algorithm that can be adapted to different scenarios with moving and static obstacles, achieving a success rate of 94.41 % . By combining the trajectory and symbolic domains, our framework can smoothly adapt the Cartesian path and slow down the execution of the task in the proximity of a human operator. Moreover, by associating a dynamical system with each control point, the controller autonomously resets to its original task, and while using the HMM, the robot never stops after overtaking the operator’s hand, ensuring correct and complete task execution. Clarifications should be made w.r.t. to all the situations where a collision occurs. The B-spline–HMM architectures ensures safety as long as the position of the obstacle is known. A positive feature of our controller is that the robot is halted every time the obstacle is too close, strongly reducing the impact in case of a collision.
Future work might involve the use of multiple cameras to span over a wider workspace and improve the detection of obstacles, no longer limited to the operator’s hand. Triangulation techniques could be, in this case, a solution to enhance the marker measurement’s precision, as during experiments, we registered an accuracy sometimes lower than 4 cm. Moreover, it could be desirable to improve the algorithm that predicts the motion of moving obstacles and operators to obtain a larger time horizon, more consistent with a smooth obstacle avoidance application. While the algorithm was validated on one task, additional experiments and validations involving various test scenarios are needed. Preferably, these tests should apply the algorithm to the joint space, including new techniques for obstacle–robot distance measurements, that might be extended to the whole body of the manipulator rather than just the end effector.

Author Contributions

Writing—original draft, G.B.; conceptualization, G.B.; methodology, G.B.; software, G.B., M.T. and L.B.; validation, G.B.; supervision, L.B.; writing—review and editing, G.B., L.B., M.T. and F.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been partially funded by the European Union—NextGenerationEU under the ECOSISTER Project—Spoke 3 and by the University of Modena and Reggio Emilia through the FARD-2022 Project.

Data Availability Statement

Not applicable.

Acknowledgments

We acknowledge the support of Ideativa for lending us the camera used during the experimental evaluation of this research; we also want to thank the members of the Autolab group for their constant patience and support during this project.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
LbDLearning by Demonstration
GMMGaussian Mixture Models
EEEnd effector
HMMHidden Markov models
ROSRobot Operative System

References

  1. Khansari-Zadeh, S.M.; Khatib, O. Learning Potential Functions from Human Demonstrations with Encapsulated Dynamic and Compliant Behaviors. Auton. Robot. 2017, 41, 45–69. [Google Scholar] [CrossRef]
  2. Scalera, L.; Giusti, A.; Vidoni, R.; Gasparetto, A. Enhancing fluency and productivity in human-robot collaboration through online scaling of dynamic safety zones. Int. J. Adv. Manuf. Technol. 2022, 121, 6783–6798. [Google Scholar] [CrossRef]
  3. Liu, H.; Qu, D.; Xu, F.; Du, Z.; Jia, K.; Song, J.; Liu, M. Real-Time and Efficient Collision Avoidance Planning Approach for Safe Human-Robot Interaction. J. Intell. Robot. Syst. 2022, 105, 93. [Google Scholar] [CrossRef]
  4. Merckaert, K.; Convens, B.; Wu, C.-J.; Roncone, A.; Nicotra, M.M.; Vanderborght, B. Real-Time Motion Control of Robotic Manipulators for Safe Human-Robot Coexistence. Robotics 2022, 73, 102223. [Google Scholar] [CrossRef]
  5. Chiaravalli, D.; Califano, F.; Biagiotti, L.; De Gregorio, D.; Melchiorri, C. Physical-Consistent Behavior Embodied in B-Spline Curves for Robot Path Planning. IFAC-PapersOnLine 2018, 51, 306–311. [Google Scholar] [CrossRef]
  6. Kanazawa, A.; Kinugawa, J.; Kosuge, K. Adaptive Motion Planning for a Collaborative Robot Based on Prediction Uncertainty to Enhance Human Safety and Work Efficiency. IEEE Trans. Robot. 2019, 35, 817–832. [Google Scholar] [CrossRef]
  7. Zanchettin, A.M.; Ceriani, N.M.; Rocco, P.; Ding, H.; Matthias, B. Safety in Human-Robot Collaborative Manufacturing Environments: Metrics and Control. IEEE Trans. Autom. Sci. Eng. 2015, 13, 882–893. [Google Scholar] [CrossRef]
  8. Zucker, M.; Ratliff, N.; Dragan, A.D.; Pivtoraiko, M.; Klingensmith, M.; Dellin, C.M.; Bagnell, J.A.; Srinivasa, S.S. Chomp: Covariant Hamiltonian Optimization for Motion Planning. Int. J. Robot. Res. 2013, 32, 1164–1193. [Google Scholar] [CrossRef]
  9. Yang, C.; Sue, G.N.; Li, Z.; Yang, L.; Shen, H.; Chi, Y.; Rai, A.; Zeng, J.; Sreenath, K. Collaborative Navigation and Manipulation of a Cable-Towed Load by Multiple Quadrupedal Robots. IEEE Robot. Autom. Lett. 2022, 7, 10041–10048. [Google Scholar] [CrossRef]
  10. Mukadam, M.; Dong, J.; Yan, X.; Dellaert, F.; Boots, B. Continuous-Time Gaussian Process Motion Planning via Probabilistic Inference. Int. J. Robot. Res. 2018, 37, 1319–1340. [Google Scholar] [CrossRef]
  11. Toussaint, M.; Goerick, C. A Bayesian View on Motor Control and Planning. In From Motor Learning to Interaction Learning in Robots; Springer: Berlin/Heidelberg, Germany, 2010; pp. 227–252. [Google Scholar] [CrossRef]
  12. Fisac, J.F.; Bajcsy, A.; Herbert, S.L.; Fridovich-Keil, D.; Wang, S.; Tomlin, C.J.; Dragan, A.D. Probabilistically Safe Robot Planning with Confidence-Based Human Predictions. arXiv 2018, arXiv:1806.00109. [Google Scholar]
  13. Calinon, S.; Guenter, F.; Billard, A. On Learning, Representing, and Generalizing a Task in a Humanoid Robot. IEEE Trans. Syst. Man Cybern. Part Cybern. 2007, 37, 286–298. [Google Scholar] [CrossRef]
  14. Kavraki, L.E.; LaValle, S.M. Motion Planning. In Springer Handbook of Robotics; Springer: Berlin/Heidelberg, Germany, 2016; pp. 139–162. [Google Scholar] [CrossRef]
  15. Jankowski, J.; Brudermüller, L.; Hawes, N.; Calinon, S. VP-STO: Via-Point-Based Stochastic Trajectory Optimization for Reactive Robot Behavior. arXiv 2022, arXiv:2210.04067. [Google Scholar]
  16. Han, D.; Nie, H.; Chen, J.; Chen, M. Dynamic Obstacle Avoidance for Manipulators Using Distance Calculation and Discrete Detection. Robot. Comput.-Integr. Manuf. 2018, 49, 98–104. [Google Scholar] [CrossRef]
  17. Khatib, O. Real-time Obstacle Avoidance for Manipulators and Mobile Robots. In Proceedings of the 1985 IEEE International Conference on Robotics and Automation, St. Louis, MO, USA, 25–28 March 1985; Volume 2, pp. 500–505. [Google Scholar] [CrossRef]
  18. Quinlan, S.; Khatib, O. Elastic Bands: Connecting Path Planning and Control. In Proceedings of the [1993] Proceedings IEEE International Conference on Robotics and Automation, Atlanta, GA, USA, 2–6 May 1993; pp. 802–807. [Google Scholar] [CrossRef]
  19. Flacco, F.; Kröger, T.; Luca, A.D.; Khatib, O. A Depth Space Approach to Human-Robot Collision Avoidance. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation, St. Paul, MN, USA, 14–18 May 2012; pp. 338–345. [Google Scholar] [CrossRef]
  20. Secil, S.; Ozkan, M. A collision-free path planning method for industrial robot manipulators considering safe human-robot interaction. Intell. Serv. Robot. 2023, 16, 323–359. [Google Scholar] [CrossRef]
  21. ISO/TS 15066; Robots and Robotic Devices—Collaborative Robots. International Organization for Standardization: Geneva, Switzerland, 2016.
  22. Rosenstrauch, M.J.; Pannen, T.J.; Krüger, J. Human Robot Collaboration—Using Kinect v2 for ISO/TS 15066 Speed and Separation Monitoring. Procedia CIRP 2018, 76, 183–186. [Google Scholar] [CrossRef]
  23. Lagomarsino, M.; Lorenzini, M.; Constable, M.D.; De Momi, E.; Becchio, C.; Ajoudani, A. Maximising Coefficiency of Human-Robot Handovers through Reinforcement Learning. IEEE Robot. Autom. Lett. 2023, 8, 4378–4385. [Google Scholar] [CrossRef]
  24. Calinon, S.; Billard, A.; Dillmann, R.; Schaal, S. Robot Programming by Demonstration. In Springer Handbook of Robotics; Siciliano, B., Khatib, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar] [CrossRef]
  25. Biagiotti, L.; Melchiorri, C. Trajectory Planning for Automatic Machines and Robots; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2008. [Google Scholar] [CrossRef]
  26. Pentland, A.; Liu, A. Modeling and Prediction of Human Behavior. Neural Comput. 1999, 11, 229–242. [Google Scholar] [CrossRef]
  27. Hovland, G.E.; Sikka, P.; McCarragher, B.J. Skill Acquisition from Human Demonstration Using a Hidden Markov Model. In Proceedings of the IEEE International Conference on Robotics and Automation, Minneapolis, MN, USA, 22–28 April 1996; Volume 3, pp. 2706–2711. [Google Scholar] [CrossRef]
  28. Roveda, L.; Magni, M.; Cantoni, M.; Piga, D.; Bucca, G. Human-Robot Collaboration in Sensorless Assembly Task Learning Enhanced by Uncertainties Adaptation via Bayesian Optimization. Robot. Auton. Syst. 2021, 136, 103711. [Google Scholar] [CrossRef]
  29. Biagiotti, L.; Melchiorri, C. Online Trajectory Planning and Filtering for Robotic Applications via B-Spline Smoothing Filters. In Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, 3–7 November 2013; pp. 5668–5673. [Google Scholar] [CrossRef]
  30. Sayols, N.; Sozzi, A.; Piccinelli, N.; Hernansanz, A.; Casals, A.; Bonfè, M.; Muradore, R. Global/Local Motion Planning Based on Dynamic Trajectory Reconfiguration and Dynamical Systems for Autonomous Surgical Robots. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 8483–8489. [Google Scholar] [CrossRef]
  31. Jurafsky, D.; Martin, J. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition; Prentice Hall PTR: Upper Saddle River, NJ, USA, 2008; Volume 2, Available online: https://dl.acm.org/doi/10.5555/555733 (accessed on 9 August 2023).
  32. Mongillo, G.; Deneve, S. Online Learning with Hidden Markov Models. Neural Comput. 2008, 20, 1706–1716. [Google Scholar] [CrossRef]
  33. Vasquez, D.; Fraichard, T.; Laugier, C. Incremental Learning of Statistical Motion Patterns with Growing Hidden Markov Models. IEEE Trans. Intell. Transp. Syst. 2009, 10, 403–416. [Google Scholar] [CrossRef]
  34. Walter, M.; Psarrou, A.; Psarrou, R.; Gong, S. Learning Prior and Observation Augmented Density Models for Behaviour Recognition. In Proceedings of the BMVC 1999—British Machine Vision Conference 1999, Nottingham, UK, 13–16 September 1999. [Google Scholar]
  35. Meuter, M.; Iurgel, U.; Park, S.-B.; Kummert, A. The Unscented Kalman Filter for Pedestrian Tracking from a Moving Host. In Proceedings of the 2008 IEEE Intelligent Vehicles Symposium, Eindhoven, The Netherlands, 4–6 June 2008; pp. 37–42. [Google Scholar] [CrossRef]
  36. Wiest, J.; Höffken, M.; Kreßel, U.; Dietmayer, K. Probabilistic Trajectory Prediction with Gaussian Mixture Models. In Proceedings of the 2012 IEEE Intelligent Vehicles Symposium, Madrid, Spain, 3–7 June 2012; pp. 141–146. [Google Scholar] [CrossRef]
  37. Gaz, C.; Cognetti, M.; Oliva, A.; Robuffo Giordano, P.; De Luca, A. Dynamic Identification of the Franka Emika Panda Robot with Retrieval of Feasible Parameters Using Penalty-Based Optimization. IEEE Robot. Autom. Lett. 2019, 4, 4147–4154. [Google Scholar] [CrossRef]
  38. Quigley, M.; Gerkey, B.; Smart, W.D. Programming Robots with ROS: A Practical Introduction to the Robot Operating System; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2015. [Google Scholar]
  39. Garrido-Jurado, S.; Muñoz-Salinas, R.; Madrid-Cuevas, F.J.; Marín-Jiménez, M.J. Automatic Generation and Detection of Highly Reliable Fiducial Markers Under Occlusion. Pattern Recognit. 2014, 47, 2280–2292. [Google Scholar] [CrossRef]
  40. Kohler, M. Using the Kalman Filter to Track Human Interactive Motion: Modelling and Initialization of the Kalman Filter for Translational Motion; Citeseer: Pennsylvania, PA, USA, 1997. [Google Scholar]
  41. Julier, S.J.; Uhlmann, J.K. New Extension of the Kalman Filter to Nonlinear Systems. In Proceedings of the Signal Processing, Sensor Fusion, and Target Recognition VI, Orlando, FL, USA, 21–24 April 1997; Volume 3068, pp. 182–193. [Google Scholar] [CrossRef]
Figure 1. Example of B-spline trajectory: p ( s ) (black line) is an approximation of the control polygon (red dashed line) defined by the control points p i .
Figure 1. Example of B-spline trajectory: p ( s ) (black line) is an approximation of the control polygon (red dashed line) defined by the control points p i .
Robotics 12 00118 g001
Figure 2. Example of forward-chained left–right HMM: being in state h i , it is only possible to move to the next state h i + 1 or remain in the current state.
Figure 2. Example of forward-chained left–right HMM: being in state h i , it is only possible to move to the next state h i + 1 or remain in the current state.
Robotics 12 00118 g002
Figure 3. Block scheme representation of the proposed framework for collision avoidance.
Figure 3. Block scheme representation of the proposed framework for collision avoidance.
Robotics 12 00118 g003
Figure 4. B-spline with associated dynamical systems to control points p i : when an external force F r e p , 2 acts on p 2 , it locally changes the desired path.
Figure 4. B-spline with associated dynamical systems to control points p i : when an external force F r e p , 2 acts on p 2 , it locally changes the desired path.
Robotics 12 00118 g004
Figure 5. B-spline and HMM combined together: while the former describes the nominal path p ( s ) in the Cartesian domain, the latter provides a probabilistic map of it as described in (7), together with transition probabilities a i , j among control points p i computed such that a left–right HMM is obtained, thus ensuring always going from a start position to a goal one.
Figure 5. B-spline and HMM combined together: while the former describes the nominal path p ( s ) in the Cartesian domain, the latter provides a probabilistic map of it as described in (7), together with transition probabilities a i , j among control points p i computed such that a left–right HMM is obtained, thus ensuring always going from a start position to a goal one.
Robotics 12 00118 g005
Figure 6. Control at both trajectory and symbolical level using B-splines and HMM ( ξ = 0.5 ). A planar task, also used in the experiments, is considered for illustration purposes. It is encoded using a B-spline exploiting 7 control points (with fixed height z = 0.3 m), and an HMM with observation space described by a GMM with 7 components.
Figure 6. Control at both trajectory and symbolical level using B-splines and HMM ( ξ = 0.5 ). A planar task, also used in the experiments, is considered for illustration purposes. It is encoded using a B-spline exploiting 7 control points (with fixed height z = 0.3 m), and an HMM with observation space described by a GMM with 7 components.
Robotics 12 00118 g006
Figure 7. Snapshots from experiments conducted on the B-spline–HMM-based controller. The control points corresponding to the nominal path are displayed in magenta. In sequence (A), the hand holding the marker moves slightly above the end effector, inducing the robot to slow down and modify its path to pass under the hand. In sequence (B), the hand suddenly moves in front of the end effector, giving the robot no time to modify its path. Therefore, the robot controller decides to stop and wait until the hand changes position. Here is a link to the related video: https://www.youtube.com/watch?v=z6HBSz7o4qo (accessed on 15 March 2023).
Figure 7. Snapshots from experiments conducted on the B-spline–HMM-based controller. The control points corresponding to the nominal path are displayed in magenta. In sequence (A), the hand holding the marker moves slightly above the end effector, inducing the robot to slow down and modify its path to pass under the hand. In sequence (B), the hand suddenly moves in front of the end effector, giving the robot no time to modify its path. Therefore, the robot controller decides to stop and wait until the hand changes position. Here is a link to the related video: https://www.youtube.com/watch?v=z6HBSz7o4qo (accessed on 15 March 2023).
Robotics 12 00118 g007
Figure 8. Box plots of the values reported in Table 1; in plots (ac) the values of T c o m p , Δ L , and T s t o p are displayed, respectively.
Figure 8. Box plots of the values reported in Table 1; in plots (ac) the values of T c o m p , Δ L , and T s t o p are displayed, respectively.
Robotics 12 00118 g008
Figure 9. Three experiments were conducted using the same obstacle trajectory (represented by the black dashed line). Each experiment virtually reproduced the trajectory, with the nominal path shown in blue and the modified path in red. The XY coordinates of the end effector were observed over time with respect to the obstacle position. The overall trajectory was projected onto the XY plane on the right-hand side for better visualization of geometric path modifications. In experiment (a), only B-spline control was used ( ξ = 1.0 ), resulting in path modifications but no change in task duration. The nominal and modified trajectories were aligned in time. In experiment (b), only HMM control was used ( ξ = 0.0 ). The end effector trajectory slowed down when close to the obstacle, but no path modification was observed as the XY projections of the nominal and modified trajectories perfectly overlapped. In experiment (c), control was implemented at both the trajectory and symbolic levels using B-splines and HMM ( ξ = 0.5 ). When the obstacle approached the robot, both spatial and temporal evolution were modified.
Figure 9. Three experiments were conducted using the same obstacle trajectory (represented by the black dashed line). Each experiment virtually reproduced the trajectory, with the nominal path shown in blue and the modified path in red. The XY coordinates of the end effector were observed over time with respect to the obstacle position. The overall trajectory was projected onto the XY plane on the right-hand side for better visualization of geometric path modifications. In experiment (a), only B-spline control was used ( ξ = 1.0 ), resulting in path modifications but no change in task duration. The nominal and modified trajectories were aligned in time. In experiment (b), only HMM control was used ( ξ = 0.0 ). The end effector trajectory slowed down when close to the obstacle, but no path modification was observed as the XY projections of the nominal and modified trajectories perfectly overlapped. In experiment (c), control was implemented at both the trajectory and symbolic levels using B-splines and HMM ( ξ = 0.5 ). When the obstacle approached the robot, both spatial and temporal evolution were modified.
Robotics 12 00118 g009
Table 1. Validation tests results in terms of average computation time for single iteration T c o m p , normalized average path deviation Δ L , average stop time T s t o p , and success rate ρ .
Table 1. Validation tests results in terms of average computation time for single iteration T c o m p , normalized average path deviation Δ L , average stop time T s t o p , and success rate ρ .
T c o m p (ms) Δ L T s t o p (s) ρ
B-spline3.515 ± 1.530.590 ± 0.08/77.97%
HMM1.480 ± 2.700.000 ± 0.000.431 ± 0.0189.66%
B-spline+HMM4.649 ± 0.550.304 ± 0.070.514 ± 0.0194.41%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Braglia, G.; Tagliavini, M.; Pini, F.; Biagiotti, L. Online Motion Planning for Safe Human–Robot Cooperation Using B-Splines and Hidden Markov Models. Robotics 2023, 12, 118. https://doi.org/10.3390/robotics12040118

AMA Style

Braglia G, Tagliavini M, Pini F, Biagiotti L. Online Motion Planning for Safe Human–Robot Cooperation Using B-Splines and Hidden Markov Models. Robotics. 2023; 12(4):118. https://doi.org/10.3390/robotics12040118

Chicago/Turabian Style

Braglia, Giovanni, Matteo Tagliavini, Fabio Pini, and Luigi Biagiotti. 2023. "Online Motion Planning for Safe Human–Robot Cooperation Using B-Splines and Hidden Markov Models" Robotics 12, no. 4: 118. https://doi.org/10.3390/robotics12040118

APA Style

Braglia, G., Tagliavini, M., Pini, F., & Biagiotti, L. (2023). Online Motion Planning for Safe Human–Robot Cooperation Using B-Splines and Hidden Markov Models. Robotics, 12(4), 118. https://doi.org/10.3390/robotics12040118

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop