Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration

Sosa-Ceron, Arturo Daniel; Gonzalez-Hernandez, Hugo G.; Reyes-Avendaño, Jorge Antonio

doi:10.3390/app16052531

Open AccessArticle

Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration

by

Arturo Daniel Sosa-Ceron

,

Hugo G. Gonzalez-Hernandez

^*

and

Jorge Antonio Reyes-Avendaño

School of Engineering and Sciences, Tecnologico de Monterrey, Ave. Eugenio Garza Sada 2501 Sur, Col. Tecnológico, Monterrey 64700, NL, Mexico

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(5), 2531; https://doi.org/10.3390/app16052531

Submission received: 6 May 2025 / Revised: 19 June 2025 / Accepted: 10 February 2026 / Published: 6 March 2026

(This article belongs to the Special Issue Human–Robot Interaction and Control)

Download

Browse Figures

Versions Notes

Abstract

Human–robot collaboration (HRC) can be defined as the close interaction between a human user and a robot working together to accomplish a specific task. True collaboration, however, can only be realized when humans and robots can share the same workspace simultaneously and move freely within it. To address these problems, Learning from Demonstrations (LfD) helps robots become competent in solving plenty of complicated tasks, greatly reducing programming times and allowing task generalization. However, complex robot tasks require complex path planning modeling for a robot to move from one place to another in a heavily constrained workspace following a collision-free path. To this end, a robot programming framework based on Dynamic Movement Primitives (DMPs) is proposed. The framework derives and implements a solution for robot path planning and includes a new DMP formulation with volumetric obstacle avoidance for robot LfD. The formulation equips robotic systems with the capability of online adaptation in the presence of dynamic obstacles. Quantitative evaluations demonstrate high success rates (>96% in tested scenarios) in collision avoidance and typical trajectory adaptation times in the order of milliseconds (<5 ms), supporting its applicability. These methods have been applied in both simulation and real robotic scenarios using a UR10e collaborative robot from Universal Robots for testing and validation purposes. The results indicate that the proposed approach can effectively make the robot follow a user-defined trajectory and learn how to adapt it to avoid collisions with volumetric obstacles of different shapes and poses in an unconstrained human–robot collaborative environment.

Keywords:

human–robot collaboration; learning from demonstrations; Dynamic Movement Primitives (DMPs); volumetric obstacle avoidance

1. Introduction

Human–robot collaboration (HRC) can be defined as the close interaction between a human user and a robot working together to accomplish a specific task. True collaboration, however, is only achieved when humans and robots can share the same workspace simultaneously and can perform tasks concurrently during production operation [1]. One of the biggest challenges in collaborative robotics resides in how to achieve full collaboration between humans and robots. Even when collaborative robots (cobots) have gained popularity in industrial settings, it is still uncommon to find them working in close interaction with human operators due to a lack of intuitive robot programming, limiting non-expert operators to create and alter robot programs quickly and intuitively [2]. To overcome these programming complexities and enable robots to learn new skills and adapt to tasks demonstrated by humans within collaborative settings, Learning from Demonstration (LfD) offers a promising paradigm.

LfD aims to equip robots with the ability to acquire new skills by observing movements performed by an expert to solve the same task. This concept is rooted in neuroscience, serving as a means to emulate the human learning process in robots [3]. The LfD process encompasses five stages: Instructor Selection, Data Acquisition, Data Modeling, Task Execution, and Learning Refinement. Initially, the expert who will instruct the task resolution is selected. Typically, the human user who possesses the precise knowledge to solve the task assumes this role. Next, the Data Acquisition phase involves determining how task information will be recorded and how task-specific information will be mapped to the robot’s configuration space. The essence of LfD lies in the Data Modeling process, where a suite of algorithms is employed to learn task features for future reproducibility. Subsequently, during the Task Execution stage, the system’s performance is assessed. Many algorithms utilized in the data modeling phase are designed to be applicable in scenarios not explicitly taught during demonstrations, enabling the evaluation of learning effectiveness. Finally, if necessary, adjustments can be made in the Learning Refinement stage. This stage allows for the fine-tuning of learning parameters or the addition of new information to demonstrations to enhance task execution.

Of course, another major concern is safety. Safety is a crucial aspect of autonomous dynamical systems expected to operate in unknown and unstructured environments. Regarding system properties, safety ensures that undesirable states or events do not occur [4]. Cobots have critical safety requirements due to HRC typically occurring in unstructured and dynamic environments. The coexistence of humans and cobots in shared spaces creates hazardous situations that require appropriate mechanisms to prevent uncontrolled physical contact between them [5]. Avoiding such contact can be viewed as a path-planning problem involving obstacle avoidance. While cobots are designed with inherent safety features, achieving seamless and efficient collaboration in dynamic environments shared with humans remains a significant hurdle. Many existing HRC systems rely on predefined paths or simplistic reactive strategies (e.g., stopping upon collision detection), which can limit fluidity and task efficiency, especially when unexpected obstacles such as moving human co-workers are present. Consequently, there is a pressing need for more advanced techniques that facilitate intuitive robot programming and enable robust, real-time adaptation to dynamic workspaces [2]. Active obstacle avoidance in the LfD framework can enhance cobots’ ability to complete tasks safely and effectively, significantly expanding the LfD application field [6].

This work proposes a LfD programming framework for human–robot collaborative scenarios, designed to enhance collaboration speed and safety by enabling adaptive robot movement in human presence. The framework introduces an improved method of volumetric obstacle avoidance based on Dynamic Movement Primitives (DMPs) formulation [7,8,9], which is a popular LfD algorithm used to represent and generate complex movements, and superquadrics [10], which are a family of geometric shapes used to model a variety of 3D shapes. This enables a robotic manipulator to perform tasks in dynamic, unconstrained environments. The major contribution of this work is the implementation of a complete robotic system for rapid robot programming and task generalization. This includes perception, hand–eye robot coordination, trajectory learning, task adaptation, and obstacle avoidance of objects modeled with superquadric shapes with varying sizes and poses. The robotic system is tested in a collaborative cell, considering human actions alongside robot movements. This contributes to improving the integration of collaborative robots in settings requiring human intervention and action.

The rest of this work is organized as follows: Section 2 presents a literature review on obstacle avoidance in the context of the LfD framework. Then, in Section 3, the basic formulation of DMPs and superquadrics are explained. In Section 4, the proposed programming framework algorithms and implementation steps are thoroughly described. Section 5 is dedicated to validating the experiment design through theoretical and simulation validity before applying the framework to a real human–robot scenario. Section 6 demonstrates the results obtained via experimental evaluation on a real HRC task. Finally, Section 7 discusses the overall performance of the proposed programming framework, its advantages over similar methods, and future research directions and opportunities.

2. Literature Review

In the LfD framework, the learning process occurs during the Data Modelling stage, where the framework gives a way to learn various goal-directed movement skills in high-dimensional continuous state-action spaces directly from human demonstration. As pointed out in [11], there exists a great variety of algorithms focused on LfD. In the context of primitive movement learning, it is possible to classify the learning algorithms into two categories: Deterministic and Probabilistic methods.

The most representative solution of the deterministic methods in LfD is known as DMPs. The term was first introduced by Ijspeert et al. in [7], and further improved in their later work [9]. DMPs have their origin in biological systems, particularly in motor control, and refer to a framework for trajectory learning based on second-order ODEs of the spring-mass-damping type with a forcing term. Furthermore, in the context of the robot obstacle avoidance problem, customized obstacle-avoidance algorithms have been devised utilizing DMPs [12].

While DMP-based methods offer strong generalization capabilities from a few demonstrations, other learning paradigms such as Reinforcement Learning (RL) have also been extensively explored for robot path planning in dynamic environments. For instance, RL enables robots to learn complex behaviors and navigation strategies through interaction and trial-and-error, sometimes even without explicit global information [13]. However, RL approaches often require significant training time and data, and ensuring safety during the learning process in physical HRC scenarios presents a considerable challenge. In contrast, the LfD framework, particularly when augmented with DMPs, aims for faster skill acquisition while providing a structured mathematical basis for incorporating modifications such as the proposed volumetric obstacle avoidance.

To ensure both obstacle avoidance and stability within the DMP framework, a perturbing term is incorporated. This term is typically constructed using potential functions [14], or the scheme of steering angle presented in [15], to facilitate efficient obstacle avoidance while preserving the overall stability of the system. The problem with these solutions is that the coupling term considers point-like obstacles, which are impractical in some situations.

The literature details several attempts to extend the DMP formulation for volumetric obstacle avoidance. One approach, for instance, represents obstacles as point clouds, utilizing the nearest point to the robot in the steering-angle-based obstacle avoidance formulation [16]. Other examples of modeling volumes as point clouds can be found in [17], which addresses issues related to trajectory jittering, ineffective obstacle avoidance in specific scenarios, and better preservation of teaching intentions in dynamic environments for enhanced performance. Meanwhile, ref. [18] employs a method of hyperparameter optimization based on RL to learn both the profiles of potentials and the shape parameters of a motion.

However, as presented by [19], some of the drawbacks of modeling volumes with point clouds are the high computational time due to the density of the point cloud and non-smooth behaviors due to the constantly changing nearest point between the robot and the obstacle. To tackle this, ref. [19] enhanced DMPs to support volumetric obstacle avoidance using superquadric functions in scenarios where obstacles have known and unknown shapes. The superquadrics’ implicit formulation was modified to propose a superquadric potential function based on superellipses that model the shape of the obstacle potential field. This solution was originally developed for static obstacles and was expanded later in [20], showing promising results in dynamic obstacle avoidance in different robotic scenarios. The main drawback of this solution is that convergence and stability of the DMPs plus the potential field is not guaranteed. To avoid these problems, a solution composed of steering angle avoidance and superquadric models was adopted in [21]. The stability and convergence of the method are proven and obstacle avoidance is guaranteed for multiple volumetric obstacles; still, dynamic obstacle avoidance was not tested.

Thus, the proposed solution in this work will extend the work of Ginesi et al. [20] and Liu et al. [21] to:

Leverage LfD to reduce programming time and enable task generalization for complex robot tasks in constrained workspaces.
Include superquadrics in the general pose to model static and dynamic obstacles.
Tackle some of the major drawbacks of the steering angle scheme by addressing limitations like the “dead zone” issue and extending the approach to volumetric obstacles using a modified Mollifier function.
The framework was successfully tested and validated in both simulations and real-world scenarios using a collaborative robot, demonstrating effective trajectory learning, adaptation, and collision avoidance with static and dynamic obstacles (including human arms) in an HRC setting.

3. Materials and Methods

This section introduces the fundamental concepts underpinning the proposed framework for robot path planning with volumetric obstacle avoidance in HRC. We begin by detailing the standard formulation of DMPs, a prominent technique within LfD for encoding and generalizing trajectories. Following this, we describe superquadrics, the geometric modeling approach used to represent obstacles volumetrically. These core methodologies form the basis for the enhanced DMP formulation and implementation presented in Section 4, which enables reactive collision avoidance in dynamic environments.

3.1. Dynamic Movement Primitives

DMPs aim to generalize a trajectory, even if the initial and final positions change, while maintaining the shape of the original trajectory by modeling its forcing term. In first-order notation, the DMP formulation proposed in [7,9] is:

\{\begin{matrix} τ \dot{v} = K (x_{g} - x) - D v + (x_{g} - x_{0}) f (z) \\ τ \dot{x} = v \end{matrix}

(1)

where

x, v, x_{g}, x_{0} \in R^{n}

are, respectively, the position and velocity of the system, goal and initial positions, and

f \in R^{n}

is the non-linear forcing term. Matrices

K, D \in R^{n \times n}

are diagonal matrices representing the elastic and damping terms, usually chosen to make the system critically damped (

D = 2 \sqrt{K}

) for

x_{0}

to monotonically converge toward

x_{g}

. Parameter

τ \in R^{+}

is a temporal scaling factor, and parameter

z \in R

is a re-parametrization of time, governed by the so-called canonical system. The introduction of this term allows DMPs to synchronize dynamical systems with multiple degrees of freedom, where each degree of freedom can have its equation system but share a common phase.

τ \dot{z} = - α z, α > 0 .

(2)

where

α \in R^{+}

determines the exponential decay.

The forcing term,

f (z) = {[f_{1} (z), f_{2} (z), \dots, f_{N} (z)]}^{T}

, is expressed using a series of basis functions. Each component

f_{j} (z)

,

j = 1, 2, \dots, N

is written as

f_{j} (z) = \frac{\sum_{i = 1}^{N} ω_{i} ψ_{i} (z)}{\sum_{i = 1}^{N} ψ_{i} (z)} z

(3)

where

ω_{i} \in R

are adjustable weights, and

{ψ_{i} (z)}_{i = 1}^{N}

is a set of

N

exponential basis functions. In the literature, Radial Gaussian basis functions are used: given a set of centers

{c_{i}}_{i = 1}^{N}

and a set of positive widths

{h_{i}}_{i = 1}^{N}

, we have

ψ_{i} (z) = exp (- h_{i} {(z - c_{i})}^{2}) .

(4)

where

c_{i}

are the centers of Gaussian basis functions distributed along the phase of the movement and

h_{i}

are their widths. We can define

c_{i} = exp (- α \frac{i - 1}{N - 1}), h_{i} = \frac{1}{{(c_{i + 1} - c_{i})}^{2}}

and

h_{N} = h_{N - 1}

with

i = 1, \dots, N

.

The learning process consists of the computation of the weights

ω_{i} \in R

that best approximate the desired forcing term

f_{d} (z)

, obtained by solving (1) for

f (z)

. Given a desired trajectory

x_{d} (t)

,

t \in [0, T]

with velocity

v_{d} (t) = τ {\ddot{x}}_{d} (t)

, this results in:

f_{d} (z (t)) = γ [τ^{2} {\ddot{x}}_{d} (t) - τ D {\dot{x}}_{d} (t) - K (x_{g} - x_{d} (t)]

(5)

where

z (t) = exp (- α t)

,

γ = \frac{1}{x_{g} - x_{0}}

and

x_{g} = x_{d} (T), x_{0} = x_{d} (0)

. We can compute the vector

ω = [ω_{1}, ω_{2}, \dots, ω_{N}]

by solving the linear system:

Φ ω = f_{d}

(6)

where

Φ = [\begin{matrix} \frac{ψ_{1} (z_{0})}{\sum_{i = 1}^{N} ψ_{i} (z_{0})} z_{0} & \dots & \frac{ψ_{N} (z_{0})}{\sum_{i = 1}^{N} ψ_{i} (z_{0})} z_{0} \\ ⋮ & ⋱ & ⋮ \\ \frac{ψ_{1} (z_{T})}{\sum_{i = 1}^{N} ψ_{i} (z_{T})} z_{T} & \dots & \frac{ψ_{N} (z_{T})}{\sum_{i = 1}^{N} ψ_{i} (z_{T})} z_{T} \end{matrix}]

(7)

3.2. Obstacle Avoidance for DMPs

Another advantage of the DMP formulation is its ability to incorporate additional forcing terms for modeling desired behaviors during trajectory reproduction; for instance, forcing the trajectory to reach specific via points not present during the demonstration. This capability is leveraged in [14] to introduce a coupling term based on potential fields, enabling the implementation of obstacle avoidance within the DMP framework. However, a drawback of potential fields arises when dealing with multiple obstacles, as it can result in the system getting trapped in local minima, and in consequence, no movement can be generated, compromising system stability and convergence. To address such limitations, Hoffmann et al. [15] proposed a formulation based on a steering angle. This formulation was later adopted by [9] in their comprehensive review of DMPs, inspired empirically by human behavior during obstacle avoidance. The original formulation (1) is extended by adding an additional coupling term

p (x, v)

to the first differential equation:

τ \dot{v} = K (x_{g} - x) - D v + (x_{g} - x_{0}) f (z) + p (x, v)

(8)

The differential equation that models human obstacle avoidance behavior is described by the change of the steering angle

ϑ

according to

\dot{ϑ} = γ ϑ exp (- β | ϑ |)

(9)

where

γ, β \in R_{+}

are constant gains. As shown in Figure 1 the steering angle can be calculated as

ϑ = arccos (\frac{〈 o - x, v 〉}{∥ o - x ∥ ∥ v ∥})

(10)

where

o

is the position of the obstacle. We can relate the change in the steering angle to the necessary direction change in the velocity vector

v

of the robot to avoid the obstacle as

\dot{v} = Rv \dot{ϑ}

(11)

Matrix

R

is a rotational matrix of angle

π / 2

with the axis generated by

(o - x) \times v

. This velocity change can be used as the coupling term

p (x, v)

as

p (x, v) = γ Rv ϑ exp (- β | ϑ |)

(12)

The issue with the earlier formulation lies in the assumption that each obstacle is treated as a point in space. In practical robotics applications, this presents a challenge because neglecting the shape of obstacles can result in unintended behaviors and collisions. Thus, another approach found in the literature, and upon which the presented solution is based, involves modeling obstacles using superquadric models.

Superquadrics are a family of shapes that includes superellipsoids, supertoroids, and superhyperboloids that can model any shape with a small number of parameters [10]. Superquadrics’ definition is modeled as superellipsoids with varying exponents:

{({(\frac{x}{a_{1}})}^{\frac{2}{ϵ_{2}}} + {(\frac{y}{a_{2}})}^{\frac{2}{ϵ_{2}}})}^{\frac{ϵ_{2}}{ϵ_{1}}} + {(\frac{z}{a_{3}})}^{\frac{2}{ϵ_{1}}} = 1

(13)

where

a_{1}, a_{2}

and

a_{3} \in R_{+}

determine the size of the superquadric and

ϵ_{1}

and

ϵ_{2} \in R_{+}

decide the shape of it. The explicit representation of it is

x (η, ω) = [\begin{matrix} a_{1} {cos}^{ϵ_{1}} (η) {cos}^{ϵ_{2}} (ω) \\ a_{2} {cos}^{ϵ_{1}} (η) {sin}^{ϵ_{2}} (ω) \\ a_{3} {sin}^{ϵ_{1}} (η) \end{matrix}]

(14)

where

η

and

ω

are bounded

- π / 2 \leq η \leq π / 2

and

- π \leq ω < π

. So, we can define any volumetric shape using only five parameters. In Figure 2, we present some examples of volumetric shapes that can be generated with the abovementioned equation.

4. Implementation

Let us consider (8),

τ \dot{v} = K (x_{g} - x) - D v + (x_{g} - x_{0}) f (z) + p (x, v)

where

p (x, v)

represents a coupling term in the DMP dynamical system formulation. As described in Section 3, the steering angle approaches the coupling term, as described by (12).

p (x, v) = γ Rv ϑ exp (- β | ϑ |)

This term can be rewritten as

p (x, v) = γ Rv Θ

(15)

where

Θ = ϑ F_{o}

describes the influence of a force generated by an obstacle on the steering angle. This term

F_{o}

will be modeled after the general pose description of a superquadric model. Usually, a superquadric centered in a local coordinate system

(x_{s}, y_{s}, z_{s})

can be defined by only five parameters (

a_{1}, a_{2}, a_{3}, ϵ_{1},

and

ϵ_{2}

). When describing the same superquadric in the general pose, it is necessary to include six additional parameters describing the pose vector relative to the center of the world coordinate system

(x_{w}, y_{w}, z_{w})

[22]. Thus, it is required to apply a frame coordinate transformation in the form of

[\begin{matrix} x_{w} \\ y_{w} \\ z_{w} \\ 1 \end{matrix}] = T [\begin{matrix} x_{s} \\ y_{s} \\ z_{s} \\ 1 \end{matrix}]

(16)

where

T = [\begin{matrix} r_{11} & r_{12} & r_{13} & p_{x} \\ r_{21} & r_{22} & r_{23} & p_{y} \\ r_{31} & r_{32} & r_{33} & p_{z} \\ 0 & 0 & 0 & 1 \end{matrix}]

(17)

with

r_{i j}

, with

i, j \in R^{+} = {1, 2, 3}

as the rotation components and

p_{x}, p_{y}, and p_{z}

as the translation vector. Since it is needed to express this relation in superquadric-centered coordinates, these coordinates are computed as

[\begin{matrix} x_{s} \\ y_{s} \\ z_{s} \\ 1 \end{matrix}] = T^{- 1} [\begin{matrix} x_{w} \\ y_{w} \\ z_{w} \\ 1 \end{matrix}]

(18)

with

T^{- 1} = [\begin{matrix} r_{11} & r_{21} & r_{31} & p_{1} \\ r_{12} & r_{22} & r_{32} & p_{2} \\ r_{13} & r_{23} & r_{33} & p_{3} \\ 0 & 0 & 0 & 1 \end{matrix}]

(19)

where

p_{1} = - (p_{x} r_{11} + p_{y} r_{21} + p_{z} r_{31}), p_{2} = - (p_{x} r_{12} + p_{y} r_{22} + p_{z} r_{32}), and p_{3} = - (p_{x} r_{13} + p_{y} r_{23} + p_{z} r_{33})

.

By substituting (18) and (19) into (13), the implicit function representation for superquadrics in the general pose is obtained.

\begin{matrix} f_{s} (x_{s}, y_{s}, z_{s}) = \\ ({(\frac{r_{11} x_{w} + r_{21} y_{w} + r_{31} z_{w} + p_{1}}{a_{1}})}^{\frac{2}{ϵ_{2}}} \\ + {(\frac{r_{12} x_{w} + r_{22} y_{w} + r_{32} z_{w} + p_{2}}{a_{2}})}^{\frac{2}{ϵ_{2}}})^{\frac{ϵ_{2}}{ϵ_{1}}} \\ + {(\frac{r_{13} x_{w} + r_{23} y_{w} + r_{33} z_{w} + p_{3}}{a_{3}})}^{\frac{2}{ϵ_{1}}} \end{matrix}

(20)

Then, considering the work of [21], function

F_{o}

can be formulated as

F_{o} = {log}_{b}^{- 1} (f_{s})

(21)

where

b \in R^{+}

represents the base of the logarithmic function. As a constant, b determines the magnitude of

F_{o}

. Unfortunately, dealing with the correct selection of a constant b adds complexity to the formulation. Instead, to reduce the number of extra parameters to tune, Equation (22) is proposed:

F_{o} = exp ({log}^{- 1} (f_{s}))

(22)

The selection of logarithmic function to describe (22) is simple to understand. When the robot is close to the surface of the obstacle, the function

f_{s}

will decrease to 1, and

F_{o}

will increase to

F_{o} \to \infty

, forcing the robot to move away from the obstacle, which is the desired behavior. While the use of the exponential function will preserve its behavior of being strictly increasing and convex, it will grow extremely fast for values of

f_{s} \to 1^{+}

. So the new DMP coupling term can be expressed as

p (x, v) = γ Rv ϑ exp ({log}^{- 1} (f_{s}))

(23)

This term can be further improved following the directions of [16,23]. Both works add an extra term considering the influence of the distance between the robot and the obstacle. If the robot is moving away from the obstacle, the effect of the coupling term should reduce to zero and vice versa; if the robot moves toward the obstacle, the effect of the coupling term should be maximized. Another drawback of the original steering angle formulation is called the “dead zone” in [23]. As the authors described it, the dead zone problem involves a “heading range towards the obstacle for which the system becomes incoherently less reactive." To avoid it, the authors proposed an additional term based on a Gaussian function to solve the dead zone problem. This can be visualized in Figure 3.

Therefore, the coupling term can be enhanced by adding two exponential terms to (15), one

exp (- k d^{2})

to regulate the coupling term based on the robot-obstacle distance, and the second,

sign (ϑ) exp (- ϑ^{2} / φ^{2})

, to tackle the dead zone problem,

p (x, v) = γ Rv sign (ϑ) exp (- k d^{2}) exp (\frac{- ϑ^{2}}{φ^{2}}) Θ

(24)

where

γ

and k

\in R

are constants. However, this solution is meant to be applied to point obstacles where it is possible to ignore an obstacle when

ϑ > π / 2

. When working with volumetric obstacles, angles greater than

π / 2

can produce collisions depending on the shape of the obstacle. Considering convex obstacles only, complete obstacle avoidance can be guaranteed only when

ϑ \geq π

(because the robot will be moving in the opposite direction from the obstacle); it is necessary to modify the Gaussian function to include the interval

π / 2 < ϑ < π

. Nonetheless, the coupling term does not reduce to zero when

ϑ \to π

, as shown in Figure 4a. For this reason, the proposed solution implements a slight modification to the formulation; the Gaussian function is replaced by a Mollifier function obtaining the final form of the proposed coupling term

p (x, v) = \{\begin{matrix} γ Rv ϑ_{m} exp ({log}^{- 1} (f_{s})), & if - φ < ϑ < φ \\ 0, & otherwise \end{matrix}

(25)

where

ϑ_{m} = sign (ϑ) exp (\frac{- 1}{1 - {|\frac{ϑ}{φ}|}^{2}}) exp (- k d^{2})

.

With this modification, the properties of the Gaussian function are preserved and

p (x, v) \to 0

when

ϑ \to π

as demonstrated in Figure 4b.

Algorithm 1 shows how obstacle avoidance is implemented in the proposed solution.

Algorithm 1 Dynamic Movement Primitives with obstacle avoidance

Input:: $x_{d}$ cartesian trajectory vector, ${\dot{x}}_{d}$ velocity vector (optional), ${\ddot{x}}_{d}$ acceleration vector (optional), $K$ stiffness matrix, $α$ canonical system constant, $τ$ time rescaling factor, T duration of the movement, N number of basis functions, $a_{1}, a_{2}, a_{3}, ϵ_{1}, ϵ_{2}, p_{s}$ superquadric parameters and pose vector, $γ, k, v_{o} obstacle velocity (optional, if not given 0)$

1:: $z \leftarrow$ canonical system $exp (- \frac{α}{τ} * t)$
2:: $x_{g} \leftarrow x_{d} (T)$
3:: $x_{0} \leftarrow x_{d} (0)$
4:: $z \leftarrow$ canonical system $exp (- \frac{α}{τ} * t)$
5:: $ψ_{i} (z) \leftarrow exp (- h_{i} {(z - c_{i})}^{2})$ where $c_{i} = exp (- α \frac{i - 1}{N - 1})$ , $h_{i} = \frac{1}{{(c_{i + 1} - c_{i})}^{2}}$ , and $i = 1, \dots, N$
6:: $γ \leftarrow \frac{1}{x_{g} - x_{0}}$
7:: $f_{d} \leftarrow$ desired forcing term $γ [τ^{2} {\ddot{x}}_{d} - τ D {\dot{x}}_{d} - K (x_{g} - x_{d})]$
8:: $ω \leftarrow 1$
9:: Initialize $Φ$ , with $\frac{ψ_{i} (z_{j})}{\sum_{i = 1}^{N} ψ_{i} (z_{j})} z_{j}$ where $i = 1, \dots, N$ and $j = 0, \dots, T$
10:: Solve $ω$ do
11:: Locally weighted regression on $ω = Φ^{- 1} f_{d}$
12:: $P_{J} = \frac{1}{λ} (P_{J - 1} - \frac{P_{J - 1} φ_{J} φ_{J}^{⊤} P_{J - 1}}{λ + φ_{J}^{⊤} P_{J - 1} φ_{J}})$ where $λ$ is the forgetting factor and $φ_{J}$ is the ${}_{J}{-th}$ transposed row of $Φ$
13:: $ω_{J} = ω_{J - 1} + (f_{d} - φ_{J}^{⊤} ω_{J - 1}) P_{J} φ_{J}$
14:: end Solve
15:: $f \leftarrow Φ ω$
16:: $o \leftarrow p_{s x}, p_{s y}, p_{s z}$ obstacle position components
17:: $v \leftarrow \dot{x} - v_{o}$
18:: $R_{o} \leftarrow r o t (p_{s φ}, p_{s θ}, p_{s ψ})$ obstacle rotation matrix from obstacle orientation components
19:: for step $t \in T$ do
20:: $f_{s} \leftarrow$ (20) with superquadric parameters and $R_{o}$
21:: $R \leftarrow Rot (π / 2, (o - x) \times v)$
22:: $ϑ = arccos (\frac{〈 o - x, v 〉}{∥ o - x ∥ ∥ v ∥})$
23:: $d \leftarrow$ distance to obstacle
24:: $p (x, v) \leftarrow γ Rv ϑ_{m} exp ({log}^{- 1} (f_{s})), if - φ < ϑ < φ else 0$
25:: $τ \ddot{x} (t) \leftarrow K (x_{g} - x_{d} (t)) - D {\dot{x}}_{d} (t) + (x_{g} - x_{0}) f (z) + p (x, v)$
26:: $τ v (t) = \dot{x} (t)$
27:: $\dot{x} (t) \leftarrow \dot{x} (t) + τ \ddot{x} (t) * step t$
28:: $x (t) \leftarrow x (t) + τ v (t) * step t$
29:: end for

Output:: Learned $x$ cartesian trajectory vector, $\dot{x}$ velocity vector, $\ddot{x}$ acceleration vector

5. Validation

In this section, the proposed DMP extended formulation will be analyzed from two perspectives: first, mathematical convergence of the system will be proven through formal analysis of Lyapunov stability for dynamical systems, followed by multiple simulation scenarios with multiple obstacles and different trajectories.

5.1. Proof of Stability and Convergence of the System

As it was done in [15], to prove the stability and convergence of the system after the new couple term is added, it is necessary to evaluate the system after a long time has passed. This is

t \to \infty

. As the forcing term depends on z, it decays to 0 exponentially

f (z) \to 0

, so (1) can be rewritten as

\{\begin{matrix} \dot{v} = K (x_{g} - x) - D v + p (x, v) \\ \dot{x} = v \end{matrix}

(26)

A Lyapunov candidate function

V (x, v)

is proposed based on the energy function of a damped-spring-mass dynamical system with unitary mass.

V (x, v) = \frac{1}{2} {(x_{g} - x)}^{T} K (x_{g} - x) + \frac{1}{2} {\dot{v}}^{T} \dot{v}

(27)

To guarantee convergence,

\dot{V} < 0

when

v \neq 0

,

\begin{matrix} \dot{V} & = \nabla_{x} V^{T} \dot{x} + \nabla_{v} V^{T} \dot{v} \\ = - {(x_{g} - x)}^{T} K \dot{x} + v^{T} \dot{v} \\ = - {(x_{g} - x)}^{T} K v + v^{T} K (x_{g} - x) - \\ {\dot{x}}^{T} D \dot{x} + γ v^{T} Rv Θ \\ = - v^{T} D v \end{matrix}

(28)

The term

v^{T} Rv

is 0, since the rotation matrix

R

represents a rotation by

π / 2

. Thus,

v^{T} Rv = | v^{T} | | v | cos π / 2 = 0

, also to guarantee damping; the damping matrix

D

is chosen to be positive definite. If

v = 0

and

x \neq x_{g}

, then

\dot{V} = 0

; however, if

x \neq x_{g}

then

\dot{v} \neq 0

, and

\dot{V}

changes. Therefore, from LaSalle’s invariance principle [24],

x

converges to

x_{g}

, and so it determines the proposed solution using the new coupling term.

5.2. Synthetic Experiments

The newly proposed coupling term is evaluated across multiple simulation scenarios. These tests aim to compare the trajectories generated by the formulation in the presence of obstacles, which vary in shape, pose, and velocity. The initial test utilizes a minimum jerk trajectory (MJT) [25] to synthesize an unconstrained point-to-point movement of a human hand. The position of this straight-line segment is defined by:

x = x_{0} + (g - x_{0}) (6 τ^{5} - 15 τ^{4} + 10 τ^{3})

(29)

where

x

is the cartesian vector describing the coordinates of the trajectory as a function of time,

τ

is the normalized time

t / t_{f}

and

0 \leq τ \leq 1

, and

x_{0}

and

g

are the initial and final positions of the generated movement.

Subsequently, the MJT trajectory is learned by DMPs with elastic and damping constants

K = K I

and

D = D I

, respectively, where

I

is the identity matrix and

K = 1050

and

D = 2 \sqrt{K}

. The hyperparameters for the coupling term are

γ = 10, k = 0.1, and - π \leq φ \leq π

, and for the validation tests, the obstacle axes are kept symmetric with values of

(0.125, 0.125, 0.125)

m, and the center of the obstacles will be fixed in the point

(- 0.1, - 0.775, 0.135)

m, which is located inside the learned trajectory. The orientation and shape of the obstacles vary to test the validity of the crafted couple-term; for the validation, the

ϵ

parameters of the superquadrics are in the interval of

[0.1, 1.5]

, respectively. Figure 5 illustrates the volumetric obstacle avoidance trajectory. Subsequent figures depicting obstacle avoidance trajectories will be presented in three distinct views: the XY-plane, the XZ-plane, and a 3D Cartesian representation. In these figures, the dashed black line corresponds to the original trajectory learned by the DMPs, while the solid red line represents the recalculated trajectory adjusted for obstacles within the robot’s workspace. The initial and goal positions of the trajectory are indicated by different shape markers: a square denotes the initial position, and a star signifies the goal position.

The rationale for selecting an obstacle whose center point aligns with the learned trajectory is to demonstrate the elimination of the dead-zone issue in the formulation. This is achieved by introducing a mollifier function, which maximizes the impact of the coupling term when the steering angle is zero. It is possible to visualize this effect in Figure 6, which compares the values of the 2-norm speed of the dynamic system represented by the DMPs (hereon, it will be referred to as the robot end-effector), in the presence of obstacles with different geometries, particularly a cube and a sphere. Both speed curves have similar maximum values, but the cube presents a sharper curve; this is correlated to the sharpness of the edges of the geometry.

Additionally, the effectiveness of the proposed coupling term is validated by applying arbitrary rotations to the geometry, demonstrating the extension of the formulation to encompass general pose superquadric functions. Furthermore, the formulation’s robustness is tested against a moving obstacle that follows an MJT for 5 s within the same workspace. Figure 7a,b illustrate flawless obstacle avoidance in both scenarios.

Finally, a more realistic simulated scenario is designed to validate the general applicability of the proposed solution. In this scenario, a trajectory is taught by a human using kinaesthetic teaching, and both single and multiple obstacles are taken into account. Figure 8 displays the trajectories generated for obstacle avoidance in simulations involving single and multiple obstacles. It is evident that the proposed solution exhibits effective obstacle avoidance behavior across various realistic scenarios.

To quantitatively assess the performance of the proposed volumetric obstacle avoidance, all the previously described scenarios were generated randomly across a total of 221 simulated trials. During these tests, the system demonstrated high reliability, achieving a success rate of 97.7% (Table 1). The only failure mode considered was collision, which occurred in 2.3% of the trials. These collision events were typically observed in highly constrained scenarios where the obstacle’s initial position was in immediate proximity to the start or end of the trajectory.

The mean peak positional deviation from the baseline trajectory was 735.55 mm (Table 2) and 740.08 mm (Table 3) for the successful and failed trials, respectively. The reactive planner maintained high computational efficiency, with a mean 99th percentile latency computation time of 1.88 ms (Table 2).

6. Results

The proposed solution was tested on a UR10e collaborative robot from Universal Robots (Odense, Denmark). The robot has six DoF and 1300 mm of reach, capable of lifting 12.5 kg and reaching speeds up to 3.142 rad/s in its joint configuration. The robot is equipped with a wrist camera and a two-finger gripper model 2F85 of the brand Robotiq (Levis, QB, Canada), as shown in Figure 9.

Likewise, is important to select the robot’s external sensors for collecting environmental data. As volumetric information about the environment is necessary, the ZED 2 stereo camera from StereoLabs (San Francisco, CA, USA) is selected. The ZED 2 stereo camera has the advantage of capturing at 60 fps in HD resolution or 30 fps in full-HD resolution, and additionally comes with some built-in object and human tracking features through its SDK, which are useful for deployment.

The workstation used for the implementation possesses an AMD Ryzen 5600G processor capable of running up to 4.4 GHz of clock frequency, and also has 32 GB of RAM and an RTX3060 NVIDIA GPU with 12 GB of DDR6 dedicated memory as its main components.

The experiments will utilize the same basic experimental setup, centered on the UR10e manipulator as shown in Figure 10. The task involves a pick-and-place operation where the robot will move a red cube of 2.5 cm from an arbitrary point in its workspace to a short pipe segment of 7.5 cm diameter, also arbitrarily positioned within the workspace. This operation can be described by a simple state machine with five actions/states: (1) visually locating the object to grasp and the target placement position; (2) moving to the object; (3) grasping it; (4) moving to the target position; and (5) releasing the object. The critical part for evaluating the algorithm’s effectiveness is the movement performed after grasping the object of interest until reaching the target position. Key performance metrics for the algorithm will include the computation time for each robot movement step and the position error relative to the calculated DMP trajectory in an obstacle-free environment.

For testing purposes, we prepared five distinct trajectories to simulate unconstrained point-to-point human hand movements. One trajectory was modeled using a MJT to represent straight-line segments. Two trajectories were directly recorded from a human user’s right-hand movements via the ZED 2 camera. Each recorded trajectory was then smoothed by fitting a five-degree polynomial (e.g., Figure 11). The remaining two trajectories were derived by flattening the recorded and smoothed trajectories.

The Cartesian vector describing these trajectories is defined in the global space frame, with its origin at the robot’s base link. Here,

x_{0}

and

g

denote the initial and goal positions for the pick-and-place operation. These positions are determined by localizing the object to be grasped (a small red cube) and the target position within the pipe segment, as illustrated in Figure 12. Cases where obstacles obstruct either the initial or goal positions during execution are excluded from this study. Such scenarios consistently yield erroneous results and pose potential risks to both the user and the robot, thus falling outside the scope of this research. Each experiment utilizes one of these five distinct trajectory shapes.

For each experiment, the trajectory is learned by a DMP with

N = 100

as the number of basis functions, and elastic and damping constants

K = K I

and

D = D I

, respectively, where

I

is the identity matrix and

K = 3550

and

D = 2 \sqrt{K}

. The hyperparameters for the coupling term are

γ = 10, k = 0.1, and - π \leq φ \leq π

. To describe the volumetric shapes of the obstacles, the

ϵ

parameters of the superquadrics are in the interval of

[0.1, 1.0]

, and the axes will be determined by the dimensions of the bounding boxes around the detected objects. The work table of the robot is described by a superquadric shape with

ϵ = [0.1, 0.1]

parameters, volume axes

[0.97, 0.5, 0.05]

, and a volume center in

[0, - 0.615, - 0.010]

.

The framework’s performance was validated across 77 real-world trials, achieving an overall success rate of 96.1% (Table 4). Failures (3.9%) were categorized into two modes: direct collisions were detected in 1.3% of trials, and precautionary stops due to exceeding the acceleration safety limit occurred in 2.6% of scenarios. The latter represents cases where the algorithm computed a valid, aggressive maneuver that was intentionally halted to ensure the physical integrity of the operator and the robotic system.

On successful runs, the mean peak deviation was 167.32 mm, and the mean 99th percentile latency computation time was 2.939 ms per step (Table 5), confirming real-time feasibility. The worst-case computation time observed across all trials was 5.51 ms, indicating robust and predictable performance under the tested conditions.

Conversely, performance metrics for unsuccessful trials, including a mean 99th percentile latency of 3.022 ms, are detailed in Table 6 for comparison. These results suggest that failures were likely due to physical constraints or trajectory issues rather than computational bottlenecks.

Furthermore, in experiments involving humans, only the right arm of the user will be visually tracked. This restriction is implemented to streamline computational processes and enable the left arm of the user to adjust the goal position dynamically during execution without compromising system performance. Additionally, the cobot maximum speed was limited to 250 mm/s, and precautionary stops due to exceeding the acceleration safety limit were kept in place, as described previously.

In these experiments, the human right arm is treated as an obstacle, and the corresponding obstacle avoidance trajectories are depicted in Figure 13. Figure 13a–c illustrate different scenarios where the right arm of the human is represented using various superquadric shapes. Different colors on the geometries indicate the initial and final positions of the right arm tracked by the ZED 2 camera: purple denotes the initial position and green denotes the final position, respectively.

As previously mentioned, the user has the capability to adjust the goal position of the trajectory during execution to demonstrate the system’s ability to converge to this new goal position. In each trajectory, a star marker indicates how the goal target position is modified during task execution, while the movements of the human right arm are represented by solid black lines.

Figure 13d illustrates the execution timesteps of the pick-and-place operation, demonstrating successful obstacle avoidance and online adaptation within the shared workspace in the presence of a human.

7. Discussion & Conclusions

This work presents a robot programming framework designed for robot LfD within human–robot collaborative environments. The framework has been developed, implemented, validated, and tested both in simulated environments and on a physical robotic station. It incorporates an enhanced formulation of DMPs that includes volumetric obstacle avoidance capabilities in Cartesian space.

The formulation implemented not only considers the geometric shape of obstacles but also their relative position, orientation, and velocity in relation to the robot. This advancement extends traditional obstacle avoidance techniques from point obstacles to volumetric obstacles, applicable to various shapes using superquadric functions as its fundamental approach. A key advantage of this method is its ability to maintain the stability and convergence properties of the original DMP formulation, even in scenarios involving dynamic obstacles.

The physical robotic station consists of an industrial collaborative manipulator equipped with a visual perception system for object and human detection and tracking. The volumetric obstacle avoidance was tested on a human–robot environment, where the robot must complete a pick-and-place operation while the user moves around in the shared workspace, and therefore the solution gives insights into human–robot interaction (HRI) in HRC task scenarios.

The quantitative results presented (Table 2 and Table 5) underscore the effectiveness of our proposed formulation. In contrast to traditional DMP obstacle avoidance techniques that often simplify obstacles to points or use potential fields prone to local minima with multiple obstacles, our approach directly models and reacts to volumetric obstacles of varying shapes and poses using superquadrics in a general pose. The demonstrated success in avoiding dynamic obstacles, including human limbs in HRC scenarios (Table 4), with an average computation time of 2.939 ms per adaptation step, highlights its practical applicability. Unlike point-cloud-based avoidance methods, which can be computationally demanding due to high data density and may result in non-smooth trajectories from rapidly changing nearest-point calculations, our superquadric modeling offers a compact and efficient representation that facilitates smoother and more predictable avoidance maneuvers.

The main drawback of the proposed method is the number of necessary hyperparameters for tuning, an activity that is critical depending on the volume shape used for modeling the obstacles. For example, the proposed coupling term is primarily influenced by the parameter

γ

, which significantly affects movement generation. Varying

γ

results in different performances in obstacle avoidance, and this behavior was qualitatively observed during the tuning of the algorithm for the experiments. Specifically, a smaller

γ

may cause movement failure in obstacle-avoidance scenarios, while a larger

γ

can lead to poor performance in tracking the desired trajectory. This can result in unrealistic acceleration values that are either unattainable physically or compromise the safety of the robot and its user. This behavior could be minimized with the correct selection of the hyperparameters, but still needs to be proven.

To address this, future research will explore options for automated hyperparameter optimization. Techniques such as RL, Bayesian optimization, or evolutionary algorithms could be investigated to systematically tune these parameters, potentially enhancing performance consistency across different tasks and reducing the manual setup effort.

Another limitation is the orientation of the robot’s end-effector, which has not been taken into account during the robot learning phase. In both path planning methods, the robot end-effector is considered a rigid point-like robot, and its volume and dimensions have been omitted during the experiments. Rather than maintaining a static end-effector configuration, the orientation should be adaptively adjusted in real-time, taking into consideration factors such as the demonstrated trajectory, the type and position of obstacles, and the object being grasped. To address this limitation, options for integrating the volumetric obstacle avoidance in quaternion DMP formulation to include the end-effector movements during the trajectory adaptation will be explored.

It is also important to acknowledge that while superquadrics provide a versatile and computationally efficient method for modeling many common object shapes, representing highly complex, non-convex, or finely articulated geometries, such as the full human body, with a single superquadric might have limitations in terms of geometric fidelity. Future investigations could explore two possible solutions. First, utilizing multiple superquadrics to form more intricate composite object representations could effectively model non-convex geometries, allowing the algorithm to treat these obstacles as chains of simpler superquadric shapes, thereby improving avoidance success rates. Second, hybrid approaches that combine superquadrics for intricate shapes with other techniques (e.g., point clouds or mesh segments) could be employed to capture local detail where high precision is crucial.

Future work will also focus on validating the system’s robustness against a wider array of dynamic obstacle behaviors. This includes testing with non-linear obstacle trajectories, variable and unpredictable speeds, and more complex articulated movements to further assess the algorithm’s performance limits and adaptability in highly dynamic and unstructured HRC environments.

While the presented real-time adaptation times are promising, a complete assessment of safety in HRC requires a detailed evaluation of emergency stop response time and adherence to minimum safety distances. Our measured 99th percentile latency computation time of 2.939 ms per step for real successful trials provides a crucial component of the overall response. However, the total emergency stop response time also incorporates factors such as sensor refresh rates, communication latency within the robot control architecture, and the physical braking capabilities of the UR10e manipulator. Future work aims to quantify this end-to-end response time through dedicated experiments, focusing on the total duration from the detection of an unsafe condition (for example, human intrusion into a predefined safety zone) to the robot achieving a complete halt or a verified safe state. Furthermore, while our system exhibited a low collision rate of 1.3% in real-world trials and utilizes ’precautionary stops’ when acceleration limits are exceeded, explicit definition and verification of the maintained minimum safety distance is essential. This will involve establishing quantifiable safety envelopes based on relevant safety standards (ISO 10218 [26], ISO/TS 15066 [1]) and employing precise tracking methods to confirm that the robot’s adapted trajectory consistently respects these distances, even under aggressive maneuvers. The influence of the

γ

parameter in the coupling term on implicitly maintaining these safety distances will also be a subject of further investigation.

Regardless of these limitations and future improvements, the initial objective of developing an LfD programming framework on a collaborative robot to improve HRC in a manufacturing task involving reactive path planning and volumetric obstacle avoidance has been accomplished. The findings of this study will help lay the foundation for further enhancements in the LfD model and better interactions between humans and robots in unconstrained environments.

Author Contributions

Conceptualization, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; methodology, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; software, A.D.S.-C.; validation, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; formal analysis, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; investigation, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; resources, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; data curation, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; writing—original draft preparation, A.D.S.-C.; writing—review and editing, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; visualization, A.D.S.-C., H.G.G.-H. and J.A.R.-A.; supervision, H.G.G.-H. and J.A.R.-A.; project administration, H.G.G.-H. and J.A.R.-A.; and funding acquisition, H.G.G.-H. and J.A.R.-A. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was funded by Tecnologico de Monterrey. This work was partially supported by scholarship grant number 646996 of Secretaría de Ciencia, Humanidades, Tecnología e Innovación (Secihti), Mexico.

Institutional Review Board Statement

This work complies with the Reglamento de la Ley General de Salud en Materia de Investigación para la Salud (General Health Law Regulation on Health Research) in Mexico. According to Article 17, the present work falls under Category I: Research without risk, as it involves no intervention or intentional modification of physiological, psychological, or social variables of human participants. Human involvement in this study is limited to non-invasive motion demonstrations and visual tracking for robotic learning and obstacle modeling purposes, strictly within an engineering and robotics context. No biomedical experimentation, clinical data collection, or health-related measurements were conducted. Therefore, in accordance with Mexican regulations, this type of research is exempt from formal approval by an institutional ethics committee.

Informed Consent Statement

Informed consent was obtained from all individual participants included in the study, and informed consent for publication was obtained from all identifiable human participants.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

During the preparation of this manuscript/study, the authors used generative AI language modeling (Gemini 1.5 Flash, developed by Google) to improve language and readability. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DMPs	Dynamic Movement Primitives
HRC	Human–Robot Collaboration
HRI	Human–Robot Interaction
LfD	Learning from Demonstration
MJT	Minimum Jerk Trajectory
RL	Reinforcement Learning

References

ISO 15066:2016(E); Robots and Robotic Devices–Collaborative Robots. International Organization for Standardization: Geneva, Switzerland, 2016.
Zaatari, S.E.; Marei, M.; Li, W.; Usman, Z. Cobot programming for collaborative industrial tasks: An overview. Robot. Auton. Syst. 2019, 116, 162–180. [Google Scholar] [CrossRef]
Chernova, S.; Thomaz, A.L. Robot learning from human teachers. In Synthesis Lectures on Artificial Intelligence and Machine Learning; Springer: Berlin/Heidelberg, Germany, 2014; Volume 28, pp. 1–121. [Google Scholar] [CrossRef]
Ames, A.D.; Coogan, S.; Egerstedt, M.; Notomista, G.; Sreenath, K.; Tabuada, P. Control Barrier Functions: Theory and Applications. In Proceedings of the 2019 18th European Control Conference (ECC), Naples, Italy, 25–28 June 2019; pp. 3420–3431. [Google Scholar] [CrossRef]
Bi, Z.M.; Luo, M.; Miao, Z.; Zhang, B.; Zhang, W.J.; Wang, L. Safety assurance mechanisms of collaborative robotic systems in manufacturing. Robot.-Comput.-Integr. Manuf. 2021, 67, 102022. [Google Scholar] [CrossRef]
Xie, Z.W.; Zhang, Q.; Jiang, Z.N.; Liu, H. Robot learning from demonstration for path planning: A review. Sci. China Technol. Sci. 2020, 63, 1325–1334. [Google Scholar] [CrossRef]
Ijspeert, A.J.; Nakanishi, J.; Schaal, S. Learning rhythmic movements by demonstration using nonlinear oscillators. In Proceedings of the IEEE International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, 30 September–4 October 2002; Volume 1, pp. 958–963. [Google Scholar] [CrossRef]
Schaal, S.; Peters, J.; Nakanishi, J.; Ijspeert, A. Learning movement primitives. In Springer Tracts in Advanced Robotics; Springer: Berlin/Heidelberg, Germany, 2005; Volume 15, pp. 561–572. [Google Scholar] [CrossRef]
Ijspeert, A.J.; Nakanishi, J.; Hoffmann, H.; Pastor, P.; Schaal, S. Dynamical movement primitives: Learning attractor models for motor behaviors. Neural Comput. 2013, 25, 328–373. [Google Scholar] [CrossRef] [PubMed]
Solina, F. Volumetric Models in Computer Vision—An Overview. J. Comput. Inf. Technol. 1994, 2, 155–166. [Google Scholar]
Sosa-Ceron, A.D.; Gonzalez-Hernandez, H.G.; Reyes-Avendaño, J.A. Learning from Demonstrations in Human-Robot Collaborative Scenarios: A Survey. Robotics 2022, 11, 126. [Google Scholar] [CrossRef]
Saveriano, M.; Abu-Dakka, F.J.; Kramberger, A.; Peternel, L. Dynamic movement primitives in robotics: A tutorial survey. Int. J. Robot. Res. 2023, 42, 1133–1184. [Google Scholar] [CrossRef]
Cheng, C.; Zhang, H.; Sun, Y.; Tao, H.; Chen, Y. A cross-platform deep reinforcement learning model for autonomous navigation without global information in different scenes. Control. Eng. Pract. 2024, 150, 105991. [Google Scholar] [CrossRef]
Park, D.H.; Hoffmann, H.; Pastor, P.; Schaal, S. Movement reproduction and obstacle avoidance with dynamic movement primitives and potential fields. In Proceedings of the Humanoids 2008—8th IEEE-RAS International Conference on Humanoid Robots, Daejeon, Republic of Korea, 1–3 December 2008; pp. 91–98. [Google Scholar] [CrossRef]
Hoffmann, H.; Pastor, P.; Park, D.H.; Schaal, S. Biologically-inspired dynamical systems for movement generation: Automatic real-time goal adaptation and obstacle avoidance. In Proceedings of the IEEE International Conference on Robotics and Automation, Kobe, Japan, 12–17 May 2009; pp. 2587–2592. [Google Scholar] [CrossRef]
Chi, M.; Yao, Y.; Liu, Y.; Zhong, M. Learning, Generalization, and Obstacle Avoidance with Dynamic Movement Primitives and Dynamic Potential Fields. Appl. Sci. 2019, 9, 1535. [Google Scholar] [CrossRef]
Zhai, D.H.; Xia, Z.; Wu, H.; Xia, Y. A Motion Planning Method for Robots Based on DMPs and Modified Obstacle-Avoiding Algorithm. IEEE Trans. Autom. Sci. Eng. 2022, 20, 2678–2688. [Google Scholar] [CrossRef]
Li, A.; Liu, Z.; Wang, W.; Zhu, M.; Li, Y.; Huo, Q.; Dai, M. Reinforcement Learning with Dynamic Movement Primitives for Obstacle Avoidance. Appl. Sci. 2021, 11, 11184. [Google Scholar] [CrossRef]
Ginesi, M.; Meli, D.; Calanca, A.; Dall’Alba, D.; Sansonetto, N.; Fiorini, P. Dynamic movement primitives: Volumetric obstacle avoidance. In Proceedings of the 2019 19th International Conference on Advanced Robotics, ICAR 2019, Belo Horizonte, Brazil, 2–6 December 2019; pp. 234–239. [Google Scholar] [CrossRef]
Ginesi, M.; Meli, D.; Roberti, A.; Sansonetto, N.; Fiorini, P. Dynamic Movement Primitives: Volumetric Obstacle Avoidance Using Dynamic Potential Functions. J. Intell. Robot. Syst. Theory Appl. 2021, 101, 79. [Google Scholar] [CrossRef]
Liu, Z.; Fang, Y. A Superquadrics-Based Steering Angle Obstacle Avoidance Method of DMPs. In Proceedings of the 2023 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; pp. 4273–4279. [Google Scholar] [CrossRef]
Jaklič, A.; Leonardis, A.; Solina, F. Superquadrics and Their Geometric Properties. In Segmentation and Recovery of Superquadrics; Springer: Dordrecht, The Netherlands, 2000; pp. 13–39. [Google Scholar] [CrossRef]
Pairet, E.; Ardón, P.; Mistry, M.; Petillot, Y. Learning Generalizable Coupling Terms for Obstacle Avoidance via Low-Dimensional Geometric Descriptors. IEEE Robot. Autom. Lett. 2019, 4, 3979–3986. [Google Scholar] [CrossRef]
Khalil, H. Nonlinear Control Global Edition; Pearson Deutschland: München, Germany, 2014; p. 400. [Google Scholar]
Sharkawy, A.N. Minimum Jerk Trajectory Generation for Straight and Curved Movements: Mathematical Analysis. arXiv 2021, arXiv:2102.07459. [Google Scholar] [CrossRef]
ISO 10218-1:2011; Robots and Robotic Devices—Safety Requirements for Industrial Robots—Part 1: Robots. International Organization for Standardization: Geneva, Switzerland, 2011.

Figure 1. Steering angle

ϑ

.

Figure 1. Steering angle

ϑ

.

Figure 2. Some examples of superquadric shapes. In the first row (from left to right), we present some basic structures like a sphere, a cylinder, and a cube. In the second row, we randomly generated parameters

ϵ_{1}

and

ϵ_{2}

with values between 0 and 2. For the final row, we selected

ϵ_{1}

and

ϵ_{2}

with values between 2 and 4.

Figure 2. Some examples of superquadric shapes. In the first row (from left to right), we present some basic structures like a sphere, a cylinder, and a cube. In the second row, we randomly generated parameters

ϵ_{1}

and

ϵ_{2}

with values between 0 and 2. For the final row, we selected

ϵ_{1}

and

ϵ_{2}

with values between 2 and 4.

Figure 3. The dead zone issue in obstacle avoidance by steering angle. (a) Original change of the steering angle

\dot{ϑ}

as described in [15]. (b) Dead zone issue in the original formulation (black), and proposed solution by [23] (purple).

Figure 3. The dead zone issue in obstacle avoidance by steering angle. (a) Original change of the steering angle

\dot{ϑ}

as described in [15]. (b) Dead zone issue in the original formulation (black), and proposed solution by [23] (purple).

Figure 4. Proposed solution to deal with steering angles greater than

π

. (a) Gaussian function when

φ = π

does not reduce to zero. (b) Gaussian function (black), and the proposed Mollifier function (purple). With

φ = π

, the Mollifier reduces to zero.

Figure 4. Proposed solution to deal with steering angles greater than

π

. (a) Gaussian function when

φ = π

does not reduce to zero. (b) Gaussian function (black), and the proposed Mollifier function (purple). With

φ = π

, the Mollifier reduces to zero.

Figure 5. Volumetric obstacle avoidance trajectories obtained with the new proposed coupling term with different superquadric shapes. (a) Obstacle described by

ϵ_{1} = 0.1

and

ϵ_{2} = 0.1

shape parameters (a cube) of the superquadric function. (b) Obstacle described by

ϵ_{1} = 1

and

ϵ_{2} = 1

shape parameters (a sphere) of the superquadric function.

Figure 5. Volumetric obstacle avoidance trajectories obtained with the new proposed coupling term with different superquadric shapes. (a) Obstacle described by

ϵ_{1} = 0.1

and

ϵ_{2} = 0.1

shape parameters (a cube) of the superquadric function. (b) Obstacle described by

ϵ_{1} = 1

and

ϵ_{2} = 1

shape parameters (a sphere) of the superquadric function.

Figure 6. Comparison of robot expected velocities in the presence of different geometries during obstacle avoidance. (a) (Left) Obstacle avoidance, volume

ϵ_{1} = 0.1

and

ϵ_{2} = 0.1

. (Right) Estimated robot end-effector velocity curve. (b) (Left) Obstacle avoidance, volume

ϵ_{1} = 1

and

ϵ_{2} = 1

. (Right) Estimated robot end-effector velocity curve.

Figure 6. Comparison of robot expected velocities in the presence of different geometries during obstacle avoidance. (a) (Left) Obstacle avoidance, volume

ϵ_{1} = 0.1

and

ϵ_{2} = 0.1

. (Right) Estimated robot end-effector velocity curve. (b) (Left) Obstacle avoidance, volume

ϵ_{1} = 1

and

ϵ_{2} = 1

. (Right) Estimated robot end-effector velocity curve.

Figure 7. Volumetric obstacle avoidance trajectories using general-pose superquadric shapes with static and dynamic obstacles. (a) Static obstacle described by volume

ϵ_{1} = 0.1

and

ϵ_{2} = 0.1

and rotation by

γ = {33.0}^{\circ}, θ = {48.0}^{\circ}, ϕ = {15.0}^{\circ}

in

z y z

Euler angle representation. (b) Dynamic obstacle represented by volume

ϵ_{1} = 0.8

and

ϵ_{2} = 1.3

and rotation by

γ = {15.0}^{\circ}, θ = {0.0}^{\circ}, ϕ = {0.0}^{\circ}

in

z y z

Euler angle representation.

Figure 7. Volumetric obstacle avoidance trajectories using general-pose superquadric shapes with static and dynamic obstacles. (a) Static obstacle described by volume

ϵ_{1} = 0.1

and

ϵ_{2} = 0.1

and rotation by

γ = {33.0}^{\circ}, θ = {48.0}^{\circ}, ϕ = {15.0}^{\circ}

in

z y z

Euler angle representation. (b) Dynamic obstacle represented by volume

ϵ_{1} = 0.8

and

ϵ_{2} = 1.3

and rotation by

γ = {15.0}^{\circ}, θ = {0.0}^{\circ}, ϕ = {0.0}^{\circ}

in

z y z

Euler angle representation.

Figure 8. Volumetric obstacle avoidance trajectories in a simulated LfD setup. (a) Volumetric obstacle avoidance is performed on a single obstacle described by volume

ϵ_{1} = 1.2

and

ϵ_{2} = 1.5

, over a custom trajectory taught by a human. (b) Volumetric obstacle avoidance is performed on multiple obstacles described by volume

ϵ_{1} = 0.1

and

ϵ_{2} = 1

, over a custom trajectory taught by a human.

Figure 8. Volumetric obstacle avoidance trajectories in a simulated LfD setup. (a) Volumetric obstacle avoidance is performed on a single obstacle described by volume

ϵ_{1} = 1.2

and

ϵ_{2} = 1.5

, over a custom trajectory taught by a human. (b) Volumetric obstacle avoidance is performed on multiple obstacles described by volume

ϵ_{1} = 0.1

and

ϵ_{2} = 1

, over a custom trajectory taught by a human.

Figure 9. Robot configuration used in the experimental setup.

Figure 10. Basic experiment setup on a collaborative robot UR10e.

Figure 11. Comparison of an Original Recorded Human Hand Trajectory and its 5th-Degree Polynomial Smoothed Version. The plot illustrates a raw recorded trajectory (grey dotted line) captured from a human user’s right hand via the ZED 2 camera. The ’Smoothed (Poly Degree 5)’ curve (purple solid line) represents the same trajectory after applying a 5th-degree polynomial fit for noise reduction and continuity. The black star indicates the start point, and the black circle marks the end point of the trajectory.

Figure 12. Visual d etection of task parameters for robotic manipulation. The red cube (right) represents the target object to be manipulated, while the pipe segment enclosed in the green bounding box (left) denotes the goal position. Red and purple dots indicate the calculated centroids used for spatial localization and trajectory planning.

Figure 13. Volumetric obstacle avoidance in a real HRC setup. The human arm was modeled using different volumes: (a) Obstacle described by volume

ϵ_{1} = 0.5

and

ϵ_{2} = 1

. (b) Obstacle described by volume

ϵ_{1} = 0.5

and

ϵ_{2} = 1

. (c) Obstacle described by volume

ϵ_{1} = 1

and

ϵ_{2} = 1

. (d) Execution sequence of a pick-and-place operation with obstacle avoidance and online adaptation.

Figure 13. Volumetric obstacle avoidance in a real HRC setup. The human arm was modeled using different volumes: (a) Obstacle described by volume

ϵ_{1} = 0.5

and

ϵ_{2} = 1

. (b) Obstacle described by volume

ϵ_{1} = 0.5

and

ϵ_{2} = 1

. (c) Obstacle described by volume

ϵ_{1} = 1

and

ϵ_{2} = 1

. (d) Execution sequence of a pick-and-place operation with obstacle avoidance and online adaptation.

Table 1. Success and Failure Rates (Simulated).

Metric	Occurrence (%)
Success Rate	97.74%
Failure Rate	2.26%

Table 2. Descriptive Statistics of Key Performance Metrics for Successful Trials (Simulated).

	Mean	Std. Dev.	Median	Min	Max
Peak Position Error (mm)	735.548	6.609	737.834	723.853	744.996
Tracking Error (RMSE, mm)	14.026	12.996	11.671	1.252	76.306
Max Computation Time (ms)	2.507	0.795	2.344	1.323	7.631
99th Percentile Latency (ms)	1.882	0.626	1.862	1.171	3.358
Computational Overhead (%)	97.890	0.398	98.147	97.105	98.378

Table 3. Descriptive Statistics of Key Performance Metrics for Failed Trials (Simulated).

	Mean	Std. Dev.	Median	Min	Max
Peak Position Error (mm)	740.080	9.192	744.996	723.853	744.996
Tracking Error (RMSE, mm)	2.748	1.588	2.357	1.156	5.422
Max Computation Time (ms)	3.011	0.562	3.210	2.340	3.707
99th percentile latency (ms)	2.191	0.549	1.927	1.884	3.163
Computational Overhead (%)	98.435	0.389	98.598	97.742	98.648

Table 4. Success and Failure Rates (Real).

Category	Breakdown	Occurrence (%)
Failure Rate	Acceleration Limit	2.60%
	Collision	1.30%
	Total	3.90%
Success Rate		96.10%

Table 5. Descriptive Statistics of Key Performance Metrics for Successful Trials (Real).

	Mean	Std. Dev.	Median	Min	Max
Peak Position Error (mm)	167.324	53.403	178.951	1.117	276.873
Tracking Error (RMSE, mm)	8.180	2.460	8.222	3.511	20.251
Max Computation Time (ms)	3.861	0.264	3.856	3.549	5.509
99th Percentile Latency (ms)	2.939	0.123	2.897	2.742	3.175
Computational Overhead (%)	48.237	0.251	48.281	47.645	48.829

Table 6. Descriptive Statistics of Key Performance Metrics for Failed Trials (Real).

	Mean	Std. Dev.	Median	Min	Max
Peak Position Error (mm)	123.202	206.338	6.479	1.683	361.444
Tracking Error (RMSE, mm)	6.810	7.069	3.043	2.422	14.965
Max Computation Time (ms)	3.676	0.579	3.575	3.153	4.299
99th Percentile Latency (ms)	3.022	0.242	2.921	2.846	3.299
Computational Overhead (%)	40.021	5.180	40.023	34.840	45.200

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sosa-Ceron, A.D.; Gonzalez-Hernandez, H.G.; Reyes-Avendaño, J.A. Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration. Appl. Sci. 2026, 16, 2531. https://doi.org/10.3390/app16052531

AMA Style

Sosa-Ceron AD, Gonzalez-Hernandez HG, Reyes-Avendaño JA. Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration. Applied Sciences. 2026; 16(5):2531. https://doi.org/10.3390/app16052531

Chicago/Turabian Style

Sosa-Ceron, Arturo Daniel, Hugo G. Gonzalez-Hernandez, and Jorge Antonio Reyes-Avendaño. 2026. "Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration" Applied Sciences 16, no. 5: 2531. https://doi.org/10.3390/app16052531

APA Style

Sosa-Ceron, A. D., Gonzalez-Hernandez, H. G., & Reyes-Avendaño, J. A. (2026). Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration. Applied Sciences, 16(5), 2531. https://doi.org/10.3390/app16052531

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Volumetric Obstacle Avoidance Based on Dynamic Movement Primitives for Robot Path Planning in Human–Robot Collaboration

Abstract

1. Introduction

2. Literature Review

3. Materials and Methods

3.1. Dynamic Movement Primitives

3.2. Obstacle Avoidance for DMPs

4. Implementation

5. Validation

5.1. Proof of Stability and Convergence of the System

5.2. Synthetic Experiments

6. Results

7. Discussion & Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI