1. Introduction
With the rapid advancement of industrial automation, robot programming has become increasingly critical across diverse domains, including manufacturing, healthcare, agriculture, and smart home applications [
1,
2]. As robots are expected to operate in increasingly complex and dynamic environments, efficient and flexible programming paradigms have become essential for enabling the rapid deployment of robotic systems. Substantial progress has been achieved in programming methodologies, such as online programming [
3], Offline Programming (OLP) [
4], Learning from Demonstration (LfD) [
5], and Virtual Reality (VR)-based programming [
6]. These approaches provide different levels of flexibility for task specification, ranging from traditional code-based programming to more intuitive human–robot interaction interfaces. Despite these advances, most approaches still heavily rely on expert knowledge or large-scale datasets, limiting their scalability and hindering the wider adoption of robotic automation [
7,
8]. For example, traditional programming methods require extensive domain expertise, while data-driven learning approaches often demand a large number of demonstrations or training samples. Therefore, there is a growing need for intuitive and efficient programming paradigms that can reduce the complexity of task instruction and enable the seamless transfer of human expertise to robotic execution. In particular, enabling robots to learn motion skills from a small number of human demonstrations and generalize them to new task situations remains a key challenge in robotic learning.
Learning from Demonstration (LfD), also known as Programming by Demonstration (PbD) or Imitation Learning, has emerged as one of the most effective methods for teaching robots. LfD allows a robot to acquire skills by observing and imitating a human or another robot [
9], thus removing the need for specialized programming expertise and enabling non-experts to demonstrate tasks directly [
10]. More importantly, the learned motion knowledge should be able to generalize across tasks with different spatial constraints or environmental configurations. To address this challenge, researchers have employed probabilistic modeling techniques, such as Gaussian Mixture Models (GMMs) combined with Gaussian Mixture Regression (GMR), to encode demonstrated trajectories and generate smooth, generalized motions [
11]. These probabilistic approaches provide compact statistical representations of human motion and allow smooth trajectory reproduction from demonstrations. However, these models are typically task-specific and may struggle to generalize to novel tasks or varying intermediate and terminal positions [
12].
To overcome these limitations, various extensions of GMM/GMR have been proposed [
13,
14], most of which adjust only Gaussian centers to accommodate new environments [
15]. For example, Hersch et al. [
16] integrate dynamical system control with statistical learning to handle variations in initial conditions and environmental perturbations, while Calinon et al. [
17] optimize imitation metrics to generalize task skills across contexts. Other methods, such as those of Xiao et al. [
18] and Feng et al. [
19], incorporate force control strategies or enhanced optimization techniques to improve trajectory smoothness and learning accuracy from limited demonstrations. Task-Parameterized GMM (TP-GMM) [
20] further extends GMMs to modulate both means and covariances according to task frames. Nevertheless, TP-GMM relies on human-defined frames, which may be redundant or omit critical task constraints [
21]. Kernelized Movement Primitives (KMPs) [
22] and motion segmentation approaches [
23] offer alternatives for adapting skills, but often lack explicit probabilistic representation or require manual design of conditional distributions [
24]. Although these approaches improve the adaptability of learned trajectories, most of them focus on trajectory adaptation rather than explicitly representing transferable motion knowledge across tasks.
Moreover, in real-world demonstrations, human teachers often focus on immediate task completion without fully exploiting robotic capabilities, while task constraints and environmental uncertainties are complex or unknown [
25]. Consequently, demonstrations collected from a source task may not be directly applicable to related tasks with different spatial configurations or constraint conditions. These factors exacerbate difficulties in skill transfer. Therefore, developing motion representations that capture inherent relationships among task constraints becomes crucial for enabling effective knowledge transfer across tasks. Capturing inherent relationships among variable constraints across tasks through parametric modeling can enable the effective transfer of motion knowledge from one task to related tasks.
Motivated by these challenges, this paper proposes a motion knowledge transference method to generalize learned trajectories from a source task to related target tasks under varying task constraints. The proposed approach integrates Learning from Demonstration (LfD) with a motion knowledge transference mechanism to capture transferable motion characteristics embedded in human demonstrations. In this framework, human motions are represented as a sequence of elementary actions encoded by Gaussian Mixture Models (GMMs), where each Gaussian component corresponds to a local motion primitive [
26]. To facilitate knowledge transfer across tasks, both the statistical characteristics of individual motion primitives and the geometric relationships between adjacent primitives are preserved, thereby maintaining the structural consistency of the learned motion representation. Based on this representation, motion knowledge obtained from a source task is adapted to new task configurations through a GMM/GMR-based motion planning process combined with task-constraint adaptation. The effectiveness of the proposed method is validated through experiments conducted on a robotic manipulation platform, including handwriting motion learning and pick-and-place tasks under varying task configurations. The main contributions of this study are as follows:
(1) A Motion Knowledge Transferring Gaussian Mixture Model (MKT-GMM) framework is proposed to enable trajectory generalization across related tasks. In the proposed framework, transferable motion knowledge is represented by jointly preserving the statistical characteristics of elementary actions and the geometric relationships among adjacent actions, allowing effective skill transfer without requiring additional demonstrations or model retraining.
(2) An integrated parameter adaptation strategy is developed by combining affine transformation estimation with task-constraint error minimization. Through this strategy, the parameters of the learned Gaussian mixture model can be systematically adjusted according to new task constraints, ensuring that the generated trajectories remain smooth while satisfying task-specific requirements.
(3) A pseudo-via-point mechanism is introduced to extend the applicability of the proposed framework to both simple and complex motion structures. By automatically generating intermediate constraints when explicit via-points are unavailable, the proposed method provides a unified parameter adaptation scheme that improves the robustness of trajectory transfer across diverse motion categories.
The rest of this paper is organized as follows: In
Section 2, the motivation of considering an adaptive model is presented to interpret the human motion in
. The overall framework for the motion knowledge transference is presented in
Section 2.
Section 3 concentrates on the recognition of the source tasks, the motion knowledge representation and how to transfer the previously-explored human motion knowledge representation to the target task. In
Section 4, the experimental validation of the method is performed with several experiments. Finally, the conclusion and future work are presented in
Section 5.
2. Motivation
LfD has been widely recognized as an effective approach to improving the practicality and usability of robotic systems across diverse applications [
27]. A key requirement of LfD is the ability to generalize knowledge from prior demonstrations and to transfer skills seamlessly to new tasks. In other words, if a human operator can perform a task, the learned knowledge should allow the robot to replicate the demonstrated motion and adapt it to accomplish the target task [
28].
Achieving such generalization requires a high degree of flexibility in robot programming. Pre-programmed routines alone are often insufficient to handle tasks with varying constraints, as robots must be capable of adapting to different environments and unforeseen situations [
1,
29]. LfD provides this essential adaptability by encoding human demonstrations or explicit instructions into structured motion knowledge representations. These representations then serve as the basis for generating appropriate robot actions in the target task environment. The concrete problem of this paper is to adapt the GMM model learned from source demonstrations to new via-point constraints without re-demonstration.
Crucially, the effectiveness of this approach depends on how human motion is interpreted and structured. In LfD, human motion is commonly perceived as a sequence of elementary actions, or action primitives [
30,
31]. To facilitate generalization, these primitives should be extracted and refined from demonstration data. As illustrated in
Figure 1, human motion is often constrained by task-specific factors, including start points, intermediate via-points, and end points [
32]. The most informative cues for generalizing motion are the transitions between successive primitives [
33], while the relationships among task parameters allow each primitive to be appropriately scaled dilated, compressed, or rotated according to task requirements [
23]. By focusing on structured motion knowledge representation, robots can leverage this information to infer task constraints accurately and generate suitable trajectories for new tasks.
In this section, some notations and definitions used in this paper are introduced as follows:
denotes the task of the robot, and refer to the source task and target task, respectively.
denotes the task constraints, namely the locations of intermediary via-points. and refer to the task constraints of and , respectively.
denotes one Gaussian Mixture Model, and its parameters are mainly composed of the prior probability , mean vector , and covariance matrix .
2.1. Problem Formulation
Assume that a robotic system has acquired prior human motion knowledge from a source task , on which it has previously applied distinct exploratory motion knowledge representations corresponding to different task constraints . These motions are typically composed of feature observations captured by the observation models from . When the robot is tasked with learning the motion of a target task , the shared properties between and can be leveraged to improve learning efficiency. Following this rationale, the procedure is defined as follows:
A set of M via-points is designed to encode the task constraints and for and , respectively.
The Gaussian Mixture Model (GMM) parameters for the source task are defined as , derived from the demonstration dataset , where t and s denote temporal and spatial variables, respectively. K is manually set only for controlled experiments, and can be automatically determined by BIC or AIC criteria in practical applications.
The robot maintains a prior motion knowledge representation associated with .
Within the MKT-GMM framework, the robot actively constructs reliable models according to the mapping
, thereby generating smooth generalized trajectories
for
. Observations for both
and
can be obtained in advance. The MKT-GMM framework comprises three primary phases: learning, motion knowledge transferring, and execution. An overview of the framework is presented in
Figure 2. In this framework, human motion knowledge is represented through two essential shared properties for
and
: statistical properties and geometric properties. The GMM parameters for
are estimated through the process of active prior motion knowledge transfer, after which the UR5 manipulator executes the target task.
During the learning phase, human demonstrators perform the source task according to the specified task constraints, while the robot end-effector trajectory is recorded as a function of time. The demonstrations are subsequently temporally aligned, and the demonstration data are encoded into a GMM. In the motion knowledge transferring phase, the robot extracts the essential task correlations between source and target tasks. Each Gaussian component is interpreted as a motion primitive, and the geometric relationships between adjacent components are regarded as critical motion knowledge. Additionally, a transition matrix is derived based on and . Finally, the GMM parameters for the target task are estimated using , the motion primitives, the geometric properties, and the transition matrix . In the execution phase, the robot generates smooth, generalized trajectories for through Gaussian Mixture Regression (GMR). This process leverages the automatically learned temporal dependencies from demonstrations, while reusing prior motion knowledge to guide trajectory planning.
2.2. What to Imitate and Transfer
When a human demonstrates an exploratory action to a robotic system, multiple feature observations are perceived. The prior motion knowledge is established based on the source task constraint and the corresponding Gaussian Mixture Model (GMM) of . To learn the GMM for the target task , two fundamental properties are delineated to be preserved between and : (1) preservation of the statistical properties of elementary actions, and (2) preservation of the geometric relationships among adjacent elementary actions.
(1) Objective 1—Statistical Properties: Human actions are typically perceived as a sequence of distinct elementary actions. Previous research [
26] has demonstrated that each Gaussian component in the GMM/GMR framework can be directly regarded as an elementary action. For the corresponding Gaussian components of
and
, an affine matrix
is defined to project each covariance matrix
to
while preserving the correlation structure. This ensures that the statistical characteristics of individual elementary actions are maintained during knowledge transfer.
(2) Objective 2—Geometric Properties: To reproduce anthropomorphic motion from human demonstrations, the GMM of , denoted as , must preserve the geometric relationships between adjacent Gaussian components. For the source task , is defined as the set of junction factors between adjacent Gaussian components. These factors encode the relative positional relationships among neighboring components, ensuring that the spatial structure of the demonstrated motion is retained in the target task.
2.3. How to Imitation and Transfer
Based on the prior motion knowledge representation
of
, the robot can learn the GMM parameters
by associating its target task
. As for the GMM parameters
, their estimation process will be detailed in
Section 3.
4. Experimental and Results
To validate the effectiveness of the proposed framework, four groups of experiments were conducted. In the first experiment, four instructors were invited to demonstrate the handwriting motion of the character “w”. The demonstrations were carried out on a UR5 robot equipped with an ATI Gamma force sensor mounted at the end-effector and a Kinect 2.0 sensor installed at the robot head, as illustrated in
Figure 9. The motion skills acquired from these demonstrations were encoded using the proposed method, and different motion knowledge representations were comparatively analyzed to assess their effectiveness.
Building upon this, the second experiment examined the generalization capability of the framework in Case 1. Subsequently, the third experiment further validated the generalization performance in Case 2 by employing the handwriting motion of the character “C” as the demonstration task. Finally, to evaluate the applicability of the framework beyond handwriting tasks, a pick-and-place experiment was performed. In this scenario, the robot was required to execute the task under varying initial and target positions, thereby demonstrating the robustness of the method in handling different task configurations. Only The right robotic arm was utilized for task execution; the left arm remained stationary.
4.1. Experimental 1
In this experiment, the motion knowledge represented by the proposed method was investigated. Four instructors were asked to demonstrate the handwriting motion of the letter “w” under the task constraint
. The motion data were captured using Kinect 2.0, as illustrated in the first column of
Figure 10, and recorded as the nib trajectory through kinesthetic teaching. The demonstrated motions were subsequently modeled using Gaussian Mixture Models (GMMs) based on the collected data, and reproduction was performed via Gaussian Mixture Regression (GMR). In this experiment, GMMs with
components were manually specified.
As shown in
Figure 10, the handwriting motion datasets
,
,
, and
, corresponding to the four instructors, exhibit noticeable variations for the same task constraint
. Consequently, the corresponding motion models encoded by GMM also differ. The primary differences among these models lie in their statistical and geometric properties, specifically the covariance matrices
of the Gaussian components and the geometric relationship parameters
between adjacent components, as illustrated in the third column of
Figure 10. The parameter
encodes the relative positional relationship between two consecutive Gaussian components, indicating whether or not their regions overlap. In particular, if
, the via-point
is located within the Gaussian ellipsoid along the
x dimension, and analogous conditions hold for
,
, and
. Conversely, if these inequalities are not satisfied, the corresponding via-point lies outside the Gaussian ellipsoid in that dimension.
The reproduced trajectories obtained via GMR for the four instructors are presented in
Figure 11, and quantitative analyses are summarized in
Table 3. The correlation coefficient
and the transfer factor
of each Gaussian component for each instructor are reported for the handwriting task. A positive value of
indicates a positive correlation between the
x and
y dimensions, whereas a negative value denotes a negative correlation. The transfer factor
plays a critical role in defining the relative positional relationship between adjacent Gaussian components and directly determines the shape of the transfer curve at the corresponding via-point.
4.2. Experimental 2
To demonstrate the generalization capability of the proposed method, the handwriting of the letter “W” was selected as the experimental task, implemented under varying task constraints, such as different positions of five via-points, using kinesthetic teaching. The task involved moving a pen, mounted on the robot end-effector, from the initial point to the endpoint while passing through intermediate points.
Figure 12 presents the experimental results. The first row illustrates the task constraints for the source task and the various target tasks. Specifically,
denotes the task constraint for the source task
, while
,
, and
correspond to the task constraints for the different target tasks
. The positions of the via-points in each task were captured using a Kinect sensor, as shown in the second column.
The statistical and geometric properties of the source task
, which were transferred to the target tasks, are summarized in
Table 4, while the corresponding affine matrices
for each target task relative to
are presented in
Table 5. In the fourth row of
Figure 12, the Gaussian Mixture Models (GMMs) of the different target tasks share the same statistical and geometric properties as the source task. In the third row, only the source task provides sample data, and the task constraint error minimization procedure is employed to compute the prior probabilities
for the target tasks. The reproduced trajectories, shown in the fourth row, maintain consistent statistical and geometric properties during the transitions from one Gaussian component to the next. The corresponding Gaussian Mixture Regression (GMR) results for the four target tasks are presented in the fifth row, demonstrating that smooth, generalized trajectories can successfully complete each task. A comparative analysis of the reproduced trajectories across the four tasks is provided in
Figure 13.
of the target tasks
is evaluated with Equation (
10) to minimize the via-point error. At this time, the significance of
lies in reducing errors, rather than just the meaning of probability in GMM.
4.3. Experimental 3
In this experiment, the generalization capability of
was evaluated using the LASA Handwriting Dataset library (Available in:
https://gitlab.idiap.ch/rli/pbdlib-matlab, accessed on 16 March 2026), which contains 30 human handwriting motion patterns in two-dimensional space. To examine
, the human motion corresponding to the letter “C” was selected. The selected motion data were first encoded using
(manually specified), as illustrated in the first column of
Figure 14. Via-points and pseudo via-points are depicted in the fourth row, first column. For the target tasks
,
, and
, the associated task constraints are shown in the first row, columns two through four, respectively. Different task constraints correspond to different affine matrices
. Pseudo via-points and Gaussian means were sequentially computed based on the first via-point, from which the parameters of the target tasks were derived, as presented in
Figure 14.
As shown in the fifth row of
Figure 14, the proposed method successfully generalizes
. Under the given task constraints, pseudo via-points serve as key reference points, such as the connection points between adjacent Gaussian components. The generalized motions preserve the essential properties of the source task, including both statistical and geometric characteristics. This demonstrates that the proposed approach effectively transfers critical skills from the source demonstration to the robot, enabling reliable adaptation to new task conditions.
4.4. Experimental 4
A three-dimensional pick-and-place task was demonstrated in this experiment, involving two pick locations,
and
, and three place locations,
,
, and
(see
Figure 15). The motion from
to
was designated as the source task,
, and served as the basis for transferring motion skills to other desired pick-and-place pairs, defined as target tasks,
, i.e.,
,
,
,
, and
. The OptiTrack optical motion capture system was employed to record the positions of
and to capture the human demonstration of the motion
(see
Figure 16).
Initially, a rigid object equipped with four markers was placed at each pick and place location,
, to establish reference positions using the OptiTrack system. Subsequently, the object was moved from
to
, and the corresponding trajectory was recorded as the source task motion. The human demonstration trajectory and its Gaussian Mixture Model (GMM) representation for
are presented in
Figure 17. In this experiment, GMMs with
components were manually specified.
The GMM parameters for the target tasks,
, were then estimated using the proposed motion knowledge transfer method. The resulting generalized motions for each target task are illustrated in
Figure 18, while
Figure 19 depicts the generalized trajectories for all evaluated pick-and-place pairs. For each specified pick-and-place pair, the motion skills encoded in
were effectively represented and transferred to the corresponding
. The generated target task trajectories retained the common properties of the source task, demonstrating that the proposed approach can reproduce human demonstration skills while preserving the underlying motion knowledge. This method thus enables robotic assembly tasks to be executed based on human demonstrations while maintaining the essential characteristics of the demonstrated motion.
4.5. Discussion
This work set out to represent and transfer human motion knowledge so that a robot can reproduce related tasks without new demonstrations. The four experiments collectively show that the proposed MKT-GMM achieves that goal by (1) extracting meaningful, task-relevant motion primitives from demonstrations, (2) adapting those primitives to new task constraints, and (3) reproducing smooth, constraint-compliant trajectories in both planar and 3D settings.
First,
confirms that the representation captures demonstrator-specific statistical and geometric properties. GMMs fitted to different instructors encode different covariance structures and inter-component relations, and these differences are reflected in the GMR reproductions (see
Figure 11 and quantitative summaries in
Table 3). This shows the method effectively decomposes demonstrated behavior into elementary actions whose statistical signatures are informative for transfer.
Second,
and
demonstrate two complementary adaptation mechanisms. For motions falling into
, simple affine scaling computed from via-point pairs suffices to adapt Gaussian covariances and means to new constraints, producing smooth target trajectories (
; illustrated in
Figure 12). For
, where the intermediate structure is not directly constrained, the pseudo via-point strategy converts the problem into
and enables similar adaptation (
; illustrated in
Figure 14). In both cases the adaptation is automatic and does not require new demonstrations or manual retuning of model parameters.
Third,
shows the approach generalizes beyond 2D handwriting to a 3D pick-and-place task. The adapted GMMs preserve the source task’s motion characteristics when transferred to new pick/place pairs, producing plausible 3D trajectories (
Figure 18 and
Figure 19). This indicates the framework’s potential utility in practical manipulation tasks where spatial constraints vary.
At the same time, several practical limitations and sensitivities were identified. The current pipeline is essentially open-loop: once GMM parameters for a target task are estimated, trajectory reproduction uses GMR without online correction, which limits robustness to unmodeled disturbances or perception errors. The pseudo via-point selection relies on heuristic timing and component-weight analyses; while effective for the tested examples, this heuristic may require refinement for more heterogeneous motion classes. The affine scaling and sign decisions (which affect covariance orientation and implied correlations) can also be sensitive to noisy via-point estimates. Finally, experiments used controlled optical sensing; real-world deployment will demand robustness to noisier/occluded perception.
In summary, the experiments validate that preserving both statistical properties of elementary actions and geometric relationships between adjacent actions provides a practical route for transferring demonstrated skills. The proposed MKT-GMM is an offline trajectory generalization framework, which is suitable for structured industrial scenarios with small disturbances. In future work, closed-loop visual or force feedback will be integrated to realize online trajectory correction for higher robustness. Compared with TP-GMM, MKT-GMM does not require manually defined task frames. Compared with DMP, it focuses on motion knowledge transfer across similar tasks. The proposed method will be extended to 6D pose transfer by representing orientation with quaternions.
5. Conclusions
This paper proposes a Motion Knowledge Transferring Gaussian Mixture Model (MKT-GMM) for robot learning from demonstration. The proposed framework preserves both the statistical characteristics of elementary actions and the geometric relationships between successive actions, enabling robust skill transfer from a source task to related target tasks. An integrated parameter adaptation framework, combining affine transformation estimation with task-constraint error minimization, is developed to generate smooth trajectories that satisfy task-specific constraints without requiring additional demonstrations. In addition, a pseudo via-point mechanism is introduced to provide a unified solution for handling both simple and complex motion structures, further improving the versatility of the proposed approach. Extensive experimental evaluations were conducted on a robotic manipulation platform through handwriting motion learning and pick-and-place tasks under varying task configurations. Comparative experiments were performed to evaluate the capability of the proposed framework in capturing demonstration variability, transferring motion knowledge across tasks, and adapting to different task constraints. In the handwriting experiments, demonstrations from multiple instructors were analyzed to examine the statistical characteristics of human motion, and the results show that the proposed representation effectively captures the variability and distinctive motion patterns among different demonstrators. In the trajectory generalization experiments, the proposed MKT-GMM was compared with direct trajectory reproduction and affine transformation–based adaptation methods, demonstrating improved accuracy in satisfying task constraints while preserving the structural consistency of the demonstrated motion. Furthermore, pick-and-place experiments under varying spatial configurations verify that the proposed method can reliably generate smooth and feasible trajectories in previously unseen task scenarios. Overall, the results indicate that the proposed framework enables effective motion knowledge transfer and achieves robust trajectory generalization under varying task constraints.
Future work will address the current limitations by developing closed-loop extensions that incorporate real-time feedback to improve accuracy and robustness. In addition, investigations will focus on cross-dimensional generalization, such as extending knowledge learned from 2D demonstrations to 3D executions, and on partial-to-complete task generalization. These directions are anticipated to broaden the applicability of the framework to increasingly diverse and complex robotic tasks, thereby advancing the state of robot learning from demonstration and contributing to the development of more autonomous and intelligent robotic systems.