Next Article in Journal
Quantitative Assessment of Biomechanical Deviations in Hybrid III Dummy Response Caused by Accessory Lumbar Supports
Previous Article in Journal
Distributed Deep Learning in IoT Sensor Network for the Diagnosis of Plant Diseases
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Collaborative Control for a Robot Manipulator via Interaction-Force-Based Impedance Method and Extremum Seeking Optimization

School of Information and Control Engineering, Southwest University of Science and Technology, Mianyang 621010, China
Sensors 2025, 25(24), 7648; https://doi.org/10.3390/s25247648
Submission received: 15 November 2025 / Revised: 9 December 2025 / Accepted: 11 December 2025 / Published: 17 December 2025
(This article belongs to the Section Intelligent Sensors)

Abstract

This paper introduces an adaptive impedance control strategy for robotic manipulators, developed through the extremum seeking technique. A model-based disturbance observer (DOB) is employed to estimate contact forces, removing the dependency on torque sensors. An impedance vector is constructed to correct the errors arising from motor uncertainties and unknown couplings, without considering the threshold value of the control parameters. Joint tracking errors and fluctuations in contact force are incorporated into the cost function. For various tasks, suitable control parameters are adaptively optimized in real time using an extremum seeking approach, which continuously evaluates the cost function. A rigorous analysis is conducted on the stability of the proposed controller. Compared to conventional approaches, the proposed adaptive impedance control offers a more streamlined design for adjusting the manipulator’s contact impedance. Experimental results confirm that the extremum seeking strategy successfully tuned the controller parameters online according to variations in the cost function.

1. Introduction

Robot manipulators are primarily designed to aid humans in carrying out tasks in intricate industrial processes, such as assembly and drilling [1,2], which typically involve environmental contact. Controlling the contact force is therefore key to ensuring rapid and precise tracking performance [3,4]. However, the presence of unmodeled disturbances and system uncertainties can hinder the robot’s ability to follow desired trajectories accurately when influenced by environmental forces.
Hence, some robust methods were proposed to avoid inaccurate and unstable motions of the manipulator, such as dynamic coupling switching [5], state space [6], and trajectory optimization [7]. Moreover, an adaptive robust control method was developed to address environmental uncertainties in the manipulator’s contact force regulation [8]. The impedance control method, serving as an alternative for manipulator contact force regulation, has also been presented [9,10]. When a robot contacts its environment, the interaction was typically modeled as a mass–spring–damper system [11].
To tackle the aforementioned issues, researchers have introduced adaptive control strategies for accurate robot impedance regulation, such as sliding-mode control (SMC) [12], time-delay control (TDC) [13], and neural network controls [14]. In [15], a distributed model-free adaptive control scheme was employed to investigate the uncertain dynamics of systems subjected to deception attacks. By leveraging the contraction mapping principle, the boundedness of the uncertain dynamics under external disturbances was rigorously demonstrated without relying on global topological information. However, the parameters of the control method were set in advance and many tests were required to achieve good control performance. In [16], an adaptive triggering mechanism with time-varying thresholds was proposed. Corresponding control protocols were designed to operate without global information, therefore ensuring that the control strategy remains effective. However, the variation in the adaptive law was fixed and could not be changed for different working conditions. In [17], the leaderless consensus problem of system with uncertain dynamics was studied. Group model learning was accomplished for Euler–Lagrange systems operating over directed networks, even under persistent unknown disturbances. However, the limits of unknown disturbances should be given in advance. In [18], a model reference adaptive consensus strategy was proposed and relied solely on position information, with the relative positions being exchanged. A drawback, however, was that the adaptive law’s parameters were fixed and require extensive testing.
SMC was more robust against the system’s nonlinear uncertainties and unknown disturbances [19]. If the SMC switching gain was larger than the highest level of nonlinear uncertainty, accurate motion regulation could be ensured. Nevertheless, it was usually impossible to define such bounds precisely [20]. When the switching gains largely exceed the upper bound, severe chattering can occur. Consequently, the robot vibrates around the sliding manifold, leading to energy loss and reduced tracking performance. A time-delay estimation (TDE) method was developed to mitigate the effects of coupled dynamics and unmodeled disturbances. Time-delay control (TDC) was a simple control scheme that had been extensively applied in autopilot systems [21] and robot control [22]. Nevertheless, TDE errors still existed, which degraded the control performance. To address this issue, appropriate control gains of the TDC were selected [23]. If the gains were excessively large or too small, the controller’s effectiveness was weakened. To decrease the effort to achieve suitable gains, neural networks were adaptively utilized to address dynamic uncertainties [24]. A supervisory fuzzy sliding mode controller was introduced for surgical robots [25]. This supervisory fuzzy control enabled real-time dynamic adjustment of the controller gains, thereby minimizing the time needed for tuning. Within a predefined range of control gains, the TDE error was successfully eliminated.
To adaptively optimize robotic motion performance, a cost function was commonly formulated for adjusting control parameters. In [26], the continues phase variable was used to regulate the impedance performance across walking modes, with a metabolic cost-based function tuning joint parameters—though updates occurred slowly. In [27], a data-driven method was conducted for stance phase with the kinematic control for swing phase. The parameters were updated to achieve the minor cost function. In [28], a novel impedance controller was designed to optimize system energy consumption. While, the optimization process required several hours. In [29], a human-in-the-loop approach was adopted to identify personalized assistance strategies, yet experiments revealed that a metabolic rate-based cost function was unsuitable for real-time adaptation. In [30], a composite cost function incorporating tracking error and energy consumption was proposed to evaluate contact force variations. Nevertheless, this function suffered from significant updated delays, highlighting the need for efficient real-time optimization methods. In [31], an extremum seeking method was employed to minimize the cost function and adjust control parameters for achieving desired robotic motion. However, the cost function relied solely on tracking error, limiting its adaptability to diverse working conditions. In [32], an extremum seeking strategy enabled real-time parameter tuning without prior knowledge or expert intervention, optimizing the cost function by adjusting manipulator stiffness in real time. Nonetheless, it neglected the influence of force variations on stiffness regulation. In [33], extremum seeking was applied to optimize a composite metric of muscular activity and spring stiffness, enabling autonomous adjustment of joint stiffness across different gait conditions. However, the control parameters responded too slowly to dynamic environmental changes. In [34], extremum seeking was utilized to adaptively adjust the control law’s parameters deduced from functional neuromuscular electrical stimulation. Although it achieved online optimization of the cost function equilibrium, the function design remained relatively simplistic, primarily based on tracking error variations.
Central Pattern Generators (CPGs), identified in both invertebrate and vertebrate species, offered an effective mechanism for producing rhythmic locomotion patterns [35,36]. Their notable features comprised self-sustained rhythmicity, bounded cycle convergence, and smooth modulation of output trajectories [37]. Typically implemented through oscillator networks, CPGs provided a tractable framework for generating and synchronizing joint trajectories. Furthermore, a CPG-based approach has been developed to estimate gait cycle percentage and derive virtual knee angles via interpolation from a healthy knee’s motion profile [38].
This paper established an adaptive impedance control scheme for robot manipulators. An impedance vector is introduced to formulate a simplified control structure. The traditional adaptive impedance control strategies usually only consider tracking the desired contact force and do not take into account the suppression of joint tracking errors. This paper proposed a comprehensive cost function J that considers both the tracking of desired contact forces and the suppression of joint tracking errors, and an extremum seeking strategy is employed to dynamically optimize the manipulator’s control parameters in response to cost function changes.
The key contributions of this study are summarized as follows: (1) the design of a model-based disturbance observer (DOB) for contact force estimation; (2) the formulation of a cost function integrating joint tracking errors and contact force variations; (3) the application of an extremum seeking approach to optimize control parameters by minimizing the cost function; (4) the adaptive control framework’s performance in manipulator motion tasks was verified experimentally.

2. Materials and Methods

2.1. Dynamic Model of Robot Manipulator

The 4-DOF robotic manipulator incorporates two active joints. As illustrated in Figure 1, the rotation angles of joints 1 and 2 are denoted as q1 and q2, respectively. The joint-space dynamics of the manipulator are expressed as follows:
M q ¨ + C q ˙ + G = τ + ϖ e
where q = q 1 , q 2 T 2 represents the rotation angle of the robot joint. τ 2 denotes the control signal of the robot joint. ϖ e 2 denotes the interaction torque between the level ground and robot. M 2 × 2 denotes the inertia matrix, M = m 1 d 1 2 + I 1 + I 2 + P 11 m 2 d 2 2 + I 2 + P 12 m 2 d 2 2 + I 2 + P 21 m 2 d 2 2 + P 22 , P 11 = m 2 ( l 1 2 + d 2 2 + 2 l 1 l 2 cos q 2 ) , P 12 = m 2 l 2 d 2 cos q 2 , P 21 = m 2 l 2 d 2 cos q 2 , P 22 = I 2 ; C 2 × 2 denotes the centripetal-Coriolis matrix, C = m 2 l 1 d 2 sin q 2 q ˙ 2 m 2 l 1 d 2 sin q 2 ( q ˙ 1 + q ˙ 2 ) m 2 l 1 d 2 sin q 2 q ˙ 1 0 ; G 2 is the gravitational torque, G = Q 1 + l 1 cos q 1 Q 2 , Q 1 = m 1 d 1 g cos q 1 + m 2 d 2 g cos ( q 1 + q 2 ) , Q 2 = m 2 d 2 g cos ( q 1 + q 2 ) .
The parameters of robotic manipulator can be designed as: m 1 = 3.5 kg, representing the mass of link 1; m 2 = 3.5 kg, representing the mass of link 2; d 1 = 0.25 m, connecting the center to joint 1; d 2 = 0.25 m, connecting the center to joint 2; I 1 = 1.3 kg·m2, representing the inertia of link 1; I 2 = 1.9 kg·m2, representing the inertia of link 2; l 1 = 0.52 m, connecting joint 1 to joint 2; l 2 = 0.52 m, connecting joint 2 to the end-effector; and g = 9.8 m/s2, representing gravitational acceleration. The primary variable notations are summarized in Table 1.
This method aims to adaptively adjust the robot’s joint trajectory q d to achieve a balance among tracking errors, contact force variations, and changes in ϖ e . The tuning process for robot manipulator motion is illustrated in Figure 2. Hence, the CPG model was employed to generate joint trajectories for various motion tasks. The joint trajectory can be immediately regulated by changing the state values of the CPG model.
According to [39], the functional mechanism of the CPG model can be summarized as:
q d i = r 0 + j = 1 N r j sin ( ω j t + s j )
ω ˙ j = g j g j 4 Ω j ω j ω j
s ˙ j = g j g j 4 ( S j s j ) s j
r ˙ j = g j g j 4 ( R j r j ) r j
r ˙ 0 = g 0 g 0 4 ( R 0 r 0 ) r 0
where q d i representing the desired trajectory of the manipulator’s i-th joint, is generated by superimposing N oscillators. r 0 represents the total oscillator offset. N is the total number of oscillators. j serves as the oscillator’s index. The variables r j , ω j , and s j denote the amplitude, frequency, and phase of j-th oscillator, respectively. Constants g j and g 0 set the convergence speed. Furthermore, Ω j , R j and S j represent the sine function of ω j , r j , s j , respectively; R 0 signifies its total offset. According to the Fourier series expansion, we have
Ω j = j ω 1
S j = arctan a j b j
R j = a j 2 + b j 2
R 0 = a 0
a j = 2 T 0 T q i 0 cos j w 1 t d t
b j = 2 T 0 T q i 0 cos j w 1 t d t
a 0 = 2 T 0 T q i 0 d t
where w 1 = 2 π T , T represents the motion cycle, q i 0 represents the initial trajectory. Ω j , R j , and S j can be calculated, ω j , r j , and s j will converge to Ω j , R j and S j respectively [39].

2.2. Extremum Seeking Optimization

The composite cost function integrates trajectory tracking errors and contact force variations, formulated as follows:
J = λ T i m ( q d i q i ) 2 + 1 λ T ( k m s k 2 + α i m ϖ e , i 2 )
where i , m , and k represent the index of time point, s k denotes the strength of the contact force at the k-th time point, α denotes a positive gain. The initial term guarantees that the manipulator follows the generated trajectory, and the second term aids in lowering the manipulator’s interaction torque with the environment. The weight between the two terms is predefined by the constant gain λ ( 0 , 1 ) . When λ = 0 , the emphasis of optimization is on trajectory alignment. When λ = 1 , the emphasis transfers to optimizing the contact force ϖ e .
An extremum seeking strategy is introduced to optimize the composite cost function J . The selected matrices and their associated costs are collated into W p a s t and J p a s t respectively. Minimizing (14) yields a variable and individualized trajectory to help the manipulator match the correct and appropriate tracking trajectory. The initial weight matrix was specified based on historical data. The extremum seeking method is illustrated in Figure 2. A sinusoidal probing signal, formulated as d ( t ) = a sin ( ω t ) , was employed to estimate the critical parameter l . According to (18) and (19), l serves as the critical variable for estimating ϖ ^ e . l governs the behavior of the motor control at the joint. Subsequently, the cost function J, derived from Equation (14), undergoes signal processing to extract the key parameter l . The main process for the extremum seeking optimization is formulated as follows: first, the resulting signal J η is calculated; secondly, η ˙ , ξ ˙ , and l ˙ are updated according to η ˙ = ω h ( J η ) , ξ ˙ = ω l ξ + ω l ( J η ) a sin ( ω t ) , and l ˙ = k ξ , ω 1 and ω h denote positive gains, l = l p r e + l ˙ , ξ = ξ p r e + ξ ˙ , and η = η p r e + η ˙ , l p r e , ξ p r e and η p r e represent pre-estimated values of each corresponding variable; then, Λ is calculated from Λ = l I ; finally, ϖ ^ e can be regulated.
The control architecture was illustrated in Figure 2, consisting of three primary functional modules: (1) the reference trajectory q r denotes the input trajectory and synthesized based on q d and τ e ; (2) through optimization, q r is adaptively adjusted to reduce the cost function J ; (3) parameter estimation enables the controller to maintain accurate tracking.

2.3. Control Development

2.3.1. Interaction-Force-Based Impedance Method

We assume that s k denotes the strength of k-th time point of the contact force. The weight function is designed as follows:
ω f o r c e = c 1 tan h ( k m s k 2 ι 1 + ι 2 ) + c 2
where c 1 and c 2 denote the positive gains to regulate the range and median of ω f o r c e , respectively. ι 1 and ι 2 denote the positive gains to regulate the range and median of k m s k 2 .
Subsequently, the impedance model based on the contact force can be expressed as:
D d ( q ˙ q ˙ d ) + P d ( q q d ) = 1 ω f o r c e ϖ e
where D d and P d represent the damping and stiffness matrices, respectively. (16) permits the robot joint to deviate from the reference trajectory q d and adjust the tracking trajectory accordingly. When the change in the contact force increases, ω f o r c e is reduced, thus lowering the impedance parameters and resulting in a large deviation.
Hence, we designed an impedance vector based on the interaction torque ϖ e as:
z = q ˙ q ˙ r = q ˙ q ˙ d + D d 1 P d q q d 1 ω f o r c e D d 1 ϖ e
where q ˙ r = q ˙ d D d 1 P d ( q q d ) + 1 ω f o r c e D d 1 ϖ e denotes the designed value. According to (17), when z 0 , (16) achieves the reference tracking performance.
Moreover, to avoid the use of a force sensor to measure ϖ e , the estimation ϖ e was established by a model-based disturbance observer (DOB):
y ˙ = B ( q ˙ , q ) y B ( q ˙ , q ) C q ˙ g + p ϖ ^ e = y + p
where ϖ ^ e denotes the estimated ϖ e , y and p denote the designed values, and B ( q ˙ , q ) denotes the gain matrix. Gain matrix B ( q ˙ , q ) and vector p can be formulated as follows:
B ( q ˙ , q ) = Λ 1 M 1 p = Λ 1 q ˙
where Λ denotes an invertible gain matrix, Λ = l I , l denotes the positive gain to be determined by l = l p r e + l ˙ . We assumed ϖ ˜ e to be the observation error, that is ϖ ˜ e = ϖ ^ e ϖ . The adaptive law for ϖ ^ e can be expressed as:
ϖ ˜ ˙ = B ( q ˙ , q ) ϖ ˜ ϖ ˙ e
The overall control input is designed as:
τ = K z z ϖ ^ e K g sgn ( z ) + M q ¨ r + C q ˙ r + G
where k g denotes the positive constant gain, and K z nxn denotes the positive constant matrix. sgn ( ) denotes the sign function, sgn ( z ) = 1 ,   z > 0 0 , z = 0 1 , z < 0 . By substituting (17) and (21) into (1), we obtain:
M z ˙ + C z = K z z ϖ ˜ e k g sgn ( z )

2.3.2. Stability Analysis

To guarantee the stability of the control system, the overall system should be bounded, and the quasi-steady-state system should be exponentially stable. The stability of the control system can be analyzed using the selected Lyapunov function as follows:
γ 1 = 1 2 z T M z
Combining (22) and (23), the following equation can be obtained:
γ ˙ 1 = z T K z z z T ϖ ˜ e k g z T sgn ( z )
when ϖ ˙ e ς , the observation error is also bounded, and:
γ ˙ 1 z T K z z ( k g κ ) z
κ 2 ς λ max ( M ) Λ 2 ρ λ min ( Γ )
where κ denotes the limitation of the uncertainty 2 ς λ max ( M ) Λ 2 ρ λ min ( Γ ) , k g denotes a positive value. When k g > κ , we have
γ ˙ 1 z T K z z < 0
when γ ˙ 1 < 0 , the system was exponentially stable.
Moreover, we consider the following candidate Lyapunov function.
γ = γ 1 + ϖ ˜ e T Λ T M Λ ϖ ˜ e
Combining (28) and (20), the following equation can be obtained:
γ ˙ = γ ˙ 1 ϖ ˜ e T ( Λ + Λ T Λ T M ˙ Λ ) ϖ ˜ e + ϖ ˙ e T Λ T M Λ ϖ ˜ e + ϖ ˜ e T Λ T M Λ ϖ ˙ e
Considering the bounds of ϖ ˙ e , ϖ ˙ e < γ , γ denotes the bound. (29) can be formulated as follows:
γ ˙ γ ˙ 1 λ min ( Γ ) ϖ ˜ 2 + 2 γ λ max ( M ) Λ 2 ϖ ˜ e = γ ˙ 1 ( 1 )   λ min ( Γ ) ϖ ˜ e 2 λ min ( Γ ) ϖ ˜ e 2 + 2 γ λ max ( M ) Λ 2 ϖ ˜ e
where ( 0 , 1 ) , Γ = Λ + Λ T Λ T M ˙ Λ . Because γ γ 1 = ϖ ˜ e T Λ T M Λ ϖ ˜ e   λ max ( M ) Λ 2 ϖ ˜ e 2 , when ϖ ˜ e 2 γ λ max ( M ) Λ 2 λ min ( Γ ) , (30) can be deduced as:
γ ˙ γ ˙ 1 ( 1 ) λ min ( Γ ) ϖ ˜ e 2 γ ˙ 1 ( 1 ) λ min ( Γ ) λ max ( M ) Λ 2 ( γ γ 1 )
Then, we have:
γ ˙ γ ˙ 1 ( 1 ) λ min ( Γ ) λ max ( M ) Λ 2 ( γ γ 1 )
And, the exponential convergence can be formulated as:
γ ( t + Δ t ) γ 1 ( t + Δ t ) ( γ ( t ) γ 1 ( t ) ) exp ( ( 1 ) λ min ( Γ ) λ max ( M ) Λ 2 Δ t )
where Δ γ = γ γ 1 achieved exponential convergence. According to (27), γ 1 achieves exponential convergence and γ = γ 1 + Δ γ also achieves exponential convergence.

3. Results

3.1. Experiments Setup

To evaluate the performance of the developed controller, three distinct motion tasks were implemented. An actual experimental setup is depicted in Figure 3. A lightweight collaborative robot manipulator was used to conduct the experiments at the SWUST Robotic Laboratory. The robotic manipulator comprises three core subsystems: a mechanically engineered framework, an integrated control unit, and a sensory feedback system. The mechanical framework was 3D printed. Table 2 provides the component masses, with a total manipulator mass of 10.8 kg. The proposed control method was validated using two active joints of the 6-DoF robotic manipulator (JAKA Robot Limit Corporation, Chengdu city, China). The first active joint operates within an approximate range of −90° to 90°, while the second joint spans roughly −150° to 150°. A DC motor actuates each joint, with an encoder measuring rotational angle data. The force sensor evaluates the change in the contact force.

3.2. Case 1: Manipulator Tracking a Circular Motion

(1)
Experiment: The circular motion tracking task in Case 1 control strategy. A circular path was defined as the reference trajectory, which the robot’s end-effector was commanded to track. However, owing to the influence of contact force, the proposed impedance method regulated the desired trajectory to the reference trajectory. Finally, the controller computed motor torques to drive the end-effector along the reference trajectory while maintaining a consistent contact force level. The combined cost function J was designed with tracking errors and the contact force change. An extremum seeking approach is introduced to optimize the composite cost function J . The suitable control parameters are selected to help the manipulator match the correct and appropriate tracking trajectory.
Figure 4 shows the manipulator tracking the reference trajectory, while the contact force was maintained at 50 N, 100 N, and 150 N. The main control parameters were set as: K z = 3.2 , K g = 1.1 , K d = 1.3 0 0 1.3 , C d = 0.75 0 0 0.75 .
(2)
Results: Figure 5 illustrates the experimental tracking performance, with subplots (a) and (b) displaying joint trajectories, (c) presenting joint tracking errors, (d) showing control torque profiles, (e) depicting contact force variations, and (f) demonstrating the evolution of cost function J. The results indicate that the proposed controller achieves smoothly varying force regulation.
Table 3 provides a comparative summary of tracking performance for multiple controllers. The proposed control method with extremum seeking significantly outperformed torque- and voltage-based controllers, reducing the mean error (MEAN) and mean square error (MSE) to 2.7° and 1.52°, respectively, indicating enhanced tracking accuracy and stability.

3.3. Case 2: Manipulator Tracking a Triangular Motion

(1)
Experiment: Case 2 evaluates how the proposed controller performs in a triangular circular path tracking task. The desired trajectory was designed as a triangular circle. The robotic end-effector was regulated to follow the predefined trajectory. However, due to the effect of contact force, the proposed impedance method aligned the desired trajectory with the reference trajectory. The controller computed the motor torques necessary to guide the robotic end-effector along the reference trajectory. In the process, the manipulator maintained the contact force at a fixed level. The combined cost function J was designed with tracking errors and the contact force change. An extremum seeking strategy is employed to optimize the aggregate cost function J. The suitable control parameters are selected to help the manipulator match the correct and appropriate tracking trajectory.
Figure 6 shows the manipulator executing its motion by tracking the reference trajectory, while the contact force was controlled at 50 N, 100 N, and 150 N. The main control parameters were set as: K z = 3.15 , k g = 1.3 , K d = 1.3 0 0 1.3 , C d = 0.75 0 0 0.75 .
(2)
Results: Figure 7 summarizes the experimental tracking performance, with subfigures (a) and (b) illustrating joint tracking results, (c) displaying joint errors, (d) presenting control torque, (e) showing contact force variations, and (f) depicting the evolution of cost function J. The results demonstrate smooth force regulation achieved by the proposed controller.
Table 4 provides a comparative summary of tracking performance for multiple controllers. The proposed control method with extremum seeking outperformed torque-based and voltage-based approaches, lowering the mean error (MEAN) to 3.4° and the mean square error (MSE) to 1.6°.

3.4. Case 3: Manipulator Tracking a Sine Circular Motion

(1)
Experiment: Case 3 evaluates the controller’s performance through a sinusoidal circular path tracking task, where the robot’s end-effector is guided along a predefined sinusoidal trajectory. However, due to the effect of contact force, the proposed impedance method regulated the desired trajectory to the reference trajectory. The controller computed the motor torques required to drive the robotic end-effector along the reference trajectory. In the process, the contact force was maintained at a fixed level. The combined cost function J was designed with tracking errors and the contact force change. An extremum seeking strategy is introduced to optimize the aggregate cost function J. The suitable control parameters are selected to help the manipulator match the correct and appropriate tracking trajectory.
Figure 8 depicts the manipulator following the reference trajectory during its motion, while the contact force was set to 50 N, 100 N, and 150 N. The main control parameters were set as: K z = 3.2 , k g = 1.2 , K d = 1.1 0 0 1.1 , C d = 0.72 0 0 0.72 .
(2)
Results: Figure 9 presents the experimental tracking performance, with subfigures (a) and (b) illustrating joint trajectories, (c) displaying tracking errors, (d) showing control torque profiles, (e) depicting contact force variations, and (f) demonstrating the evolution of cost function J. The results confirm smooth force regulation achieved by the proposed controller.
Table 5 provides a comparative summary of tracking performance for multiple controllers. The proposed control method with extremum seeking outperformed torque- and voltage-based methods, lowering the mean error (MEAN) to 3.65° and the mean square error (MSE) to 2.32°.

4. Discussion

A model-based disturbance observer was designed to estimate the contact force and use the adaptive estimates as input terms in the tracking controller. The input–output data was exploited to achieve estimates. Some delay may be occurred at the begin of the estimation.
The tracking performance of traditional adaptive control relies on the threshold value of control parameters. This threshold must be set sufficiently large to ensure accurate tracking, but it may induce undesirable chattering. To address this issue, an adaptive control law for the threshold is designed to eliminate chattering. In comparison to conventional approaches, the developed adaptive impedance control strategy achieves a more streamlined framework for adjusting the manipulator’s impedance during environmental interaction. By integrating a model-based disturbance observer and an extremum seeking method to compensate for dynamic uncertainties, this approach achieves more robust performance.

5. Conclusions

This study presents an adaptive extremum-seeking control scheme for manipulator impedance regulation. The proposed impedance vector compensates for motor uncertainties and unknown couplings. A cost function was formulated to jointly account for joint tracking errors and variations in contact forces. For adaptation to different tasks, the extremum seeking method continuously modified the control parameters based on real-time variations in the cost function. Stability analysis confirmed that the control scheme achieved uniform ultimate boundedness in a semi-global sense. Experimental validation further showed that the proposed approach adaptively tuned the manipulator’s parameters, forming a natural adaptive policy. In performance tests, the mean error (MEAN) and mean square error (MSE) decreased to 2.7° and 1.52° (Case 1), 3.4° and 1.6° (Case 2), 3.65° and 2.32° (Case 3), respectively.
Future work includes enhancing the robustness of the developed controller to more external disturbances that can affect the convergence of the adaptive estimates. In addition, the designed controller will be implemented with different patterns to examine the convergence of the estimates of the parameters and assess the performance for tracking diverse tasks.

Funding

This research was funded by the Sichuan Provincial Key R&D Program (Grant No. 2024YFFK0039) and the Doctoral Research Fund of the Southwest University of Science and Technology (Grant No. 21zx7142). The APC was funded by the same source.

Institutional Review Board Statement

This study did not involve humans or animals; therefore, ethical approval was not required.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Kommuri, S.K.; Han, S.; Lee, S. External torque estimation using higher order sliding-mode observer for robot manipulators. IEEE/ASME Trans. Mechatron. 2022, 27, 513–523. [Google Scholar] [CrossRef]
  2. Dong, W.; Ma, Z.; Sheng, X.; Zhu, X. Centimeter-level aerial assembly achieved with manipulating condition inference and compliance. IEEE/ASME Trans. Mechatron. 2022, 27, 1660–1671. [Google Scholar] [CrossRef]
  3. Xie, Z.; Jin, L.; Luo, X. Kinematics-based motion-force control for redundant manipulators with quaternion control. IEEE Trans. Autom. Sci. Eng. 2023, 20, 1815–1828. [Google Scholar] [CrossRef]
  4. Fan, J.; Jin, L.; Xie, Z.; Li, S.; Zheng, Y. Data-driven motion-force control scheme for redundant manipulators: A kinematic perspective. IEEE Trans. Ind. Inform. 2022, 18, 5338–5347. [Google Scholar] [CrossRef]
  5. Li, Z.; Kang, Y. Dynamic coupling switching control incorporating support vector machines for wheeled mobile manipulators with hybrid joints. Automatica 2010, 46, 832–842. [Google Scholar] [CrossRef]
  6. Wang, G.; Lee, J. State control of a 12 DOF mobile manipulator via centroid feedback. Int. J. Precis. Eng. Manuf. 2013, 14, 745–754. [Google Scholar] [CrossRef]
  7. Korayem, M.H.; Nazemizadeh, M.; Rahimi, H.N. Trajectory optimization of nonholonomic mobile manipulators departing to a moving target amidst moving obstacles. Acta Mech. 2013, 224, 995–1008. [Google Scholar] [CrossRef]
  8. Mai, T.; Wang, Y. Adaptive force/motion control system based on recurrent fuzzy wavelet CMAC neural networks for condenser cleaning crawler-type mobile manipulator robot. IEEE Trans. Control Syst. Technol. 2014, 22, 1973–1982. [Google Scholar] [CrossRef]
  9. Jin, X. Formation-based decentralized iterative learning cooperative impedance control for a team of robot manipulators. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 872–881. [Google Scholar] [CrossRef]
  10. Arefinia, E.; Talebi, H.A.; Doustmohammadi, A. A robust adaptive model reference impedance control of a robotic manipulator with actuator saturation. IEEE Trans. Syst. Man Cybern. Syst. 2020, 50, 409–420. [Google Scholar] [CrossRef]
  11. Li, G.; Yu, J.; Chen, X. Adaptive fuzzy neural network command filtered impedance control of constrained robotic manipulators with disturbance observer. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 5171–5180. [Google Scholar] [CrossRef] [PubMed]
  12. He, X.; Li, X.; Song, S. Nonsingular terminal sliding-mode control of second-order systems subject to hybrid disturbances. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 5019–5023. [Google Scholar] [CrossRef]
  13. Park, J.; Kwon, W.; Park, P. An improved adaptive sliding mode control based on time-delay control for robot manipulators. IEEE Trans. Ind. Electron. 2023, 70, 10363–10373. [Google Scholar] [CrossRef]
  14. Sun, W.; Wu, Y.; Lv, X. Adaptive neural network control for full-state constrained robotic manipulator with actuator saturation and time-varying delays. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 3331–3342. [Google Scholar] [CrossRef]
  15. Li, F.; Hou, Z. Distributed model-free adaptive control for MIMO nonlinear multiagent systems under deception attacks. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 2281–2291. [Google Scholar] [CrossRef]
  16. Wang, L.; Chen, S. Fully distributed observer-based leader-following consensus of linear multiagent systems by adaptive dynamic event-triggered schemes. IEEE Trans. Cybern. 2025, 55, 332–342. [Google Scholar] [CrossRef]
  17. Wang, S.; Zhang, H.; Baldi, S.; Zhong, R. Leaderless consensus of heterogeneous multiple euler–lagrange systems with unknown disturbance. IEEE Trans. Autom. Control 2023, 68, 2399–2406. [Google Scholar] [CrossRef]
  18. Meng, X.; Mei, J.; Miao, Z.; Wu, A.; Ma, G. Fully distributed consensus of multiple euler–lagrange systems under switching directed graphs using only position measurements. IEEE Trans. Autom. Control 2024, 69, 1781–1788. [Google Scholar] [CrossRef]
  19. Li, C.; Zhao, L.; Xu, Z. Finite-time adaptive event-triggered control for robot manipulators with output constraints. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 3824–3828. [Google Scholar] [CrossRef]
  20. Zhai, J.; Li, Z. Fast-exponential sliding mode control of robotic manipulator with super-twisting method. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 489–493. [Google Scholar] [CrossRef]
  21. Lee, S.; Shin, M.; Son, H. Robust predictor-based control for multirotor UAV with various time delays. IEEE Trans. Ind. Electron. 2023, 70, 8151–8162. [Google Scholar] [CrossRef]
  22. Zhang, X.; Liu, J.; Tong, Y.; Liu, Y.; Ju, Z. Attitude decoupling control of semifloating space robots using time-delay estimation and supertwisting control. IEEE Trans. Aerosp. Electron. Syst. 2021, 57, 4280–4295. [Google Scholar] [CrossRef]
  23. Wang, Y.; Liu, L.; Wang, D.; Ju, F.; Chen, B. Time-delay control using a novel nonlinear adaptive law for accurate trajectory tracking of cable-driven robots. IEEE Trans. Ind. Inform. 2020, 16, 5234–5243. [Google Scholar] [CrossRef]
  24. Chen, Z.; Huang, F.; Sun, W.; Gu, J.; Yao, B. RBF-neural-network-based adaptive robust control for nonlinear bilateral teleoperation manipulators with uncertainty and time delay. IEEE/ASME Trans. Mechatron. 2020, 25, 906–918. [Google Scholar] [CrossRef]
  25. Xu, J.; Niu, Y.; Lam, H. Event-triggered sliding mode control of fuzzy systems via artificial time-delay estimation. IEEE Trans. Fuzzy Syst. 2021, 29, 2467–2478. [Google Scholar] [CrossRef]
  26. Welker, C.G.; Best, T.K.; Gregg, R.D. Improving sit/stand loading symmetry and timing through unified variable impedance control of a powered knee-ankle prosthesis. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 4146–4155. [Google Scholar] [CrossRef]
  27. Best, T.K.; Welker, C.G.; Rouse, E.J.; Gregg, R.D. Data-driven variable impedance control of a powered knee–ankle prosthesis for adaptive speed and incline walking. IEEE Trans. Robot. 2023, 39, 2151–2169. [Google Scholar] [CrossRef]
  28. Cao, W.; Chen, C.; Hu, H.; Fang, K.; Wu, X. Effect of hip assistance modes on metabolic cost of walking with a soft exoskeleton. IEEE Trans. Autom. Sci. Eng. 2021, 18, 426–436. [Google Scholar] [CrossRef]
  29. Gordon, D.F.N.; McGreavy, C.; Christou, A.; Vijayakumar, S. Human-in-the-loop optimization of exoskeleton assistance via online simulation of metabolic cost. IEEE Trans. Robot. 2022, 38, 1410–1429. [Google Scholar] [CrossRef]
  30. Zhang, J.; Fiers, P.; Witte, K.A.; Jackson, R.W.; Poggensee, K.L.; Atkeson, C.G.; Collins, S.H. Human-in-the-loop optimization of exoskeleton assistance during walking. Science 2017, 356, 1280–1284. [Google Scholar] [CrossRef]
  31. Killingsworth, N.J.; Krstić, M. PID tuning using extremum seeking: Online, model-free performance optimization. IEEE Control Syst. Mag. 2006, 26, 70–79. [Google Scholar]
  32. Lin, J.; Divekar, N.V.; Thomas, G.C.; Gregg, R.D. Optimally biomimetic passivity-based control of a lower-limb exoskeleton over the primary activities of daily life. IEEE Open J. Control Syst. 2022, 1, 15–28. [Google Scholar] [CrossRef] [PubMed]
  33. Kumar, S.; Zwall, M.R.; Bolívar-Nieto, E.A.; Gregg, R.D.; Gans, N. Extremum seeking control for stiffness auto-tuning of a quasi-passive ankle exoskeleton. IEEE Robot. Autom. Lett. 2020, 5, 4604–4611. [Google Scholar] [CrossRef]
  34. Paz, P.; Oliveira, T.R.; Pino, A.V.; Fontana, A.P. Model-free neuromuscular electrical stimulation by stochastic extremum seeking. IEEE Trans. Control Syst. Technol. 2020, 28, 238–253. [Google Scholar] [CrossRef]
  35. Ijspeert, A.J. Central pattern generators for locomotion control in animals and robots: A review. Neural Netw. 2008, 21, 642–653. [Google Scholar] [CrossRef]
  36. Crespi, A.; Ijspeert, A.J. Online optimization of swimming and crawling in an amphibious snake robot. IEEE Trans. Robot. 2008, 24, 75–87. [Google Scholar] [CrossRef]
  37. Endo, G.; Nakanishi, J.; Morimoto, J.; Cheng, G. Experimental studies of a neural oscillator for biped locomotion with QRIO. In Proceedings of the IEEE International Conference on Robotics and Automation, Barcelona, Spain, 18–22 April 2005. [Google Scholar]
  38. Torrealba, R.R.; Cappelletto, J.; Fermín, L.; Fernández-López, G.; Grieco, J.C. Cybernetic knee prosthesis: Application of an adaptive central pattern generator. Kybernetes 2012, 41, 192–205. [Google Scholar] [CrossRef]
  39. Liu, X.; Onal, C.D.; Fu, J. Reinforcement learning of CPG-regulated locomotion controller for a soft snake robot. IEEE Trans. Robot. 2023, 39, 3382–3401. [Google Scholar] [CrossRef]
  40. Souzanchi-K., M.; Arab, A.; Akbarzadeh-T., M.R. Robust impedance control of uncertain mobile manipulators using time-delay compensation. IEEE Trans. Control Syst. Technol. 2018, 26, 1942–1953. [Google Scholar] [CrossRef]
Figure 1. Sketch of the robot manipulator. (a) Real robot manipulator. (b) Diagram of manipulator’s parameters.
Figure 1. Sketch of the robot manipulator. (a) Real robot manipulator. (b) Diagram of manipulator’s parameters.
Sensors 25 07648 g001
Figure 2. Schematic representation of the control flow. The reference trajectory q r denotes the input trajectory and synthesized based on q d and τ e . q denotes the output trajectory. Through optimization, q r is adaptively adjusted to reduce the cost function J . Parameter estimation enables the controller to maintain accurate tracking.
Figure 2. Schematic representation of the control flow. The reference trajectory q r denotes the input trajectory and synthesized based on q d and τ e . q denotes the output trajectory. Through optimization, q r is adaptively adjusted to reduce the cost function J . Parameter estimation enables the controller to maintain accurate tracking.
Sensors 25 07648 g002
Figure 3. Experimental setup.
Figure 3. Experimental setup.
Sensors 25 07648 g003
Figure 4. Desired circular path trajectory.
Figure 4. Desired circular path trajectory.
Sensors 25 07648 g004
Figure 5. Tracking performance under Case 1. (a) Joint 1 trajectory tracking. (b) Joint 2 trajectory tracking. (c) Joint tracking errors. (d) Joint control torque. (e) Contact force response. (f) Cost function J variation.
Figure 5. Tracking performance under Case 1. (a) Joint 1 trajectory tracking. (b) Joint 2 trajectory tracking. (c) Joint tracking errors. (d) Joint control torque. (e) Contact force response. (f) Cost function J variation.
Sensors 25 07648 g005
Figure 6. Desired triangular path trajectory.
Figure 6. Desired triangular path trajectory.
Sensors 25 07648 g006
Figure 7. Tracking performance in Case 2. (a) Joint 1 trajectory tracking. (b) Joint 2 trajectory tracking. (c) Joint tracking errors. (d) Joint control torque. (e) Contact force response. (f) Cost function J variation.
Figure 7. Tracking performance in Case 2. (a) Joint 1 trajectory tracking. (b) Joint 2 trajectory tracking. (c) Joint tracking errors. (d) Joint control torque. (e) Contact force response. (f) Cost function J variation.
Sensors 25 07648 g007
Figure 8. Desired sinusoidal path trajectory.
Figure 8. Desired sinusoidal path trajectory.
Sensors 25 07648 g008
Figure 9. Tracking performance in Case 3. (a) Joint 1 trajectory tracking. (b) Joint 2 trajectory tracking. (c) Joint tracking errors. (d) Joint control torque. (e) Contact force response. (f) Cost function J variation.
Figure 9. Tracking performance in Case 3. (a) Joint 1 trajectory tracking. (b) Joint 2 trajectory tracking. (c) Joint tracking errors. (d) Joint control torque. (e) Contact force response. (f) Cost function J variation.
Sensors 25 07648 g009
Table 1. Main variable notations.
Table 1. Main variable notations.
VariableNotationVariableNotationVariableNotationVariableNotation
M inertia matrix q 1 angle for joint 1 R j amplitude l 1 positive gain
C centripetal
coriolis matrix
q 2 angle for joint 2 g 0 constant l 2 positive gain
G gravitational torque g gravity R 0 total offset c 1 positive gain
q real angle of joints q d desired trajectory q i 0 raw trajectory c 2 positive gain
τ control torque q d i the i-th joint of q d T gait cycle γ 1 candidate function
ϖ e interaction torque r 0 offset J cost function γ candidate function
q r reference trajectory r j amplitude λ weight Δ γ candidate function
P d stiffness matric ω j frequency Γ assistant variable l control gain
D d damping matric t time K z constant matrix ζ control gain
ψ m   s ,   0 initial value s j phase k g constant gain η control gain
z impedance vector g j constant B gain matrix Λ gain matrix
d ( t ) seeking signal Ω j frequency τ ^ e estimation
Table 2. Mass weight of robot manipulator.
Table 2. Mass weight of robot manipulator.
PartMass (Kg)Motion Range (°)
body structure2.06
control assembly2.38
joint 12.77−90° to 90°
joint 22.59−150° to 150°
total10.8
Table 3. Trajectory tracking in case 1.
Table 3. Trajectory tracking in case 1.
ControllersTracking Error (°)
MEAN (°)
MSE (°)Force Error (N)
MEAN (N)
MSE (N)
Torque-based
Control [10]
4.23.13.42.3
Voltage-based
control [40]
3.62.33.02.5
Proposed control2.71.522.31.6
Table 4. Trajectory tracking in case 2.
Table 4. Trajectory tracking in case 2.
ControllersTracking Error (°)
MEAN (°)
MSE (°)Force Error (N)
MEAN (N)
MSE (N)
Torque-based
Control [10]
5.13.33.42.5
Voltage-based
control [40]
4.62.53.42.35
Proposed control3.41.62.42.1
Table 5. Trajectory tracking in case 3.
Table 5. Trajectory tracking in case 3.
ControllersTracking Error (°)
MEAN (°)
MSE (°)Force Error (N)
MEAN (N)
MSE (N)
Torque-based
Control [10]
4.873.173.832.4
Voltage-based
control [40]
4.33.23.62.3
Proposed control3.652.322.81.95
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pi, M. Collaborative Control for a Robot Manipulator via Interaction-Force-Based Impedance Method and Extremum Seeking Optimization. Sensors 2025, 25, 7648. https://doi.org/10.3390/s25247648

AMA Style

Pi M. Collaborative Control for a Robot Manipulator via Interaction-Force-Based Impedance Method and Extremum Seeking Optimization. Sensors. 2025; 25(24):7648. https://doi.org/10.3390/s25247648

Chicago/Turabian Style

Pi, Ming. 2025. "Collaborative Control for a Robot Manipulator via Interaction-Force-Based Impedance Method and Extremum Seeking Optimization" Sensors 25, no. 24: 7648. https://doi.org/10.3390/s25247648

APA Style

Pi, M. (2025). Collaborative Control for a Robot Manipulator via Interaction-Force-Based Impedance Method and Extremum Seeking Optimization. Sensors, 25(24), 7648. https://doi.org/10.3390/s25247648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop