Active Disturbance Rejection Terminal Sliding Mode Control for Tele-Aiming Robot System Using Multiple-Model Kalman Observers

: This study proposes a tele-aiming control strategy for the ground reconnaissance robot to track the maneuvering target rapidly in the presence of dynamic uncertainties, sensory measurement noises, and time-varying external disturbances. First, the tele-aiming control trajectory generated by human–computer interaction (HCI) device is ﬁltered with a tracking differentiator and a recursive average ﬁlter. Second, the inertial impact force disturbance generated by maneuvering tele-aiming control jointly with the other uncertainties (e.g., internal friction, modeling error, etc.) is considered as a lumped disturbance, and then a novel multiple-model augmented-state extended Kalman observer (MEKO) is designed, capable of ﬁltering out the joint measurement noises and estimating the lumped disturbance simultaneously. Lastly, a nonsingular terminal sliding mode controller is applied to eliminate the lumped disturbance and control the joints to track the corresponding desired joint trajectory. To verify the tele-aiming control performance, the random trajectory tracking experiments are designed to simulate the tele-aiming tracking control of maneuvering targets. As indicated from the experimental results, the proposed control strategy is capable of signiﬁcantly suppressing the effect of inertial impact force disturbance and joint measurement noises, and achieving fast and stable tele-aiming control.


Introduction
Over the past few years, as robot technology has been leaping forward, the application of robots is booming. In addition to manufacturing, robots have also been extensively adopted in several fields (i.e., disaster rescue, military, and aerospace). In simple structured scenarios, robots have been able to work autonomously. However, for sophisticated and unknown unstructured scenarios, robots still have not achieved complete autonomy, and their operation mode is still determined by human-computer interaction (HCI) based teleoperation technology. This technology exhibits the advantage that the master operator is capable of integrating the operator's intention into the real-time control of the robot. Such a teleoperation control technology that combines human intelligence can enhance the operation ability exhibited by robots in a sophisticated and unknown unstructured environment. By exploiting this technology, robots can perform dangerous tasks under the command of humans, such as armed reconnaissance missions. Such tasks are generally performed by a mobile robot equipped with a multi-degree of freedom (DOF) manipulator [1] and one or more hand-eye cameras. The success of such tasks depends largely on the accuracy and stability of the operator's remote control of the manipulator.
The core of teleoperation is to generate the desired trajectory through an HCI device, then send the trajectory wirelessly to the robot, and make the manipulator track the desired trajectory through a trajectory tracking controller. Throughout the research over the past few years, the design of trajectory tracking controllers for robotic manipulators has been extensively conducted.
The CTC requires an accurate robot dynamic model, and its control accuracy is determined by the accuracy of the model [2]. The OC, which also needs an accurate model, aims to seek a control strategy under given constraints to maximize (or minimize) a given prescribed performance [3]. The control output of MPC is determined by online solving a finite time domain open-loop optimal control problem in the next N time steps at the respective time step [6], which will lead to large computational costs [11][12][13][14], thereby causing poor real-time performance. The control effect depends on solving the optimal solution of the error between the predicted trajectory and the reference trajectory in the next N time steps, which implies that MPC is only suitable for the setpoint tracking scenarios not for the teleoperation scenarios [14]. The FC has a sophisticated design, especially when the system complexity is noticeably high, the appropriate membership function and fuzzy rules are difficult to determine [15,16], and there are difficulties in analyzing the structural properties of FC, including stability, controllability, and robustness [8].
In practical applications, due to the uncertainty of modeling error, internal friction, and time-varying external disturbance, it is generally impossible to obtain a completely accurate robot dynamic model. Therefore, it is difficult to obtain the desired tracking effect by directly using the above methods. Therefore, some methods, such as adaptive control [17], robust control [18], and sliding mode control (SMC) [9,19], study how to make the control performance meet the requirements when the model is inaccurate. Among these methods, SMC [20][21][22][23] has aroused considerable attention in the fields of robotic control for its fast transient response and robustness against uncertainties and disturbances. As indicated from the mentioned literature, SMC exhibits the advantage that it enables the system state to efficiently approach the equilibrium state in a limited time, even when the initial state is far away from the equilibrium.
Some research focuses on designing uncertainty compensation or disturbance estimator and realizing trajectory tracking control by combining with the above control methods. To remedy modeling uncertainties and external disturbances, in [24], a fuzzy compensator is employed in the CTC to achieve the trajectory tracking of a 3-joint manipulator. In [25] an extended state observer is adopted to estimate structured and unstructured uncertainties and is effectively applied to the trajectory tracking of a 2-joint manipulator. The work of [26] proposes a novel CTC control system for a 7-DOF robot, in which a Radial Basis Function (RBF) neural network compensator is introduced to the CTC to remedy the system error attributed to the imprecision of the model. The work of [7] formulates a control strategy of the CTC plus fuzzy compensation by exploiting an adaptive fuzzy logic system to remedy the uncertain part of the mechanical model of a 7-DOF picking manipulator. The work of [27] develops a novel anti-disturbance tracking controller for a class of Multiple Input Multiple Output (MIMO) systems under unknown disturbances and nonlinear dynamics, in which T-S fuzzy models were designed to estimate the unknown nonlinear disturbances. In [28] an asynchronous fuzzy integral sliding mode control (AFISMC) is proposed to drive the trajectories of the nonlinear Markov jump systems represented by Takagi-Sugeno (T-S) models, which are subject to external noise and matched uncertainties, into the predetermined sliding mode boundary layer in finite time.
For some teleoperation control scenarios where the reference trajectory can be given in advance, such as the teleoperation control of space manipulator, the above methods can meet the requirements. For some other special teleoperation scenarios, such as the HCI-based tele-aiming system of military robots, the operator needs to quickly control the robot to aim at the maneuvering target, and the reference trajectory needs to be changed according to the movement of the maneuvering target in real-time. The trajectory of the maneuvering target cannot be predicted, which also leads to non-uniform and great randomness of this time-varying reference trajectory. Therefore, the control methods that need to give the reference trajectory in advance, such as MPC or OC et al., are generally not suitable for those tele-aiming scenarios.
When the above-mentioned control methods use the HCI device to generate the corresponding time-varying non-uniform maneuvering trajectory, it may cause a large inertial impact force disturbance to the joints of the robot, and therefore causes an instantaneous significant residual between the robot end trajectory and the tele-aiming trajectory. When converted to joint space, this instantaneous large residual dramatically changes the joint reference trajectory and acceleration reference trajectory, thereby generating a large overshoot, which is easy to cause the flutter or instability of the manipulator [29].
For the above HCI-based tele-aiming scenarios, the observer-based control method is a very promising way to solve this time-varying inertial impact disturbance. For example, the active disturbance rejection control method (ADRC) can treat the modeling uncertainty and this time-varying inertial impact force disturbance as the lumped disturbance, and then estimate and eliminate this lumped disturbance online in real-time through the extended state observer (ESO) [30][31][32][33][34][35][36].
However, since the conventional ESO is a high-gain observer, its capabilities are intrinsically limited by the presence and severity of high-frequency sensor noise [37]. In practical robot systems, measurement noise inevitably exists and cannot be ignored. The high gains of the observer amplify the measurement noise and transfer it into the control loop, which may cause a decrease in control quality and even make the control system unstable. In addition, the traditional observers such as ESO, Kalman observer, and sliding mode observer can only adapt to observe one kind of disturbance, such as constant value disturbance or slow time-varying disturbance, and their observation results are usually very accurate without measurement noise. If one tries to use a set of observer coefficients to estimate multiple types of disturbance, the effect will be very poor.
Motivated by the aforementioned problem and the above discussion, in this paper, we propose an HCI-based tele-aiming control system for the ground reconnaissance robot equipped with a multi-degree of freedom reconnaissance system. To control the reconnaissance robot to aim and track the maneuvering target quickly, we designed a multiple-model observer-based active disturbance rejection sliding mode controller for the teleoperation aiming control system by transforming multiple-model Kalman filter technology into extended state observer, which can observe multiple types of disturbance including the inertial impact force disturbance and efficiently eliminate the influence of measurement noise, and achieve stable guidance trajectory tracking when the tele-aiming system is guided by time-varying non-uniform teleoperation.
Compared to previous works, the major contributions of the present study are presented below. (1) The proposed novel multiple-model augmented-state extended Kalman observer (MEKO) can achieve inertial impact force disturbance estimation and noise filtering simultaneously. (2) The proposed observer can estimate multiple types of disturbances, which only needs to determine the parameter range of the observer without adjusting the parameters manually and accordingly. (3) An active disturbance rejection terminal sliding mode tele-aiming control strategy based on MEKO is proposed to efficiently eliminate the instantaneous large residual attributed to inertial impact disturbance and achieve stable maneuvering target tracking when the robot system is controlled by an HCI device that can generate the time-varying maneuvering trajectory.
The rest of this study is organized below. In Section 2, A brief introduction of the dynamic model of a manipulator-based tele-aiming system is given. In Section 3, the proposed control strategy is given, consisting of the design of the novel multiple-model augmented-state extended Kalman observer and nonsingular terminal sliding mode controller. In Section 4, the verification experiments are presented. In Section 5, the conclusion is drawn.

Problem Formulation and Preliminaries
In this paper, we use a multi-degree-of-freedom vehicle-mounted tele-manipulator as the control object of the tele-aiming system, in which a camera is installed at the end of the tele-manipulator, and the camera image can be sent to the monitor through wireless image transmission equipment. According to the movement of the maneuvering target displayed in the monitoring video, the operator quickly controls the HCI device to generate the corresponding time-varying trajectory to control the tele-manipulator. For a rigid n-degree-of-freedom tele-manipulator, its dynamic equation can be expressed as: where q,q,q ∈ R n×1 denotes the joint angle, velocity, and acceleration, respectively; N(q) ∈ R n×n denotes the inertia matrix; V(q,q) ∈ R n×n denotes the nonlinear Coriolis and centrifugal force; G(q) ∈ R n×1 represents gravity; S(q) ∈ R n×1 is friction; d ∈ R n×1 is external disturbance; τ ∈ R n×1 expresses the joint torque.
In practical applications, accurate robot model information is usually difficult to obtain, whereas only the nominal robot model can be acquired. The nominal model of the telemanipulator is assumed as N 0 (q), V 0 (q,q), G 0 (q). Setting N = N 0 + ∆N, V = V 0 + ∆V, G = G 0 + ∆G and combining with the kinetic equation, we can get: Accordingly, N 0 (q)q + V 0 (q,q)q + G 0 (q) = τ + τ d (q,q,q, t) where τ d (q,q,q, t) ∈ R n×1 represents the unmodeled term, friction, and unknown disturbance of the tele-manipulator, as expressed below: . If x 1 = q, x 2 =q, τ = u is defined, we can get the state-space equation by rewriting the above equation: where y is the measurement value of joint angle, and v ∈ R n×1 is the measurement In addition, we design a saturation function to prevent the joint torque u from exceeding the upper or lower limit, which is shown as follows: where τ u and τ l are the upper limit value and the lower limit value of the joint torque.
To efficiently tele-control the robot to aim at the maneuvering target, the operator should efficiently generate the time-varying non-uniform guidance trajectory through an HCI device (e.g., mouse or force feedback handle), and the guidance trajectory will be transformed into the desired joint trajectory corresponding to the joint space through the IK (inverse kinematics algorithm). Lastly, the proposed controller enables the respective joint of the tele-manipulator to track the corresponding joint trajectory, in which the desired joint trajectory includes the desired joint angle q d , speedq d and accelerationq d .
The purpose of the controller designed in this study is to make the joint angle x 1 and speed x 2 track the non-uniform trajectories q d andq d generated by the HCI device in real-time efficiently, stably, and accurately. The joint position tracking errors and velocity tracking errors are expressed as follows: Thus, the goal of this study is to design a controller that can make the error expressed by the above equation approach zero efficiently and stably in a limited time.

Tele-Aiming Robot System Overall Design
The tele-manipulator system, which is the control object of the tele-aiming system studied in this paper, is a highly coupled nonlinear system with multiple inputs and multiple outputs. Accordingly, when the time-varying non-uniform guidance trajectory is applied to the tele-manipulator, each joint of the tele-manipulator also should perform the time-varying non-uniform rapid steering movement. Due to the existence of inertia, each joint of the tele-manipulator will cause inertial impact force disturbance to other joints during rapid steering movement. Since the joints of the tele-manipulator are highly coupled, the whole tele-manipulator may not be able to achieve stable target tracking as impacted by coupling disturbance. To weaken the influence of inertial impact force disturbance and achieve fast time-varying non-uniform target tracking, this study proposes an active disturbance rejection terminal sliding mode guidance controller based on a multiple-model augmented-state extended Kalman observer. The proposed controller can efficiently eliminate the inertial impact force disturbance and achieve stable target tracking when the tele-manipulator is controlled by the tele-aiming control system. The overall structure of the tele-aiming control system is presented in Figure 1.
According to Figure 1, to eliminate the coupling inertial impact force disturbance, the multi-input multi-output nonlinear and strong coupling tele-manipulator system is first decoupled into a single-input single-output joint group, and the internal friction, parameter perturbation, modeling error, and other uncertain factors of the joint together with the inertial impact force disturbance are considered as a lumped disturbance. Subsequently, the lumped disturbance is estimated by MEKO. Lastly, the lumped disturbance is eliminated by the active disturbance rejection terminal sliding mode controller, and the respective joint can efficiently, accurately, and stably track the joint trajectory corresponding to the end guidance trajectory, to achieve the fast, stable, and high-precision end guidance control by the HCI device.
To achieve joint decoupling, we define For the multi-input multi-output nonlinear affine system expressed by the Equation (4), a "virtual control variable": U = B(x 1 )u is introduced in this study, where U = [U 1 U 2 · · · U n ] T , and then the (4) becomes: The i-th joint in the robot system (4) turns out to be: where the input of the i-th joint is U i , and its output is y i , v i is the measurement noise on the corresponding joint channel. Thus, the input U i and output y i of each joint is a single-input single-output relationship. That is, the relationship between input U i and output y i of the i-th joint is completely decoupled. D i (x i1 , x i2 , t) represents the lumped disturbance of the i-th joint, including the inertial impact force disturbance attributed to rapid maneuvering traction. The actual control torque u = [u 1 u 2 · · · u n ] T of the tele-manipulator is expressed by the virtual control variable U = [U 1 U 2 · · · U n ] T . To weaken the chattering of the control torque, we carry out a low-pass filter (LPF) on the actual joint torque control sequence, as expressed below:

MEKO-Based Active Disturbance Rejection Terminal Sliding Mode Controller
To enable the end of n-joint tele-manipulator to aim at the maneuvering target efficiently, the guidance trajectory generated by the HCI device is first converted into joint trajectories by using inverse kinematics. Then, the n-joint tele-manipulator system is decoupled and a single joint trajectory tracking controller is applied to every single joint. To achieve state estimation and disturbance estimation under measurement noises simultaneously, the single joint trajectory tracking controller is designed as an active disturbance rejection terminal sliding mode controller based on MEKO.
Its schematic diagram is presented in Figure 2. First, the tracking differentiator is adopted to pre-filter the reference joint trajectory and extract its speed information. Next, the MEKO is used to estimate the position, velocity, and disturbance of the respective joint simultaneously. Lastly, the nonsingular terminal sliding mode control algorithm is used to eliminate the disturbance term and drive the joint to track the corresponding trajectory with low chattering and minimum tracking errors. The reference acceleration processed by the recursive average filter (RAF) is used in the nonsingular terminal sliding mode control.

Tracking Differentiator
To aim at the maneuvering target, the operator generates the guidance trajectory with time-varying, non-uniform, and random abrupt motion characteristics by the HCI device. The guidance trajectory may also exist measurement noise disturbance. To ensure that the tele-manipulator system is capable of tracking the guidance trajectory with measurement noise and abrupt motion, in this study, a tracking differentiator (TD) was adopted to preprocess the reference joint trajectory. The TD is expressed as: where x i1d and x i2d denote the desired position and velocity of the i-th joint, respectively, h is the sampling period, and θ id is the reference input signal, r i0 represents the parameter of tracking speed, h i0 is the step size different from h, f han( Through Equations (10) and (11), the tracking differentiator can efficiently pre-filter the input reference signal θ id and extract the approximate differential signal simultaneously, even when θ id is mixed with random offset noise generated by rapid maneuvering.

Multiple-Model Augmented-State Extended Kalman Observer
When the tele-manipulator is quickly guided by the HCI device, the joints will inertially impact other joints, which will cause instantaneous large residuals between the actual trajectory and the guidance trajectory. After converting to the joint space, such an instantaneous large residual will cause the guidance joint trajectory to vary suddenly, resulting in a sudden increase of the instantaneous tracking error. This inertial impact force disturbance can be considered as lumped disturbance together with internal friction, parameter perturbation, modeling error, and other uncertain factors. If this lumped disturbance can be estimated, it can be eliminated before it adversely affects the tele-aiming system. As we all know, the observer-based control methods can achieve a better control effect by estimating the lumped disturbance with a disturbance observer. To speed up the elimination of instantaneous tracking error, it is usually necessary to increase the observer gain, but this amplifies the measurement noise and may cause the flutter or instability.
In this paper, we propose a novel multiple-model augmented-state extended Kalman observer (MEKO) to achieve lumped disturbance estimation and noise filtering simultaneously. According to the second-order system expressed by (8), (8) is considered as an extended state x i3 and assuming that D i (x i1 , x i2 , t) is differentiable, and its derivative is bounded, (8) can be written as: is considered as process noise, then (12) is further rewritten as: The above equation can be expressed approximately in discrete form, then we get: where Z i = z i1 z i2 z i3 T is the state vector of the discrete system, k represents the number of samples, W i and V i are the uncorrelated process noise and measurement noise.
To estimate the disturbance and filter the measurement noise simultaneously, we propose a novel MEKO based on an extended Kalman observer (EKO). IfẐ i = ẑ i1ẑi2ẑi3 T is defined, EKO can realize state estimation, that isẐ i → Z i . Its principle is to treat the system modeling error and external disturbance as process noise, then expand it into a new system state and finally estimate it. Its basic principle is the same as that of Luenberger observer and ESO. However, compared with Luenberger observer and ESO, EKO can achieve the optimal estimation under linear minimum variance. In practical application, the external disturbance of the robot system may be a constant disturbance, time-varying disturbance, or instantaneous impact disturbance, which makes it difficult for us to determine the statistical characteristics of disturbance noise.
In this paper, to make the observer adapt to the disturbance noise with different statistical characteristics, the proposed MEKO optimize the weighted value of each observer online by using the estimation of innovation variance, so as to make the MEKO output the optimal disturbance estimation value according to the statistical characteristics of disturbance noise in the preset range. As shown in Figure 3, the output of the MEKO is the weighted average of a group of EKO outputs.  The convergence proof of EKO (basic observer of MEKO) is given in references [38,39], i.e., the EKO can achieveẑ i1 → The pseudo-code of MEKO is shown in Algorithm 1 as follows: Update: Kalman gain K i,m 10: 13: Compute: Innovation IV i,m and innovation covariance CIV i,m

15:
if k ≤ N i then 16: else 18: for j = 2 : N i do 19: end for 21: end if 23: It can be seen from the above pseudo-code that the weight of each EKO in MEKO is adjusted online according to the optimal estimation of innovation covariance. That is, if the innovation variance of an EKO is smaller, the estimated value of the EKO is accurate, and the weight of the EKO in the bank of EKOs is larger. By automatically adjusting the weights, a bank of EKOs with preset process statistical characteristics can output the optimal observation value in real-time according to the innovation and its covariance.
Strictly speaking, the proposed multiple-model observer-based control method belongs to the scope of data-driven control. The control effect of common data-driven control highly depends on the training data. It learns the controlled object without constraints, and its final learning is uncertain, and its stability cannot be well guaranteed. However, the proposed method does not directly use the input and output data to learn the controlled object or adjust the parameters of the controller, as in reference [40], but uses the measured output data to adjust the weight of each observer in the multiple-model observer, and finally outputs the lumped disturbance value according to the weighted sum of all observers. Different from the common data-driven methods, our method still belongs to model-based control, but the unmodeled dynamics and external disturbance can be observed in real-time through the data-driven method and eliminated in the subsequent control stage.

Nonsingular Terminal Sliding Mode Controller
Sliding mode control exhibits the advantages of simple, fast response, and strong robustness to external disturbances, unmodeled dynamics, and parameter perturbation. Compared with linear sliding mode control, terminal sliding mode enhances the convergence characteristics exhibited by the system by adding nonlinear terms into the sliding mode surface. It is characterized by the advantages of fast dynamic response, finite-time convergence, and high steady-state tracking accuracy. To further improve the robustness of the control system, a nonsingular terminal sliding mode control law is added into the nonlinear feedback control in the active disturbance rejection control framework.
First, the trajectory tracking error of i-th joint is defined as e i1 = x i1 − x i1d , and the nonsingular sliding mode surface function is defined as: where β i > 0, p i , q i (p i > q i ) are positive odd numbers. The nonsingular terminal sliding mode controller is designed as: In practical applications, to further suppress chattering, the saturation function sats(s i ) is generally adopted to replace the sign function sign(s i ) in the above equation, and its expression is: Compared with the conventional sign function, the saturation function can make the system adopt different strategies by complying with the value of sliding mode, i.e., outside the boundary layer δ i , the switching control is adopted to make the system state efficiently tend to slide mode. At the boundary layer δ i , the feedback control is adopted to reduce the chattering attributed to the rapid switching of sliding modes and ensure the function s i to be constantly at the boundary layer.
Generally, the acceleration signal is obtained by using the second-order difference method. Since the rapid random non-uniform tele-aiming control generates a guidance trajectory with an abrupt signal, the conventional second-order difference method will introduce amplified noise when processing this signal. Accordingly, to reduce the effect of the noise signal in the desired joint acceleration on the joint control, the recursive average filter (RAF) is exploited to process the desired joint accelerationẍ i1d , as expressed below: In this study, the MEKO is adopted to achieve the real-time online estimation of system state and lumped disturbance under the measurement noise, i.e.,ẑ i1 → x i1 ,ẑ i2 → x i2 ,ẑ i3 → x i3 . Thus, the trajectory tracking error of the i-th joint is actuallyê i1 =ẑ i1 − x i1d ,ê i2 = ė i1 = z i2 − x i2d , so the nonsingular terminal sliding mode controller of the i-th joint under the MEKO is modified as: is the estimated value of nonsingular sliding mode surface function. To reduce the chatter of the controller's output signal, we perform a low-pass filter (LPF) on the controller's output signal before sending it to robot, as expressed below:

Stability Proof of Closed-Loop Control System
In this paper, ẍ i1 =ẍ i1 + ∆ i is defined, and is set as the estimation error of the expanded state. The Lyapunov function of the i-th joint is taken as V is = 0.5ŝ i 2 , then the Lyapunov derivative is as follows. where η i > 0, the above equationV is = −η i |ŝ i | ≤ 0, which indicates that the system is stable. As indicated from reference [21], the sliding mode function s i can approach zero in a finite time. It is therefore suggested that the closed-loop control system of the i-th joint is stable, and the sliding mode surface is reachable.
When the closed-loop control system of each joint can ensure stability and the sliding mode surface can be reached, the closed-loop control system of the robot is also stable, and the sliding mode surface can be reached. In other words, when t approaches infinity, s, e,ė approaches zero, i.e., t → ∞, s → 0, e → 0,ė → 0. To be specific, the proposed MEKO-based nonsingular terminal sliding mode controller can make (6) stable to zero.

Disturbance Observation Experiment
To verify the effectiveness of our proposed MEKO, we set up a comparative experiment of disturbance observation. The observed object is shown as follows: where, F(x 1 , x 2 ) = −25x 2 , b = 120 are the known parts, D(x 1 , x 2 , t) is the unknown lumped part, the input torque signal u = sin(0.2 * π * t) was in the form of the sine wave. The initial position and initial velocity of the observed object were taken as zero. The simulation step was set to 0.01, and the total number of simulation steps was 5000 time steps.
To verify the multi disturbance observation ability of the observer, the lumped disturbance used the following piecewise function: The contrast observer used extended state observer ESO [30,41], high-order sliding mode observer SMO [21], and extended state Kalman observer EKO [42], respectively. Table 1 shows the parameters of all observers used in this experiment. These parameters are the optimal parameters obtained after many experiments.

Observer
Parameter Setting To simulate the real system, we added measurement noise with a mean value of 0 and a variance of 20 to the output signal, as shown in Figure 4a. The experimental results in Figure 4a, respectively, show the results of the four observers in observing the first-order position signal, the second-order velocity signal, and the third-order lumped disturbance signal. The black solid line represents the actual signal, the blue dotted line represents the observation results of ESO, the brown dotted line represents the observation results of SMO, the green dotted line represents the observation results of EKO, and the red solid line represents the observation results of MEKO proposed in this study. The experimental results in Figure 4b show the observed mean error and variance of the four observers at the first-order position, the second-order velocity, and the third-order lumped disturbance.
As can be seen from Figure 4a, when observing the first-order position signal, the fluctuation of ESO and EKO is slightly larger in the case of measurement noise, while the filtering effect and tracking effect of SMO and our proposed observer are better. When observing the second-order velocity, the fluctuation of ESO and EKO is slightly larger, and the observation result of MEKO is relatively consistent with the actual velocity. When observing the lumped disturbance signal, the four observers have a certain degree of phase delay in the face of multiple disturbances, but generally speaking, the MEKO represented by the red curve is relatively consistent with the actual disturbance signal. It can also be seen from Figure 4b that ESO has the maximum observation mean error and variance, followed by SMO and EKO, while our proposed MEKO has the minimum error and variance in the observation of first-order position, second-order velocity and third-order lumped disturbance signal. This comparative experiment shows that our proposed MEKO has good estimation performance for multiple types of disturbance (two constant value disturbance and one sinusoidal disturbance) under measurement noise interference.

Trajectory Tracking Experimental Setup
To verify the effectiveness of the proposed control method, several simulation experiments were performed on the MATLAB and the CoppeliaSim simulation platform. The UR5 manipulator in the CoppeliaSim simulation platform was employed as the tele-aiming control experimental object. The D-H parameters and dynamics parameters of the UR5 are listed in Table 2, and the coordinate system is presented in Figure 5. The simulation step size was set to 10 ms. The maximal joint torque of all joints is limited to 150 N (i.e., τ u = 150, τ l = 150), and its initial joint positions are [90 • , −160 • , 150 • , 0 • , 90 • , 0 • ].

Uniform Trajectory Tracking Experiment
To verify the effectiveness of the MEKO-based active disturbance rejection nonsingular terminal sliding mode controller (MEKOTSMC) proposed in this study, the MEKOTSMC and other comparison controllers were applied to uniform trajectory tracking experiments.
In this experiment, the tuning parameters of the proposed MEKOTSMC were set as follows. In TD part: r i0 = 1600, h i0 = 2h; In NTSMC part: β i = 46, η i = 300, p i = 9, q i = 7, δ i = 0.2; In MEKO part: the sliding window width of innovation sequence N i = 10, the model number M i = 60, the measurement noise variance matrices R i,m = 1000, the prior error covariance matrices P i,m (0) = 10, 000, the process noise variance matrices As a contrast, for the identical trajectory tracking experiment, this study employed the computed torque controller (CTC) [43], the terminal sliding mode controller based on the computed torque controller (CTC-NTSMC) [44], the computed torque controller based on radial basis function (RBF) neural network compensation (CTC-RBF) [45], the active disturbance rejection sliding mode controller (ADRSMC) [46], as well as the proposed controller (MEKOTSMC). Table 3 shows the parameters of all controllers used in this experiment. These parameters are the optimal parameters obtained after many experiments.
In this section, to verify the performance of the controllers, two groups of experiments were set: • MEKOTSMC and comparison controllers were adopted to control the robot, respectively, to achieve uniform trajectory tracking without joint measurement noises. The experimentally obtained results are presented in Figure 6a,c,e; • MEKOTSMC and comparison controllers were exploited to control the robot, respectively, to achieve uniform trajectory tracking with joint measurement noises (the joint measurement noise of [−0.05 rad~0.05 rad] is randomly added). The experimentally obtained results are illustrated in Figure 6c,d,f.
The comparison results of the end motion trajectory are presented in Figure 6, in which Figure 6a shows the results without the addition of joint measurement noises, and Figure 6b shows the results under the addition of joint measurement noises. Figure 6c,d show the comparison of square root errors generated by five controllers in two cases, i.e., with or without joint measurement noises. Figure 6e,f show the comparison results of the mean error and variance diagram generated by the five controllers in two cases, i.e., with or without joint measurement noises. The comparison results reflect the average level of tracking error and the similarity between the tracking trajectory and the desired trajectory throughout the tracking.
As indicated in Figure 6a, for the uniform trajectory without joint measurement noises, the five controllers were all able to achieve stable tracking. From the square root error diagram in Figure 6c, it can be seen that CTC-NTSMC was able to efficiently reduce the tracking error at the initial stage of trajectory tracking of the robot. The tracking error of the proposed controller first increases slightly and then decreases rapidly at the initial stage of tracking, which also leads to a larger mean error than CTC-NTSMC. Compared with other controllers, the proposed controller performed slightly better in the middle and later stages of the tracking process. As presented in Figure 6e, CTC and CTC-RBF performed poorly, while CTC-NTSMC, ADRSMC, and the proposed controller in this study performed well. Among them, CTC-NTSMC produced the smallest mean error, i.e., for the uniform trajectory without joint measurement noises, CTC-NTSMC was able to enable the robot to achieve trajectory tracking with less error.
According to Figure 6b,d, for the uniform trajectory with joint measurement noises, CTC, CTC-RBF, and CTC-NTSMC all failed to track, and only ADRSMC and the proposed controller were able to achieve stable tracking. From the square root error diagram in Figure 6d, it can be seen that at the initial stage of trajectory tracking of the robot, the five controllers can efficiently reduce the tracking error, but CTC, CTC-RBF, and CTC-NTSMC have error divergence immediately. This shows that these three controllers have poor robustness in the presence of measurement noises, while ADRSMC and the proposed controller have good anti-noise ability in the face of joint measurement noises. As indicated in Figure 6f, the mean error and variance diagram of ADRSMC and the proposed controller are very close. To show the performance of the five controllers in more detail, we show the mean error and variance of the respective joint generated when the five controllers were adopted to control the robot to track a uniform trajectory in Figure 7. Among them, Figure 7a,b show the mean error and variance of the joint without joint measurement noises and with joint measurement noises, respectively. According to the error bar graph presented in Figure 7a, compared with CTC and CTC-RBF, the mean error and variance of each joint generated by CTC-NTSMC, ADRSMC, and the proposed controller are smaller. The error bar graph shown in Figure 7b reveals that ADRSMC and the proposed controller can achieve the minimal mean error and variance of the respective joint, while CTC, CTC-RBF, and CTC-NTSMC show noticeable obvious error divergence when controlling the 1st∼6th joint of the robot, which also accounts for the failure of tracking uniform trajectory.
In brief, Figures 6 and 7 indicate that only ADRSMC and the proposed controller were able to enable the robot to achieve stable trajectory tracking for the uniform trajectory without joint measurement noises and with joint measurement noises, and there is an insignificant difference between the two.

Non-Uniform Maneuvering Target Trajectory Tracking Experiment
To more practically simulate the non-uniform maneuvering target and the performance of the tele-aiming control system, we generate a non-uniform maneuvering target trajectory by adding [−0.01 m~0.01 m] random offset noise to the circular trajectory radius. Subsequently, the controllers were adopted to control the robot to track this non-uniform trajectory. Furthermore, to verify the performance of the controllers under joint measurement noises, the following two groups of experiments were set: • the proposed controller and the comparison controllers were adopted to control the robot to achieve non-uniform trajectory tracking without joint measurement noises. The experimentally obtained results are illustrated in Figure 8a,c,e; • the proposed controller and the comparison controllers were adopted to control the robot to achieve non-uniform trajectory tracking with joint measurement noises (the joint measurement noise of [−0.05 rad~0.05 rad] is randomly added to simulate the sensor measurement noise in the actual robot system). The experimentally obtained results are presented in Figure 8b,d,f.  Figure 8a,b illustrate the end motion trajectory comparison diagram when the end of robot was controlled to track the non-uniform trajectory (two cases, i.e., joint measurement noises not added, and joint measurement noises added) by using the proposed controller and the four comparison controllers (i.e., CTC, CTC-RBF, CTC-NTSMC, and ADRSMC), respectively. Figure 8c,d compare the square root errors generated by the five controllers in two cases, i.e., with and without joint measurement noises. Figure 8e,f present the comparison diagrams of mean error and variance diagram generated by the five controllers in two cases, i.e., with or without joint measurement noises. The comparison diagram presents the average level of tracking error and the similarity between the tracking trajectory and the desired trajectory throughout the tracking.
According to Figure 8a, for the non-uniform trajectory without joint measurement noises, the five controllers were able to achieve stable tracking. As indicated in the square root error diagram in Figure 8c, the proposed controller first increases the tracking error slightly, then reduces the error rapidly, and then maintains it at a low level. Compared with other comparison controllers, the proposed controller performed slightly better at the middle and later tracking stages. According to Figure 8e, the mean error and variance of CTC and CTC-RBF are relatively large, while the mean error and variance of CTC-NTSMC, ADRSMC, and the proposed controller are relatively small. The proposed controller was the controller with the smallest mean error throughout the tracking, i.e., for the uniform trajectory without joint measurement noises, the proposed controller was able to make the robot achieve trajectory tracking with less error.
As presented in Figure 8b,d, for the uniform trajectory with joint measurement noises, CTC, CTC-RBF, and CTC-NTSMC all failed to track. Only ADRSMC and the proposed controller were capable of achieving stable tracking. From the square root error diagram in Figure 8d, at the initial stage of trajectory tracking, it can be seen that the five controllers were able to efficiently reduce the tracking error, whereas CTC, CTC-RBF, and CTC-NTSMC exhibit error divergence immediately. Thus, CTC, CTC-RBF, and CTC-NTSMC show poor robustness in the presence of measurement noises, while ADRSMC and the proposed controller exhibit a high anti-noise ability when facing joint measurement noises. According to Figure 8f, the mean error and variance diagram of the proposed controller is slightly smaller than those of ADRSMC and other comparison controllers, which reveals that the proposed controller can achieve stable tracking with small error even if the measurement noises are added when facing non-uniform trajectory.
To more intuitively illustrate the performance of the five controllers, the mean error and variance of each joint are presented when the five controllers were adopted to control the robot to track the non-uniform trajectory in Figure 9. Figure 9a,b present the mean error and variance of the joint without joint measurement noises and with joint measurement noises, respectively. According to the error bar graph given in Figure 9a, when joint measurement noises were not added, the five controllers were able to make the respective joint of the robot stably track the desired trajectory of the corresponding joint, and the joint mean error and variance generated by CTC-NTSMC, ADRSMC, and the proposed controller were relatively small. As suggested from the error bar graph in Figure 9b, after the addition of the joint measurement noises, the tracking error divergence occurred in CTC, CTC-RBF, and CTC-NTSMC. Only ADRSMC and the proposed controller in this study produced less joint mean error and variance. However, compared with other controllers, the proposed controller was capable of keeping the tracking error of each joint of the robot at a lower level when tracking the corresponding desired trajectory.  In brief, according to Figures 8 and 9, only ADRSMC and the proposed controller were able to make the robot achieve stable trajectory tracking for the non-uniform trajectory without joint measurement noises and with joint measurement noises, while the proposed controller was able to make the robot achieve stable trajectory tracking with less error.
Finally, we set up an additional comparative experiment: using the tele-aiming system to control the robot to track the maneuvering target with rectangular motion, in which the controller still used CTC, CTC-RBF, CTC-NTSMC, ADRSMC, and our proposed controller. Similar to the previous experiments, to simulate the motion of the non-uniform maneuvering target, we added [−0.01 m~0.01 m] random offset noise to the rectangular trajectory. Furthermore, to verify the performance of the controllers under joint measurement noises, [−0.05 rad~0.05 rad] random noise is randomly added to simulate the sensor measurement noise in the actual robot system. The experiment results are shown in Figure 10. Figure 10a shows the rectangular motion trajectory with random offset noise and the tracking trajectory when the tele-aiming system used different controllers. Figure 10b shows the change curve of mean square tracking error in the process of tracking rectangular trajectory. Figure 10c shows the mean error and variance bar graph of the whole process. Figure 10d shows the mean error and variance bar graph of each joint when the tele-aiming system used five controllers to track the non-uniform rectangular motion. It can be seen from Figure 10a,b that when the tele-aiming system uses the conventional CTC method, the robot appears instability, resulting in its inability to stably track this nonuniform maneuvering target with random offset noise under the condition of measurement noise. It can be seen from Figure 10c,d that the controller with the best performance is our proposed MEKO based active disturbance rejection terminal sliding mode controller.

Conclusions
In this study, a maneuvering tele-aiming control strategy is proposed based on a novel multiple-model augmented-state extended Kalman observer (MEKO) and an active disturbance rejection terminal sliding mode controller. The control strategy enables the ground reconnaissance robot to efficiently track the maneuvering trajectory generated by the HCI device even when the robot is accompanied by joint measurement noises. To this end, a novel multiple-model observer is designed, capable of filtering the joint measurement noises and estimating the lumped disturbance simultaneously. Subsequently, a nonsingular terminal sliding mode controller is exploited to eliminate the lumped disturbance and control the robot, as an attempt to efficiently track the maneuvering trajectory. To simulate the maneuvering guidance trajectory, random offset noises are added to the guidance trajectory in the experimental verification part. To simulate the tele-aiming control system of the real robot, some random joint measurement noises are introduced when reading the joint angle. As revealed from the experiment results, the control strategy proposed in this study can endow the robot with the ability to track the maneuvering target stably under the maneuvering tele-aiming control.
Our proposed control method has a wide application background, especially in the field of robot control where the desired trajectory cannot be given in advance, such as in the field of HCI-based robot teleoperation control (ground reconnaissance robots, space manipulators, etc.). If the human traction to the robot is regarded as an external disturbance, our proposed method can also be used to the drag teaching robots.
The basic idea of MEKO is to automatically adjust the weight of each observer through the innovation covariance. Based on this idea, MEKO can output the optimal observation values of disturbance noise with different statistical characteristics in real-time, but MEKO needs to preset the process noise statistical characteristic matrix of each observer, the number of observers, and the length of the innovation sliding window. Therefore, to make up for this deficiency, our follow-up work is to explore the use of heuristic methods to automatically optimize these super parameters.