Next Article in Journal
Deep Transfer HSI Classification Method Based on Information Measure and Optimal Neighborhood Noise Reduction
Previous Article in Journal
Parallel Control of Converters with Energy Storage Equipment in a Microgrid
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Adaptive Reaction Null Space Planning and Control Strategy Based on VFF–RLS and SSADE–ELM Algorithm for Free-Floating Space Robot

Space Engineering University, No.1 Bayi Road, Beijing 101416, China
*
Author to whom correspondence should be addressed.
Electronics 2019, 8(10), 1111; https://doi.org/10.3390/electronics8101111
Submission received: 17 August 2019 / Revised: 25 September 2019 / Accepted: 27 September 2019 / Published: 2 October 2019
(This article belongs to the Section Systems & Control Engineering)

Abstract

:
With the increase of on-orbit maintenance and support requirements, the application of a space manipulator is becoming more promising. In actual operation, the strong coupling of the free-floating space robot itself and the unknown disturbance of the contact target caused a major challenge to the robot base posture control. Traditional Reaction Null Space (RNS) motion planning and control methods require the construction of precise dynamic models, which is impossible in reality. In order to solve this problem, this paper proposes a new Adaptive Reaction Null Space (ARNS) path planning and control strategy for the contact of free-floating space robots with unknown targets. The ARNS path planning strategy is constructed by the Variable Forgetting Factor Recursive Least Squares (VFF–RLS) algorithm. At the same time, a robust adaptive control strategy based on the Strategy Self-Adaption Differential Evolution–Extreme Learning Machine (SSADE–ELM) algorithm is proposed to track the dynamic changes of the planned path. The algorithm enables us to intelligently learn and compensate for the unknown disturbance. Then, this paper constructs a robust controller to compensate model uncertainty. A striking feature of the proposed strategy is that it does not require an accurate system model or any information about unknown attributes. This design can dynamically implement RNS path tracking performance. Finally, through simulation and experiment, the proposed algorithm is compared with the existing methods to prove its effectiveness and superiority.

1. Introduction

Nowadays, with the increasing frequency of space activities, the impact of spacecraft being hit by space debris has increased. A large amount of space debris has already seriously threatened the safety of on-orbit spacecraft. Especially by a large number of large-scale space debris, they may change the attitude and orbit of the spacecraft and even cause the spacecraft to be completely destroyed [1].
In order to solve such problems, the development of space debris removal technology has become particularly urgent. Among the many active space removal technologies, the technology of space manipulator removal in the orbit has received extensive attention. The ETS–VII of Japan, the Orbital Express of the United States, and the SY-7 of China have successively conducted verification experiments on this technology in space, and are still intensifying their research [2,3].
The related research [4,5,6,7,8,9] shows that the space robot is in a free-floating state during the process of capturing the target, and the collision from the target contact may lead to a large pulse momentum. The strong dynamic coupling between the spacecraft base and the manipulator may lead to instability of the attitude of the space robot base, which in turn may cause problems such as space robot rollover. The minimum planning of the base attitude disturbance can minimize the disturbance of the manipulator movement on the base posture.
Nenchev [10] proposed an attitude control algorithm for free-floating space robots based on a fixed attitude constrained Jacobian matrix. The algorithm aims to plan the joint movement speed so that the joint angular velocity falls within the null space of the joint inertia matrix, thereby keeping the attitude of the body unchanged. Xie et al. [11] proposed a reactionless coordinated motion planning algorithm with a kinematically redundant manipulator for the dynamic coupling problems of free-floating space manipulators. The numerical results of the planar motion of a 4-links space robotic show that the algorithm is effective. Wei et al. [12] considered the attitude reactionless control and the vibration suppression in the meantime to reduce the risk in grasping operation. The simulation results indicate that, by using the optimal control for the vibration suppression in the attitude RNS, the vibration of the manipulator could be alleviated significantly and the base attitude could also almost be undisturbed in the meantime during the whole grasping procedure.
The space manipulator equipped with a flexible mechanism can better achieve the collision force buffering and unloading during the contact with the target. However, the contact between the manipulator and the target will stimulate the flexible manipulator vibration and cause substantial disturbance to the base attitude, due to the momentum conservation in space. The flexible manipulator vibration and the impulse to the base attitude are critical to the contacting safety and the performance. The uncertain properties from the captured unknown objects such as space debris can cause the conventional no-disturbance planning methods to be invalid. To solve this problem, the Adaptive Reaction Null Space (ARNS) method [13,14] with dynamic changing planning paths for manipulators during capturing of an unknown object was proposed and introduced to realize the minimum disturbance to the spacecraft body attitude. In this method, the unknown part of the target can be obtained online via the Recursive Least Square (RLS) algorithm, and compensation can be made in the form of a time-variable matrix for the path planning equation of the manipulator. Xu et al. [15] proposed an adaptive control algorithm based on Reaction Null Space (RNS) for free-floating robots with uncertain kinematics and dynamics, which realizes base attitude adjustment and continuous path tracking of end effectors. Similar schemes should acquire accurate information of the captured object and bring about high sensor requirements and a delay control issue. Lu and Jia [16] made the spacecraft attitude regulation error and the end–effector pose tracking error meeting the respective prescribed performance requirements, converging to zero by the ARNS and the prescribed performance functions. Thus, the coupling effects of the free-floating robots are overcome. Zhang [17] studied the target dynamics parameter identification based on ARNS planning and used the RLS algorithm to update the system in real-time. This method does not need the iteration of unknown dynamic parameters of non-cooperative targets, and the base parameter identification is realized. The attitude disturbance is minimal and the adaptive control can be separated from the dynamic parameter identification. Jiao et al. [18] derived a dual-arm space robot model and proposed an ARNS motion control approach to satisfy the principal objective of maintaining a minimum disturbance to the base in the post-capture of a non-cooperative target. They developed a new ARNS control for a dual-arm space robot system. The adaptive algorithm is developed based on the momentum conservation of the system and the RLS algorithm is employed for parameter adaptation. The simulation results are presented to demonstrate the effectiveness of the proposed approach.
The traditional RLS algorithm has data saturation phenomenon [19]. When the space robot contacts an unknown target, the recursive algorithm cannot be directly used because the system parameters change with time, thereby reducing the performance of the ARNS algorithm path planning. In order to adapt to the case of time-varying parameters, the forgetting factor can be added to reduce the weight of the old data and increase the weight of the new data, thereby increasing the adaptability of the ARNS algorithm to the time-varying system [17]. In addition, the sliding-window RLS (SW–RLS) algorithm eliminates the saturation of the RLS algorithm by retaining a finite data length [20]. To realize the reactionless motion for a space robot while capturing an unknown object, not only the ARNS path planning algorithm based on dynamic characteristics, but also the space robot attitude tracking control algorithm, capable of effectively tracking the planned path, is needed. However, when the space robot itself has uncertainty and the target state is unknown, in order to ensure the stability of the system attitude and reduce the influence of tracking error, the tracking control strategy of the system needs to learn the unknown disturbance and compensate through the adaptive strategy, while robust strategy culls model uncertainty, such as in Reference [21]. However, traditional adaptive strategies, such as Back-Propagation neural network [22], radial basis function neural network [23], and fuzzy algorithms [24,25], are not processing fast enough to handle disturbance with bursty properties. We need a faster learning and compensating method for the abrupt unknown properties.
The Extreme Learning Machine (ELM) [26,27] algorithm was developed for its extremely fast system uncertainty learning and cognizing performance [28,29,30]. However, the random input feature settings (input weight, hidden layer bias) of conventional ELM networks could weaken the network performance to some extent, because the pre-setting input feature owns the possibility of staying away from its optimal value.
Given the problem, some ameliorated methods by optimizing the input feature of the ELM algorithm have been proposed. Figueiredo et al. [31] studied the effects of eight different topologies on the performance of Particle Swarm Optimization-ELM (PSO–ELM). The results show that there is no optimal topology suitable for all problems, but according to the root mean square error from the experimental results, the global topologies perform better. Zhang et al. [32] proposed a Memetic Algorithm Based-ELM (M–ELM) algorithm. M–ELM integrates individual heuristic searches into the global optimization framework of the population to automatically learn the optimal network parameters, and effectively overcomes the problem of premature convergence. Cao et al. [33] proposed a Self-adaptive Evolutionary (SaE–ELM) algorithm that not only adaptively adjusts the crossover probability and scaling factor, but also automatically selects the best mutation strategy from the four-seed generation strategy. The convergence of the algorithm is improved. Tang et al. [34] proposed a Self-adaptive Differential Evolutionary–Weighted ELM (SDE–WELM) algorithm, which uses an adaptive differential evolution algorithm to optimize the parameters of hidden layer neurons and the weight of training samples, improving the accuracy of unbalanced data classification. Chu et al. [20] proposed a robust adaptive control strategy, using PSO–ELM algorithm to dynamically learn the reactionless path of space robot planning, and compensate for the unknown characteristics in the form of constructive adaptive control. Using the ELM algorithm to learn fast, the mutation characteristics of the space robot system are processed, and the input characteristics of the ELM are optimized by the PSO algorithm, which improves the ELM performance.
Considering the uncertainty of the space robot system and unknown external disturbance, and in order to meet the requirements of space robot for ARNS planning and manipulator tracking control with faster adaptive performance and higher tracking accuracy, the strategy uses the Variable Forgetting Factor Recursive Least Squares (VFF–RLS) algorithm to identify and compensate for the unknown characteristics of the system, and avoids the saturation phenomenon when dealing with time-varying elements in the planning process. At the same time, the ELM algorithm is improved by the Strategy Self-Adaptation Differential Evolution (SSADE) algorithm to effectively compensate for the unknown interference in the tracking control process. Based on this, this paper proposes an ARNS path planning strategy based on the VFF–RLS algorithm, and a tracking control algorithm based on SSADE–ELM algorithm.
The work carried out in this paper is as follows: In Section 2, the dynamic model of the space robot is established. In Section 3, the planning Strategy and Adaptive robust control algorithm constructed by the ARNS Planning Strategy based on VFF–RLS, adaptive robust control algorithm based on SSADE–ELM, and robust control algorithm are elaborated. The simulation results are shown in Section 4. The experimental results are shown in Section 5. The discussion is shown in Section 5. Finally, the conclusions are provided in Section 6.

2. Dynamic Model of the Space Robot

The simplified model of the space robot designed in this paper is shown in Figure 1. Where Σ I is the inertial frame; Σ i ( i 1 , 2 , , n ) is the ith body frame and Σ 0 is the base frame, where b 0 is a vector pointing from center of the base to the centroid of the first joint. r 0 , v 0 , ω 0 are the position vector of the centroid of the base, the velocity and angular velocity of the base, respectively. a i is a vector pointing from center of the ith joint to the centroid of the ith body and b i is a vector pointing from the centroid of the ith body to the center of the (i + 1)th joint. r 0 g is the vector from origin of the mass center of the base to the mass center of the system. r g is the position vector of the whole system with respect to inertial frame.
Considering that the space robot is in a free-floating state during the contact with the target and the post-contact stabilization process, the complex system composed of the space robot and the target is not subjected to any external force, and the complex system maintains the angular momentum and the linear momentum conservation. The only disturbance in the system is due to internal factors such as joint friction, measurement noise, and uncertainty of the target parameters. The Lagrange equation can be used to construct the dynamic model of a space robot as in Equation (1):
M t q ¨ + C t q ˙ + τ k = τ e ; Δ M = M t M , Δ C = C t C
Equation (1) represents the complex dynamics model in which the dynamic parameters of the system are changed based on the dynamic parameters of the original space robot after the space robot captures the unknown target to form the complex. q is the generalized state variable of joint angular displacement. M and C , respectively, represent the system equivalent moment of inertia and equivalent friction coefficient of the joint of the space manipulator. M t and C t represent the actual moment of inertia and the actual frictional damping coefficient of the joint of the space manipulator joint, respectively. Δ represents an unknown system model parameter perturbation. τ k is the joint actuation torque. τ e is an unknown bounded external disturbance. The model parameter uncertainty in the system dynamics model in Equation (1), the unknown friction torque, and the unknown external disturbance can be expressed as a form of composite interference, as shown in Equation (2).
M q ¨ + C q ˙ + τ k = d d = τ e Δ M q ¨ Δ C q ˙
where M = d i a g { M 11 , M i i , M n n } , Δ M = d i a g { Δ M 11 , Δ M i i , Δ M n n } are constant coefficient diagonal matrices. So, Equation (2) can be rewritten as
q ¨ = M 1 ( d C q ˙ τ k ) = M 1 ( d C q ˙ ) M 1 τ k
Setting d = M 1 ( d C q ˙ ) , where d = [ d 1 , d 2 , , d n ] . Since M is a diagonal reversible matrix, it can be proved that Equation (3) is decoupable. The angular displacement of the above equation after decoupling has the following form.
q ¨ i = d i M i i 1 τ k i
Let q d be the generalized angular displacement of the joint desired. Then there are joint angle error and angular velocity error, as shown in the following equation.
e = q q d ; e ˙ = q ˙ q ˙ d
The control variable of the joint angle error system of the space robot can be expressed as:
r = e ˙ + Λ e
where Λ is a positive diagonal matrix. Since r tends to zero, the reference output of the joint angular displacement can be expressed as:
q ˙ r = q ˙ d + Λ e
Bringing Equation (7) into Equation (1) gives:
M ( q ¨ d + Λ e ˙ ) + C ( q ˙ d + Λ e ) + τ k d = M ( q ¨ d + Λ e ˙ ) + C ( q ˙ d + Λ e ) ( M q ¨ + C q ˙ d ) d = M ( q ¨ d + Λ e ˙ q ¨ ) + C ( q ˙ d + Λ e q ˙ ) = M r ˙ + C r

3. Adaptive Reaction Null Space (ARNS) Planning Strategy and Adaptive Robust Control Algorithm

3.1. Adaptive Reaction Null Space (ARNS) Planning Strategy Based on Variable Forgetting Factor Recursive Least Squares (VFF–RLS)

After the space robot captures the target, the motion of the robot arm may cause the base posture to be greatly disturbed. This section uses the ARNS planning to solve the problem that the complex system has the least disturbance to the base during the motion. The minimum perturbation to the base attitude is achieved by planning the reaction null space joint angular velocity with unknown target kinetic parameters. This section uses the least squares method to complete the adaptive update based on the smallest error prediction, since the conventional least squares method has saturation and is not suitable for the case of time-varying system parameters. Therefore, the variable forgetting factor is used to improve the least squares method, and the VFF–RLS algorithm is constructed to achieve ARNS planning.
The space robot in this paper is in a free-floating state, and its manipulator has redundancy. The centroid position vector of each arm of the space robot complex system is as follows.
r i = r 0 + b 0 + k = 1 i 1 ( a k + b k ) + a i
Setting J R i = [ Z 1 , Z 2 , , Z i , 0 , , 0 ] and J T i = [ Z 1 × ( r i p 1 ) , Z 2 × ( r i p 2 ) , , Z i × ( r i p i ) , 0 , , 0 ] . The linear velocity and angular velocity at the centroid of each boom are as follows.
v i = r ˙ i = v 0 + ω 0 × ( r i r 0 ) + J T i q ˙
ω i = ω 0 + J R i q ˙
The linear momentum equation of the system is:
P = i = 0 n ( m i r ˙ i ) = i = 0 n ( m i r ˙ i )
Substituting Equation (10) into Equation (12) can obtained that:
P = M v 0 + ( M r ˜ T 0 g ) ω 0 + J T ω q ˙ = [ M E M r ˜ T 0 g ] [ v 0 ω 0 ] + J T ω q ˙
where r 0 g = r g r 0 , J T ω = i = 1 n ( m i J T i ) .
Similarly, the system satisfies the following angular momentum conservation equation:
L = i = 0 n ( I i ω i + m i r i × r ˙ i )
Substituting Equations (10) and (11) into Equation (14) yields the following equation:
L = I 0 ω 0 + i = 1 n I i ( ω 0 + J R i q ˙ ) + i = 1 n m i r ˜ i ( v 0 r ˜ T 0 i ω 0 + J T i q ˙ ) = [ M r ˜ 0 g I ω ] [ v 0 ω 0 ] + I ω ϕ q ˙
where I ω = i = 1 n ( I i m i r ˜ i T r ˜ 0 i ) + I 0 , I ω ϕ = i = 1 n ( I i J R i + m i r ˜ i J T i ) .
The angular momentum of the multi-body system set as L g is calculated around the centroid of the whole space robot. The angular momentum conservation equation can be rewritten as:
L = L g + r g × P
The angular momentum and linear momentum equations of the space robot after contact with the target can be obtained. In conjunction with Equations (13) and (15), the momentum conservation equation can be expressed as in Equation (17), where P is the complex system linear momentum, and L is the angular momentum of the composite system.
[ P L ] = [ M E M r ˜ T 0 g 0 H ω ] [ v 0 ω 0 ] + [ J T ω H ω ϕ ] q ˙ + [ 0 r g × P ]
where H ω = i = 1 n ( I i + m i r ˜ g i T r ˜ 0 i ) + I 0 , H ω ϕ = i = 1 n ( I i J R i + m i r ˜ g i J T i ) , M is the total mass of the space robot target complex system.
The angular momentum equation of the space robot complex system obtained by Equation (17) is as follows.
L = H ω ω 0 + H ω ϕ q ˙ + r g × P
When the parameters of the complex system are known, the RNS-based composite system manipulator joint angular motion plan is:
q ˙ d | R N S = H ω ϕ + ( L r g × P ) + ( E H ω ϕ + H ω ϕ ) ξ ˙
where E H ω ϕ + H ω ϕ is the null space projection of the coupling matrix H ω ϕ . H ω ϕ + is the generalized inverse matrix of H ω ϕ . ξ ˙ R n is any non-zero vector. However, since the target angular momentum is an unknown variable, H ω and H ω ϕ in Equation (19) cannot be accurately obtained, and the estimated values are used, which are denoted as H ˜ ω and H ˜ ω ϕ . Therefore, using the ARNS motion planning algorithm to adjust the coefficient matrix online, the accurate performance of the complex system can be obtained.
q ˙ = H ˜ ω ϕ + ( L r g × P ) H ˜ ω ϕ + H ˜ ω ω 0 + ( E H ˜ ω ϕ + H ˜ ω ϕ ) ξ ˙
Setting y = q ˙ d | R N S q ˙ , then bringing in Equation (20) can obtain Equation (21) as follows.
y = q ˙ d | R N S q ˙ = ( H ω ϕ + H ^ ω ϕ + ) ( L r g × P ) + H ^ ω ϕ + H ^ ω ω 0 + ( H ^ ω ϕ + H ^ ω ϕ H ω ϕ + H ω ϕ ) ξ ˙ = K 1 + K 2 ω 0 + K 3 ξ ˙ = [ K 1 K 2 K 3 ] [ 1 ω 0 ξ ˙ ]
where K 1 = ( H ω ϕ + H ^ ω ϕ + ) ( L r g × P ) ; K 2 = H ^ ω ϕ + H ^ ω ; K 3 = ( H ^ ω ϕ + H ^ ω ϕ H ω ϕ + H ω ϕ ) .
Define the error function ε is as follows.
ε = y ( K ^ 1 + K ^ 2 ω 0 + K ^ 3 ξ ˙ )
Since the parameters K 1 , K 2 , and K 3 contain uncertain parameters of non-cooperative targets, in order to ensure that the angular velocity of the pedestal converges to zero, an algorithm is needed to continuously update the values of parameters K 1 , K 2 , and K 3 , so that the error function ε is minimal.
In order to ensure that the unknown target has the least disturbance to the base attitude, the parameters K 1 , K 2 , and K 3 are automatically updated online according to Equation (15) using the VFF–RLS algorithm to achieve the effect of adaptively updating the RNS velocity of the joint. Equation (15) can be written as a standard regression equation as follows.
y T = Φ W
where Φ = [ 1 ω 0 ξ ˙ ] T , W = [ K 1 K 2 K 3 ] T . According to the VFF–RLS algorithm, Equation (20) is the update equation of W .
W ^ ( k ) = W ^ ( k 1 ) + N ( k ) ε 1
where
ε 1 = y ( k ) T Φ ( k ) W ^ ( k ) N ( k ) = Q ( k 1 ) Φ ( k ) λ ( k ) E + Φ ( k ) T Q ( k 1 ) Φ ( k ) Q ( k ) = λ ( k ) 1 E [ E N ( k ) Φ ( k ) T ] Q ( k 1 )
If λ ( k + 1 ) is a constant, it cannot achieve good stability and tracking capabilities at the same time. In fact, when λ ( k + 1 ) is small, the algorithm has fast convergence rate but poor stationarity, and when λ ( k + 1 ) is large, the algorithm has converse performance. It is not an easy work to find an appropriate constant. Therefore, a new form of changing rule is introduced, where the convergence rate is fast in the beginning phase and has good stability in the subsequent phase. In this paper, the following changing rule is adopted.
λ ( k ) = λ m a x σ 1 e σ 2 k
where σ 1 and σ 2 are two coefficients that control the changing manner of forgetting factor. In the above formula, W ^ ( k ) is an estimate of the kth generation of W ( k ) . ε 1 represents the a priori residual. Q ( k ) is the inverse of the autocorrelation matrix. Q ( 0 ) = δ E , δ > 1 . Here δ = 1.4. The forgetting factor is close to 1 to ensure that the ARNS planning algorithm is stable during the post-capture motion. So, take σ 1 = 0.1 , σ 2 = 0.04 , λ max = 0.99 . When W ^ is updated, the expected RNS joint angular velocity can be expressed as:
q ˙ d | R N S = q ˙ + W ^ T [ 1 ω 0 ξ ˙ ]
Figure 2 shows the ARNS path planning algorithm flow.

3.2. Adaptive Control Algorithm Based on Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine (SSADE–ELM)

The space robot dynamics model constructed in this paper is a nonlinear system with unknown disturbances. The nonlinear system has parameter perturbation, while adaptive control and robust control are the main means to cancel the parameter perturbation of nonlinear systems. This section presents an adaptive robust control algorithm based on SSADE–ELM. The diagram of adaptive control algorithm based on SSADE–ELM is shown in Figure 3. The control law is as follows:
τ k i = M i i ( τ a i + τ c i + τ P D i )
τ a i represents an adaptive controller for compensating for uncertain or unknown disturbance present in the system. τ c i represents a robust controller that compensates for model uncertainty. τ P D i represents the feedback controller, which is the PD controller. According to Equations (4) and (28), Equation (29) can be obtained:
d i = q ¨ i ( τ a i + τ c i + τ P D i )
Since the ELM algorithm has a strong approximation ability, it can be applied to a controller that compensates for unknown disturbance of a space robot without requiring prior knowledge. This method enables real-time control of the robot and accurately tracks the planned trajectory. The input vector of the ELM network is x i , which is defined as shown in the following equation.
x i = [ q ¨ d i q ˙ d i q d i e i e ˙ i ]
Therefore, the output of the hidden layer node can be defined as:
h i = g i ( A i x i + b i )
where g i is the activation function. Here it uses the sigmoid function. A i is the input weight indicating the connection between the hidden layer neurons and the input layer neurons, and b i is the hidden layer bias. The adaptive compensation controller τ a is constructed using the ELM network as follows:
τ a i = h i β i
where β i = [ β i 1 , β i 2 , , β i j ] ( j = 1 , 2 , , l ) is a weight matrix between the output layer and the hidden layer, which can be solved by the RLS algorithm.
β i = h i + τ a i
where h i + is the Moore–Penrose generalized inverse of h i . The adaptive law of β ^ i can be expressed as:
β ^ ˙ i = μ i h i r i
where β ^ ˙ i = [ β ^ ˙ i 1 , β ^ ˙ i 2 , , β ^ ˙ i j ] . μ i = [ μ i 1 , μ i 2 , , μ i j ] , ( j = 1 , 2 , , l ) is the step adjustment factor. μ i j = 1 . l is the number of hidden layer neurons. According to Equation (32), Equation (35) can be obtained:
τ ^ a i = h i β ^ i
The improved SSADE algorithm is introduced into the ELM algorithm. The differential variation and crossover operator of the algorithm can be used to search for the optimal input weight A i and hidden layer bias b i according to the dynamic adjustment of the whole population, thus obtaining a more compact network structure. This can avoid excessive random neurons and cause the hidden layer to have no sparsity and adjustment ability, which affects the generalization ability and stability of the network.
Compared with the traditional differential evolution algorithm [35], the algorithm adopts different mutation strategies in different periods of population evolution, which improves the convergence speed and convergence precision of the algorithm. The implementation steps shown in Figure 4 are as follows:
Step 1: Initialization. Set the ELM hidden layer unit number l and the excitation function g(x). The population T = [ a r , g 11 , a r , g 12 , , a r , g 1 n , , a r , g l 1 , a r , g l 2 , , a r , g l n , d r , g 1 , d r , g 2 , , d r , g l ] is constructed from the input weight matrix and the implicit layer bias matrix of the ELM, and g represents the number of iterations. Initialize Np vectors with dimension of D (l × (n + 1)). For each individual population, the range of each dimension is [−1, 1]. The hidden layer output matrix h i is calculated according to Equation (31), and the output weight β i is calculated according to Equation (33).
Step 2: Mutation. Aiming at the problem of mutation strategy selection based on the DE algorithm, based on the staged idea, this paper proposes a phased strategy adaptive differential evolution algorithm. The proposed algorithm includes two parts: the evolutionary stage estimation based on the population congestion degree and the adaptive selection of the mutation strategy.
1. The evolutionary stage estimation based on the population congestion degree.
Referring to the idea of crowded density estimation in multi-objective differential evolution algorithm, the judgment basis of the evolutionary stage is set as the ratio of the average distance d g , a v e to the maximum distance d max between each individual and the optimal individual (the minimum objective function value) in the current population. This average distance is as follows:
d g , a v e = r = 1 N p j = 1 D ( t r , g j t b e s t , g j ) 2 N p
where t r , g j is the current individual in the gth generation population. t b e s t , g j is the best individual in the gth generation population. Compare the average distance of each generation with d max . If the average distance of the current generation is greater than d max , then replace d max . Since the population eventually converges to a point, the minimum value of the average distance is d min = 0 . Then, the average distance of each generation is normalized according to the maximum and minimum values of the average distance, i.e.,
d ¯ g , a v e = d g , a v e d min d max d min = d g , a v e d max
Finally, based on the d ¯ g , a v e of each generation, estimate the stage of the current population.
{ S 1 , i f d ¯ g , a v e > c S 2 , i f   1 c d ¯ g , a v e < c S 3 , o t h e r w i s e
where S 1 , S 2 , and S 3 denote the first, second, and third stages, respectively, and c is the stage control factor.
2. The adaptive selection of the mutation strategy.
In order to improve the search efficiency of the algorithm, while maintaining the diversity of the population, avoiding the algorithm falling into local optimum and premature convergence, different strategies are designed to mutate according to the characteristics of different stages.
In the first stage S 1 , the whole population is relatively dispersed, and all individuals are committed to searching for promising sub-regions. At this time, the population diversity should be maintained to ensure that as many regions as possible are searched. Therefore, in the first phase, the DE/rand/1mutation strategy is adopted.
V i , g = T r 1 , g ( t ) + F ( T r 2 , g T r 3 , g )
where V i , g indicates a variant individual. T r 1 , g , T r 2 , g , and T r 3 , g are randomly selected individuals. r 1 r 2 r 3 [ 1 , N p ] . F is a contraction factor, which obeys Cauchy distribution C (0.5, 0.1).
In the second stage, S 2 , the algorithm begins to detect promising regions that have been searched and searches further for more promising regions. At this point, the algorithm needs to perform both global detection and local search. Therefore, in order to balance the relationship between population diversity and convergence rate, the DE/pbest/1 mutation strategy is adopted.
V i , g = T p b e s t , g + F ( T r 1 , g T r 2 , g )
where T p b e s t , g is the optimal individual among the historical optimal solutions before the g-generation of the target individual T i , g . T r 1 , g , and T r 2 , g are randomly selected individuals. F is a contraction factor. Through the collaboration of historically optimal individuals and other individuals to guide the mutation, not only can the population diversity be maintained, but also the algorithm can search for as many regions as possible, and the local search ability of the algorithm can be improved, thereby accelerating the convergence speed.
In the third stage, S 3 , all individuals may be clustered into a certain area. The algorithm is dedicated to searching for the optimal solution in the area. Therefore, in order to speed up the convergence, the DE/current-to-best/1 mutation strategy is adopted.
V i , g = T i , g + F ( T b e s t , g T i , g ) + λ ( T r 1 . g T r 2 , g )
where T b e s t , g is the best individual in the entire population. T r 1 , g and T r 2 , g are two different randomly selected individuals in the population other than T b e s t , g . By using the information of the current optimal individual to guide the mutation, the algorithm converges quickly.
Boundary operation: The mutation individuals generated by the mutation operator may exceed the allowable range of the boundary. If V i , g is out of bounds, the individual is randomly generated within its range of values as follows:
V r , g = r a n d ( 0 , 1 ) ( t r U t r L ) + t r L
where t r U and t r L are the upper and lower bounds of t r , respectively. Rand (0, 1) represents a random fraction between (0, 1) and obeys a uniformly distributed.
Step 3: Crossover. The crossover operation increases the diversity of the group is as follows.
T t r i a l , r , g = { V r , g , i f R C R T r , g ,   o t h e r w i s e
where C R obeys a normal distribution N (0.5, 0.3). R ( 0 , 1 ) obeys uniformly distributed.
Step 4: Selection. According to the planning of the minimum base disturbance of the free-floating space robot, the dynamic equation of the space robot is selected as the fitness function. f   =   e is the objective function. That is, the attitude parameters of the space robot need to be close to the desired attitude parameters obtained through planning. When the value of f decreases, the value of T approaches the optimal. Calculate the objective function values f ( t t r i a l , r , g ) and f ( t r , g ) of the mutated individual and the target individual, respectively, and select the one with the smaller objective function to retain. At the same time, consider the party with the smaller ELM weight because it has the better generalization ability.
T r , g + 1 = { T t r i a l , r , g , i f f ( T t r i a l , r , g ) f ( T r , g ) a n d β r i a l , r , g < β r , g T r , g , o t h e r w i s e
Step 5: Determine if the objective function reaches the optimal value, or if the maximum number of iterations is reached. When the optimal value or the maximum number of iterations is reached, the optimal individual is output. According to the SSADE algorithm above, the optimal input weight A i and hidden layer bias b i are searched to obtain a more compact network structure, optimize the ELM algorithm, and improve the performance of the adaptive control item.

3.3. Robust Control Algorithm

The main purpose of the robust control term proposed in this paper is to suppress the uncertainty of the model. The residual error is generated by an adaptive control process, and the error E τ a i of Equation (35) is:
E τ a i = τ ^ a i τ a i = h i β ^ i h i β i
where τ a i * is the optimal approximation of τ a i . Suppose E τ a i has an upper bound, E τ a i max , | E τ a i | E τ a i max . The robust controller designed in this paper is as follows.
τ ^ R i = E τ a i m a x s i g n ( r i )
where τ ^ R i represents the estimated output of the robust control term. r i represents the ith term of the systematic error r . It can be obtained that the estimated output of the space robot control algorithm is:
τ ^ k = [ τ ^ k 1 , τ ^ k 2 , , τ ^ k i , , τ ^ k n ] τ ^ k i = M i i ( τ ^ P D i + τ ^ a i + τ ^ R i ) = M i i ( K r i + h i β ^ i + E τ a i m a x s i g n ( r i ) )
where K represents the gain of the PD control algorithm.

3.4. Stability Analysis

Take the Lyapunov function:
V = 1 2 r T M r + 1 2 i = 1 n j = 1 l 1 μ i j β ˜ i j β ˜ i j
Find the first derivative of Equation (48):
V ˙ = r T M r ˙ + 1 2 r T M ˙ r + i = 1 n j = 1 l 1 μ i j β ˜ i j β ^ ˙ i j = r T ( M ( q ¨ d + Λ e ˙ ) + C ( q ˙ d + Λ e ) + d τ k C r ) + 1 2 r T M ˙ r + i = 1 n j = 1 l 1 μ i j β ˜ i j β ^ ˙ i j = r T ( M ( q ¨ d + Λ e ˙ ) + C ( q ˙ d + Λ e ) + d K r i = 1 n h i β ^ i E τ a i max s i g n ( r ) C r ) + 1 2 r T M ˙ r + i = 1 n j = 1 l 1 μ i j β ˜ i j β ^ ˙ i j = r T ( i = 1 n h i β i i = 1 n h i β ^ i K r E τ a i max s i g n ( r ) C r ) + 1 2 r T M ˙ r + i = 1 n j = 1 l 1 μ i j β ˜ i j β ^ ˙ i j = r T ( i = 1 n h i β i i = 1 n h i β ^ i K r E τ a i max s i g n ( r ) C r ) + r T i = 1 n E τ a i 1 2 r T ( 2 C M ) r + i = 1 n j = 1 l 1 μ i j β ˜ i j β ^ ˙ i j = i = 1 n h i β ˜ i r i r T K r E τ a i max r T s i g n ( r ) + r T i = 1 n E τ a i + i = 1 n j = 1 l 1 μ i j β ˜ i j β ^ ˙ i j = r T K r E τ a i max r T s i g n ( r ) + r T i = 1 n E τ a i r T K r i = 1 n ( E τ a i max E τ a i ) | r i | 0
It can be proved that the control algorithm constructed in this paper is stable.

4. Simulation

4.1. Parameter Settings

The space manipulator structure model designed by the modeling software NX in this paper is shown in Figure 5. From the model, the inertia parameters shown in Table 1 can be obtained by choosing similar materials and structures to the experimental system. Other system parameters shown in Table 1 are set according to the physical parameters of the hardware of the experimental system below.
In order to verify the validity of the method, this paper uses the model of a planar three-DOFs free-floating space robot as established by the Matlab/Simulink software. The target initial angular velocity is 0.5 rad/s. The interior disturbance term d in the robotic system is set by the function d = 0.5 + r a n d ( 0 , 1 ) × s i g n ( q ˙ ) . Table 2 shows the control algorithm parameters of the space robot system.

4.2. Simulation Results

Figure 6 shows the total angular momentum the space robot with target.
Figure 7 shows the space robot attitude parameters by simulation. Figure 7a shows the base angle. Figure 7b shows the base angular velocity. Figure 7c,d show the joints angle via VFF–RLS and RLS. Figure 7e,f show the joints angular velocity via VFF–RLS and RLS.
Figure 8 shows the tracking error of each joint trajectory of control algorithms by simulation. Table 3 shows the average error of joint tracking of control algorithms by simulation.

5. Experiment

5.1. Experimental Setting

Based on the parameters in Table 1, a semi-physical experiment system of a space robot principle prototype is constructed. The main hardware architecture includes: a computer for data processing, an FPGA control card as the core control module, and a space robot prototype. The communication module of the joint control system adopts CAN bus. The prototype is placed on an air floating platform supported by air feet and has three yaw freedoms. It is constructed mainly by the joint motors (Kollmorgen TBM(S)–12955-X) and the six-dimensional force sensors (ATI–Nano17) are used for measuring the axial forces and moments. The encoders (EAC58P) are used for measuring the angular displacement and speed. The motor drives (HAR–5/60) are used for controlling motor motion. There is another stiffness motor which we closed for a constant stiffness in this experiment. The diagram of the experimental system is shown in Figure 9. The workflow of the semi-physical experiment system is as follows: the operator operates on the PC and issues commands to the joint control system. The joint control system then sends the command to the motor drive, which rotates to change the joint angle. At the same time, the joint control system uses the motor encoder to monitor the movement of the motor, and the position and speed information of the feedback motor are transmitted to the joint control module and the driver, respectively, and the motor movement is adjusted according to the error, thereby adjusting the movement of the joint to achieve the purpose of movement and precision. Requirements to make the joints have better positioning. The experiment system uses a single pendulum to strike the end of the joint to produce an external disturbance torque τ e . In order to be able to generate the force in the yaw direction, the experiment was designed to strike the end of the joint at a certain angle to the pitch axis of the joint. The method of calculating the end impact force is as follows:
τ e = Fsin θ × a t = m × ( v 0 v t ) t sin θ × a t
where F is the total collision force, f y a w is the collision force in the yaw direction, m is the mass of the pendulum block, t is the collision duration, v 0 is the initial velocity of the pendulum block during the collision, and v t is the velocity of the pendulum block after the collision, and we have:
v 0 = 2 g l ( 1 cos α 0 ) ; v t = 2 g l ( 1 cos α t )
where g is the acceleration of gravity, l is the distance from the fixed end of the inelastic light rope to the centroid of the pendulum block, α 0 is the angle between the rope and the vertical direction at the starting position of the pendulum block, and α t is the angle with the vertical direction when the pendulum block reaches the highest position after the collision. In this experiment, the parameters setting are as follows: θ   =   30 , α 0   =   10 , l   =   0.2   m , g = 9.8   m × s 2 , m   = 5   kg , t   =   0.01   s , the experimental measurements obtain that α t 3.8 , so F     10 N and τ e     0.5   N m .
The control algorithms used in the experiment system are written by MATLAB. The control algorithm parameter settings are shown in Table 2.

5.2. Experimental Results

Figure 10 shows the space robot attitude parameters by experiment. Figure 10a shows the base angle. Figure 10b shows the base angular velocity. Figure 10c,d show the joints angle via VFF–RLS and RLS. Figure 10e,f show the joints angular velocity via VFF–RLS and RLS.
Figure 11 shows the tracking error of each joint trajectory of control algorithms by experiment. Table 4 shows the average error of joint tracking of control algorithms by experiment.

6. Discussion

It can be seen from Figure 6 that when the space robot is in contact with the target, the entire complex system maintains angular momentum conservation. In the simulation, the adaptive reactionless path planning achieves adaptability by using the RLS algorithm and the VFF–RLS algorithm, respectively. It can be seen from Figure 7a,b that the VFF–RLS algorithm can achieve the minimum disturbance planning of the base attitude, but the RLS algorithm cannot. It can be seen from Figure 7d,f that for the system with time-varying characteristics, the RLS algorithm does have a saturation phenomenon, and the time-varying parameters of the tracking system cannot be realized to plan the manipulator attitude parameters. Meanwhile, the VFF–RLS algorithm exhibits good parameter tracking performance for time-varying system, and can realize the time-varying parameters of the tracking system to plan the attitude parameters of the manipulator, thus ensuring the stability of the base attitude. The same results can be obtained in the experiment. It can be seen from Figure 10a,b that the VFF–RLS algorithm can achieve the minimum disturbance planning of the base attitude, but the RLS algorithm cannot, and the offset of the base angle is much bigger than the simulation caused by the unknown disturbance of the semi-physical experiment system. It can be seen from Figure 10d,f that saturation phenomenon of the RLS algorithm makes it unable to plan the manipulator attitude parameters in the time-varying system, while the VFF–RLS algorithm achieves this goal very well. The differences between the manipulator attitude parameters between the simulation and the experiment are also caused by the unknown disturbance of the semi-physical experiment system. The SSADE–ELM algorithm proposed in this paper is compared with the adaptive control algorithm based on ELM and the POS–ELM control algorithm proposed in Reference [21]. As can be seen from Figure 8a–c and Table 3 by simulation, since the adaptive control algorithm based on ELM does not need to perform control parameter optimization, it is superior to the SSADE–ELM control algorithm and the POS–ELM control algorithm based on the trajectory tracking speed. However, in terms of control precision, the SSADE–ELM control algorithm is the best, with the POS–ELM control algorithm following closely. The ELM-based adaptive control algorithm is the worst. This is due to the use of the group intelligence algorithm to optimize the input weight of the ELM algorithm. We can get the same results in the experiment. As can be seen from Figure 11a–c and Table 4, the SSADE–ELM control algorithm has the best trajectory tracking ability, and due to the time overhead of control parameter optimization, the trajectory tracking speeds of the SSADE–ELM control algorithm and the POS–ELM control algorithm are a little slower than the ELM control. Through the above analysis of simulation and experiment, the effectiveness and superiority of the ARNS planning and control algorithm based on SSADE–ELM proposed in this paper are proven.

7. Conclusions

An ARNS path planning and control algorithm is proposed for free-floating space robots. The RNS planning strategy is adopted to plan the movement of the robot arm to ensure the stability of the space robot base. At the same time, in order to contact the unknown target, the reactionless motion of the manipulator is realized to ensure the stability of the basic attitude of the space robot, in order to adapt the requirement of time-varying disturbance of the system. The VFF–RLS algorithm is introduced to construct an ARNS path planning algorithm to avoid the saturation phenomenon of the RLS algorithm. This paper proposes a stability control algorithm for space robot contact via the SSADE–ELM algorithm, which can track the dynamic change planning path and improve the speed and accuracy of tracking performance. The proposed algorithm does not require accurate system dynamics modeling, nor does it require information about unknown time-varying disturbance, enabling dynamic reactionless path tracking control. The simulation results verify the effectiveness and superiority of the proposed algorithm, which makes it important for future on-orbit operation. The algorithm proposed in this paper still has some shortcomings, such as the slow execution speed of the algorithm, which will be the direction of further research.

Author Contributions

Project administration, Z.-H.D.; Supervision, J.-C.H.; Writing—original draft, X.Y.

Funding

This research was funded by Scientific and technological innovation projects of General Armament Department, grant number ZYX12010001.

Acknowledgments

This work was supported by Scientific and technological innovation projects of General Armament Department (grant number ZYX12010001).

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Ellery, A. Tutorial Review on Space Manipulators for Space Debris Mitigation. Robotics 2019, 8, 34. [Google Scholar] [CrossRef]
  2. Li, W.-J.; Cheng, D.-Y.; Liu, X.-G.; Wang, Y.-B.; Shi, W.-H.; Tang, Z.-X.; Gao, F.; Zeng, F.-M.; Chai, H.-Y.; Luo, W.-B.; et al. On-orbit service (OOS) of spacecraft: A review of engineering developments. Prog. Aerosp. Sci. 2019, 108, 32–120. [Google Scholar] [CrossRef]
  3. Liang, B.; Xu, W.F. Space Robot: Modeling, Planning, and Control, 1st ed.; Tsinghua University Press: Beijing, China, 2017; pp. 26–27. [Google Scholar]
  4. Flores-Abad, A.; Crain, A.; Nandayapa, M.; Garcia-Teran, M.A.; Ulrich, S. Disturbance Observer-Based Impedance Control for a Compliance Capture of an Object in Space. In Proceedings of the 2018 AIAA Guidance, Navigation, and Control Conference, Kissimmee, FL, USA, 8–12 January 2018. [Google Scholar] [CrossRef]
  5. Yoon, H.; Chung, S.; Kang, H.; Hwang, M. Trapezoidal Motion Profile to Suppress Residual Vibration of Flexible Object Moved by Robot. Electronics 2019, 8, 30. [Google Scholar] [CrossRef]
  6. Wang, Y.; Liang, X.; Gong, K.; Liao, Y. Kinematical Research of Free-Floating Space-Robot System at Position Level Based on Screw Theory. Int. J. Aerosp. Eng. 2019, 2019, 1–13. [Google Scholar] [CrossRef]
  7. Crain, A.D. Optimal Trajectory Planning and Compliant Spacecraft Capture Using a Space Robot. Ph.D. Thesis, Carleton University, Ottawa, ON, Canada, 2018. [Google Scholar] [Green Version]
  8. Gasbarri, P.; Pisculli, A. Dynamic/control interactions between flexible orbiting space-robot during grasping, docking and post-docking manoeuvres. Acta Astronaut. 2015, 110, 225–238. [Google Scholar] [CrossRef]
  9. Yu, X.-Y.; Chen, L. Modeling and observer-based augmented adaptive control of flexible-joint free-floating space manipulators. Acta Astronaut. 2015, 108, 146–155. [Google Scholar] [CrossRef]
  10. Nenchev, D.N. Reaction null space of a multibody system with applications in robotics. Mech. Sci. 2013, 4, 97–112. [Google Scholar] [CrossRef]
  11. Xie, R.; Shi, P.; Zhao, Y. Zero Reaction Coordinated Motion Planning for Free-Floating Space Manipulators. In Proceedings of the 34th Chinese Control Conference (CCC), Hangzhou, China, 28–30 July 2015; pp. 5830–5834. [Google Scholar]
  12. Wei, C.; Gu, H.; Liu, Y.; Zhao, Y. Attitude reactionless and vibration control in space flexible robot grasping operation. Int. J. Adv. Robot. Syst. 2018, 15. [Google Scholar] [CrossRef]
  13. Nguyen-Huynh, T.C.; Sharf, I. Adaptive reactionless motion and parameter identification in postcapture of space debris. J. Guid. Control Dyn. 2013, 36, 404–414. [Google Scholar] [CrossRef]
  14. Yang, C.; Wu, H.; Li, Z.; He, W.; Wang, N.; Su, C.-Y. Mind control of a robotic arm with visual fusion technology. IEEE Trans. Ind. Inform. 2018, 14, 3822–3830. [Google Scholar] [CrossRef]
  15. Xu, S.; Wang, H.; Zhang, D.; Yang, B. Adaptive zero reaction motion control for free-floating space manipulators. IEEE Trans. Aerosp. Electron. Syst. 2016, 52, 1067–1076. [Google Scholar] [CrossRef]
  16. Lu, X.; Jia, Y. Adaptive coordinated control of uncertain free-floating space manipulators with prescribed control performance. Nonlinear Dyn. 2019, 97, 1541–1566. [Google Scholar] [CrossRef]
  17. Zhang, B. Research on Trajectory Planning and Control of Space Robot After Autonomously Taking Over a Noncooperative Target. Ph.D. Thesis, Harbin Institute of Technology, Harbin, China, 2017. [Google Scholar]
  18. Jiao, C.; Liang, B.; Wang, X. Adaptive Reaction Null-Space Control of Dual-Arm Space Robot for Post-Capture of Non-Cooperative Target. In Proceedings of the 2017 29th Chinese Control and Decision Conference (CCDC), Chongqing, China, 28–30 May 2017; pp. 531–537. [Google Scholar]
  19. Nguyen Huynh, T.C. Adaptive Reactionless Control of a Space Manipulator for Post-Capture of an Uncooperative Tumbling Target. Ph.D. Thesis, McGill University, Montreal, QC, Canada, 2013. [Google Scholar]
  20. Chu, Z.; Ma, Y.; Cui, J. Adaptive reactionless control strategy via the PSO-ELM algorithm for free-floating space robots during manipulation of unknown objects. Nonlinear Dyn. 2018, 91, 1321–1335. [Google Scholar] [CrossRef]
  21. Yang, C.; Jiang, Y.; He, W.; Na, J.; Li, Z.; Xu, B. Adaptive parameter estimation and control design for robot manipulators with finite-time convergence. IEEE Trans. Ind. Electron. 2018, 65, 8112–8123. [Google Scholar] [CrossRef]
  22. Wang, L.; Zeng, Y.; Chen, T. Back propagation neural network with adaptive differential evolution algorithm for time series forecasting. Expert Syst. Appl. 2015, 42, 855–863. [Google Scholar] [CrossRef]
  23. Daachi, M.E.; Madani, T.; Daachi, B.; Djouani, K. A radial basis function neural network adaptive controller to drive a powered lower limb knee joint orthosis. Appl. Soft Comput. 2015, 34, 324–336. [Google Scholar] [CrossRef]
  24. Smith, A.M.C.; Yang, C.; Ma, H.; Culverhouse, P.; Cangelosi, A.; Burdet, E. Novel hybrid adaptive controller for manipulation in complex perturbation environments. PLoS ONE 2015, 10, e0129281. [Google Scholar] [CrossRef]
  25. Chu, Z.; Cui, J.; Sun, F. Fuzzy adaptive disturbance-observer-based robust tracking control of electrically driven free-floating space manipulator. IEEE Syst. J. 2014, 8, 343–352. [Google Scholar] [CrossRef]
  26. Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: A new learning scheme of feedforward neural networks. Neural Netw. 2004, 2, 985–990. [Google Scholar]
  27. Zhao, J.T. Extreme learning machine control algorithm for control saturated systems. Sci. Technol. Eng. 2019, 19, 173–177. [Google Scholar]
  28. Rong, H.-J.; Wei, J.-T.; Bai, J.-M.; Zhao, G.-S.; Liang, Y.-Q. Adaptive neural control for a class of MIMO nonlinear systems with extreme learning machine. Neurocomputing 2015, 149, 405–414. [Google Scholar] [CrossRef]
  29. Petković, D.; Danesh, A.S.; Dadkhah, M.; Misaghian, N.; Shamshirband, S.; Zalnezhad, E.; Pavlović, N.D. Adaptive control algorithm of flexible robotic gripper by extreme learning machine. Robot. Comput. Integr. Manuf. 2016, 37, 170–178. [Google Scholar] [CrossRef]
  30. Yang, C.; Huang, K.; Cheng, H.; Li, Y.; Su, C.-Y. Haptic identification by ELM-controlled uncertain manipulator. IEEE Trans. Syst. Man Cybern. Syst. 2017, 47, 2398–2409. [Google Scholar] [CrossRef]
  31. Figueiredo, E.M.N.; Ludermir, T.B. Investigating the use of alternative topologies on performance of the PSO-ELM. Neurocomputing 2014, 127, 4–12. [Google Scholar] [CrossRef]
  32. Zhang, Y.; Wu, J.; Cai, Z.; Zhang, P.; Chen, L. Memetic extreme learning machine. Pattern Recognit. 2016, 58, 135–148. [Google Scholar] [CrossRef]
  33. Cao, J.; Lin, Z.; Huang, G.-B. Self-adaptive evolutionary extreme learning machine. Neural Process. Lett. 2012, 36, 285–305. [Google Scholar] [CrossRef]
  34. Tang, X.; Chen, L. A self-adaptive evolutionary weighted extreme learning machine for binary imbalance learning. Prog. Artif. Intell. 2018, 7, 95–118. [Google Scholar] [CrossRef]
  35. Das, S.; Mullick, S.S.; Suganthan, P.N. Recent advances in differential evolution—An updated survey. Swarm Evol. Comput. 2016, 27, 1–30. [Google Scholar] [CrossRef]
Figure 1. The simplified model of space manipulator.
Figure 1. The simplified model of space manipulator.
Electronics 08 01111 g001
Figure 2. The Adaptive Reaction Null Space (ARNS) path planning algorithm flow diagram.
Figure 2. The Adaptive Reaction Null Space (ARNS) path planning algorithm flow diagram.
Electronics 08 01111 g002
Figure 3. Diagram of adaptive control algorithm based on Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine (SSADE–ELM).
Figure 3. Diagram of adaptive control algorithm based on Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine (SSADE–ELM).
Electronics 08 01111 g003
Figure 4. The improved Strategy Self-Adaptation Differential Evolution (SSADE) algorithm flow chart.
Figure 4. The improved Strategy Self-Adaptation Differential Evolution (SSADE) algorithm flow chart.
Electronics 08 01111 g004
Figure 5. The space manipulator structure model.
Figure 5. The space manipulator structure model.
Electronics 08 01111 g005
Figure 6. Total angular momentum.
Figure 6. Total angular momentum.
Electronics 08 01111 g006
Figure 7. Space robot attitude parameters by simulation. (a) The base angle; (b) The base angular velocity; (c) The joints angle via Variable Forgetting Factor Recursive Least Squares (VFF–RLS); (d) The joints angle via RLS; (e) The joints angular velocity via VFF–RLS; (f) The joints angular velocity via RLS.
Figure 7. Space robot attitude parameters by simulation. (a) The base angle; (b) The base angular velocity; (c) The joints angle via Variable Forgetting Factor Recursive Least Squares (VFF–RLS); (d) The joints angle via RLS; (e) The joints angular velocity via VFF–RLS; (f) The joints angular velocity via RLS.
Electronics 08 01111 g007aElectronics 08 01111 g007b
Figure 8. The tracking error of each joint trajectory of control algorithms by simulation. (a) The tracking error of Joint1 trajectory of control algorithms; (b) The tracking error of Joint2 trajectory of control algorithms; (c) The tracking error of Joint3 trajectory of control algorithms.
Figure 8. The tracking error of each joint trajectory of control algorithms by simulation. (a) The tracking error of Joint1 trajectory of control algorithms; (b) The tracking error of Joint2 trajectory of control algorithms; (c) The tracking error of Joint3 trajectory of control algorithms.
Electronics 08 01111 g008aElectronics 08 01111 g008b
Figure 9. The diagram of the experimental system.
Figure 9. The diagram of the experimental system.
Electronics 08 01111 g009
Figure 10. Space robot attitude parameters by experiment. (a) The base angle; (b) The base angular velocity; (c) The joints angle via Variable Forgetting Factor Recursive Least Squares (VFF–RLS); (d) The joints angle via RLS; (e) The joints angular velocity via VFF–RLS; (f) The joints angular velocity via RLS.
Figure 10. Space robot attitude parameters by experiment. (a) The base angle; (b) The base angular velocity; (c) The joints angle via Variable Forgetting Factor Recursive Least Squares (VFF–RLS); (d) The joints angle via RLS; (e) The joints angular velocity via VFF–RLS; (f) The joints angular velocity via RLS.
Electronics 08 01111 g010aElectronics 08 01111 g010b
Figure 11. The tracking error of each joint trajectory of control algorithms by experiment. (a) The tracking error of Joint1 trajectory of control algorithms; (b) The tracking error of Joint2 trajectory of control algorithms; (c) The tracking error of Joint3 trajectory of control algorithms.
Figure 11. The tracking error of each joint trajectory of control algorithms by experiment. (a) The tracking error of Joint1 trajectory of control algorithms; (b) The tracking error of Joint2 trajectory of control algorithms; (c) The tracking error of Joint3 trajectory of control algorithms.
Electronics 08 01111 g011aElectronics 08 01111 g011b
Table 1. The dynamic parameters of the space robot system.
Table 1. The dynamic parameters of the space robot system.
DescriptionSymbolValueUnit
Base mass m 0 100kg
Joints mass m 1 , m 2 , m 3 5kg
Base inertia J 0 6.67kg·m2
Joints inertia J 1 , J 2 , J 3 0.2kg·m2
Target mass m t 5kg
Target inertia J t 2.5kg·m2
Vector from center of the base to the centroid of the first joint b 0 0.5m
Vectors from center of the ith joint to the centroid of the ith body a 1 , a 2 , a 3 0.5m
Vectors from the centroid of the ith body to the center of the (i + 1)th joint b 1 , b 2 , b 3 0.5m
Vectors from center of the Target to the centroid of the end a t 0.1m
Initial base angle θ 0 0rad
Initial base angular velocity ω 0 0rad/s
Initial base linear velocity v 0 0m/s
Initial joint angle q i , i = 1 , 2 , 3 [0, 0, 0]rad
Initial joint angular velocity q ˙ i , i = 1 , 2 , 3 [0, 0, 0]rad/s
Table 2. The parameters of the adaptive control algorithm.
Table 2. The parameters of the adaptive control algorithm.
AlgorithmDescriptionSymbolValue
Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine (SSADE–ELM)Amount of hidden layer nodes of ELM network l 60
Max iteration t max 1000
Population sizeNP30
Stage control factorc0.85
Arbitrary non-zero vector ξ ˙ [1, 1, 1]
Upper bound of error E τ a i max 1
The gain of the PD control algorithmKDiag (100, 50)
Positive diagonal coefficient matrix Λ Diag (2, 1)
Extreme Learning Machine (ELM)Amount of hidden layer nodes of ELM network l 60
Max iteration t max 1000
Population sizeNP30
Input weight A i rand (0, 1)
Hidden layer bias b i rand (0, 1)
Arbitrary non-zero vector ξ ˙ [1, 1, 1]
Upper bound of error E τ a i max 1
The gain of the PD control algorithmKDiag (100, 50)
Positive diagonal coefficient matrix Λ Diag (2, 1)
Particle Swarm Optimization–ELM (PSO–ELM)Amount of hidden layer nodes of ELM network l 60
Max iteration t max 1000
Population sizeNP30
The weights of the stochastic acceleration terms c 1 = c 2 0.2
The inertial weight serving as a tradeoff between the global and local exploration capabilities of the swarm w 2
Arbitrary non-zero vector ξ ˙ [1, 1, 1]
Upper bound of error E τ a i max 1
The gain of the PD control algorithmKDiag (100, 50)
Positive diagonal coefficient matrix Λ Diag (2, 1)
Table 3. The average error of joint tracking of control algorithms by simulation. (SSADE–ELM = Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine, PSO–ELM = Particle Swarm Optimization–ELM).
Table 3. The average error of joint tracking of control algorithms by simulation. (SSADE–ELM = Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine, PSO–ELM = Particle Swarm Optimization–ELM).
AlgorithmSSADE–ELMELMPSO–ELM
Average error−3.0 × 10−7−4.5 × 10−7−4.2 × 10−7
Table 4. The average error of joint tracking of control algorithms by experiment. (SSADE–ELM = Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine, PSO–ELM = Particle Swarm Optimization–ELM).
Table 4. The average error of joint tracking of control algorithms by experiment. (SSADE–ELM = Strategy Self-Adaptation Differential Evolution–Extreme Learning Machine, PSO–ELM = Particle Swarm Optimization–ELM).
AlgorithmSSADE–ELMELMPSO–ELM
Average error−5.3 × 10−6−7.6 × 10−6−6.9 × 10−6

Share and Cite

MDPI and ACS Style

Ye, X.; Dong, Z.-H.; Hong, J.-C. Research on Adaptive Reaction Null Space Planning and Control Strategy Based on VFF–RLS and SSADE–ELM Algorithm for Free-Floating Space Robot. Electronics 2019, 8, 1111. https://doi.org/10.3390/electronics8101111

AMA Style

Ye X, Dong Z-H, Hong J-C. Research on Adaptive Reaction Null Space Planning and Control Strategy Based on VFF–RLS and SSADE–ELM Algorithm for Free-Floating Space Robot. Electronics. 2019; 8(10):1111. https://doi.org/10.3390/electronics8101111

Chicago/Turabian Style

Ye, Xin, Zheng-Hong Dong, and Jia-Cai Hong. 2019. "Research on Adaptive Reaction Null Space Planning and Control Strategy Based on VFF–RLS and SSADE–ELM Algorithm for Free-Floating Space Robot" Electronics 8, no. 10: 1111. https://doi.org/10.3390/electronics8101111

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop