A Framework for IBVS Using Virtual Work

Yu, Qiuda; Wei, Wu; Wang, Dongliang; Li, Yanjie; Gao, Yong

doi:10.3390/act13050181

Open AccessArticle

A Framework for IBVS Using Virtual Work

¹

School of Automation Science and Engineering, South China University of Technology, Guangzhou 510641, China

²

The Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, Guangzhou 510640, China

³

Unmanned Aerial Vehicle Systems Engineering Technology Research Center of Guangdong, South China University of Technology, Guangzhou 510641, China

⁴

School of Department of Electronic and Information Engineering, Shantou University, Shantou 515063, China

⁵

School of Automation, Jiangsu University of Science and Technology, Zhenjiang 212100, China

^*

Author to whom correspondence should be addressed.

Actuators 2024, 13(5), 181; https://doi.org/10.3390/act13050181

Submission received: 26 March 2024 / Revised: 24 April 2024 / Accepted: 6 May 2024 / Published: 10 May 2024

Download

Browse Figures

Versions Notes

Abstract

:

The visual servoing of manipulators is challenged by two main problems: the singularity of the inverse Jacobian and the physical constraints of a manipulator. In order to overcome the singularity issue, this paper presents a novel approach for image-based visual servoing (IBVS), which converts the propagation of errors in the image plane into the conduction of virtual forces using the principle of virtual work. This approach eliminates the need for Jacobian inversion computations and prevents matrix inversion singularity. To tackle physical constraints, reverse thinking is adopted to derive the function of the upper and lower bounds of the joint velocity on the joint angle. This enables the proposed method to configure the physical constraints of the robot in a more intuitive manner. To validate the effectiveness of the proposed method, an eye-in-hand system based on UR5 in VREP, as well as a physical robot, were established.

Keywords:

image-based visual servoing (IBVS); virtual work; manipulator; Jacobian transpose; physical constraints

1. Introduction

Visual servoing is a crucial technology in intelligent robot systems, as it greatly enhances the ability of robots to perceive the environment. Various applications of visual servoing have been developed, including visual-based formation control [1,2], visual grasping [3,4], object tracking [5], and human–robot collaboration [6]. Visual servoing employs image features extracted from visual sensors, such as RGB cameras, as feedback for the controller. This approach makes the controller more flexible, reliable, and efficient when dealing with complex scenes.

Scientists have performed a great deal of work in visual servoing. These works are generally thought to fall into three categories. The first kind of visual servoing is called image-based visual servoing (IBVS) [7,8] and only uses the pixel coordinates of feature points as feedback. IBVS is insensitive to calibration errors. The second kind of visual servoing is called position-based visual servoing (PBVS) [9,10] and uses the 3D positions of the corresponding feature points as feedback for the controller. The 3D positions can be obtained through an RGB-D camera (RealSense) or a stereo camera (ZED). This means that PBVS needs the camera model to be accurate. The last kind of visual servoing is called hybrid visual servoing (HVS) [11,12], which combines 2D and 3D servoing techniques. HVS aims to improve precision and robustness in robotic tasks by integrating the advantages of different dimensional visual information. Compared to other servoing techniques, IBVS shows better movement in the image plane; however, it is not optimal for 3D motion because of the lack of 3D information. By contrast, PBVS has enhanced access to the movement path of the 3D space but cannot access the optimal route in the image plane. The hybrid method merges the information of the pixel plane and the 3D space. Both camera calibration and hand-eye calibration are indispensable in attaining highly accurate parameters, and these parameters are easily changed through the aging of the equipment or a change in the relative displacement between the camera and the manipulator. Therefore, research has focused on IBVS.

Researchers have extensively studied IBVS and made notable contributions. IBVS has been applied to robot development [1,2] and a quadcopter that mimics bird predation [4]. Keshmiri et al. [13] transformed pixel error propagation into the expected acceleration of the camera and used the computed torque method (CTM) to obtain the joint angular acceleration. Some researchers have used light field cameras for feature extraction [14], while others have usedutilized image feature extraction methods such as Bézier curves [15]. Incremental control laws have been used to avoid the multiple solutions yielded by some algorithms in inverse kinematics [16]. In addition to the common serial manipulators, there have been algorithmic studies conducted on parallel manipulator IBVS [17].

However, one common challenge faced by these algorithms is computing the inverse Jacobian matrix. For kinematic control near singularity, the joint movements of the manipulator may no longer satisfy the end-effector motion requirements, resulting in increases in joint movement velocity and acceleration. This can lead to significant force and torque demands, potentially causing the manipulator to be overloaded or generate abnormal accelerations, increasing energy consumption and mechanical component wear.

To solve this problem, Wang et al. [18] designed a virtual-goal-guided RRT algorithm for trajectory planning to fit the field of view and other physical constraints. Kazemi et al. [19] built a cyber–physical system that alternates between exploring the state space of the camera and the configuration space of the robot to obtain feasible camera/robot paths, thereby obtaining feasible feature trajectories in the image space. The main idea of these method is to plan a trajectory that is in compliance with constraints and avoids the singularities of the manipulator [20]. These methods attempt to avoid singular points, but they do not actually solve the problem of abnormal motion near the singular points.

In recent years, visual servoing methods based on the optimization algorithm have been developed for a redundant manipulator. These methods consider the visual servoing as a linear parameter-varying (LPV) model. A convex objective function of joint velocity has been built, and several constraints have been applied to the optimization algorithm. These constraints involve the angle, velocity, and acceleration of a joint, as well as the mapping between the error derivative and angular velocity. The servo task is then transformed into a quadratic programming (QP) problem. Afterward, a neural network is used to solve this QP problem. Hajiloo et al. [21] designed a robust model of a predictive controller to avoid the inverse of the Jacobian matrix. Jin et al. [22] built a dynamic recurrent neural network for redundant robots. Zhang et al. [23] changed the network mentioned in [22] and created a single-layer neural network for image-based visual servoing. Although these optimization algorithms merge the limitation of joint velocity and angle into one constraint, they approximate the curve portion of the merged constraint function using a line [24], which substantially wastes the feasible region and is not very intuitive.

Even though the above-mentioned controllers avoid the inverse of the Jacobian matrix, the optimization-based derivation process is complex, and once derived, the format is fixed. Inspired by the principle of virtual work, this article proposes a new visual servoing framework that converts errors into virtual forces using an impedance model and drives joint displacement using an admittance model through backward force propagation. This framework also avoids calculating the Jacobian matrix inversion and can design different impedance/admittance controllers based on different environments, involving support for linear and nonlinear functions.

The main contributions of this paper can be summarized by the following three aspects:

This study transforms the propagation of errors into the transmission of virtual forces and provides an intuitive understanding at the physical level. This transformation eliminates the inverse of the Jacobian matrix, thus avoiding the risks caused by the abnormal movements of certain joints near the singularities of a manipulator.
This article provides a function that expresses the limits of the joint angular velocity as a function of the angle. It integrates angle constraints, angular velocity constraints, and angular acceleration constraints. These limits can be obtained by directly setting the maximum and minimum values for the angle, angular velocity, and angular acceleration, without the need to calculate additional parameters.
This article demonstrates the design process of a controller, and its effectiveness was validated through simulation experiments and physical experiments.

The rest of this paper is structured as follows. Section 2 provides an overview of the model of the eye-in-hand system and presents fundamental definitions. In Section 3, the proposed image-based visual servoing framework is described in detail, along with a design example based on this framework. In Section 4, the experiments conducted using both the VREP simulation platform and real robots are discussed, and the results are analyzed. The main findings of this study are then summarized, and prospects for future research are discussed in Section 5.

2. System Modeling

In this section, the construction processes of both the camera and robot models are described in detail.

2.1. UR5 Robot Model

According to the D-H modeling [25], the coordinate definitions for each joint of the robot are shown in Figure 1.

Here,

O_{0}

represents the world coordinate system, and

O_{1}

through

O_{6}

are the origin of the D-H coordinate system definitions for the first six joints.

O_{1}

is overlapped with

O_{0}

. In the process of defining the D-H coordinate system, first, the positive direction of the z-axis is directed along the axis of rotation toward the next joint. If the previous and current joints are parallel, the z-axis direction will be consistent with the previous one. Then, the x-axis, which should be perpendicular to and intersect with the z-axis of the previous coordinate system, is chosen. Next, the origin of the coordinate system is selected. If

z_{i}

and

z_{i - 1}

are not coplanar, the origin can be uniquely determined. When

z_{i}

and

z_{i - 1}

are coplanar, the origin can be determined by letting

x_{i}

pass through

O_{i - 1}

. Finally, the right-hand rule is used to determine the y-axis to complete the definition of the coordinate system.

Let

v_{r} \in R^{6}

be a column vector representing the motion of the end-effector defined in the base coordinate system. The upper three dimensions denote translation along the x-axis, y-axis, and z-axis, while the lower three dimensions represent rotation around the x-axis, y-axis, and z-axis. Subsequently, the pose of the end-effector and the Jacobian matrix between the end-effector motion

v_{r}

and the joint velocity

\dot{q}

can be obtained.

v_{r} = J_{r} \dot{q}

(1)

The Jacobian matrix

J_{r}

is a

6 \times N

matrix, where N represents the degrees of freedom of the robot.

2.2. Camera Pin-Hole Model

The key step in modeling the camera as a pin-hole model is to define three coordinate systems: the camera coordinate system, the image coordinate system, and the pixel coordinate system. In this paper, these coordinate systems are defined as illustrated in Figure 2.

_{c} O_{c} X_{c} Y_{c} Z

defines the camera coordinate system;

_{I} O_{I} X_{I} Y

defines the image coordinate system; and

_{p i x} O u v

represents the pixel coordinate system. Let

_{c} p = {[X, Y, Z]}^{T}

,

_{I} p = {[x, y]}^{T}

, and

s = {[u, v]}^{T}

denote the coordinates of a point described in the coordinate system of the camera, image, and pixel, respectively. The following relations can be obtained:

[\begin{matrix} u \\ v \\ 1 \end{matrix}] = \frac{1}{Z} [\begin{matrix} f a_{x} & 0 & u_{0} \\ 0 & f a_{y} & v_{0} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} X \\ Y \\ Z \end{matrix}]

(2)

where f is the focal length of the camera,

u_{0}

and

v_{0}

are the center coordinates of the picture in the pixel frame, and

a_{x}

and

a_{y}

are factors that describe the pixel density.

Let

v_{c}

and

ω_{c}

denote the translation and rotation motion, respectively, along the x-axis, y-axis, and z-axis in the camera coordinate system. Therefore, there exists the following relationship between a static point

_{c} p

and the motion of the camera:

\dot{_{c} p} = - v_{c} - ω_{c} \times_{c} p

(3)

In deriving Equation (2) and combining it with Equation (3), an image Jacobian is derived as Equation (4):

\dot{s} = J_{img} V_{c}

(4)

where

J_{img} = [\begin{matrix} a_{x} & 0 \\ 0 & a_{y} \end{matrix}] \cdot [\begin{matrix} - \frac{f}{Z} & 0 & \frac{x}{Z} & \frac{x y}{f} & - \frac{f^{2} + x^{2}}{f} & y \\ 0 & - \frac{f}{Z} & \frac{y}{Z} & \frac{f^{2} + y^{2}}{f} & - \frac{x y}{f} & - x \end{matrix}]

is the image Jacobian, and

v_{c} \in R^{6}

is a six-dimensional column vector representing the motion of the camera in the camera coordinate system. The first three dimensions are linear velocity

v_{c}

, and the last three dimensions are angular velocity

ω_{c}

.

2.3. Eye-in-Hand System Model and Kinematics

An eye-in-hand structure is illustrated in Figure 3.

As mentioned earlier,

v_{r}

is defined in the robot’s base coordinate system, while

v_{c}

is defined in the camera coordinate system. This study used the camera’s pose as the pose of the end-effector of the manipulator. According to the symbol definitions commonly used in [26] for rigid body motion, let

^{c} R_{0}

represent the rotation matrix from the camera coordinate system to the base coordinate system. Consequently, the following relationship can be established:

v_{c} = Γ v_{r},

(5)

where

Γ = [\begin{matrix} ^{c} R_{0} & 0 \\ 0 & ^{c} R_{0} \end{matrix}] .

Therefore, in combining the equations (Equations (1), (4), and (5)), the new Jacobian matrix relating the change in pixel position

\dot{s}

to the joint motion

\dot{q}

can be obtained:

\dot{s} = J_{img} Γ J_{r} \dot{q} .

(6)

3. Proposed Method

3.1. Proposed Framework

The general framework of an IBVS controller is shown in Figure 4. Here,

s^{*}

represents the desired pixel position of a feature point in the image plane, while

s

represents the actual pixel position. The difference between the two is the pixel error. The visual servoing controller calculates the control outputs based on this error to control the motion of the manipulator. Additionally, in the hand-eye system, a camera captures images and updates the actual pixel position

s

of the feature points through image processing, providing feedback information to the visual servoing controller.

The error in the pixel coordinate system is defined as

e = s^{*} - s

, and then an impedance controller is employed to convert the error into a virtual elastic force

f_{v} \in R^{2}

:

f_{v} = I (e)

(7)

where

I (\cdot)

is the impedance controller. The specific form needs to be defined according to the scenario. In using the principle of virtual work, if a particle is in equilibrium, the total virtual work of forces acting on the particle is zero for any virtual displacement. Let the virtual torques applied to each joint after the propagation of

f_{v}

to the joint space be denoted as

τ_{v}

, considering the object and manipulator as a whole, with no external forces acting on it. In order to keep the system in a state of equilibrium, the manipulator needs to generate a torque opposite to

τ_{v}

to balance it while ensuring that the sum of the power is zero:

- τ_{v}^{T} \dot{q} + f_{v}^{T} \dot{s} = 0

(8)

Substitute Equation (6) into Equation (8) to obtain

τ_{v}^{T} \dot{q} = f_{v}^{T} J_{img} Γ J_{r} \dot{q}

(9)

Then,

\forall \dot{q}

, it satisfies

(τ_{v}^{T} - f_{v}^{T} J_{img} Γ J_{r}) \dot{q} = 0 .

(10)

Then, Equation (10) can be simplified as follows:

τ_{v} = J^{T} f_{v}

(11)

where

J = J_{img} Γ J_{r}

.

Subsequently, an admittance controller

A (τ_{v})

was designed in the joint space to transform the virtual torques into joint angles. The proposed framework is illustrated in Figure 5. The controller can be represented using Equation (12):

q = A (J^{T} I (e, \dot{e}))

(12)

where

I (\cdot)

is the impedance controller, and

A (\cdot)

is the admittance controller. The specific forms of these two functions need to be designed according to the specific task requirements. A demonstration is provided in Section 3.3.

Compared to that in Figure 4, the controller is divided into two parts: an impedance controller and an admittance controller. The impedance controller generates virtual force

f_{v}

based on the error

e

, while the admittance controller generates joint angles

q

using the transmitted virtual torque

τ_{v}

. Therefore, the main challenge lies in designing both the impedance controller and the admittance controller.

In Figure 5, the controller is partitioned into two components: an impedance controller, which generates a virtual force

f_{v}

based on the error

e

, and an admittance controller, which determines joint angles

q

according to the received virtual torque

τ_{v}

. The primary challenge revolves around designing both the impedance controller and the admittance controller.

3.2. Physical Constraints

The most common constraints of a manipulator include limitations on the position, velocity, and acceleration of each joint. These constraints can be described as follows:

\begin{matrix} q^{-} \leq q \leq q^{+} \\ {\dot{q}}^{-} \leq \dot{q} \leq {\dot{q}}^{+} \\ {\ddot{q}}^{-} \leq \ddot{q} \leq {\ddot{q}}^{+} . \end{matrix}

(13)

Wang et al. [27] used a two-stage controller to solve the constraint problem. However, the upper and lower bounds of joint velocities are not constants relative to the joint position, because when the joint position increases at its maximum velocity, the joint velocity cannot suddenly become zero when it reaches the upper limit of the joint position. Therefore, it is essential to derive the upper and lower bounds of the joint velocity at different joint positions. The limit of

\dot{q}

at position

q

is deduced using reverse thinking. At the lower bound of the joint angle, one considers starting from zero with the maximum absolute angular acceleration

| {\ddot{q}}^{+} |

to obtain the curve of the velocity as a function of the joint angle, representing the lower limit ofthe velocity (Figure 6).

\begin{matrix} \frac{1}{2} {\ddot{q}}^{+} t^{2} = q - q^{-} \\ \dot{q} \geq max {{\dot{q}}^{-}, - {\ddot{q}}^{+} t} \end{matrix}

(14)

Upon the elimination of the time variable t from the given inequality, a resulting expression is obtained as follows:

\dot{q} \geq max {{\dot{q}}^{-}, - \sqrt{2 * | {\ddot{q}}^{+} | . * | q^{-} - q |}}

(15)

where

. *

denotes element-wise multiplication.

The upper bound expression for q can be derived in a similar manner, and it is presented as Equation (16):

\dot{q} \leq min {{\dot{q}}^{+}, \sqrt{2 * | {\ddot{q}}^{-} | . * | q^{+} - q |}}

(16)

3.3. Demonstration of the IBVS Controller Design Using the Proposed Framework

The impedance controller was defined as shown in Equation (7). However, as the error

e

becomes small, the value of

f_{v}

also becomes too small, resulting in a slow convergence rate in the final stages. Conversely, when the error

e

is large,

f_{v}

becomes overly large, leading to divergence in the discrete controller. To tackle this challenge, a sigmoid-style function is employed to map the error into a suitable range.

First, the error is mapped to the range

[0, 10]

using Equation (17):

e_{m a p} = \frac{10 * e}{p i c t u r e_s i z e}

(17)

where

p i c t u r e_s i z e

denotes the number of pixels on the longer side of the image, representing the maximum error.

e_{m a p}

stands for the mapped error. The range of [0, 10] is a hyperparameter. As can be seen from the formula, it serves as a scaling factor for the sigmoid function. The larger this value is, the stronger the amplification effect of the virtual force on small errors, which may also lead to oscillation near zero errors. Conversely, the smaller this value is, the less sensitive it is to small errors. Subsequently, the sigmoid-sytle function (Equation (18)) is utilized to generate virtual forces:

f_{v} = (\frac{1}{1 + exp (- e_{m a p})} * 2 - 1) * p i c t u r e_s i z e

(18)

Figure 7 illustrates the differences between the linear-style model and the sigmoid-style model. As the error becomes very small, the sigmoid-style model can amplify the virtual force and exhibits saturation when the error is large.

In the context of the admittance control module, the external force is conceptualized as a virtual joint torque. This torque is generated by the impedance controller and is propagated as a virtual force. We eliminated the spring from the mass–spring–damper system, and the dynamic description is as follows:

M \ddot{q} + C \dot{q} = τ_{v}

(19)

where

M

and

C

are diagonal matrices.

Let

ξ = \dot{q}

. Rewrite Equation (19) as a state equation:

\dot{ξ} = - M^{- 1} C ξ + M^{- 1} τ_{v} .

(20)

Let

A = - M^{- 1} C

,

B = M^{- 1}

. Both

A

and

B

are also diagonal matrices.

Equation (20) can be discretized using the forward Euler method:

ξ (k + 1) = ξ (k) + (A ξ (k) + B τ_{v} (k)) d t

(21)

where

d t

is the control period. Then, the total controller is

\begin{matrix} \dot{q} (k) & = Ω_{v} (\dot{q} (k - 1) + Ω_{a} (A \dot{q} (k - 1) + B τ_{v} (k - 1)) d t) \\ q (k + 1) & = q (k) + \dot{q} (k) d t . \end{matrix}

(22)

where

q (k)

denotes the current angle of the joints, which can be obtained from the robot sensors or data of the simulation environment;

Ω_{v} (\cdot)

is the limitation of the joint velocity using Equations (15) and (16); and

Ω_{a} (\cdot)

is the limitation of the joint acceleration.

4. Results and Discussion

In this study, an experiment was conducted using the VREP simulation environment to validate the proposed method. The UR5 robotic was utilized for this experiment. The D-H parameters for the UR5 are provided in Table 1.

The physical constraints of the manipulator are shown in Table 2.

The improved physical constraint boundary function is depicted in Figure 8. If the joint velocity is beyond the boundary, there must be a physical constraint that is not satisfied at some times.

A visual sensor was mounted at the end-effector of the UR5. The parameters of the camera are shown in Table 3.

The neural network control algorithm proposed in [23] was reproduced for a comparison. This method does not include the inverse of the Jacobian matrix and does not have a singularity problem. The controller in [23] is described as follows:

\dot{q} = P_{Ω} (- κ_{1} J^{T} (s - s_{d}) - κ_{2} J^{T} \int_{0}^{t} (s - s_{d}) d t)

(23)

where

ω

is the velocity vector of the joint, and

P_{Ω} (\cdot)

projects the output within the physical constraints.

κ_{1}

and

κ_{2}

are the parameters that need to be adjusted. The physical constraints in [23] are described in Equation (24). Therefore, there is another parameter k in the physical constraints.

\max {{\dot{q}}^{-}, k * (q^{-} - q)} \leq \dot{q} \leq \min {{\dot{q}}^{+}, k * (q^{+} - q)}

(24)

Hence, to meet the physical constraints, we drew a line between

(3.137, 3.142)

and

(6.287, 0)

in Figure 8 and obtained the parameter

k = 1

. Table 4 describes the adjusted parameters in the neural network method.

The final parameters applied to the proposed algorithm are shown in Table 5.

The control period was dt = 0.05 s in order to fit the real environment, which satisfies most real-time image frame rates. To verify the performance of the algorithm under different tasks, after the parameters were properly adjusted, it was maintained constant across all experiments.

4.1. Convergence Performance Simulation

Convergence performance is one of the most important indexes used to evaluate a visual servoing algorithm. We set a static object and designated the target points as

(80, 80)

and

(320, 240)

. The former point is near the corner of the image, which easily loses sight of the object. The latter is at the center of the image, which is near the major target point of the visual servoing tasks. The experimental environment is shown in Figure 3.

The neural network method needs a long time to converge because it has an integral. When the error becomes too small, the neural network controller needs a long time to overcome the accumulated error. The proposed sigmoid-style impedance module enhances the influence of error on the controller when the error is small. Without the integration of errors, the proposed algorithm converges close to zero. The final error in the pixel of the proposed algorithm is (0.01, 0.2), and that of the neural network method is (2.6, 0.8), as shown in Figure 9.

Figure 10 displays the curve of the joint velocity. Both algorithms speed up with the maximum joint acceleration at the beginning and then decelerate to zero. At approximately 0.4 s, the acceleration of the neural network method decreased, as shown in Figure 10b–d, mainly because the error is very small. Hence, velocity was mainly determined through the integral portion in Equation (23). The proposed algorithm could keep a rapid response when the error was small.

In many cases, the target point of the visual servoing task is located near the center of the image; thus, the center of the image

(320, 240)

is representative.

Figure 11 shows that the proposed algorithm has a fast convergence rate, although the convergence rate when the servo is near

(80, 80)

is equivalent. As mentioned before, the neural network method can converge to a narrow range. However, it needs a long time to converge to zero.

Figure 12 shows the velocity curve of each joint. The velocities in (b), (c), and (d) of this figure are near zero, which means that these joints are near the target joint angles. The curve of velocity in (a), (e), and (f) show that the proposed algorithm always exhibits high acceleration. Thus, the proposed algorithm enhances the performance of the manipulator.

4.2. Object Tracking Task Simulation

Object tracking is another widely used application in dynamic object grasping and photography. We set up the environment as shown in Figure 13.

We built a curved path and let the blue ball move along it at a speed of 0.1 m/s. This movement can reflect the response performance when the error changes in different directions.

Figure 14a shows the pixel error on the x-axis, and (b) shows the pixel error on the y-axis. Both systems can track the target within a margin of error of 0.35 s. Compared with the method in [23], the proposed algorithm shows a smaller pixel error both on the x- and y-axes.

Figure 15 shows that the proposed algorithm can speed up to a higher velocity, which means that it can catch the target faster. The proposed algorithm also has a faster response to velocity, which is especially obvious in Figure 15a.

4.3. Physical Experiment

To validate the feasibility of the proposed algorithm in practical applications, we replicated the simulation environment at a 1:1 scale. However, because of the difficulty of maintaining a fixed trajectory for a small ball to follow a constant speed in a real-world environment, we conducted only static servo experiments. Using a UR5 with a body manufactured in 2015 (The manufacturer is Universal Robots USA, Inc 27175 Haggerty Road, Suite 16048377 Novi, MI, USA, We introduced the UR5 in 2017 by purchasing a third party mobile robot equipped with UR5 in Shenzhen, China) and a control system upgraded to version CB3.1 (Figure 16), we controlled the UR5 by sending velocity commands through TCP/IP protocol communication using a Python (version 3.10.12) script running on a laptop. During actual operation, the maximum control frequency of the UR5 in this control mode was 10 Hz, due to the reporting frequency of the state of UR5 being 10Hz. Therefore, we adjusted the control cycle of the controller to 0.1 s.

The notebook configuration adopted in this study was as follows: an Intel 9th Generation Core i7, 16 GB DDR4, and an NVIDIA GeForce RTX 2060 GPU. With this configuration, the controller’s computation time ranged from 3 ms to 5 ms, which is significantly shorter than the control cycle of 100 ms.

The parameters of the real camera can be obtained from the Intel RealSense API. Table 6 shows the detailed parameters.

In this study, the traditional PID control was also incorporated into the comparison. The design of the PID controller is depicted in the following figure (Figure 17).

Here,

K_{p}

,

K_{i}

, and

K_{d}

, respectively, represent the proportional, integral, and derivative coefficients.

J^{†}

is the pseudoinverse of the Jacobian matrix.

The parameters of the controller used in the physical experiment are shown in Table 7.

Our proposed algorithm shows superiority over the compared method, as it converged approximately 2 s earlier than others (Figure 18). From the error curve, it can be observed that, initially, the convergence speed of the proposed algorithm is not the fastest. However, in the final 10 pixels, the convergence speed of the proposed algorithm surpasses those of other comparative algorithms.

The first column in Figure 19 shows the curves of the joint velocity in, it can be seen that the PID controller using the pseudoinverse algorithm tends to concentrate the joint motion on a certain joint, causing the acceleration of that joint to quickly reach its upper limit, thereby affecting the convergence speed of the error. However, in using the transposition method, it can be seen that, initially, all joints reach the maximum acceleration, and in the subsequent stage, the acceleration of all joints does not exceed the joint’s acceleration limit, and the speed distribution is relatively uniform. In comparison, this method is more conducive to the performance of the robotic arm.

The third column in Figure 19 shows the trajectory of featrue point

s

at each sample time, it can be observed that the transposition method and inverse method exhibit different directions of fastest convergence. According to our understanding of the proposed framework, when the virtual power is transformed into the joint space, the inverse method guides the joint motion by calculating the velocity using kinematics, whereas the transposition method guides the joint motion by generating the target velocity through the admittance control based on virtual torque. In assuming that the propagated virtual power in both algorithms is the same, there is an inverse relationship between the joint velocity obtained by the inverse method and the virtual torque derived from the transposition method. Thus, it can be inferred that in the direction where the inverse method converges rapidly, the transposition method converges slower, whereas in the direction where the inverse method converges slowly, the transposition method exhibits a faster convergence.

Compared with the two transposed methods, it can be seen that the proposed method has a more aggressive speed adjustment and faster convergence rate when the error is small, which indicates the effectiveness of the proposed architecture for a visual servo controller design. Moreover, from the acceleration curve, it can be seen that in the first 0.1 s, the reference method experiences a maximum acceleration due to excessive output caused by a large error, which reaches the physical acceleration limit of the joint. Thanks to the design of the impedance controller, the algorithm based on the proposed framework can limit the output to prevent divergence in discrete systems when there is a large error, while amplifying the impact of errors on joints when there is a small error, thereby achieving a faster convergence speed in small errors. Furthermore, from the speed curve, it can be seen that while the joint with the fastest speed has almost the same speed as the reference method with our proposed method, the other joints rotate faster with our proposed method than with the reference method. As shown in the trajectory image on the right, our solution has a faster convergence speed while maintaining an equivalent overshoot compared to the reference method.

5. Conclusions

This paper proposes a new image-based visual servoing framework for redundant manipulators, which use Jacobian transport instead of inversion. The framework can effectively avoid the singular value of the Jacobian matrix. For physical constraints, we elucidated the relationship between joint acceleration and the boundary function of joint velocities on joint angles, which can set the joint acceleration directly. The dynamic and static experiments used the same configuration, which ensured that the system could move to any point in the picture. The results show that our method performs well for dynamic tracking and lower amplitude fluctuations when the target arrives. The physical experiment verified the feasibility of our algorithm. To meet the end-velocity requirements of the end-effector, the inverse Jacobian method may cause joints in singular points to operate at high speeds, seriously affecting the safe operation of the manipulator. However, because virtual force propagation is used in the proposed method, joints in singular points are not subjected to (or receive minimal) force traction, resulting in joint speeds approaching zero. Based on this feature, we will further use this method as a downstream algorithm for reinforcement learning to operate a manipulator for grasping.

Author Contributions

Conceptualization, Q.Y. and W.W.; methodology, Q.Y.; software, Q.Y.; validation, W.W., D.W. and Y.G.; formal analysis, Y.L.; resources, W.W.; writing—original draft preparation, Q.Y.; writing—review and editing, Y.L.; visualization, Y.G.; supervision, W.W.; project administration, W.W.; funding acquisition, W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science and Technology Planning Project of Guangdong under grants 2015B010919007, 2016A04040312, 2017B090901043, 2019A050520001, 2022A1515110566, and 2023A1515011574.

Data Availability Statement

Data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

IBVS	Image-Based Visual Servo;
PBVS	Position-Based Visual Servo;
HVS	Hybrid Visual Servo;
CTM	Computed Torque Method;
RRT	Rapidly Exploring Random Tree;
LPV	Linear Parameter-Varying;
QP	Quadratic Programming.

References

Lin, J.; Miao, Z.; Zhong, H.; Peng, W.; Wang, Y.; Fierro, R. Adaptive image-based leader–follower formation control of mobile robots with visibility constraints. IEEE Trans. Ind. Electron. 2020, 68, 6010–6019. [Google Scholar] [CrossRef]
Miao, Z.; Zhong, H.; Lin, J.; Wang, Y.; Chen, Y.; Fierro, R. Vision-based formation control of mobile robots with FOV constraints and unknown feature depth. IEEE Trans. Control Syst. Technol. 2020, 29, 2231–2238. [Google Scholar] [CrossRef]
Qi, R.; Tang, Y.; Zhang, K. An optimal visual servo trajectory planning method for manipulators based on system nondeterministic model. Robotica 2022, 40, 1665–1681. [Google Scholar] [CrossRef]
Thomas, J.; Loianno, G.; Sreenath, K.; Kumar, V. Toward image based visual servoing for aerial grasping and perching. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China, 31 May–5 June 2014; pp. 2113–2118. [Google Scholar] [CrossRef]
Song, J.B. Object tracking and visual servoing using features computed from local feature descriptor. In Proceedings of the IEEE international Conference on Control, Automation and Systems (ICCAS), Gyeonggi-do, Republic of Korea, 27–30 October 2010; pp. 1044–1048. [Google Scholar] [CrossRef]
Rosenberger, P.; Cosgun, A.; Newbury, R.; Kwan, J.; Ortenzi, V.; Corke, P.; Grafinger, M. Object-independent human-to-robot handovers using real time robotic vision. IEEE Robot. Autom. Lett. 2020, 6, 17–23. [Google Scholar] [CrossRef]
Pan, W.; Lyu, M.; Hwang, K.S.; Ju, M.Y.; Shi, H. A neuro-fuzzy visual servoing controller for an articulated manipulator. IEEE Access 2018, 6, 3346–3357. [Google Scholar] [CrossRef]
Siradjuddin, I.; Behera, L.; McGinnity, T.M.; Coleman, S. Image-based visual servoing of a 7-DOF robot manipulator using an adaptive distributed fuzzy PD controller. IEEE-ASME Trans. Mechatron. 2013, 19, 512–523. [Google Scholar] [CrossRef]
Shi, H.; Chen, J.; Pan, W.; Hwang, K.S.; Cho, Y.Y. Collision avoidance for redundant robots in position-based visual servoing. IEEE Syst. J. 2018, 13, 3479–3489. [Google Scholar] [CrossRef]
Noh, S.; Park, C.; Park, J. Position-Based Visual Servoing of Multiple Robotic Manipulators: Verification in Gazebo Simulator. In Proceedings of the IEEE international Conference on Information and Communication Technology Convergence (ICTC), Jeju, Republic of Korea, 21–23 October 2020; pp. 843–846. [Google Scholar] [CrossRef]
He, Z.; Wu, C.; Zhang, S.; Zhao, X. Moment-based 2.5-D visual servoing for textureless planar part grasping. IEEE Trans. Ind. Electron. 2018, 66, 7821–7830. [Google Scholar] [CrossRef]
Malis, E.; Chaumette, F.; Boudet, S. 2 1/2 D visual servoing. IEEE Trans. Robot. Autom. 1999, 15, 238–250. [Google Scholar] [CrossRef]
Keshmiri, M.; Xie, W.F.; Mohebbi, A. Augmented image-based visual servoing of a manipulator using acceleration command. IEEE Trans. Ind. Electron. 2014, 61, 5444–5452. [Google Scholar] [CrossRef]
Tsai, D.; Dansereau, D.G.; Peynot, T.; Corke, P. Image-based visual servoing with light field cameras. IEEE Robot. Autom. Lett. 2017, 2, 912–919. [Google Scholar] [CrossRef]
Wang, H.; Yang, B.; Wang, J.; Liang, X.; Chen, W.; Liu, Y.H. Adaptive visual servoing of contour features. IEEE-ASME Trans. Mechatron. 2018, 23, 811–822. [Google Scholar] [CrossRef]
Dong, G.; Zhu, Z.H. Kinematics-based incremental visual servo for robotic capture of non-cooperative target. Robot. Auton. Syst. 2019, 112, 221–228. [Google Scholar] [CrossRef]
Chaudhuri, S.; Saha, R.; Chatterjee, A.; Mookherjee, S.; Sanyal, D. Development of a motion sensing system based on visual servoing of an eye-in-hand electrohydraulic parallel manipulator. IEEE Syst. J. 2020, 20, 8108–8116. [Google Scholar] [CrossRef]
Wang, R.; Zhang, X.; Fang, Y.; Li, B. Virtual-Goal-Guided RRT for Visual Servoing of Mobile Robots With FOV Constraint. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 2073–2083. [Google Scholar] [CrossRef]
Kazemi, M.; Gupta, K.K.; Mehrandezh, M. Randomized Kinodynamic Planning for Robust Visual Servoing. IEEE Trans. Robot. 2013, 29, 1197–1211. [Google Scholar] [CrossRef]
Gong, Z.; Tao, B.; Qiu, C.; Yin, Z.; Ding, H. Trajectory Planning With Shortest Path for Modified Uncalibrated Visual Servoing Based on Projective Homography. IEEE Trans. Autom. Sci. Eng. 2020, 17, 1076–1083. [Google Scholar] [CrossRef]
Hajiloo, A.; Keshmiri, M.; Xie, W.F.; Wang, T.T. Robust online model predictive control for a constrained image-based visual servoing. IEEE Trans. Ind. Electron. 2015, 63, 2242–2250. [Google Scholar] [CrossRef]
Jin, L.; Li, S.; La, H.M.; Luo, X. Manipulability optimization of redundant manipulators using dynamic neural networks. IEEE Trans. Ind. Electron. 2017, 64, 4710–4720. [Google Scholar] [CrossRef]
Zhang, Y.; Li, S. A neural controller for image-based visual servoing of manipulators with physical constraints. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 5419–5429. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, J.; Xia, Y. A dual neural network for redundancy resolution of kinematically redundant manipulators subject to joint limits and joint velocity limits. IEEE Trans. Neural Netw. 2003, 14, 658–667. [Google Scholar] [CrossRef] [PubMed]
Kebria, P.M.; Al-Wais, S.; Abdi, H.; Nahavandi, S. Kinematic and dynamic modelling of UR5 manipulator. In Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC), Budapest, Hungary, 9–12 October 2016; pp. 4229–4234. [Google Scholar] [CrossRef]
Corke, P.I.; Khatib, O. Robotics, Vision and Control: Fundamental Algorithms in MATLAB, 2nd ed.; Springer: Cham, Switzerland, 2011; pp. 63–68. [Google Scholar]
Wang, R.; Zhang, X.; Fang, Y. Visual tracking of mobile robots with both velocity and acceleration saturation constraints. Mech. Syst. Signal Process. 2021, 150, 107274. [Google Scholar] [CrossRef]
Universal Robots-DH Parameters for Calculations of Kinematics and Dynamics. 2022. Available online: https://www.universal-robots.com/articles/ur/application-installation/dh-parameters-for-calculations-of-kinematics-and-dynamics/ (accessed on 9 March 2022).

Figure 1. D-H coordinate definition.

Figure 2. The definition of the coordinate systems for modeling a camera.

Figure 3. Eye-in-hand system structure.

Figure 4. The general framework.

Figure 5. Proposed IBVS framework.

Figure 6. Description of the reasoning process of reverse thinking.

Figure 7. Schematic of linear-style and sigmoid-style impedance modules.

Figure 8. The proposed joint velocity constraint on the joint angle.

Figure 9. The curve of the pixel error when the feature point

s

moves to (80, 80): (a) error along the x-axis; (b) error along the y-axis.

Figure 9. The curve of the pixel error when the feature point

s

moves to (80, 80): (a) error along the x-axis; (b) error along the y-axis.

Figure 10. Joint speed in the move to (80, 80): (a) base joint; (b) shoulder joint; (c) elbow joint; (d) wrist_1 joint; (e) wrist_2 joint; (f) wrist_3 joint.

Figure 11. The curve of the pixel error when the servo moves to (320, 240): (a) error along the x-axis. (b) error along the y-axis.

Figure 12. Joint speed in the move to (320, 240): (a) angular velocity of base joint; (b) angular velocity of the shoulder joint; (c) angular velocity of the elbow joint; (d) angular velocity of wrist_1 joint; (e) angular velocity of wrist_2 joint; (f) angular velocity of wrist_3 joint.

Figure 13. Circle path of visual servoing environment.

Figure 14. The curve of the pixel error when the feature point

s

moves to (320, 240) with a dynamic object: (a) error along the x-axis; (b) error along the y-axis.

Figure 14. The curve of the pixel error when the feature point

s

moves to (320, 240) with a dynamic object: (a) error along the x-axis; (b) error along the y-axis.

Figure 15. The curve of the joint speed when the feature point

s

moves to (320, 240) with a dynamic object: (a) base joint; (b) shoulder joint; (c) elbow joint; (d) wrist_1 joint; (e) wrist_2 joint; (f) wrist_3 joint.

Figure 15. The curve of the joint speed when the feature point

s

moves to (320, 240) with a dynamic object: (a) base joint; (b) shoulder joint; (c) elbow joint; (d) wrist_1 joint; (e) wrist_2 joint; (f) wrist_3 joint.

Figure 16. The environment of the real experiment.

Figure 17. The traditional PID controller.

Figure 18. The curve of the pixel error when the moves to (80, 80).

Figure 19. (First row): neural network method. (Second row): PID controller. (Third row): proposed algorithm.

Table 1. UR5 D-H parameters [28].

Joint Number	Joint Name	$α_{i}$ (rad)	$a_{i}$ (m)	$d_{i}$ (m)	$θ_{i}$ (rad)
1	base	$π / 2$	0	0.089159	$θ_{1}$
2	shoulder	0	−0.425	0	$θ_{2}$
3	elbow	0	−0.39225	0	$θ_{3}$
4	wrist_1	$π / 2$	0	0.10915	$θ_{4}$
5	wrist_2	$- π / 2$	0	0.09465	$θ_{5}$
6	wrist_3	0	0	0.0823	$θ_{6}$

Table 2. Limitation of UR5.

UR5 Parameters	Limitation	Unit
Joint angle	$[- 2 π, 2 π]$	rad
Joint velocity	$[- π, π]$	rad/s
Joint acceleration	$[- π / 2, π / 2]$	rad/s²

Table 3. The configuration of the camera in VREP.

Camera Parameter	Value	Unit
Width	640	pixel
Height	480	pixel
f	0.01	meter
$u_{0}$	319.4716	pixel
$v_{0}$	239.2442	pixel
$a_{x}$	45,668.2625	pixels per meter
$a_{y}$	45,669.5772	pixels per meter
Z	10	meter

Table 4. Adjusted parameters of neural network controller.

Parameters	Value
$k 1$	0.000009
$k 2$	0.0000001
k	1

Table 5. Tested parameters of proposed algorithm.

Parameters	Value
$M$	diag(16,000, 16,000, 16,000, 16,000, 16,000, 16,000)
$C$	diag(368,000, 368,000, 368,000, 368,000, 368,000, 368,000)

Table 6. The parameters of realsense D435i.

Parameter	Value	Unit
Width	640	pixel
Height	480	pixel
f	0.604	meter
$u_{0}$	321.294	pixel
$v_{0}$	244.492	pixel
$a_{x}$	406.743	pixels per meter
$a_{y}$	883.042	pixels per meter
Z	10	meter

Table 7. Adjusted parameters of physical experiment.

Parameters	Value	Belongs to
$k 1$	0.000002	neural network controller
$k 2$	0.000000004	neural network controller
k	1	neural network controller
$K_{p}$	0.7	PID controller
$K_{i}$	0	PID controller
$K_{d}$	0.0001	PID controller
M	55,000	proposed algorithm
C	790,000	proposed algorithm

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, Q.; Wei, W.; Wang, D.; Li, Y.; Gao, Y. A Framework for IBVS Using Virtual Work. Actuators 2024, 13, 181. https://doi.org/10.3390/act13050181

AMA Style

Yu Q, Wei W, Wang D, Li Y, Gao Y. A Framework for IBVS Using Virtual Work. Actuators. 2024; 13(5):181. https://doi.org/10.3390/act13050181

Chicago/Turabian Style

Yu, Qiuda, Wu Wei, Dongliang Wang, Yanjie Li, and Yong Gao. 2024. "A Framework for IBVS Using Virtual Work" Actuators 13, no. 5: 181. https://doi.org/10.3390/act13050181

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Framework for IBVS Using Virtual Work

Abstract

1. Introduction

2. System Modeling

2.1. UR5 Robot Model

2.2. Camera Pin-Hole Model

2.3. Eye-in-Hand System Model and Kinematics

3. Proposed Method

3.1. Proposed Framework

3.2. Physical Constraints

3.3. Demonstration of the IBVS Controller Design Using the Proposed Framework

4. Results and Discussion

4.1. Convergence Performance Simulation

4.2. Object Tracking Task Simulation

4.3. Physical Experiment

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI