Application of Support Vector Machine in Designing Theo Jansen Linkages

Hwang, Min-Chan; Huang, Chiou-Jye; Liu, Feifei

doi:10.3390/app9030371

Open AccessArticle

Application of Support Vector Machine in Designing Theo Jansen Linkages

by

Min-Chan Hwang

^*,

Chiou-Jye Huang

and

Feifei Liu

School of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou 341000, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2019, 9(3), 371; https://doi.org/10.3390/app9030371

Submission received: 24 December 2018 / Revised: 13 January 2019 / Accepted: 15 January 2019 / Published: 22 January 2019

(This article belongs to the Special Issue Selected Papers from TIKI-ICICE 2018)

Download

Browse Figures

Versions Notes

Abstract

Theo Jansen linkage is an appealing mechanism to implement a bio-inspired motion for a legged robot. The oval orbit that is generated by the Theo Jansen linkage, possessing a transversal axis longer than a lateral axis, achieves energy efficient walking comparing to the circular orbit that is generated by the four-bar linkage. However, the ensemble of its links can produce different patterns of orbits other than oval orbits, some of which are not qualified to be the foot trajectories. It is vital to give a guideline, to which one can refer, to ensure the design of a Theo Jansen leg always possessing its eligibility. In this paper, the machine learning technique, called SVM (Support Vector Machine) along with machine vision serving as a classifier to distinguish desired trajectories from undesired ones, is employed and two databases gathering all eligible data concerned with properties of orbits and dimensions of Theo Jansen linkages are established. Based upon SVM to delimit the eligible designs, one can seek the improvement of a Theo Jansen linkage by resizing its links without rendering an ineligible design. The ensemble dimensions of Theo Jansen linkage can be determined by searching the orbits in compliance with the specification of obliqueness and slenderness from the database of properties and using their correspondent identity numbers to list all candidates of TJLs from the database of dimensions. With the aid of this proposed method, the TJLs have been successfully designed and implemented on a legged robot.

Keywords:

Theo Jansen linkage; Support Vector Machine; machine vision

1. Introduction

The strandbeest elegantly manifested a bio-inspired locomotion by means of mechanism was introduced by Theo Jansen in 1990 [1]. The innovative mechanism to implement legged motion is called a Theo Jansen Linkage (TJL), which consisted of eight links. Once the ensemble of its links is determined, it can generate a deterministic orbit. The oval orbit produced by the TJL, possessing a transversal axis longer than a lateral axis, achieves energy efficient walking as comparing to the circular orbit that is generated by the four-bar linkage [2].

It soon drew attention from some robotics researchers. In the attempt of achieving motions, such as climbing or skipping, other than walking, Komoda [3] changed a static point of a TJL to be movable and studied its singularity [4]. Mohsenizadeh [5] employed the loop closure equations method to analyze the kinematics of TJLs and validated it with the simulation in SolidWorks. Nansai [6] employed the projection method, a technique dealing with constraint forces to analyze the dynamics of TJLs. Later on, he also investigated the position and velocity control of TJLs [7,8]. So far, all of academic works that are relevant to TJLs focus on the analysis of kinematics, dynamics, and controls. None has ever addressed the issue of design. It is of interest to note that the analysis is mainly to study the forward problems while the design has to rely on the knowledge of inverse problems. For instance, the kinematics of the TJL can be derived according to the Denavit–Hartenberg convention along with some trigonometric constraints. It well defines a forward problem, i.e., given the dimension of the TJL to generate the orbit. Nevertheless, how to design a TJL to achieve locomotion is in fact a sort of inverse problems, i.e., given a specification of orbit to determine the dimension of the TJL.

The primary goal of this paper is to develop a technique to design the TJLs, i.e., solving its inverse problem. Conventionally, the standard numerical scheme called LM (Levenberg–Marquardt) method [9,10] is an effective tool to solve the inverse problems. However, the patterns of orbits produced by the TJL may vary according to its ensemble dimension and can be casted into four groups, such as bell curves, ovals, sharp-pointed ovals, and lemniscates. Some orbits can be used as the foot trajectories for legged robots but some cannot. Overall speaking, the ovals or bell orbits are legitimate and the sharp-pointed ovals are partly legitimate, nevertheless the lemniscates are illegitimate. Although the algorithm of LM always leads to a (locally optimal) solution, it does not guarantee the solution falling into the legitimate class because it is unable to tell the patterns of orbits.

Besides, the way of given a prescribed orbit to determine the dimension of a TJL is too strict to be practical. The pattern of foot trajectories is only one factor in designing a legged robot. In general, it demands abundant feasible designs in which a designer can make a trade off when other factors are taken into account. Hence, it is vital to give a guideline, to which one can refer, to ensure that the design of a TJL always possesses its eligibility that is generating a legitimate orbit for a legged robot. In the attempt of gathering feasible designs to establish a database, difficulty arises as to how to distinguish the feasible data from the infeasible ones among a vast amount of data. We propose employing the ML (Machine Learning) technique [11,12,13] called SVM (Support Vector Machine) along with machine vision [14,15] to tackle this problem. There are two merits of using SVM. First, it gives a geometric perspective that is finding the equation for a hyperplane to best separate the different classes in the feature space. Second, it solves the convex optimization problem analytically and hence always returns the same optimal hyperplane.

The proposed method is divided into two steps. First, a preliminary model is built with a small portion of data to conduct a pilot study on discovering the knowledge of data. Second, an advanced model is built to deal with large scale data and to fulfill the inverse problem.

In the preliminary model, the training data will be classified by our human vision and based upon intuitive judgement that the major axis of an orbit is closely aligned with the horizontal and the shape of an orbit possessing no self-intersection should be neither too slim nor too thick. Subsequently, the image processing is applied to extract two properties from the image of each orbit, i.e., obliqueness

α

and slenderness

β

. When remapping the training data under these two properties, it unveils that the legitimate data tend to cluster within a region that is consistent with our preference in classification. It means that both obliqueness and slenderness are sufficient to characterize the orbits, based upon which our vague preference can be turned into definite mathematical criteria and the machine vision can be applied to label the orbits.

In the advanced model, two SVMs are trained and two databases concerned with properties and dimensions of all eligible designs are established. The models of SVM are used to quickly identify the eligibility of a design and the databases provide abundant of feasible data to which a designer can refer as designing the TJLs. Moreover, a program can be developed to fulfill the inverse problem. It searches the orbits in compliance with the specification of

α

and

β

from the database of properties and uses their correspondent identity numbers to list all candidates of TJLs from the database of dimensions.

2. Theo Jansen Linkage and Foot Trajectories

The TJL can be built either using eight links or twelve links. It depends on the way of fabricating two triangluar links of the TJL. If each triangular link is machined into a whole piece, then the TLJ is an eight-bar linkage. If each of them is assembled by three straight bars, the TLJ will be a twelve-bar linkage. Regardless of eight or twelve, the degrees of freedom of the TJLs turn out to be the same, i.e., one. The TJL in this paper is the eight-bar linkage, as shown in Figure 1a, where the TJL is normalized with respect to its link-1, so that link-1, 2, 3, 6, 7 are straight bars with lengths 1, a, b, d, d, respectively; link-4, 5 are isosceles right triangles with sides of length c; link-8 is a rectangular fixture with dimensions S_x and S_y. In Figure 1b, a robot is mounted with TJLs to achieve the legged motion.

The kinematics of the TJL can be derived according to the Denavit-Hartenberg [16,17] convention along with some trigonometric constraints. Hence, the foot trajectory at the point E is a function of the input variable

θ

defined, as follows.

x = - c \sin θ_{1} + d \cos (θ_{1} + θ_{2})

(1)

y = - [c \cos θ_{1} + d \sin (θ_{1} + θ_{2})]

(2)

where

θ_{1} = β_{1} + β_{3} + β_{4} - 90^{°}

(3)

θ_{2} = 270^{°} - (β_{4} + β_{6})

(4)

β_{1} = \tan^{- 1} (\frac{S_{y}}{S_{x}})

(5)

β_{2} = β_{1} + θ

(6)

β_{3} = \tan^{- 1} (\frac{\sin β_{2}}{S_{0} - \cos β_{2}})

(7)

β_{4} = \cos^{- 1} (\frac{S_{1}^{2} + c^{2} - a^{2}}{2 S_{1} c})

(8)

β_{6} = \cos^{- 1} (\frac{S_{1}^{2} + d^{2} - b^{2}}{2 S_{1} d})

(9)

S_{0} = \sqrt{S_{x}^{2} + S_{y}^{2}}

(10)

S_{1} = \sqrt{S_{0}^{2} + 1 - 2 S_{0} \cos β_{2}}

(11)

With the aid of kinematic Equations (1)–(11), the orbit of a TJL can be determined provided that the six parameters, i.e., a, b, c, d, S_x, S_y, are given. This is so called the forward problem. However, designing a TJL to generate a legitimate orbit is an inverse problem, i.e., given the specification of an orbit to determine six parameters, a, b, c, d, S_x, S_y. It is hopeless to derive any analytical function to deal with this inverse problem. Hence, the numerical schemes have to be applied.

The LM (Levenberg–Marquardt) method [9,10] is a standard numerical scheme in solving the inverse problems. Supposedly, the six unknown parameters can be obtained by minimizing an objective function, which is the square of residual between the desired and actual orbits. However, let us take a glimpse of Figure 2, where the orbits are generated by randomly resizing TJLs. The patterns of orbits can be roughly casted into four patterns, i.e., bell curves, ovals, sharp-pointed ovals, and lemniscates, as shown in Figure 3.

Some orbits, named as legitimate class, can be used as the foot trajectories for legged robots, but some, named as illegitimate class, cannot. Overall speaking, the ovals or bell orbits are legitimate and the sharp-pointed ovals are partly legitimate, nevertheless the lemniscates are illegitimate. Unfortunately, the LM method is unable to tell the patterns of orbits. Hence, its algorithm does not guarantee that its solution always stays within the legitimate class. In other words, the LM method is a numerical scheme good for the case of homogeneity, i.e., all orbits belonging to the legitimate class. As encountered with the case of heterogeneity, i.e., legitimate orbits mingled with illegitimate ones, it becomes so incompetent.

Since neither the analytical method nor the standard numerical scheme can prevail, we resort to leverage the ML (Machine Learning) in the attempt of drawing out the knowledge or model from those heterogeneous data, i.e., orbits. Many different types of ML techniques have been developed and they can be categorized into three types, i.e., supervised learning, unsupervised learning, and reinforcement learning, depending on the training method. Among them, a supervised learning technique, called SVM, will be employed here to achieve the binary classification. There are two merits of using SVM. First, it gives a geometric perspective that is finding the equation for a hyperplane to best separate the different classes in the feature space. Second, it solves the convex optimization problem analytically and hence always returns the same optimal hyperplane. Due to the importance of SVM, a brief of SVM is presented in the following section.

3. Support Vector Machine

The SVM [18,19] was officially introduced by Boser, Guyon, and Vapnik during the Fifth Annual Association for Computing Machinery Workshop on Computational Learning Theory in 1992. SVMs are probably the most popular machine learning approach for supervised learning. The underlying idea of SVMs with kernels is to apply a nonlinear mapping,

φ

, which maps the input into some feature space and then find a hyperplane that best separates the different patterns in the feature space. The problem of finding a hyperplane that separates two patterns with a maximum margin can boil down to an optimization problem, as follows.

\begin{matrix} \min_{w, b} & \frac{1}{2} ‖ w^{2} ‖ \end{matrix}

(12)

subject to the constraints:

y_{i} (< w, φ (x_{i}) > + b) \geq 1 for \forall i

(13)

where

w

is a vector normal to the hyperplane;

b

is a bias;

φ

stands for a predefined function; and,

y_{i}

is a sign function defined as

y_{i} = s i g n (< w, φ (x_{i}) > + b)

.

The kernel or Gram matrix on all pairs of data points is defined, as follows.

K (x, u) = \sum_{i} φ_{i} (x) φ_{i} (u)

(14)

The selection of a kernel function is heavily dependent on the data specifics. In general, the Gaussian kernels, also called Radial Basis Functions (RBFs), are mostly applied in the absence of prior knowledge. Afterwards, the solution to the primal problem in (12) can be found efficiently by solving a dual problem, as follows.

\begin{matrix} \max_{λ_{j}} & \sum_{j} λ_{j} \end{matrix} - 0.5 \sum_{j} \sum_{k} λ_{j} λ_{k} y_{j} y_{k} K (x_{j}, x_{k})

(15)

subject to the constraints:

\sum_{j} y_{j} λ_{j} = 0

(16)

0 \leq λ_{j}

(17)

where the index j can be restricted to a set of support vectors instead of the set of whole training data. Subsequently, the score function of a SVM used to classify the data is given, as follows.

f (x) = < w^{*}, φ (x) > + b^{*} = \sum_{j} y_{j} λ_{j}^{*} K (x_{j}, x) + b^{*}

(18)

where

w^{*}, b^{*}

, and

λ_{j}^{*}

stand for the optimal solutions of the primal problem (12) and dual problem (15).

4. Preliminary Model of SVM

The preliminary model of SVM is configured, as shown in Figure 4, where it consists of two phases, i.e., using a training algorithm (red background) and employing the model to classify new data (yellow background). This model is used to conduct a small scale study prior to a full-scale model. It is helpful to discover the knowledge of our preference in selecting the legitimate orbits and to observe how well the SVM might classify them.

It all starts with a small portion of examples. We arbitrarily choose the dimensional parameters a and b to form a mesh of grids. Each of them possesses eight points uniformly distributed over [3,5], i.e., [3, 3.2857, 3.5714, 3.8571, 4.1429, 4.4286, 4.7143, 5]. Then a mesh of grids formed by [a, b] consists of 64 points, each of which associates with an orbit as shown in Figure 5a where the rest of parameters are set to be c = 3, d = 3.1111, S_x = 3, S_y = 0.6667.

At this preliminary stage, we could boldly categorize the orbits based upon our intuitiveness. Consequently, the orbits in Figure 5a labeled with 1, 2, 3, 9, 10, 11, 17, 18, 19, 25, 26, 27, 33, 34, 35, 41, 42, 43, 49, 53, 54, 55, 56, 60, 61, 62, 63, 64 are attributed to the legitimate class, while the others are defined as the illegitimate class. In Figure 5b, the blue and red dots stand for the legitimate and illegitimate classes, respectively.

Obviously, this set of training data cannot be separated linearly. Hence, a Gaussian kernel is applied to achieve the nonlinear separation. The model of SVM is obtained by using fitcsvm, a command of Matlab, with KernelFunction, KernelScale, Sigma, and Standardize, setting to be rbf, auto, 0.8, and true, accordingly. The decision boundary results in two black curves, as shown in Figure 5b.

To get a grasp of how the kernel trick works, we evaluate the Equation (18) with respect to

(a, b)

and plot the family of its contours accordingly. In Figure 6, the surface that is generated by the score function contains all of the data where the points of the legitimate class are distributed over the uphill, while the points of the illegitimate class are distributed over the downhill. The decision boundary is exactly the contour at the level of zero, as shown in Figure 6. Hence, the nature of the kernel trick is to embed the input space into a higher-dimensional space, referred to as feature space or kernel space, in which the data can be linearly separable by a plane lying horizontally at level zero. The scatter diagram in Figure 5b is in fact the top view of the stereoscopic model, as shown in Figure 6.

So far, the SVM has manifested its great strength in classification. Taking a close look of the orbits attributed to the legitimate class, it unveils that our preference in selecting the orbits possesses three characteristics, i.e., the obliqueness of the orbit being shallow, avoiding the orbit from being too slim or too thick, and no self-intersection.

In this pilot study, the training data were prepared manually because they were inspected by our vision and judged by our intuitiveness. The judgement based on the qualitative descriptions is more or less involved with subjectivity rather than objectivity. It leads that the training data might be contaminated with some amounts of unconscious bias. How to well prepare the training data is always a vital step in the supervised machine learning techniques. Hence, the following two sections are devoted to turn those discriminant criteria from qualitative descriptions into quantitative definitions, like mathematical functions or rules, based on which the machine vision can take the place of human vision to label a large scale of orbits.

5. Obliqueness and Slenderness

Let us define two properties of the orbit, i.e., obliqueness

α

and slenderness

β

, as shown in Figure 7a and apply the image processing to extract these properties from it. The flowchart of extracting these properties from an orbit, as shown in Figure 7b, includes computing the forward kinematics of the TJL using Equations (1)–(11); plotting an orbit; capturing the image of the orbit [20,21] from the screen; and, measuring the obliqueness

α

of the orbit, the length of its major axis, the length of its minor axis from its image. Subsequently, the slenderness ratio,

β

, of an orbit is calculated by taking the length of its minor axis divided by the length of its major axis.

After computation, the properties of the 64 orbits presented in the previous section are obtained and listed in the Table 1. Regarding the extraction of properties, it is involved with several standard techniques of image processing [22,23], i.e., calculating the centroid of the image, obtaining the centroid-shifted image, computing the eigenvectors and eigenvalues of the centroid-shifted image, retaining the eigenvectors with the maximum and minimum eigenvalues, calculating the Euclidean distances and orientations or simply applying the Matlab command, regionprops.

The question remains as to whether we can deduce the criteria in terms of mathematical functions that a computer can reason about. The answer is positive, as the sequel will show. When we plot the scatter diagram according to the Table 1, the Figure 8a unveils that the data of the legitimate class cluster within a region that is tightly related to our preference in selecting the desired orbits. As observing the location of the clustering region, it tells that our preference is indeed consistent with the requirements of shallow obliqueness and moderated slenderness. It also confirms our skepticism of the training data contaminated with some amounts of bias because few dots of blue are aberrated from the clustering region.

In Figure 8a, the contours are plotted according to the score function of the SVM. The contour at level 1.2 encircles a region containing a family of orbits in consistency with our preference. The mathematical criterion obtained by imposing an inequality on the score function of the SVM is called the model-based criterion. However, this set of training data are only sampled from two-dimensional parameters, i.e., a and b, is not sufficient to stand for the overall data and the orbits classified by our human vision is not objective. Hence, this model-based criterion will not be used. Instead, imposing the upper and lower bounds on

α

and

β

, i.e.,

α \in [- 10^{°}, + 10^{°}]

and

β \in [0.25, 0.45]

, as shown in Figure 8b to delimit the desired region, called the rule-based criteria, will be used by machine vision to reclassify the orbits in the sequel.

6. Self-Intersection

Except the obliqueness and slenderness, there is an additional factor that has to be taken into account, i.e., self-intersection. An investigation is conducted here to evaluate the feasibility of image processing to detect the self-intersection. It shows that the image processing can correctly distinguish the majority of orbits regarding self-intersection but there does exit few adverse events.

Using our human vision to inspect Figure 5a, the orbits with identity numbers 7, 8, 15, 16, 23, 24, 30, 31, 44, 50, 51, 52, 57, 58, and 59 are lemniscates possessing self-intersection, not acceptable for legged motion. When using the image processing to identify the self-intersection, it is achieved by the morphological erosion operation that is filling the area enclosed by the orbit, eroding the filled area with a three-by-three square structuring element, and counting the number of connected objects, as shown in Figure 9. The orbit will be judged as no self-intersection provided that the number of objects is equal to 1. Otherwise, it is self-intersected. It turns out that the orbits with identity numbers 8, 15, 16, 22, 23, 24, 30, 31, 37, 38, 44, 45, 49, 50, 51, 52, 57, 58, 59 recognized by machine vision are self-intersected.

All of the lemniscates recognized by us have been successfully captured by machine vision, except one discrepancy on the orbit with identity number 7. The reason why it escapes from the inspection of machine vision is because it resembles a sharp-point oval before erosion and it becomes a sharp-point oval after erosion, as shown in Figure 10.

Figure 11 shows that the non- self-intersected orbits are misjudged as self-intersection by the machine vision. Get a glimpse of Figure 10 and Figure 11, it seems that the image erosion encountering with the slimness undermines its identification on self-intersection.

Let us examine the lemniscates that are recognized by human vision and machine vision, as shown in Figure 12a,b, where the lemniscates are marked with the cross-X and the discrepancies are marked with the circle-O. It is clear that the lemniscates distribute over the region with a low slenderness ratio. Be reminded that, when the orbits are too slim, they will be precluded from the legitimate class and then it is meaningless to care about their self-intersection. As imposing the rule-based criteria

α \in [- 10^{°}, + 10^{°}]

and

β \in [0.25, 0.45]

to redefine the legitimate region, all of the lemniscates are far away from that region, as shown in Figure 12. It is sufficient to use the rule-based criteria on

α

and

β

to categorize the orbits without detecting the self-intersection.

7. Advanced Model of SVM

Obtaining a sufficient amount of training data that adequately reflects the characteristics of the field data is always the bottleneck of supervised learning. The breakthrough we made is to combine the image processing with the rule-based criteria, as shown in Figure 13, so that the machine vision can label a vast amount of training data as well as a full-scale of data.

Note that Xp consists of properties

α

and

β

; Xd consists of six dimensional parameters, i.e.,

a, b, c, d \in [3, 5]

,

S_{x} \in [3, 4]

,

S_{y} \in [0, 2]

; and, Y is the label set. Xd and Y carry the inputs and correct outputs of the training data for SVM2, so do Xp and Y for SVM1. If each dimensional parameter is uniformly sampled into

n

points, the number of training data associated with 6 parameters will be

N = n^{6}

. Hence, we prepare four sets of training data with different numbers, i.e., 64(

= 2^{6}

), 4096(

= 4^{6}

), 46656(

= 6^{6}

), 262144(

= 8^{6}

), to train SVM1 and SVM2.

The cross-validation is the method to evaluate the performance of the trained model. In Table 2, there is a list of losses that were obtained by 10-fold cross-validation. It shows that the SVM1 has a performance better than SVM2. As the number of training data is increased, the rate of loss is decreased.

Since SVM1 is the model drawn from the training data [Xp, Y] and the input data Xp possess only two variables, its model can be visualized. However, it is not the case for SVM2 as its input data Xd possessing dimension higher than two. The scatter diagrams of SVM1 with four sets of training data are shown in Figure 14 and Figure 15 where the blue and red dots stand for legitimate and illegitimate classes, respectively; the black cross-X stands for the lemniscates that were identified by machine vision. Note that the diagram with 64 data is insufficient to manifest its geometrical manifold, hence some contours are added for the sake of visualization.

Recall the self-intersection addressed previously. Observing the Figure 14 and Figure 15, most of lemniscates marked with X indeed aggregate at the region with a slenderness ratio below 0.25. Interestingly, the lemniscates whose slenderness ratio exceeds 0.25 will also be precluded from the legitimate class by the reason of their steepness. Evidently, both slenderness and obliqueness criteria are sufficient to categorize the data.

As refining the sampling points on each dimensional parameter, either the number of training data or the computational time in property extraction prodigiously grows, as shown in Table 3. Comparatively, each dimension sampled into six points is more pragmatic than when it is sampled into eight points, because it can provide the SVM with sufficient amount of unbias training data at the cost of less computational time.

In the framework of machine vision and SVM, as shown in Figure 13, two databases gathering all eligible data concerned with properties of orbits and dimensions of TJLs are established. With the aid of these two databases, a program can be developed to fulfill the inverse problem. It searches the orbits in compliance with the specification of

α

and

β

from the database of properties and uses their correspondent identity numbers to list all candidates of TJLs from the database of dimensions. Undoubtedly, the database of properties has collected all the orbits with legitimate properties, i.e.,

α \in [- 10^{°}, + 10^{°}]

and

β \in [0.25, 0.45]

. However, what would these legitimate orbits look like? Let us randomly select the data from the database of dimensions and plot their orbits accordingly. In Figure 16, it shows that all the orbits possessing swallow obliqueness, neither too slim nor too thick, and no self-intersection are absolutely qualified to be the foot trajectories for the legged robots. Hence, the designer can refer to the database to determine the dimensions of TJLs without a risk of rendering an illegible design.

In Table 3, the number of training data is 262144, as each dimension is sampled into eight points. The orbits in Figure 2 are randomly selected from that 262144 training data while the orbits in Figure 16 are randomly selected from the legitimate class, which is only 10.63% of the training data. As comparing Figure 16 with Figure 2, it evidently shows that the machine vision has effectively classified a vast amount of orbits and successfully gathered all legitimate ones in the database.

8. Conclusions

Theo Jansen linkage is an appealing mechanism to implement a bio-inspired motion for a legged robot. The oval orbit generated by the Theo Jansen linkage, possessing a transversal axis longer than a lateral axis, achieves energy efficient walking as comparing to the circular orbit generated by the four-bar linkage. However, the patterns of orbits that are produced by the TJL may vary according to its ensemble dimension and can be casted into four groups, such as bell curves, ovals, sharp-pointed ovals, and lemniscates. Some orbits can be used as the foot trajectories for legged robots but some cannot. Overall speaking, the ovals or bell orbits are legitimate and the sharp-pointed ovals are partly legitimate, nevertheless the lemniscates are illegitimate.

All of academic works relevant to TJLs mainly focus on the analysis of forward problems, like kinematics, dynamics, and controls. None have ever addressed the issue of design that has to rely on the knowledge of inverse problems. To design a Theo Jansen linkage, which can generate a foot trajectory to achieve an energy efficient walking, relies on successfully sizing its eight links. It is equivalent to determine its six dimensional parameters according to the specification of orbits. Due to the complexity of TJLs, it is impossible to derive any inverse function analytically. As resorting to the numerical scheme, the heterogeneous patterns of orbits hinder the Levenberg–Marquardt method from prevailing because it is lacking capability to tell the patterns of orbits.

In order to discriminate the orbits, the machine learning technique, called SVM, is employed. The SVM gives a geometric perspective that is finding the equation for a hyperplane to best separate the different classes in the feature space, solves the convex optimization problem analytically, and hence always returns the same optimal hyperplane. Obtaining a sufficient amount of training data that adequately reflects the characteristics of the field data is always the bottleneck of supervised learning. The breakthrough that we made is to apply the image processing to label a vast amount of training data without bias.

The proposed method is divided into two steps. First, a preliminary model is built with a small portion of data to conduct a pilot study on discovering the knowledge of data. Second, an advanced model is built to deal with the large scale data. Consequently, two models of SVM are trained and the two databases concerned with properties and dimensions of all eligible designs can be established. The models of SVM are used to quickly identify the eligibility of a design data and the databases provide abundant of feasible data to which a designer can refer as designing the TJLs. The inverse problem concerned with design is fulfilled by searching the orbits in compliance with the specification of obliqueness and slenderness from the database of properties and using their correspondent identity numbers to list all the candidates of TJLs from the database of dimensions. With the aid of this proposed method, the TJLs have been successfully designed and implemented on a legged robot.

There are two topics of interest for future research. First, SVM is one of discriminant technique among learning machines. In contrast to ANN (Artificial Neural Network), hyperplanes are highly dependent on the initialization and termination criteria. SVM distinguishes itself from ANN, in that it always returns the same optimal hyperplane with a large margin. Due to its successful application, we shall advance to other ML techniques, like K-Means clustering, K-Nearest Neighbors, Naive Bayes, and Random Forest, etc., and evaluate their merits and demerits. Second, the structure of a TJL can be improved by replacing two isosceles right triangles with two right triangles. The image processing shall extract more properties from the orbits so as to discriminate the bell curves, ovals, sharp-pointed ovals, and lemniscates. The significance of this research is to tackle the mechanism problem by means of the machine learning and machine vision.

Author Contributions

M.-C.H. and C.-J.H. co-organized the work, conceived and designed the methodology and coded programs. M.-C.H. wrote the manuscript. F.L. and M.-C.H. co-worked to prepare the final manuscript and co-supervised the research.

Funding

This research was funded by Jiangxi University of Science and Technology, People’s Republic of China (Grant No. JXXJBS18018).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Strandbeest Evolution 2018. Available online: http://www.strandbeest.com/ (accessed on 18 October 2018).
Erdman, A.G.; Sandor, G.N. Mechanism Design: Analysis and Synthesis; Prentice-Hall, Inc.: Englewood Cliffs, NJ, USA, 1984; Volume 2, pp. 2–5. ISBN 0135723965. [Google Scholar]
Komoda, K.; Wagatsuma, H. A Proposal of the extended mechanism for Theo Jansen linkage to modify the walking elliptic orbit and a study of cyclic bas function. In Proceedings of the Dynamic Walking Conference, Pensacola, FL, USA, 21–24 May 2012; pp. 2–4. [Google Scholar]
Komoda, K.; Wagatsuma, H. Singular Configurations Analyses of the Modifiable Theo Jansen-like Mechanism by Focusing on the Jacobian determinant-A Finding Limitations to Exceed Normal Joint Range of Motion. In Proceedings of the 2014 IEEE/ASME International Conference on Advanced Intelligent Mechatronics, Besançon, France, 8–11 July 2014; pp. 76–81. [Google Scholar]
Mohsenizadeh, M.; Zhou, J. Kinematic Analysis and Simulation of Theo Jansen Mechanism. In Proceedings of the Fifteenth Annual Early Career Technical Conference, Tuscaloosa, AL, USA, 7–8 November 2015; pp. 97–104. [Google Scholar]
Nansai, S.; Rajesh, M.; Iwase, M. Dynamic Analysis and Modeling of Jansen Mechanism. Procedia Eng. 2013, 64, 1562–1571. [Google Scholar] [CrossRef]
Nansai, S.; Mohan, R.E.; Tan, N.; Rojas, N.; Iwase, M. Dynamic Modeling and Nonlinear Position Control of a Quadruped Robot with Theo Jansen Linkage Mechanisms and a Single Actuator. J. Robot. 2015, 2015, 1–15. [Google Scholar] [CrossRef]
Nansai, S.; Rajesh Elara, M.; Iwase, M. Speed Control of Jansen Linkage Mechanism for Exquisite Tasks. J. Adv. Simul. Sci. Eng. 2016, 3, 47–57. [Google Scholar] [CrossRef]
Levenberg, K. A Method for the Solution of Certain Non-linear Problems in Least Squares. Quart. Appl. Math. 1944, 2, 164–168. [Google Scholar] [CrossRef]
Marquardt, D.W. An Algorithm for Least-Squares Estimation of Nonlinear Parameters. J. Soc. Ind. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
Vapnik, V.N. An Overviews of Statistical Learning Theory. IEEE Trans. Neural Netw. 1999, 10, 988–999. [Google Scholar] [CrossRef] [PubMed]
Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Shai, S.-S.; Shai, B.-D. Understanding Machine Learning from Theory to Algorithms; Cambridge University Press: New York, NY, USA, 2014; Volume 2, pp. 202–226. ISBN 9781107057135. [Google Scholar]
Sundararajan, D. Digital Image Processing, 3rd ed.; Springer Nature: Singapore, 2017; pp. 281–362. ISBN 9789811061134. [Google Scholar]
Jähne, B. Digital Image Processing, 6th ed.; Springer: Berlin/Heidelberg, Germany, 2005; pp. 449–462. ISBN 9783540240358. [Google Scholar]
Paul, R.P. Robot Manipulators, Mathematics, Programming, and Control; MIT Press: Cambridge, MA, USA, 1981; pp. 41–63. ISBN 026216082X. [Google Scholar]
Denavit, J.; Hartenberg, R.S. A Kinematic Notation for Lower-Pair Mechanisms Based on Matrices. ASME J. Appl. Mech. 1955, 22, 215–221. [Google Scholar]
Boser, B.E.; Guyon, I.M.; Vapnik, V.N. A Training Algorithm for Optimal Margin Classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory—COLT’92, Pittsburgh, PA, USA, 27–29 July 1992; ACM Press: New York, NY, USA, 1992; pp. 144–152. [Google Scholar]
Vapnik, V.N. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 1995; pp. 119–156. ISBN 9781475724400. [Google Scholar]
Fu, K.S.; Mui, J.K. A Survey on Image Segmentation. Pattern Recognit. 1981, 13, 3–16. [Google Scholar] [CrossRef]
Canny, J. A Computational Approach to Edge Detection. IEEE Trans. Pattern Anal. Mach. Intell. 1986, PAMI-8, 679–698. [Google Scholar] [CrossRef]
Marques, O. Practical Image and Video Processing Using MATLAB; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2013; pp. 365–386. ISBN 9780470048153. [Google Scholar]
Svoboda, T.; Kybic, J.; Hlavac, V. Image Processing, Analysis, and Machine Vision: A MATLAB Companion; Thomson Learning: Toronto, ON, Canada, 2008; ISBN 9780495295952. [Google Scholar]

Figure 1. Theo Jansen Linkages (a) Dimensions (b) Implemented on a robot.

Figure 2. Orbits generated by randomly resizing TJLs.

Figure 3. Four patterns of orbits: (a) Bell curve, (b) Oval, (c) Sharp-pointed oval, and (d) Lemniscate.

Figure 4. Preliminary Model of Support Vector Machine (SVM).

Figure 5. (a) Sixty four orbits as a set of training data (b) Scatter diagram in two-dimensions (2D) with decision boundary.

Figure 6. Scatter diagram in three-dimensions (3D).

Figure 7. (a) Obliqueness

α

and slenderness

β

; and, (b) Flowchart of property extraction.

Figure 7. (a) Obliqueness

α

and slenderness

β

; and, (b) Flowchart of property extraction.

Figure 8. Region of interest encircled by (a) the contour of SVM, and (b) the upper and lower bounds of

α

and

β

.

Figure 8. Region of interest encircled by (a) the contour of SVM, and (b) the upper and lower bounds of

α

and

β

.

Figure 9. Detection of self-intersection by image processing.

Figure 10. Failure of detecting the self-intersection.

Figure 11. Some cases of non-self-intersection misjudged as self-intersection.

Figure 12. Lemniscates recognized by (a) human vision and (b) machine vision.

Figure 13. Advanced model of SVM.

Figure 14. Scatter diagrams of SVM1 in cases of N = 64 and N = 4096.

Figure 15. Scatter diagrams of SVM1 in cases of N = 46656 and N = 262144.

Figure 16. Orbits generated by Theo Jansen Linkage (TJLs) randomly selected from the database of dimensions.

Table 1. Obliqueness

α

and slenderness

β

.

Table 1. Obliqueness

α

and slenderness

β

.

N ¹	$α$ ²	$β$ ³	C ⁴	N	$α$	$β$	C	N	$α$	$β$	C	N	$α$	$β$	C
1	−13.50	0.35	1	17	−8.35	0.30	1	33	−2.13	0.24	−1	49	6.18	0.15	−1
2	−15.95	0.33	1	18	−10.80	0.27	1	34	−4.51	0.20	−1	50	3.91	0.12	−1
3	−18.61	0.30	1	19	−13.48	0.24	1	35	−7.15	0.16	−1	51	2.32	0.04	−1
4	−21.56	0.27	1	20	−16.47	0.20	1	36	−10.11	0.10	−1	52	−0.58	0.10	−1
5	−24.91	0.23	1	21	−19.93	0.15	1	37	−13.63	0.04	−1	53	−3.88	0.16	−1
6	−28.86	0.19	1	22	−24.03	0.10	1	38	−17.92	0.06	−1	54	−8.09	0.26	−1
7	−33.31	0.15	1	23	−30.17	0.04	−1	39	−24.20	0.18	1	55	−14.58	0.42	1
8	−43.67	0.09	−1	24	−49.91	0.26	−1	40	−36.92	0.38	1	56	−32.46	0.67	1
9	−11.01	0.33	1	25	−5.42	0.27	−1	41	1.69	0.19	−1	57	14.07	0.08	−1
10	−13.47	0.30	1	26	−7.85	0.24	−1	42	−0.61	0.15	−1	58	12.38	0.08	−1
11	−16.15	0.27	1	27	−10.51	0.20	−1	43	−3.14	0.11	−1	59	10.93	0.18	−1
12	−19.12	0.24	1	28	−13.52	0.16	−1	44	−6.01	0.05	−1	60	8.48	0.24	−1
13	−22.53	0.19	1	29	−17.02	0.10	−1	45	−9.51	0.05	−1	61	5.65	0.32	−1
14	−26.56	0.15	1	30	−21.36	0.05	−1	46	−13.84	0.14	−1	62	2.47	0.44	−1
15	−31.91	0.10	−1	31	−27.44	0.12	−1	47	−20.27	0.27	1	63	−2.12	0.63	1
16	−42.13	0.13	−1	32	−39.48	0.31	1	48	−34.59	0.49	1	64	−41.18	0.95	1

¹ N: identity number; ²

α

: obliqueness (degree); ³

β

: ratio of slenderness; ⁴ C: classes (1: legitimate, −1: illegitimate).

Table 2. Loss obtained by 10-fold cross-validation.

$N$	64	4096	46656	262144
SVM1	0.0156	0.0103	0.0077	0.0074
SVM2	0.0625	0.0513	0.0221	0.0107

Table 3. Rate of eligibility.

Number of sampling points on each dimension	2	4	6	8
Number of training data (orbits)	64	4096	46,656	262,144
Number of the orbits in the legitimate class	4	383	4803	27845
Rate of eligibility	6.25%	9.35%	10.29%	10.62%
Computational time in property extraction (sec)	2.591	158.6692	1756.5	12136

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hwang, M.-C.; Huang, C.-J.; Liu, F. Application of Support Vector Machine in Designing Theo Jansen Linkages. Appl. Sci. 2019, 9, 371. https://doi.org/10.3390/app9030371

AMA Style

Hwang M-C, Huang C-J, Liu F. Application of Support Vector Machine in Designing Theo Jansen Linkages. Applied Sciences. 2019; 9(3):371. https://doi.org/10.3390/app9030371

Chicago/Turabian Style

Hwang, Min-Chan, Chiou-Jye Huang, and Feifei Liu. 2019. "Application of Support Vector Machine in Designing Theo Jansen Linkages" Applied Sciences 9, no. 3: 371. https://doi.org/10.3390/app9030371

APA Style

Hwang, M.-C., Huang, C.-J., & Liu, F. (2019). Application of Support Vector Machine in Designing Theo Jansen Linkages. Applied Sciences, 9(3), 371. https://doi.org/10.3390/app9030371

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Application of Support Vector Machine in Designing Theo Jansen Linkages

Abstract

1. Introduction

2. Theo Jansen Linkage and Foot Trajectories

3. Support Vector Machine

4. Preliminary Model of SVM

5. Obliqueness and Slenderness

6. Self-Intersection

7. Advanced Model of SVM

8. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI