Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects

Feng, Di; Kaboli, Mohsen; Cheng, Gordon

doi:10.3390/s18020634

Open AccessArticle

Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects

by

Di Feng

^†,

Mohsen Kaboli

^*,†

and

Gordon Cheng

Institute for Cognitive Systems (ICS), Technische Universität München, Arcisstrasse 21, 80333 München, Germany

^*

Author to whom correspondence should be addressed.

^†

Mohsen Kaboli and Di Feng contributed equally to this work.

Sensors 2018, 18(2), 634; https://doi.org/10.3390/s18020634

Submission received: 1 November 2017 / Revised: 14 February 2018 / Accepted: 16 February 2018 / Published: 21 February 2018

(This article belongs to the Special Issue Tactile Sensors and Sensing)

Download

Browse Figures

Versions Notes

Abstract

:

Reusing the tactile knowledge of some previously-explored objects (prior objects) helps us to easily recognize the tactual properties of new objects. In this paper, we enable a robotic arm equipped with multi-modal artificial skin, like humans, to actively transfer the prior tactile exploratory action experiences when it learns the detailed physical properties of new objects. These experiences, or prior tactile knowledge, are built by the feature observations that the robot perceives from multiple sensory modalities, when it applies the pressing, sliding, and static contact movements on objects with different action parameters. We call our method Active Prior Tactile Knowledge Transfer (APTKT), and systematically evaluated its performance by several experiments. Results show that the robot improved the discrimination accuracy by around

10 %

when it used only one training sample with the feature observations of prior objects. By further incorporating the predictions from the observation models of prior objects as auxiliary features, our method improved the discrimination accuracy by over

20 %

. The results also show that the proposed method is robust against transferring irrelevant prior tactile knowledge (negative knowledge transfer).

Keywords:

tactile sensing; artificial robotic skin; active tactile object perception; active tactile object learning; active tactile transfer learning

1. Introduction

1.1. Motivation

We humans perceive tactual properties of an object (e.g., stiffness, texture, temperature, weight) by applying exploratory actions (e.g., pressing, sliding, static contact, lifting) [1]. After applying different exploratory actions on an object, we can obtain its different tactile information. Conversely, making the same exploratory action on different objects produces different tactile observations. Therefore, when we learn about an object, we always link its physical properties with the exploratory actions that we apply on it.

Besides different kinds of exploratory actions, the tactile information we perceive from an object is also dependent on how we apply an action. Consider an example of pressing on two objects. Object 1 is made of soft sponge, and object 2 is made by covering a solid metal with a soft sponge surface. When pressing our fingertips on both objects with a small normal force, we can recognize similar object deformations. However, if we press with a larger normal force, object 1 deforms much more than object 2, since we have reached the metal part of object 2. A similar situation occurs when we apply the sliding movement on object surfaces with different forces and velocities. As a result, by applying different exploratory actions in different ways, we can build a detailed knowledge of the object’s tactual properties which we call tactile exploratory action experiences.

We humans learn about new objects in an active and incremental way. We actively select the most informative exploratory actions to interact with them [2,3]. More importantly, we relate these new objects with the experiences of exploring objects that we have previously encountered. By transferring the prior tactile knowledge, or prior tactile exploratory action experiences, we can largely reduce the amount of exploratory actions required to discriminate among new objects. In this way, we humans save a lot of time and energy, and recognize new objects with high accuracy [4,5,6,7,8,9,10].

Can robotic systems with a sense of touch also perform like humans to actively transfer the past tactile exploratory action experiences when learning about new objects (transfer learning)?

1.2. Background

Over the past decades, researchers have developed various tactile sensors and mounted them on robotic systems (e.g., [11,12,13,14,15,16,17]). In this way, the robots with a sense of touch can perceive different objects’ tactual properties by applying exploratory actions. For example, a robot can slide its sensory parts on objects to sense their textural properties [18,19,20,21], establish a static contact to estimate the temperature [22], or lift objects to measure their center of mass [23]. Bhattacharjee et al. [24] developed algorithms to classify objects into four categories: (1) Hard-Unmoved; (2) Hard-Moved; (3) Soft-Unmoved; and (4) Soft-Moved using One Nearest Neighbor Classifier, Hidden Markov Models and Long Short Term Memory networks based on features of time-varying tactile sensor data (maximum force, contact area, and contact motion). Furthermore, several methods have been proposed for the active object exploration problem, in which the robot actively applies multiple exploratory actions to recognize objects (e.g., [25,26,27,28,29,30,31,32]).

However, the problem of transferring the robotic tactile knowledge has been rarely investigated. Even though many transfer learning techniques have been successfully applied to several areas (e.g., Natural Language Processing: [33]; WiFi-based localization: [34]; Computer Vision: [35,36,37,38]; Bio-informatics: [39]), it is our works that introduced tactile transfer learning. Previously, Kaboli et al. [20,21] developed a novel textural descriptor. Using the descriptor, a ShadowHand dexterous robotic hand equipped with BioTac sensors on its fingertips could efficiently discriminate among object surface textures. Later, we designed a transfer learning method [40,41,42] so that the robotic hand could reuse the prior texture models from 12 objects to learn about the surface textures of 10 new objects. However, since only the sliding movement was applied, the robot could only transfer the object textural properties.

In our previous works [43,44], we proposed an active touch learning method in which an UR10 robotic arm with an artificial skin on its end-effector or fingertips could apply sliding, pressing, and static contact movements to learn about objects’ surface texture, stiffness, and thermal conductivity, respectively. Even though our active learning method enables the robot to efficiently learn about objects, the robot still needs to learn from scratch given a new set of objects. In this regard, recently, for the first time in robotics and tactile domains, we proposed an algorithm called Active Tactile Transfer Learning (ATTL) [45] to actively transfer multiple physical properties of prior objects. Using ATTL, the UR10 robotic arm could actively select prior knowledge to transfer (surface texture, stiffness, and thermal conductivity by applying sliding, pressing, and static contact movements). As a result, the robot could use fewer training samples (even one sample) to achieve higher recognition rate, when it learns about new objects.

The robotic systems in the above-mentioned works only applied exploratory actions with fixed action parameters, e.g., sliding with a fixed velocity to perceive surface textures. In order to learn their detailed physical properties (e.g., the vibro-tactile feedbacks by sliding at different speeds) so as to better discriminate among them, the robots should be able to apply exploratory actions with different action parameters.

1.3. Contribution

In this paper, we focus on actively transferring the prior tactile exploratory action experiences to learn more details about the physical properties of new objects (see Figure 1). Our contributions are two-fold:

We enable a robot to apply exploratory actions with multiple action parameters. In this way, the robot gains more detailed tactile information.
We propose an active tactile transfer learning algorithm so that the robot leverages the previously obtained detailed tactile knowledge (prior tactile exploratory action experiences) while learning about a new set of objects.

In the sequel, we first introduce the robotic system (Section 2). Then, we illustrate how the robot applies exploratory actions and obtains the physical properties of objects (Section 3). Afterwards, we illustrate our proposed tactile transfer learning in detail (Section 4), followed by a systematic evaluation of the method (Section 5). We finalize this paper with a conclusion and a discussion about future works (Section 6).

2. System Description

2.1. Multi-modal Artificial Skin

To enable the robot to perform more human-like behaviours with multiple tactile sensing modalities, we designed and manufactured multi-modal artificial skin (Figure 2a made by seven active tactile modules (Figure 2b [12]. Each module is a small hexagonal printed circuit board equipped with off-the-shelf sensors (one temperature sensor, one accelerometer, three normal force sensors, and one proximity sensor). In this way, robots are equipped with such an artificial skin that contains seven temperature sensors, seven accelerometers, 21 normal force sensors, and seven proximity sensors. They can emulate the human tactile sensing about temperature, vibrations, force, and light touch. Their technical information is summarized in Table 1.

2.2. UR10 Robotic Arm

We mounted the multi-modal artificial skin on the end-effector of an Universal Robotic Arm (UR10) with six DoFs (Figure 2a). The UR10 was controlled in collaboration with the aritificial skin in order to apply different exploratory actions on objects.

3. Exploratory Actions and Perception

3.1. Exploratory Actions Definition

By applying exploratory actions on objects with different action parameters, the robot can attain different feature observations. In this work, we consider three types of exploratory actions: pressing (denoted as P), sliding (denoted as S), and static contact (denoted as C). Formally, we define

N_{α}

number of exploratory actions as

A = {α_{n}^{θ_{n}}}_{n = 1}^{N_{α}}

, where

θ_{n}

is the action parameters that define "how" the robot can apply the exploratory action. We further define

θ_{n} \in {θ_{P}, θ_{S}, θ_{C}}

, where

θ_{P}, θ_{S}

, and

θ_{C}

represent the action parameters for the pressing, sliding, and static contact movements respectively.

3.1.1. Pressing

The robotic system presses its sensory part on the object surfaces in order to perceive its stiffness (see Figure 3a). The pressing movement consists of pressing until a depth of

d_{P}

and holding the artificial skin for

t_{P}

seconds, i.e.,

θ_{P} = [d_{P}, t_{P}]

. During the pressing, the multi-modal artificial skin can record the normal force feedbacks from each normal force sensor:

F_{n_{f}, n_{s}} = {F_{n_{f}, n_{s}}^{m}}_{m = 1}^{t_{P} \cdot f_{s}}

in order to measure the object stiffness.

n_{f}

is the index of a normal force sensor in one skincell (

n_{f} = 1, \dots, N_{f}

, in our case

N_{f} = 3

), and

n_{s}

is the index of skincells in the artificial skin (

n_{s} = 1, \dots, N_{s}

, in our case

N_{s} = 7

).

f_{s}

is the sampling rate of the artificial skin, and m the sampling time step. In addition to the normal force feedbacks, the robot can also record the temperature feedbacks from each temperature sensor in order to measure the object thermal conductivity:

T_{n_{t}, n_{s}} = {T_{n_{t}, n_{s}}^{m}}_{m = 1}^{t_{P} \cdot f_{s}}, n_{t} = 1, \dots, N_{t}

, with

N_{t}

being the number of temperature sensors in one skincell (in our case

N_{t} = 1

).

3.1.2. Sliding

The robot slides the artificial skin on the object surface and perceives its textural properties [18,21] (see Figure 3b). To do this, the robot first builds a contact with objects with the normal force of

F_{S}

, then it linearly slides on the objects with a speed of

v_{S}

for

t_{S}

seconds,

θ_{S} = [F_{S}, v_{S}, t_{S}]

. During sliding, the robot collects the outputs of accelerometers (in three axes:

x, y, z

):

a_{n_{a}, n_{s}}^{(x)} = {a_{n_{a}, n_{s}}^{(x), m}}_{m = 1}^{t_{S} \cdot f_{s}}

,

a_{n_{a}, n_{s}}^{(y)} = {a_{n_{a}, n_{s}}^{(y), m}}_{m = 1}^{t_{S} \cdot f_{s}}

,

a_{n_{a}, n_{s}}^{(z)} = {a_{n_{a}, n_{s}}^{(z), m}}_{m = 1}^{t_{S} \cdot f_{s}}

. Then the robot combines these signals together:

a = {a_{n_{a}, n_{s}}}_{n_{a} = 1, n_{s} = 1}^{N_{a}, N_{s}}

;

a_{n_{a}, n_{s}} = [a_{n_{a}, n_{s}}^{(x)}, a_{n_{a}, n_{s}}^{(y)}, a_{n_{a}, n_{s}}^{(z)}], n_{a} = 1, \dots, N_{a}

, where

N_{a}

is the number of accelerometers in one skincell (in our case

N_{a} = 1

). Besides, the change of temperature during sliding is also collected as an extra information

T_{n_{t}, n_{s}} = {T_{n_{t}, n_{s}}^{m}}_{m = 1}^{t_{S} \cdot f_{s}}

.

3.1.3. Static Contact

The object thermal cues can be attained by the robotic system by applying static contact movement: the robot presses its sensory part against the object surface until a depth of

d_{C}

and maintains the contact for

t_{C}

seconds, i.e.,

θ_{C} = [d_{C}, t_{C}]

(see Figure 3c). The normal force feedbacks and temperature feedbacks are recorded:

F_{n_{f}, n_{s}} = {F_{n_{f}, n_{s}}^{m}}_{m = 1}^{t_{C} \cdot f_{s}}

,

T_{n_{t}, n_{s}} = {T_{n_{t}, n_{s}}^{m}}_{m = 1}^{t_{C} \cdot f_{s}}

.

3.2. Object Physical Properties Perception

3.2.1. Stiffness

We use the normal force averaged over all normal force sensors and time steps as an indicator for the object stiffness. For the pressing movement with pressing time steps

t_{P} \cdot f_{s}

, object stiffness can be estimated by

\bar{F} = \frac{1}{t_{P} \cdot f_{s}} \frac{1}{N_{f}} \frac{1}{N_{s}} \sum_{m = 1}^{t_{P} \cdot f_{s}} \sum_{n_{f} = 1}^{N_{f}} \sum_{n_{s} = 1}^{N_{s}} F_{n_{f}, n_{s}}^{m}

.

3.2.2. Textural Property

In this work, we use the same textural feature extraction method in [43]: The vibration signals

a

in the artificial skin are used to calculate the activity, mobility and complexity features, denoted as

A (a)

,

M (a)

,

C (a)

. These features represent the object tactile properties in the time domains. We also computed the linear correlation of accelerometer signals between different directions (

x y

,

y z

,

x z

) denoted as

L (a)

, as these accelerometer components are correlated with each other during the sliding movement. The final descriptor of textural features combines activity, mobility, complexity and linear correlation together [43]:

T D = [A (a), M (a), C (a), L (a)]

.

3.2.3. Thermal Conductivity

To extract the features that describe the object thermal cues, we first calculate the average temperature sequence from all the temperature sensors:

\bar{T} = \sum_{n_{t} = 1}^{N_{t}} \sum_{n_{s} = 1}^{N_{s}} \frac{T_{n_{t}, n_{s}}}{N_{t} \cdot N_{s}}

. We then calculate its gradient at each time step as:

\nabla \bar{T}

, and combine it with the average temperature sequence:

[\bar{T}, \nabla \bar{T}]

. To avoid the curse of dimensionality, we further reduce this combination to 10 dimensions via Principle Component Analysis (PCA) method and use it as the final feature to describe the object thermal conductivity.

Table 2 summarizes the exploratory actions, the sensory feedbacks and the corresponding tactile features.

4. Transferring Prior Tactile Exploratory Action Experiences

This section describes our proposed active prior tactile knowledge transfer algorithm (APTKT) in detail. First, we formulate our problem in Section 4.1. Then, we illustrate our transfer learning method, including its process (Section 4.3) and the problems of what to transfer (Section 4.4), how to transfer (Section 4.5), from where to transfer, and how much to transfer (Section 4.6). The motivation of our method is demonstrated in Figure 1.

4.1. Problem Formulation

Assume that a robotic system has gained prior tactile knowledge of some old objects, on which the robot has previously applied different exploratory actions with different action parameters. These prior exploratory action experiences consist of the feature observations perceived by the multiple sensors and observation models from the old objects. Now, the robot is tasked to learn about a set of new objects. Since the old objects might share some similar physical properties with the new objects, by leveraging the related tactile exploratory action experiences, the robot can learn about new objects more efficiently.

We define

N_{new}

number of new objects (

C^{new} = {c_{j}^{new}}_{j = 1}^{N_{new}}

) the robot is tasked to learn about through different exploratory actions

A = {α_{n}^{θ_{n}}}_{n = 1}^{N_{α}}

(For simplicity, we will denote

α

as an exploratory action in the rest of the paper). In other words, the robot should actively attain object feature observations (

V_{α}^{new} = {V_{c_{1}}^{new}, V_{c_{2}}^{new}, \dots, V_{c_{N_{new}}}^{new}}

) for each exploratory action

α

and construct reliable observation models

V_{α}^{new} \overset{f_{α}^{new}}{\to} C^{new}

. We further define the robot prior tactile experience for an exploratory action

α

for

N_{old}

number of prior objects (

C^{old} = {c_{i}^{old}}_{i = 1}^{N_{old}}

) as the prior object feature observations (

V_{α}^{old} = {v_{c_{1}}^{old}, V_{c_{2}}^{old}, \dots, V_{c_{N_{old}}}^{old}}

) and the observation models of old objects

V_{α}^{old} \overset{f_{α}^{old}}{\to} C^{old}

. These feature observations are collected by the multiple tactile sensors from the artificial robotic skin.

We formulate our problem as the transfer learning in the Gaussian Process Classification (GPC) framework [46], where each object is regarded as a class, and for each exploratory action, a GPC model is built as the observation model. The robot iteratively applies the exploratory actions and leverages prior tactile knowledge to improve the GPC observation models of new objects.

4.2. Gaussian Process Classification

The Gaussian Process Classification (GPC) model describes the mapping between the observation set X and the output set Y by:

X \overset{f}{\to} Y

. The latent function

g (x)

in the GPC model is assumed to be sampled from a high-dimensional gaussian distribution called GP prior [46]:

g (x) \sim GP (m (x), K (x, x^{'}))

, where each sample

g (x)

is a random variable. In this work, we use one-vs-all multi-class classification. For each object class, a binary GPC whose output label is converted to

{- 1, + 1}

is trained for each of the N labels:

f_{n} (\cdot)

. Given a new sample

x^{*}

, each binary classifier predicts the observation probability of its label

p (y_{n} | x^{*})

. The sample is assigned to the class with the largest prediction probability

y^{*} = \arg \max_{y_{n} \in Y} p (y_{n} | x^{*})

.

4.3. Process

The robot following our proposed method first applies each exploratory action one time on each new object, in order to collect a small number of feature observations

V^{new} = {V_{α_{n}}^{new}}_{n = 1}^{N_{α}}

(Initial data collection). Then, the robot reuses its prior tactile exploratory action experiences to improve the observation models for each new object (Initial prior knowledge transfer). During this process, the robot compares the relatedness between its prior tactile exploratory action experiences and the new objects (Section 4.6), and chooses the most related one to transfer the old object feature observations

V^{old}

(Section 4.5). Afterwards, the robot begins to iteratively collect and combine the feature observations and update the prior tactile knowledge in order to improve the observation models. At each iteration of prior tactile knowledge updating, the robot (1) actively selects the next object and the next exploratory action in order to attain a new feature observation; and (2) updates the prior tactile knowledge for the selected exploratory action. The iteration terminates when there is no improvement in the observation models of new objects. Our algorithm is demonstrated by Figure 4.

4.4. What to Transfer

When the robotic system applies an exploratory action on objects, it perceives multiple feature observations (e.g., by the pressing movement, the robot can perceive the object stiffness and thermal conductivity). The prior tactile exploratory action experiences are built using the feature observations of prior objects from multiple sensory modalities that are combined together and the corresponding GPC observation models of prior objects.

In order to combine the observations perceived from different tactile sensors, we first define

v_{α}

as the feature observation of an exploratory action

α

. It is comprised of multiple observations:

v_{α} = [v_{α}^{(1)}, \dots, v_{α}^{(m_{α})}, \dots, v_{α}^{(M_{α})}]

, where

v_{α}^{(m_{α})}

is an observation from the sensor modality

m_{α}

,

M_{α}

is the number of sensing modalities. For the pressing and static contact movements, we use the normal force and temperature sensing, for the sliding movement the accelerometer and temperature sensing (Table 2). Then, we assume that for a sensor modality

m_{α}

, a kernel function

K^{(m_{α})}

is given. To combine multiple feature observations so as to exploit the information from all sensors after applying the exploratory action

α

, we linearly combine the kernels:

K_{α}^{'} = γ_{α}^{(1)} K^{(1)} + \dots + γ_{α}^{(m_{α})} K^{(m_{α})} + \dots + γ_{α}^{(M_{α})} K^{(M_{α})},

(1)

where

γ_{α}^{(m_{α})} \geq 0

. This hyper-parameter controls how much the robot can rely on the sensor modality

m_{α}

. It ranges between 0 and 1, with

γ_{α}^{(m_{α})} = 0

indicating that the sensor feedback is not informative, and

γ_{α}^{(m_{α})} = 1

highly useful. We further constrain these hyper-parameters with

L_{1}

norm:

| \sum_{m_{α = 1}}^{M_{α}} γ_{α}^{(m_{α})} | = 1 .

For each exploratory action, a GPC observation model is built using

K^{'}

. The hyper-parameters of

γ

and kernels are selected by maximizing the log marginal likelihood [46]. Figure 5 illustrates our multiple feature observations combination method. It is also demonstrated by Algorithm 1.

Algorithm 1 Multiple Feature Observations Combination

4.5. How to Transfer

Taking advantage of our previously proposed method [45], the robotic system transfers the feature observations of a prior object

c_{i}^{old}

to learn the GPC observation model of a new object

c_{j}^{new}

, based on an exploratory action

α

. For simplicity, we hereby refer to i and j as

c_{i}^{old}

and

c_{j}^{new}

, respectively. We define

g_{i}^{old}

as the Gaussian Process latent function values [46] for the old object

c_{i}^{old}

and

g_{j}^{new}

for the new object

c_{j}^{new}

. We assume that these two function values are not independent of each other, but are sampled together over a dependent Gaussian Prior (GP). This dependent GP is then used to construct the GPC observation model of the new object. The latent function can be modified accordingly:

g_{j}^{{new}^{'}} \leftarrow [g_{i}^{old}, g_{j}^{new}]

[45]. We further incorporate the relatedness between prior object and new object into the dependent GP model by introducing the following dependent kernel function:

\begin{matrix} K^{'} = [\begin{matrix} K (V_{i}^{old}, V_{i}^{old}) & λ K (V_{i}^{old}, V_{j}^{new}) \\ λ K (V_{j}^{new}, V_{i}^{old}) & K (V_{j}^{new}, V_{j}^{new}) \end{matrix}] . \end{matrix}

(2)

K (V_{i}^{old}, V_{i}^{old})

and

K (V_{j}^{new}, V_{j}^{new})

serve as the kernel matrix that measures the similarity among all feature observations of the old object and the new object, respectively. Each element in the kernel matrix measures the similarity between two feature observations, which is calculated by the radial basis function (RBF).

λ K (V_{j}^{new}, V_{i}^{old})

and

λ K (V_{i}^{old}, V_{j}^{new})

are the kernel matrix between the old object and the new object.

λ

controls the relatedness, or similarity, between

c_{i}^{old}

and

c_{j}^{new}

. We constrain its range within

[0, 1]

. As Chai et al. [47] evaluated,

λ = 0

indicates that the old object and the new object are totally different, while

λ = 1

indicates that the two objects are the same.

4.6. From Where and How Much to Transfer

Section 4.5 describes how to transfer the prior tactile knowledge to learn about new objects. This section illustrates how the robotic system selects the most related old object (from where to transfer) and how to determine the relatedness (

λ

) between two objects (how much to transfer).

To do this, we use our previously proposed method [45]. Let

p (c_{i}^{old} | v_{j}^{new})

be the prediction probability that a feature observation from the new object

v_{j}^{new}

is assigned to the old object

c_{i}^{old}

. We measure the average prediction to all the observations

v_{j}^{new} \in V_{j}^{new}

that belong to the new object:

\bar{p} (c_{i}^{old} | V_{j}^{new}) = \frac{1}{N_{j}^{new}} \sum p (c_{i}^{old} | v_{j}^{new})

, with

N_{j}^{new}

being the number of new object feature observations. This average prediction value indicates the similarity between the old object

c_{i}^{old}

and the new object

c_{j}^{new}

. A larger value indicates that these two objects are highly similar. Therefore, we can use it to select the most related old object (denoted as

c^{{old}^{*}}

) for a new object regarding the exploratory action

α

. Furthermore, to avoid transferring irrelevant tactile information, we add a threshold

ϵ_{n e g}

which prevents the robot from selecting any old object when the prediction value is smaller than

ϵ_{n e g}

. The final old object selection criterion is:

\begin{matrix} c^{{old}^{*}} = \{\begin{matrix} \underset{c_{i}^{old} \in C^{old}}{\arg \max} \bar{p} (c_{i}^{old} | V_{j}^{new}), & if \bar{p} (c^{{old}^{*}} | V_{j}^{new}) \geq ϵ_{n e g} \\ None, & otherwise . \end{matrix} \end{matrix}

(3)

Once we select

c^{{old}^{*}}

, we further use the predictions from the observation model of old objects to determine the object relatedness

λ^{*}

:

λ^{*} = \bar{p} (c^{{old}^{*}} | V^{new})

.

4.7. Prior Exploratory Action Experiences Update

When the robot updates its prior exploratory action experiences, it needs to iteratively collect a new feature observation by applying an exploratory action on an object. We use our previously proposed active tactile learning algorithm [43] called Active Touch for Learning Physical Properties (AT-LPP). Using our AT-LPP approach, the robot actively decides which new feature on the object to explore next (denoted as

c^{{new}^{*}}

) and which physical property to learn next (which exploratory action to apply next). It is denoted as

α^{*})

. In the following, we briefly summarize the AT-LPP algorithm (Algorithm 2) [43].

The robot first calculates the Shannon entropy of the object posterior for a new feature observation

v^{new}

with the equation:

H (c^{new} | v^{new}) = - \sum_{c_{j}^{new} \in C^{new}} p (c_{j}^{new} | v^{new}) \log (p (c_{j}^{new} | v^{new}))

. Then the robot estimates the uncertainty in the GPC model with regard to each exploratory action and new object by the mean value of the Shannon entropy:

UNC (α_{n}, c_{j}) = \frac{1}{N_{α_{n}, j}^{new}} \sum_{v_{α_{n}, j}^{new} \in V_{α_{n}, c_{j}^{new}}^{new}} H (c_{j}^{new} | v_{α_{n}, j}^{new})

, where

v_{α_{n}, j}^{new}

refers to the a feature observation the robot has collected for the new object

c_{j}^{new}

and exploratory action

α_{n}

;

N_{α_{n}, j}^{new}

is the number of feature observations. A large

UNC (α_{n}, c_{j})

indicates that the robot is uncertain about the object feature observations from the exploratory action

α_{n}

. As discussed in [43], an efficient next object and the next action selection process should be considered to greedily reduce such uncertainty while at the same time allowing the robot to explore (exploration-exploitation trade-off). In this regard, the next exploratory action

α^{*}

and the next object

c^{{new}^{*}}

are determined by:

\begin{matrix} \{\begin{matrix} c^{{new}^{*}}, α^{*} = \underset{α_{n} \in A; c_{j}^{new} \in C^{new}}{\arg \max} UNC (α_{n}, c_{j}^{new}) & if p_{rand} \geq ϵ_{explor} \\ c^{{new}^{*}} = U {c_{1}^{new}, c_{2}^{new}, \dots, c_{N_{new}}^{new}}, α^{*} = U {α_{1}, α_{2}, \dots α_{N_{α}}} & otherwise, \end{matrix} \end{matrix}

(4)

where

ϵ_{explor}

is the exploration rate, and

p_{rand}

is randomly generated following the uniform distribution

U (0, 1)

.

Algorithm 2 Active Touch for Learning Physical Properties

Once the robot collects a new feature observation, it updates the prior tactile exploratory action experiences only from action

α^{*}

. This process includes updating the feature observation combination, updating the object relatedness

λ

, and transferring these prior feature observations to the observation models of new objects.

5. Experimental Results

5.1. Experimental Objects

In order to evaluate the performance of the proposed active prior tactile knowledge transfer algorithm (APTKT), we deliberately selected 10 daily objects with different physical properties which served to build the robotic prior to tactile exploratory action experiences (see Figure 1 Prior objects). Furthermore, we selected five new objects about which the robotic system was tasked to learn (Figure 1 New objects). For each new object, there existed one or more old objects that shared similar physical properties. For example, both rough sponge and smooth sponge are soft; paper box and hard box have similar surface textures; metal toolbox and biscuit box have high thermal conductivity. In this way, when learning about new objects based on their physical properties, the robot can leverage the related prior tactile knowledge.

5.2. Exploratory Action Determination and Test Data Collection

In our experiment, we defined seven exploratory actions from the pressing, sliding, and static contact movements with various action parameters (Pressing:

P 1

,

d_{P} = 1

mm,

t_{P} = 3

s;

P 2

,

d_{P} = 2

mm,

t_{P} = 3

s. Sliding:

S 1

,

F_{S} = 0.1

N,

t_{S} = 5

s

v_{S} = 1

cm/s;

S 2

,

F_{S} = 0.1

N,

t_{S} = 1

s,

v_{S} = 5

cm/s;

S 3

,

F_{S} = 0.2

N,

t_{S} = 5

s,

v_{S} = 1

cm/s;

S 4

,

F_{S} = 0.2

N,

t_{S} = 1

s,

v_{S} = 5

cm/s. Static Contact:

C 1

,

d_{C} = 2

mm,

t_{C} = 15

s). Before applying any of the seven exploratory actions, the robot established light contact with the objects which were detected once the total normal force on the artificial skin increased above

0.05

N. Furthermore, after applying an exploratory action, the robot was controlled to raise its end-effector for 30 s such that the temperature sensors could be restored to the ambient temperature.

We evaluated the performance of our proposed method based on a test dataset built by the robot by applying each actions 20 times on each object. During this process, objects were manually shifted and rotated so that the data was robust against the variations in the object contact locations with the artificial skin.

5.3. Evaluation of Multiple Feature Observations Combination Method

We first evaluated the performance of our proposed robotic multiple feature observation combination algorithm. To do this, the robot selected 10 groups of objects (shown in Figure 1) to construct the GPC observation models for each of the seven exploratory actions. Each group contained five objects that were selected randomly both from the old and new object lists, following a uniform distribution. The algorithm performance was evaluated by the discrimination accuracy of the test dataset predicted by the GPC models with the growing number of feature observations. We compared our method with the baseline methods that built the GPC models using only a single sensor modality.

The experiments were conducted 10 times for each object group. For a fair comparison, we used RBF kernel [46] for each sensor modality. Results are plotted in Figure 6. For all seven exploratory actions, our proposed algorithm either took advantage of combining different sensor modalities to reach the best discrimination accuracy (

P 1

,

P 2

,

C 1

,

S 4

in Figure 6), or performed the same as the best single-sensor result (

S 1

,

S 2

,

S 3

in Figure 6), indicating that the robot actively selected the most informative sensory feedback to learn about objects.

5.4. Evaluation of the Transfer Learning Method with Different Groups of Prior Objects

In this experiment, we evaluated the performance of our proposed transfer learning method (APTKT) to learn the five new objects (see new objects in Figure 1) with different groups of prior objects (see prior objects in Figure 1). To start the learning process, the robot applied each of the seven actions once on each new object. When the robot iteratively learned the new objects’ physical properties, it updated both the multiple feature observations combination and the prior tactile knowledge built by the dependent GPC models with all the feature observations collected so far. At each learning iteration, we measured the object discrimination accuracy of the test dataset. The transfer learning performance was compared with the baseline learning method that combined multiple feature observations without transferring any prior tactile knowledge.

We randomly shuffled the prior objects into ten groups following a uniform distribution. Each group consisted of the feature observations and the observation models from three prior objects. We conducted the experiment with five trials for each group. In each trial, the robot followed the transfer learning approach and no-transfer approach to collect 40 feature observations in total, allowing a fair comparison between different learning strategies to be made. Figure 7 illustrates that with the help of prior knowledge, the robot consistently outperformed the learning process without prior knowledge with a discrimination accuracy of

10 %

.

In order to further evaluate the robustness of APTKT, the robot was then tasked to learn about objects via applying only one of the exploratory actions. The experimental procedure was the same as the one described above. As the results in Figure 8 show, The robot had a larger improvement by actions P1, P2 and C1 than actions S1, S2, S3 and S4. For example, the robot increased the discrimination accuracy by

25 %

, when it reused the prior tactile instance knowledge from the movement P2. However, when learning about objects by actions S1 and S4, little improvement was seen. This was due to the fact that different exploratory actions produced different object feature observations. For action P2, there existed higher related prior tactile knowledge than S1 and S4, and the robot could benefit more on it.

In all scenarios, using our proposed transfer learning algorithm, the robot could achieve a higher discrimination accuracy than the baseline method with the same number of feature observations. Therefore, we can conclude that APTKT helps the robot build reliable observation models of new objects with fewer training samples, even when only one kind of exploratory action is applied.

5.5. Increasing the Number of Prior Objects

We further evaluated the performance of our proposed method with an increasing number of prior tactile experiences. Intuitively, as the number of old objects grows, it is more likely that the robot can find highly-related prior tactile knowledge, so that the learning performance can continue to be improved. In this regard, the robot was asked to learn about new objects via all seven exploratory actions, with the number of old objects increasing from 5, 7 to 10. We followed the same experimental procedure described above, and conducted each experiment with five trials. Unexpectedly, as Figure 9b–d show, the growing number of prior tactile knowledge reduced the transfer learning improvement. This was because as the number of prior objects grow, it was more difficult for the robot to classify them. As a result, the object relatedness

λ

predicted by the old object GPC models was lower than the threshold

ϵ_{neg}

, making the robot stop transferring prior knowledge.

To compensate for this, we use our previously proposed feature augmentation trick [45]. We defined

p (c_{i}^{old} | v)

as the prediction probability that a feature observation from the new object

v

is assigned to the old object

c_{i}^{old}

. Then we augmented a feature observation

v

from a new object as:

\begin{matrix} v^{'} = [\underset{original features}{\underset{︸}{v,}} \underset{predictions from old objects^{'} observation models}{\underset{︸}{p (c_{1}^{old} | v), \dots, p (c_{i}^{old} | v), \dots, p (c_{N_{old}}^{old} | x)}}] . \end{matrix}

(5)

The auxiliary features

[p (c_{1}^{old} | v), \dots, p (c_{N_{old}}^{old} | v)]

encode the knowledge of all prior objects. They represent the relatedness between prior objects and the new object, and thus can help the robotic system to distinguish among new objects. Furthermore, since the auxiliary features can be regarded to be perceived from an auxiliary sensor, we directly employed our proposed multiple feature observation combination method to the augmented feature observations by casting a weight

γ

to its kernel. The augmented feature observations were then used to build the new object dependent GPC models.

We tested our proposed feature augmentation technique when the robot leveraged the tactile knowledge of 3, 5, 7, and 10 prior objects to learn about new objects via all seven actions. The learning performance is shown by the green curves in Figure 9a–d. Clearly, by introducing the probability predictions as auxiliary features, the robot was able to reuse the prior tactile knowledge again, and it achieved similar improvement of discrimination accuracy for 3 prior objects, and higher improvement for 5, 7, and 10 prior objects compared to the other methods. Specifically, when resuing 10 prior objects, the robot achieved

20 %

higher discrimination accuracy than the baseline method, when only one new feature observation was collected, showing the one-shot learning behaviour. This experiment also indicates that with a further growing number of prior objects, a further improvement of discrimination accuracy is achievable.

5.6. Negative Prior Tactile Knowledge Transfer Testing

When the constructed prior tactile exploratory action experiences are not relevant to the new objects, a brutal-force transfer may degrade the learning performance, resulting in the negative knowledge transfer phenomena. In this case, the transfer learning algorithm should stop leveraging irrelevant prior knowledge.

In order to evaluate our proposed transfer learning method (APTKT) against the negative tactile knowledge transfer, we deliberately selected irrelevant prior objects and compared the transfer learning performance with the baseline method, following the same experimental process described in Section 5.4. When finding which objects were relevant (or irrelevant) to each other, we built object confusion matrices to roughly evaluate the object similarity. For each of the seven exploratory actions, we trained a Gaussian Mixture Model (GMM) and calculated the object confusion matrix. To do this, we first used GMM to cluster all the samples from the dataset with the hyper-parameters optimized by the Expectation-Maximization (EM) algorithm. The number of clusters was set to be the same as the number of objects (in our case, 15), and each cluster centroid was initialized as the mean value of all data samples that belonged to an object. The maximum EM iterations was set to be 100, with convergence threshold being

0.001

. We further calculated the confusion matrix averaged over all exploratory actions. These matrices indicated the averaged similarity between objects. We rescaled their values to be within 0-1, with 0 meaning that two objects are totally dissimilar, and 1 the same. The objects which had low similarity values with target objects were selected as irrelevant objects. The results are shown in Figure 10. According to Figure 10, prior objects {1, 5, 7} (objects {1–10}) were dissimilar to the new objects (objects {11–15}) regarding the exploratory movement P1, objects {1, 4, 7} for P2, objects {4, 7, 10} for C1, objects {1, 6, 9} for S1, objects {1, 7, 10} for S2, objects {1, 3, 9} for S3, and objects {1, 3, 8} for S4. We thus used these objects as prior objects to test the transfer learning performance via the single exploratory action. We further selected objects {1, 5, 10} to test the learning process via all exploratory actions, since these three objects shared relative small similarity to the new objects.

The results in Figure 11 illustrate that the discrimination accuracy achieved by APTKT was similar to the baseline method, when the robot applied either one or all seven exploratory actions. The results indicate that our proposed algorithm stopped transferring negative prior tactile instance knowledge.

6. Conclusions

In this work, we proposed a transfer learning method for a robot equipped with multi-modal artificial skin to actively reuse the prior tactile exploratory action experiences when learning about the detailed physical properties of new objects. These prior action experiences are built by the feature observations, when the robotic arm applies the pressing, sliding and static contact movements with different action parameters on the previous-explored objects (prior objects). The feature observations are perceived from multiple sensory modalities. Using our proposed tactile transfer learning method, the robot has a "warm start" of the learning process. It applies fewer exploratory actions and gains a detailed tactile knowledge of new objects (e.g., normal force feedback at different pressing depths).

One limitation of our work is that performing static contact movement took 15 s, which prevented the rapid transfer learning. Furthermore, due to the limitations of our artificial skin, the robot can only interact with objects with flat surfaces. In the future, we will extend our method to more exploratory actions (such as tapping and lifting), so that the robot can transfer more exploratory action experiences to learn more physical properties of an object, such as auditory feedback and center of mass. Furthermore, an interesting topic would be how to transfer the prior tactile knowledge across different exploratory actions, e.g., transferring the tactile knowledge from pressing to static contact movement.

Acknowledgments

This work was supported by the German Research Foundation (DFG) and the Technical University of Munich within the Open Access Publishing Funding Programme.

Author Contributions

M.K. and G.C have developed the idea of tactile transfer learning. D.F. and M.K. conceived and designed the experiments; D.F. performed the experiments; D.F., M.K., and G.C analyzed the data and evaluated the experimental results; G.C. has supervised this research as the final responsible of its supporting projects. D.F., M.K., and G.C wrote the paper. M.K. and D.F. contributed equally to this work.

Conflicts of Interest

The authors declare no conflict of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

References

Lederman, S.J.; Klatzky, R.L. Hand movements: A window into haptic object recognition. Cogn. Psychol. 1987, 19, 342–368. [Google Scholar] [CrossRef]
Lederman, S.J.; Klatzky, R.L. Haptic classification of common objects: Knowledge-driven exploration. Cogn. Psychol. 1990, 22, 421–459. [Google Scholar] [CrossRef]
Kaboli, M.; Long, A.; Cheng, G. Humanoids learn touch modalities identification via multi-modal robotic skin and robust tactile descriptors. Adv. Rob. 2015, 29, 1411–1425. [Google Scholar] [CrossRef]
Pugh, K.J.; Bergin, D.A. Motivational influences on transfer. Educational Psychologist 2006, 41, 147–160. [Google Scholar] [CrossRef]
Schunk, D. Learning Theories: An Educational Perspective, 4th ed.; Pearson: Upper Saddle River, NJ, USA, 2004; p. 22. ISBN 0130384968. [Google Scholar]
Cree, V. Transfer of Learning in Professional and Vocational Education; Routledge: Abingdon, UK, 2000; ISBN 0415204186. [Google Scholar]
Ormrod, J.E. Human Learning, 6th ed.; Pearson: Upper Saddle River, NJ, USA, 2012; ISBN 9780132595186. [Google Scholar]
Hung, W. Problem-based learning: A learning environment for enhancing learning transfer. New Directions Adult Continuing Educ. 2004, 137, 27–38. [Google Scholar] [CrossRef]
Choi, S.; Meeuwsen, H.; French, R.; Sherrill, C.; McCabe, R. Motor Skill Acquition, Rentention, and Transfer in Adults with Profound Mental Retardation. Adapted Phys. Act. Q. 2001, 18, 257–272. [Google Scholar] [CrossRef]
Canini, K.R.; Shashkov, M.M.; Griffiths, T.L. Modeling Transfer Learning in Human Categorization with the Hierarchical Dirichlet Process. In Proceedings of the 27th International Conference on Machine Learning (ICML-10), Haifa, Israel, 21–24 June 2010; pp. 151–158. [Google Scholar]
Yogeswaran, N.; Dang, W.; Navaraj, W.T.; Shakthivel, D.; Khan, S.; Polat, E.O.; Gupta, S.; Heidari, H.; Kaboli, M.; Lorenzelli, L.; et al. New materials and advances in making electronic skin for interactive robots. Adv. Rob. 2015, 29, 1359–1373. [Google Scholar] [CrossRef]
Mittendorfer, P.; Cheng, G. Humanoid multimodal tactile-sensing modules. IEEE Trans. Rob. 2011, 27, 401–410. [Google Scholar] [CrossRef]
Jamali, N.; Sammut, C. Material classification by tactile sensing using surface textures. In Proceedings of the 2010 IEEE International Conference Robotics and Automation (ICRA), Anchorage, AK, USA, 3–7 May 2010; pp. 2336–2341. [Google Scholar]
Chu, V.; McMahon, I.; Riano, L.; McDonald, C.G.; He, Q.; Perez-Tejada, J.M.; Arrigo, M.; Fitter, N.; Nappo, J.C.; Darrell, T.; et al. Using robotic exploratory procedures to learn the meaning of haptic adjectives. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation (ICRA), Karlsruhe, Germany, 6–10 May 2013; pp. 3048–3055. [Google Scholar]
Papakostas, T.V.; Lima, J.; Lowe, M. A large area force sensor for smart skin applications. Proc. IEEE. 2002, 2, 1620–1624. [Google Scholar]
Song, A.; Han, Y.; Hu, H.; Li, J. A novel texture sensor for fabric texture measurement and classification. IEEE Trans. Instrum. Meas. 2014, 63, 1739–1747. [Google Scholar] [CrossRef]
Watanabe, K.; Sohgawa, M.; Kanashima, T.; Okuyama, M.; Norna, H. Identification of various kinds of papers using multi-axial tactile sensor with micro-cantilevers. In Proceedings of the World Haptics Conference (WHC), Daejeon, Korea, 14–18 April 2013; pp. 139–144. [Google Scholar]
Kaboli, M.; Mittendorfer, P.; Hügel, V.; Cheng, G. Humanoids learn object properties from robust tactile feature descriptors via multi-modal artificial skin. In Proceedings of the 14th IEEE International Conference on Humanoid Robots (Humanoids), Madrid, Spain, 18–20 November 2014; pp. 187–192. [Google Scholar]
Friedl, K.E.; Voelker, A.R.; Peer, A.; Eliasmith, C. Human-inspired neurorobotic system for classifying surface textures by touch. IEEE Rob. Autom. Lett. 2016, 1, 516–523. [Google Scholar] [CrossRef]
Kaboli, M.; Cheng, G. Robust Tactile Descriptors for Discriminating Objects from Textural Properties via Artificial Robotic Skin. IEEE Trans. Rob. 2018, 9, 1–19. [Google Scholar]
Kaboli, M.; Rosa, A.D.L.T.; Walker, R.; Cheng, G. In-hand object recognition via texture properties with robotic hands, artificial skin, and novel tactile descriptors. In Proceedings of the IEEE International Conference on Humanoid Robots (Humanoids), Seoul, South Korea, 3–5 November 2015; pp. 1155–1160. [Google Scholar]
Bhattacharjee, T.; Wade, J.; Kemp, C. Material Recognition from Heat Transfer given Varying Initial Conditions and Short-Duration Contact. In Proceedings of the Robotics: Science and Systems, Rome, Italy, 13–17 July 2015; pp. 1–6. [Google Scholar]
Yao, K.; Kaboli, M.; Cheng, G. Tactile-based Object Center of Mass Exploration and Discrimination. In Proceedings of the IEEE International Conference on Humanoid Robots (Humanoids), Birmingham, UK, 15–17 November 2017; pp. 1–6. [Google Scholar]
Bhattacharjee, T.; Rehg, M.J.; Kemp, C. Inferring Object Properties with a Tactile Sensing Array Given Varying Joint Stiffness and Velocity. Int. J. Humanoid Rob. 2017, 14, 1–32. [Google Scholar] [CrossRef]
Zhang, M.M.; Atanasov, N.; Daniilidis, K. Active end-effector pose selection for tactile object recognition through monte carlo tree search. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 3258–3265. [Google Scholar]
Martinez-Hernandez, U.; Dodd, T.J.; Prescott, T.J. Feeling the shape: Active exploration behaviors for object recognition with a robotic hand. IEEE Trans. Syst. Man Cybern. Syst. 2017, 99, 1–10. [Google Scholar] [CrossRef]
Xu, D.; Loeb, G.E.; Fishel, J.A. Tactile identification of objects using Bayesian exploration. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Karlsruhe, Germany, 6–10 May 2013; pp. 3056–3061. [Google Scholar]
Schneider, A.; Sturm, J.; Stachniss, C.; Reisert, M.; Burkhardt, H.; Burgard, W. Object identification with tactile sensors using bag-of-features. In Proceedings of the IEEE RSJ International Conference on Intelligent Robots and Systems, St. Louis, MO, USA, 10–15 October 2009; pp. 243–248. [Google Scholar]
Lepora, N.F.; Martinez-Hernandez, U.; Prescott, T.J. Active touch for robust perception under position uncertainty. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation (ICRA), Karlsruhe, Germany, 6–10 May 2013; pp. 3020–3025. [Google Scholar]
Fishel, J.A.; Loeb, G.E. Bayesian exploration for intelligent identification of textures. Front. Neurorobotics 2012, 6, 1–20. [Google Scholar]
Saal, H.; Ting, J.A.; Vijayakumar, S. Active sequential learning with tactile feedback. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Taipei, Taiwan, 18–22 October 2010; pp. 677–684. [Google Scholar]
Tanaka, D.; Matsubara, T.; Sugimoto, K. An optimal control approach for exploratory actions in active tactile object recognition. In Proceedings of the 2014 IEEE-RAS International Conference on Humanoid Robots, Madrid, Spain, 18–20 November 2014; pp. 787–793. [Google Scholar]
Guo, H.L.; Zhang, L.; Su, Z. Empirical study on the performance stability of named entity recognition model across domains. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, 22–23 July 2006; pp. 509–516. [Google Scholar]
Yang, Q. Activity Recognition: Linking Low-level Sensors to High-level Intelligence. IJCAI 2009, 9, 20–25. [Google Scholar]
Tommasi, T.; Orabona, F.; Caputo, B. Safety in numbers: Learning categories from few examples with multi model knowledge transfer. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, 13–18 June 2010; pp. 3081–3088. [Google Scholar]
Tommasi, T.; Orabona, F.; Kaboli, M.; Caputo, B.; Martigny, C. Leveraging over prior knowledge for online learning of visual. In Proceedings of the British Machine Vision Conference categories, Guildford, UK, 3–7 September 2012; pp. 1–8. [Google Scholar]
Kaboli, M. Leveraging over Prior Knowledge for Online Learning of Visual Categories across Robots. Thesis Dissertation, The Royal Institute of Technology (KTH), Stockholm, Sweden, 2012. [Google Scholar]
Rodner, E.; Denzler, J. One-shot learning of object categories using dependent gaussian processes. In Joint Pattern Recognition Symposium; Springer: Berlin/Heidelberg, Germany, 2010; Volume 637, pp. 232–241. [Google Scholar]
Yang, X.; Kim, S.; Xing, E.P. Heterogeneous multitask learning with joint sparsity constraints. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 7–10 December 2009; pp. 2151–2159. [Google Scholar]
Kaboli, M.; Walker, R.; Cheng, G. Re-using prior tactile experience by robotic hands to discriminate in-hand objects via texture properties. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Stockholm, Sweden, 16–21 May 2016; pp. 2242–2247. [Google Scholar]
Kaboli, M.; Cheng, G. Novel Tactile Descriptors and a Tactile Transfer Learning Technique for Active In-Hand Object Recognition via Texture Properties. In Proceedings of the IEEE International Conference on Humanoid Robots (Humanoids), Cancun, Mexico, 15 November 2016; pp. 1–6. [Google Scholar]
Kaboli, M.; Cheng, G. Dexterous hands learn to re-use the past experience to discriminate in-hand objects from the surface texture. In Proceedings of the 33rd Annual Conference of the Robotics Society of Japan, Tokyo, Japan, 3–5 September 2015; pp. 1–6. [Google Scholar]
Kaboli, M.; Di, F.; Kunpeng, Y.; Pablo, L.; Cheng, G. A Tactile-based Framework for Active Object Learning and Discrimination using Multi-modal Robotic Skin. IEEE Rob. Autom. Lett. 2017, 2, 2143–2150. [Google Scholar] [CrossRef]
Kaboli, M.; Kunpeng, Y.; Di, F.; Cheng, G. Tactile-based active object discrimination and target object search in an unknown workspace. Autonom. Rob. 2018, 2, 1–35. [Google Scholar] [CrossRef]
Kaboli, M.; Di, F.; Cheng, G. Active Tactile Transfer Learning for Object Discrimination in an Unstructured Environment using Multimodal Robotic Skin. Int. J. Humanoid Rob. 2017, 15, 1–27. [Google Scholar] [CrossRef]
Rasmussen, C.E.; Williams, C.K.I. Gaussian processes for machine learning; MIT Press: Cambridge/London, UK, 2006. [Google Scholar]
Chai, K.M. Generalization errors and learning curves for regression with multi-task Gaussian processes. In Advances in neural information processing systems; MIT Press: Cambridge/London, UK, 2009; pp. 279–287. [Google Scholar]

Figure 1. The robot leverages the prior tactile exploratory action experiences built by applying the pressing, sliding, and static contact movements with different action parameters on the prior objects (with index #1–#10) to learn about new objects’ (with index #1–#5) physical properties. The feature observations of prior objects (prior tactile instance knowledge) were used to transfer the action experiences.

Figure 2. (a) The robotic arm equipped with a multi-modal artificial skin; (b) The multi modal artificial skin.

Figure 3. The figure visualizes multiple exploratory actions. (a) The pressing movement defined by the action parameters

d_{P}

and

t_{P}

; (b) The sliding movement with action parameters

v_{S}

,

F_{S}

, and

t_{S}

; (c) The static contact movement defined by

d_{C}

and

t_{C}

.

Figure 3. The figure visualizes multiple exploratory actions. (a) The pressing movement defined by the action parameters

d_{P}

and

t_{P}

; (b) The sliding movement with action parameters

v_{S}

,

F_{S}

, and

t_{S}

; (c) The static contact movement defined by

d_{C}

and

t_{C}

.

Figure 4. Flowchart of the Active Prior Tactile Knowledge Transfer algorithm.

Figure 5. Illustration of multiple feature observations combination method. (a) The robotic system combines the normal force sensing and temperature sensing to learn about objects by applying pressing and static contact movements; (b) The robot slides on the object surface to sense its textural property and thermal conductivity.

Figure 6. Multiple feature observations combination results for exploratory actions

P 1

,

P 2

,

C 1

,

S 1

,

S 2

,

S 3

,

S 4

and the averaged result. STIF: building the GPC observation model based on object stiffness; Thermal-C: thermal conductivity; Texture: object surface textural properties; Multi: combined feature observations. The horizontal axis represents the number of feature observations. The vertical axis represents the discrimination accuracy of the test dataset.

Figure 6. Multiple feature observations combination results for exploratory actions

P 1

,

P 2

,

C 1

,

S 1

,

S 2

,

S 3

,

S 4

and the averaged result. STIF: building the GPC observation model based on object stiffness; Thermal-C: thermal conductivity; Texture: object surface textural properties; Multi: combined feature observations. The horizontal axis represents the number of feature observations. The vertical axis represents the discrimination accuracy of the test dataset.

Figure 7. Transferring the exploratory actions experiences from three prior objects. The small plots show the learning process from 10 groups of old objects. The large plot on the right shows the averaged results. Horizontal axis: the growing number of feature observations the robot collected. Vertical axis: the discrimination accuracy of the test dataset.

Figure 8. Transfer learning using only one exploratory action.

Figure 9. Increasing the number of prior objects from 3, 5, 7 to 10, and comparing the performance of different learning methods. Red: baseline method; Blue: the proposed active prior tactile knowledge transfer method (APTKT) without auxiliary features; Green: APTKT with auxiliary features.

Figure 10. object confusion matrices (value normalized between 0 and 1) for each exploratory action and the average. The blue indices represent the old objects. The red indices represent the new objects, with #11–#15 indicating new objects #1–#5. Best viewed in magnification.

Figure 11. Negative prior tactile knowledge transfer testing. The prior objects that were unrelated to the new objects were deliberately selected.

Table 1. Technical information of sensors in the artificial skin ([12]).

Type	Sensor	Range	Accuracy	Resolution
Proximity	VCNL4010	$200 mm$	N.A.	$0.25$ lx
Acceleration	BMA250	$\pm 2 g$	$256 LSB / g$	$3.91$ mg
Temperature	LM71	−40–150 °C	$\pm 1.5^{\circ} C$	$31.25 m^{\circ} C$
Normal force	customized	> $10 N$	$0.05 N$	N.A.

Table 2. Exploratory actions and perception.

Exploratory actions	Action Parameters ( $θ$ )	Sensory feedbacks	Features
Pressing	$d_{P}$ , $t_{P}$	$F$ , $T$	$\bar{F}$ , $[\bar{T}, \nabla \bar{T}]$
Sliding	$F_{S}$ , $t_{S}$ , $v_{S}$	$a$ , $T$	$T D$ , $[\bar{T}, \nabla \bar{T}]$
Static contact	$d_{C}$ , $t_{C}$	$F$ , $T$	$\bar{F}$ , $[\bar{T}, \nabla \bar{T}]$

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Feng, D.; Kaboli, M.; Cheng, G. Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects. Sensors 2018, 18, 634. https://doi.org/10.3390/s18020634

AMA Style

Feng D, Kaboli M, Cheng G. Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects. Sensors. 2018; 18(2):634. https://doi.org/10.3390/s18020634

Chicago/Turabian Style

Feng, Di, Mohsen Kaboli, and Gordon Cheng. 2018. "Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects" Sensors 18, no. 2: 634. https://doi.org/10.3390/s18020634

APA Style

Feng, D., Kaboli, M., & Cheng, G. (2018). Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects. Sensors, 18(2), 634. https://doi.org/10.3390/s18020634

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Active Prior Tactile Knowledge Transfer for Learning Tactual Properties of New Objects

Abstract

1. Introduction

1.1. Motivation

1.2. Background

1.3. Contribution

2. System Description

2.1. Multi-modal Artificial Skin

2.2. UR10 Robotic Arm

3. Exploratory Actions and Perception

3.1. Exploratory Actions Definition

3.1.1. Pressing

3.1.2. Sliding

3.1.3. Static Contact

3.2. Object Physical Properties Perception

3.2.1. Stiffness

3.2.2. Textural Property

3.2.3. Thermal Conductivity

4. Transferring Prior Tactile Exploratory Action Experiences

4.1. Problem Formulation

4.2. Gaussian Process Classification

4.3. Process

4.4. What to Transfer

4.5. How to Transfer

4.6. From Where and How Much to Transfer

4.7. Prior Exploratory Action Experiences Update

5. Experimental Results

5.1. Experimental Objects

5.2. Exploratory Action Determination and Test Data Collection

5.3. Evaluation of Multiple Feature Observations Combination Method

5.4. Evaluation of the Transfer Learning Method with Different Groups of Prior Objects

5.5. Increasing the Number of Prior Objects

5.6. Negative Prior Tactile Knowledge Transfer Testing

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI