Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network

Macdonald, Fraser L. A.; Lepora, Nathan F.; Conradt, Jörg; Ward-Cherrier, Benjamin

doi:10.3390/s22186998

Open AccessArticle

Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network

¹

Department of Engineering Mathematics, University of Bristol, Bristol BS8 1TW, UK

²

Bristol Robotics Laboratory, University of the West of England, Bristol BS34 8QZ, UK

³

School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, 114 28 Stockholm, Sweden

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(18), 6998; https://doi.org/10.3390/s22186998

Submission received: 31 July 2022 / Revised: 24 August 2022 / Accepted: 9 September 2022 / Published: 15 September 2022

(This article belongs to the Special Issue Advances in Bio-Inspired Skin-Like Sensor Technologies)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Dexterous manipulation in robotic hands relies on an accurate sense of artificial touch. Here we investigate neuromorphic tactile sensation with an event-based optical tactile sensor combined with spiking neural networks for edge orientation detection. The sensor incorporates an event-based vision system (mini-eDVS) into a low-form factor artificial fingertip (the NeuroTac). The processing of tactile information is performed through a Spiking Neural Network with unsupervised Spike-Timing-Dependent Plasticity (STDP) learning, and the resultant output is classified with a 3-nearest neighbours classifier. Edge orientations were classified in 10-degree increments while tapping vertically downward and sliding horizontally across the edge. In both cases, we demonstrate that the sensor is able to reliably detect edge orientation, and could lead to accurate, bio-inspired, tactile processing in robotics and prosthetics applications.

Keywords:

tactile robotics; neuromorphic; spiking neural network

1. Introduction

Research in artificial tactile sensing is crucial to enable dexterous manipulation with robotic hands. The use of bio-inspired sensors and methods can also lead to advances in our understanding of biological touch and to the development of systems that can interact more seamlessly with biological systems, an important consideration for prosthetic devices.

This research utilises the NeuroTac, a neuromorphic optical tactile sensor comprising an event-based camera recording events from a compliant skin (adapted from the original TacTip [1]). The biomimetic nature of the sensor’s skin structure and data output (in the form of spikes) renders it particularly adapted to the detection of edges. We used the NeuroTac to interact with an edge of 30

°

sharpness through tapping vertically and sliding horizontally at different sensor roll angles. The data were processed in a 3-layered spiking neural network with unsupervised learning, with the output from the network being fed to a 3-nearest neighbour classifier to ascertain the angle of the edge relative to the sensor. This research seeks to develop biomimetic algorithms and sensors for artificial touch, leading to advances in robotic manipulation and prosthetics.

1.1. Neuromorphic Tactile Sensors

Tactile sensors come in all shapes and sizes. Their underlying technologies are varied and can be based on task-dependent designs. Some examples of existing tactile sensor technologies include capacitive sensors [2], piezoelectric sensors [3], piezoresistive sensors [4], quantum tunnelling composites [5], and optical sensors [1,6].

Camera-based optical tactile sensors, as we use here, have the advantages of being high-resolution and able to exploit computer vision algorithms for data processing. Our sensor is based on the TacTip [1], a well-established tactile sensor which has been applied to a wide range of tactile tasks [7,8].

Rather than use a traditional frame-based camera, the version of the sensor used in this study is neuromorphic. Neuromorphic sensors output spike-based data, which more closely mimics biological signals, and have even been used to directly interface with the human nervous system to create prosthetic eyes [9]. One of our long-term research goals is to replicate that success in artificial vision for artificial touch. Neuromorphic systems also have the advantage of being low latency, power-efficient, and operating at high temporal resolution (events are produced at the MHz range) [10].

Although other neuromorphic tactile sensors do exist, they tend to have relatively low spatial resolution [11,12], whereas the NeuroTac (Figure 1) exploits recent developments in event-based cameras [13] to produce a high spatial and temporal resolution spike-based output.

1.2. Tactile Sensing and Edge Detection

In order to interact with the environment through touch, an agent requires continuous information, immediate feedback and limited noise [14]. In humans, fine motor interaction with tactile stimulus by the mechanoreceptors in our fingers occurs almost seamlessly without requiring conscious cognitive agency [15]. However, current robotic systems lack the level of feedback and control required for human-like dexterous manipulation.

To achieve this level of feedback, tactile sensing technologies have been applied to tasks such as texture detection [16,17], slip detection [18,19], and object recognition [20], all useful parts of the tactile interactions necessary for dexterous manipulation.

In humans, edges and vertices are the most salient local features of a 3D shape, and are key to the identification and in-hand manipulation of complex objects [21]. Edge detection is, therefore, an important feature for artificial tactile systems within robot hands or prosthetics [22]. The current study aims to tackle this facet of tactile sensing and focuses on the discrimination of edge orientations with a neuromorphic artificial tactile sensor, the NeuroTac.

Other studies investigating edge orientation with neuromorphic tactile sensors have used different approaches, such as spatiotemporal correlation matching of spike outputs [23] or delays in a spiking neural network for coincidence detection [16]. Another recent study created a three-layer network emulating peripheral biological circuits [24]. This study distinguishes itself from those articles through our use of a high-resolution optical tactile sensor emulating fast adapting mechanoreceptor output and our focus on a shallow feed-forward spiking neural network to process spikes. We focus on a simple network to enable its future integration on light-weight embedded systems for prosthetics and take inspiration from biology, with evidence suggesting that edge orientation is processed in first-order tactile neurons at the periphery of the nervous system [25].

1.3. Spiking Neural Networks

The human brain processes information as trains of spikes. Although the most biologically plausible and efficient method to encode information as spike trains is still an open question in neuroscience [26], there is a general consensus that spiking neural networks (SNNs) are currently the most biologically accurate neural network models of the brain.

SNNs have been applied to a number of problems in robotics, including robot control [27] and navigation [28], and numerous applications in robotic vision, including convolutional versions of SNNs [29,30]. SNNs efficiently process sparse data, which leads to low computational and power requirements, as demonstrated by recent benchmarking efforts [31]. Here we aim to exploit these properties of SNNs and focus on a simple feed-forward network structure for the eventual integration of neuromorphic embedded systems in prosthetic and robotic hands.

A number of strategies have been proposed to implement learning in SNNs through supervised methods, which replace traditional back-propagation, such as ReSuMe [32], SpikeProp [33], or SLAYER [34]. Here, our aim is to focus on low complexity and biological plausibility. As such, we apply a synaptic weight update rule known as spike-timing-dependent plasticity (STDP). STDP is a method of unsupervised learning in which synaptic weights are strengthened or weakened based on the temporal correlation of input and output spikes in a neuron [35].

2. Materials and Methods

2.1. Hardware—The NeuroTac Sensor

The NeuroTac sensor design is based on the TacTip sensor [1], which involves a small rubber dome encompassing pins submerged in transparent silicone gel (Techsil, RTV27905) in which pin displacements convey tactile information. In contrast with the original TacTip sensor, the NeuroTac [36]) integrates an event-based camera (Figure 1), which fundamentally changes the nature of the sensor’s output and its associated algorithms for perception. The use of Dynamic Vision Sensors in the NeuroTac leads to an event-based output analogous to biological spikes.

The version of the NeuroTac developed here features a lower form factor than its predecessor [36] (≈30 mm diameter, ≈20 mm height), enabling easier integration into robotic or prosthetic hands. This miniaturisation was enabled, in part, by the use of a lower form-factor event-based Dynamic Vision Sensor (mini-eDVS) camera [37]. The mini-eDVS records local changes in luminance at the pixel level, which outputs microsecond precision events once a threshold is reached. In the NeuroTac, events are produced when the biomimetic pins within the compliant skin are displaced due to contact.

2.2. Experimental Setup and Procedure

The NeuroTac sensor is attached to the end of a 4-axes robot arm (Dobot Magician, M0651) with the edge stimulus fixed to an optical breadboard beneath the robot (Figure 2). The Dobot Magician has a repeatability accuracy of 0.2 mm. Therefore, we may expect some minor noise during data collection. The Dobot robot arm is controlled through a python wrapper on the ‘common robot interface’ framework developed at Bristol Robotics Laboratory (github.com/jlloyd237/cri (accessed on 30 July 2022)). Data are gathered from the sensor through iniVation’s DV Graphical User Interface (inivation.com/dvp/dvsoftware/ (accessed on 30 July 2022)), which streams events over TCP/IP to a python script. We explored two different methods of interaction between the NeuroTac and the edge stimulus: the first is a vertical tapping motion onto the edge (referred to as Tap) and the second a horizontal movement across the stimulus (referred to as Slide).

Data collection proceeds in a similar manner for both conditions. Prior to the commencement of each trial, the tip rests 2 mm above the stimulus. In Tap trials, the NeuroTac is centred in relative XY space to the centre point of the stimulus. In Slide trials, the NeuroTac is positioned central in X space but at an 8 mm displacement in Y space. In the Tap condition, the NeuroTac descends to a point 2 mm deep in Z space relative to the apex of the stimulus before returning to its resting position above, concluding the trial. In the Slide condition, the Dobot descends to a point 2 mm deep in Z space relative to the apex of the stimulus, then moves 20 mm in the Y direction, enabling the NeuroTac to slide across the stimulus before being elevated to its initial Z coordinate and then returned to its resting position. Orientations are gathered from 0–170

°

in steps of

10 °

, with 10 trials at each orientation.

2.3. Spiking Neural Network Architecture

Data ouptut from the NeuroTac sensor is processed through a spiking neural network. The network consists of three layers: an input layer, a hidden layer, and an output layer (Figure 3). The input layer is a direct representation of the data output by the NeuroTac, without any need for conversion to spikes due to the asynchronous event-based output of the sensor. The hidden layer implements a spike-based version of pooling, with its neurons having circular receptive fields in the input layer of a set diameter (pooling factor = 16 in the optimised network). The output layer in the optimised network consists of 100 neurons whose firing rate is used to classify edge orientation.

2.3.1. Input Layer

The first layer of the network consists of 128 × 128 neurons, totalling 16,348 neurons which correspond to each pixel output of the NeuroTac sensor. The neurons are arranged in a 128 × 128 cm grid, with 1 cm grid spacing between neurons. The unit of measurement here is employed to give the user understanding of the relative position between grid-spaced layers.

The neural network operates in the millisecond range, which leads to duplicate timestamps for some of the microsecond range events output from the sensor. Implementing a refractory period ensures that these duplicates are removed from the input layer. This layer’s output consists of spike event information transmitted through a list of varying lengths denoting times of individual neuron activation.

2.3.2. Hidden Layer

The hidden layer operates as a spike-based pooling layer. Neurons in this layer are spatially distributed to cover the grid of neurons in the input layer. For instance, 64 neurons in this layer would be arranged in a grid of 8 × 8, with 16 cm grid spacing. Each neuron is then assigned a circular receptive field size equal to the downsampling between input layer neurons and pooling layer neurons (for 64 neurons in the hidden layer, the receptive field diameter would be 16 cm). Note that this entails some overlap between receptive fields of neurons in the hidden layer.

The weight of synaptic connections between the input and hidden layer is fixed at w = 0.2.

2.3.3. Output Layer

The output layer contains a vector of neurons (

n_{o u t p u t}

= 100 in the optimised network), each receiving input from 16–32 random neurons in the hidden layer. We randomise the initial weights to synaptic connections between the hidden and output layer in the range of w = 0–0.7 and apply STDP (see Section 2.5) to these weights during learning.

The spike rates of output neurons are then fed to a 3-nearest neighbour classifier to determine edge orientation.

2.4. Neuron Model

The neuron model used in our SNN is an adaptive exponential integrate-and-fire (IAF), as described by Brette and Gerstner [38], built using the PyNN frontend [39] to the NEST simulator [40]. This model strikes a balance between computational efficiency and the complexity of neuron behaviour.

The model is defined by the following equations:

\begin{matrix} \frac{C d V}{d t} = & g_{L} (V - E_{L}) + g_{L} * Δ_{T} * e x p (\frac{V - V_{T}}{Δ_{T}}) - \\ g_{e} (t) (V - E_{e}) - g_{i} (t) (V - E_{i}) - w + I_{e} \end{matrix}

(1)

where V is the membrane potential, C is the membrane capacitance,

g_{L}

is the leak conductance,

E_{L}

s the leak reverse potential,

V_{T}

is the spike threshold, and

Δ_{T}

is the slope factor. The spike-adaptation current w is defined by

T_{w} * \frac{d w}{d t} = a (V - E_{L}) - W

(2)

where a is the subthreshold adaptation.

For more details on the neuron model, we refer the reader to the original work defining it [38]. In Table 1, we include the basic parameters of the model we used, which determines its spiking behaviour.

2.5. Unsupervised Learning

As mentioned in the previous section, STDP is implemented between the hidden and output layers of the network—a method of optimising learning efficiency by learning patterns of correlation between pre- and post-synaptic spike events [41]. The STDP function here is denoted as

Δ w_{j} = \sum_{f = 1}^{N} \sum_{n = 1}^{N} W (t_{i}^{n} - t_{j}^{f})

(3)

in which

Δ w_{j}

is the weight change from the pre-synaptic neuron j, and

t_{j}^{f}

and

t_{i}^{n}

represent the spike arrival times of the pre- and post-synaptic neurons, respectively. The W learning function of STDP is defined as follows:

W (x) = \{\begin{matrix} A_{+} e x p (\frac{- x}{τ_{+}}), & for x > 0 \\ A_{-} e x p (\frac{x}{τ_{-}}), & for x < 0 \end{matrix}

(4)

where

A_{+}

determines the learning rate for weight strengthening when pre-synaptic spikes precede post-synaptic spikes, and

A_{-}

determines the learning rate for weight weakening in the opposite case.

τ_{+}

and

τ_{-}

are the time constants of the positive and negative portion of the STDP curve, in our network, both set to 20 ms. We also set hard bounds on the weights within the network as

w_{m a x}

(which we optimise) and

w_{m i n} = 0

.

3. Results

3.1. Inspection of Data

The NeuroTac sensor outputs raw data in the form of events analogous to biological spikes. In this section, we inspect examples of raw data for the Tap and Slide experiments and visualise the distribution of each dataset using t-distributed stochastic neighbour embedding (t-SNE).

3.1.1. Raw Data

As mentioned in Section 2.1, the NeuroTac sensor outputs event-based data. The data are in the Address-Event Representation (AER) format, which contains the pixel location and microsecond timestamp of each recorded event. Converting these timestamps to milliseconds and treating each pixel as a neuron allows us to represent the sensor output as a spike raster plot akin to neurological data (Figure 4).

From this data, we can observe that tapping contact occurs approximately between 200 and 600 ms and sliding contact between 1000 and 1400 ms. The sliding spike raster plot (Figure 4b) also presents a distinct pattern, with a decrease in overall spike activity towards the midpoint of contact. This pattern is likely due to the way the NeuroTac’s internal pins are deflected as the sensor moves across the edge stimulus.

3.1.2. Data Clustering

Here we perform a rapid visualisation of the clustering of our raw data for the tapping (Figure 5a) and sliding (Figure 5b) datasets using t-SNE [42]. t-SNE enables high-dimensional datasets to be reduced into low-dimensional graphical representations of the data, presenting clusters of data points that share similarities. For both datasets, the input space for t-SNE is the number of events per pixel; further parameters were set as perplexity = 20, learning rate = 400, and max iterations = 1000.

The clustering of Tap data (Figure 5a) suggests there are four distinct clusters within the data, corresponding to orientations of (i) 20–40

°

, (ii) 50–90

°

, (iii) 160–180

°

and

10 °

, and (iv) 90–150

°

. The orientation groupings could be a consequence of the morphology of the NeuroTac skin and its relatively low internal pin resolution.

Contrary to this, the sliding data (Figure 5b) appears to present separable clusters by angle orientation, which we would expect to lead to a more robust classification performance overall.

3.2. Network Optimisation

Our initial optimisation concerns the architectural structure of the SNN. We aim to maximise classification performance whilst retaining a simple network structure. We target two parameters that determine the sizes of the hidden and output layers:

Pooling factor (pf): this variable sets both the degree of reduction in network population size from the input to the hidden layer and the receptive field diameter of neurons in the hidden layer (we found from initial runs that setting these values equal led to the best performance). The range of values tested for the pF parameter are found in (Figure 6)
Output neurons ( $n_{o u t p u t}$ ): This parameter simply determines the number of neurons in the output layer. We vary it over the range 50–450 neurons in steps of 50.

Since the parameter ranges are limited, we apply a brute-force approach and test classification performance for every combination of the pf and

n_{o u t p u t}

variables (Figure 6a). We optimise over a subset of our datasets comprising 9 orientations and 10 trials for each orientation to enable additional optimisation iterations.

Classification performance peaks for pf = 16 (Figure 6a), which creates a hidden layer comprising 256 neurons. The number of output neurons has less of an effect on classification performance, but we set

n_{o u t p u t}

= 100 neurons as this is the smallest number of neurons that achieves maximal performance (Figure 6a).

Learning is then optimised by varying three parameters of the STDP model:

$A_{+}$ , $A_{-}$ : these variables set the rate at which potentiation and depression of synaptic weights occur, respectively. We vary them both independently over the range [0.001–0.1]
$w_{m a x}$ : This parameter is an upper boundary for synaptic weights. We optimise it over a range of [1–6]

We optimise the STDP parameters through Bayesian optimisation using the python hyperopt package. The final optimised parameter values of the network are displayed in Table 2.

3.3. Network Dynamics

Here we explore the dynamics of the SNN as it processes the data. We present spike raster plots at three stages of the network: its input, after dimensionality reduction, and the output layer. We present similar data in spike count format through heat maps of the layer activity.

3.3.1. Spike Raster Plots

Input layer— In the input layer (Figure 7), the greatest activity is noticeable within the central neurons. This is due to the dome-like structure of the NeuroTac, as the central region of the sensor will incur the most displacement and, therefore, the greatest activity. In the temporal domain, the input layer shows its greatest activity between ≈200–600 ms for tapping and ≈600–1200 ms for sliding; the periods of contact with the edge stimulus. For tapping, there is a noticeable drop-off in neuron activity after the halfway point of 400 ms. This could be due to the internal dynamics of the NeuroTac sensor, with pins returning to their original positions at a slower rate than their deflections as a result of pressing down on the edge stimulus. For the sliding condition, there is a pattern of circular gaps in activity, which likely are a trace of the NeuroTac’s internal pins. Pins and the gaps between them are relatively large to the mini-eDVS’s pixel resolution. As such, at certain angles of sliding, there will be groups of pixels that are not covered by the pin displacements, which lead to gaps in activity during the sliding interaction.

Hidden layer—For the hidden layer, horizontal gaps appear in the spike raster plot. This is due to the neuron number allocation occurring sequentially across the grid of topological hidden layer neurons, with neurons connected to the outer regions of the sensor displaying low activity. The hidden layer presents significantly fewer neurons than the input layer but displays a similar overall shape of spike activity for both tapping and sliding. This is due to the input and hidden layers being topographically related, with neurons connected based on their location in space. We also notice setting the synaptic weights between input and hidden layers to a fixed value of 0.2 has significantly reduced some of the noise in the input layer (for instance, spike activity before 200 ms).

Output layer—Spiking activity in the output layer is significantly lower than the input and hidden layers and no longer displays a similar pattern of activity to the input and hidden layers. This is due to the randomised connections between hidden layer neurons and their corresponding output layer neurons. In the sliding condition, there appears to be more activity in the output layer than in the tapping condition. This is a reflection of the sliding interaction creating more spikes and activating more pixels in the NeuroTac’s output.

3.3.2. Weight Updates through STDP

Heatmaps

Input layer—The input layer heatmap for tapping visually displays an angle orientation of 120

°

(Figure 8). Pin displacement occurs mostly along the stimulus edge, and the areas of high activity reflect this. The circular features within the heatmaps showing a lack of spikes represent the NeuroTac’s internal pins, whose white markers only produce events at their edges as they deflect. For the sliding condition, edge orientation is less visually evident than for the tapping condition. However, the input layer heatmap shows that a larger area of the NeuroTac skin is employed throughout the data gathering procedure. Small clusters of increased activity are also noticeable in the upper right quadrant of the pin depressions. These could indicate a preferred direction of deflection of internal pins, which is most likely correlated to edge orientation.

Hidden layer—The hidden layer for both conditions demonstrates dimensionality reduction and noise reduction while maintaining the overall spatial pattern of spike activity as the input layer. Pin contours are less obvious in this layer, but the edge orientation is still clearly displayed for the tapping condition.

Output layer—The output layer is less visually informative than the other two layers, as the connections from the hidden to the output layer are no longer topological. The 10 × 10 grid shown here is, therefore, not representative of the spatial structure within the layer and simply represents the 100 output neurons that are each connected to 16 random neurons in the hidden layer. We can identify several neurons in the output layer in each condition that demonstrates high activity and is likely to be responsive to the particular edge orientations displayed here.

In order to learn the features of the raw data, the weights between the hidden and output layer are subject to STDP learning (see Section 2.5). Here we illustrate the progression of the synaptic weights between the hidden and output layers at the 1st, 10th, and 140th iteration of training data (Figure 9). The dominance of a small number of connections that reach

w_{m a x}

can be detected by 140 training cycles (Figure 9, right panel). The optimisation of the STDP parameters

A_{+}

,

A_{-}

, and

w_{m a x}

leads to a balance between potentiation and depression to create specialised feature detectors in the output layer.

3.4. Edge Orientation Classification Performance

Once the network is optimised, we split our data using an 80% train, 20% test ratio. We test the performance by feeding the spike rates of the output neurons to a 3-nearest neighbours classifier. The input to the classifier is thus a 100 dimensional vector of output neuron spike counts.

The classification accuracy for the tapping condition reaches 62.5%, and for the sliding condition, 54.2%.

As evidenced from the confusion matrices (Figure 10), errors in orientation prediction tend to be within one to two classes of the correct orientation. Average orientation errors thus give a clearer sense of the sensor and network’s performance, with an average orientation error of

4 . 44^{\circ}

for tapping and

5 . 69^{\circ}

for sliding.

As a method of comparison, we also performed a 3-nearest neighbour classification on the raw input data. The Tap condition’s performance was 54% accurate, with a

6 . 11^{\circ}

average orientation error, demonstrating improved performance with the SNN. The Slide condition’s performance was greater on the raw data set with 83% accuracy and an average angle error of

1 . 67^{\circ}

. Note that the dimensionality of the data before and after being processed by the network is not equivalent (16,384 for raw data, 100 for network processed data), but these results illustrate there is a clear difference in how the network affects the tapping and sliding tactile interactions, which we discuss in more detail in Section 4.2.

4. Discussion

Our aim was to develop an efficient, bio-inspired system for tactile edge orientation classification. To achieve this, we developed a neuromorphic tactile sensor and connected it to a minimally complex 3-layer feed-forward spiking neural network with STDP learning applied to its final layer and output fed to a 3-nearest neighbour classifier. Our system was able to accurately detect edge orientation while tapping on (average orientation error:

4 . 44^{\circ}

) and sliding across (average orientation error:

5 . 69^{\circ}

) an edge stimulus.

4.1. Hardware

The internal pins within the NeuroTac are large relative to the pixel resolution of the sensor’s output, leading to large areas of the data not producing spikes (Figure 8). Modifying the length of the pins would also affect their deflection during contact. Optimizing the size and length of these internal pins through a hardware optimisation procedure could lead to increased performance on an edge orientation task. Human fingertips have a density of FA1 mechanoreceptors of approximately 141 units/cm

^{2}

[43], leaving us a large margin for improvement in our sensor resolution.

4.2. Network

The architecture of our SNN is purposefully minimal to enable efficient processing that could be run on embedded neuromorphic systems. Future work aims to implement this system on a SpiNNaker board [44] to fully exploit the spike-based nature of the processing and low computational requirements.

Subsequently to the SNN, we employed a KNN-classifier to assess the network output task performance. This return to spike counts and classical machine learning for classification allows us to test the network’s feature transformation capabilities. However, in future work, we will extend the spike-based nature of our algorithm to include classification through Hebbian supervised learning [45] and a winner-takes-all classification approach.

While maintaining simplicity within the network, additional architectures could be explored, such as liquid state machines [46] or random cluster shapes of receptive fields in the hidden layer, as has been observed in biological systems [25]. As demonstrated by the improvement in classification our network brings for tapping data and the loss of classification accuracy for sliding data, SNNs may need to be designed for specific tactile interactions. As well as network structures, in future works, we will explore alternative time-based encoding methods (rank-order coding, first spike coding, and interspike intervals) for dynamic tactile task classification.

4.3. Tactile Task

While the task under investigation was relatively simple in construction, edge orientation detection is both an important area of investigation in tactile robotics [22] and is crucial in the long-term goal of in-hand tactile manipulation of complex objects [21]. The results from this study serve as a foundation for biologically plausible edge orientation detection in neuromorphic tactile systems. A planned extension of the existing framework will include a spike-based control algorithm for a robot arm that could produce neuromorphic tactile servoing, a step towards neuromorphic tactile in-hand manipulation.

5. Conclusions

We demonstrated a neuromorphic tactile sensor and associated spiking neural network able to discriminate edge orientation through two tactile interactions: vertical tapping and horizontal sliding. Average orientation errors over a range of 0–170

°

were 4.44% for tapping and 5.69% for sliding. This study lays a foundation for bio-inspired edge orientation detection on embedded neuromorphic systems for robotic hands and prosthetics.

Author Contributions

Conceptualisation, F.L.A.M.; methodology, F.L.A.M., J.C. and B.W.-C.; software, F.L.A.M.; validation, F.L.A.M. and B.W.-C.; formal analysis, F.L.A.M. and B.W.-C.; investigation, F.L.A.M.; resources, N.F.L. and J.C.; writing—original draft preparation, F.L.A.M.; writing—review and editing, B.W.-C.; supervision, N.F.L. and B.W.-C.; project administration, B.W.-C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Royal Academy of Engineering grant number RF02021071 and EPSRC studentship number EP/R513179/1. The APC was funded by EPSRC.

Data Availability Statement

Data are available at the University of Bristol data repository, data.bris, at https://doi.org/10.5523/bris.2r2olkmnek54f2233zgufotruo.

Acknowledgments

The authors would like to thank the technician team at Bristol Robotics Laboratory for their help with hardware development and members of the Tactile Robotics team at Bristol Robotics Laboratory for their insights and feedback.

Conflicts of Interest

The authors declare no conflict of interest.The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

KNN	K-nearest Neighbours
SNN	Spiking Neural Network
STDP	Spike-Timing-Dependant Plasticity
t-SNE	t-distributed Stochastic Neighbour Embedding

References

Ward-Cherrier, B.; Pestell, N.; Cramphorn, L.; Winstone, B.; Giannaccini, M.E.; Rossiter, J.; Lepora, N.F. The TacTip Family: Soft Optical Tactile Sensors with 3D-Printed Biomimetic Morphologies. Soft Robot. 2018, 5, 216–227. [Google Scholar] [CrossRef] [PubMed]
Muhammad, H.; Recchiuto, C.; Oddo, C.M.; Beccai, L.; Anthony, C.; Adams, M.; Carrozza, M.C.; Ward, M. A capacitive tactile sensor array for surface texture discrimination. Microelectron. Eng. 2011, 88, 1811–1813. [Google Scholar] [CrossRef]
Goger, D.; Gorges, N.; Worn, H. Tactile sensing for an anthropomorphic robotic hand: Hardware and signal processing. In Proceedings of the 2009 IEEE International Conference on Robotics and Automation, Kobe, Japan, 12–17 May 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 895–901. [Google Scholar]
Stassi, S.; Cauda, V.; Canavese, G.; Pirri, C.F. Flexible tactile sensing based on piezoresistive composites: A review. Sensors 2014, 14, 5296–5332. [Google Scholar] [CrossRef] [PubMed]
Zhang, T.; Liu, H.; Jiang, L.; Fan, S.; Yang, J. Development of a flexible 3-D tactile sensor system for anthropomorphic artificial hand. IEEE Sens. J. 2012, 13, 510–518. [Google Scholar] [CrossRef]
Yuan, W.; Dong, S.; Adelson, E.H. Gelsight: High-resolution robot tactile sensors for estimating geometry and force. Sensors 2017, 17, 2762. [Google Scholar] [CrossRef]
Lepora, N.F.; Church, A.; De Kerckhove, C.; Hadsell, R.; Lloyd, J. From pixels to percepts: Highly robust edge perception and contour following using deep learning and an optical biomimetic tactile sensor. IEEE Robot. Autom. Lett. 2019, 4, 2101–2107. [Google Scholar] [CrossRef]
Aquilina, K.; Barton, D.A.; Lepora, N.F. Shear-invariant Sliding Contact Perception with a Soft Tactile Sensor. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 4283–4289. [Google Scholar]
Zaghloul, K.A.; Boahen, K. A silicon retina that reproduces signals in the optic nerve. J. Neural Eng. 2006, 3, 257–267. [Google Scholar] [CrossRef]
Lee, W.W.; Kukreja, S.L.; Thakor, N.V. Discrimination of dynamic tactile contact by temporally precise event sensing in spiking neuromorphic networks. Front. Neurosci. 2017, 11, 5. [Google Scholar] [CrossRef]
Birkoben, T.; Winterfeld, H.; Fichtner, S.; Petraru, A.; Kohlstedt, H. A spiking and adapting tactile sensor for neuromorphic applications. Sci. Rep. 2020, 10. [Google Scholar] [CrossRef]
Oddo, C.M.; Controzzi, M.; Beccai, L.; Cipriani, C.; Carrozza, M.C. Roughness encoding for discrimination of surfaces in artificial active-touch. IEEE Trans. Robot. 2011, 27, 522–533. [Google Scholar] [CrossRef]
Delbruck, T. Frame-free dynamic digital vision. In Proceedings of the International Symposium on Secure-Life Electronics, Tokyo, Japan, 6–7 March 2008; pp. 21–26. [Google Scholar] [CrossRef]
Johansson, R.S.; Flanagan, J.R. Coding and use of tactile signals from the fingertips in object manipulation tasks. Nat. Rev. Neurosci. 2009, 10, 345–359. [Google Scholar] [CrossRef]
Johnson, K.O. The roles and functions of cutaneous mechanoreceptors. Curr. Opin. Neurobiol. 2001, 11, 455–461. [Google Scholar] [CrossRef]
Rongala, U.B.; Mazzoni, A.; Chiurazzi, M.; Camboni, D.; Milazzo, M.; Massari, L.; Ciuti, G.; Roccella, S.; Dario, P.; Oddo, C.M. Tactile decoding of edge orientation with artificial cuneate neurons in dynamic conditions. Front. Neurorobot. 2019, 13, 44. [Google Scholar] [CrossRef]
Friedl, K.E.; Voelker, A.R.; Peer, A.; Eliasmith, C. Human-Inspired Neurorobotic System for Classifying Surface Textures by Touch. IEEE Robot. Autom. Lett. 2016, 1, 516–523. [Google Scholar] [CrossRef]
Yuan, W.; Li, R.; Srinivasan, M.A.; Adelson, E.H. Measurement of shear and slip with a GelSight tactile sensor. In Proceedings of the 2015 IEEE International Conference on Robotics and Automation (ICRA), Seattle, WA, USA, 26–30 May 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 304–311. [Google Scholar]
James, J.W.; Pestell, N.; Lepora, N.F. Slip detection with a biomimetic tactile sensor. IEEE Robot. Autom. Lett. 2018, 3, 3340–3346. [Google Scholar] [CrossRef]
Luo, S.; Mou, W.; Li, M.; Althoefer, K.; Liu, H. Rotation and translation invariant object recognition with a tactile sensor. In Proceedings of the SENSORS, 2014 IEEE, Valencia, Spain, 2–5 November 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1030–1033. [Google Scholar]
Lederman, S.J.; Klatzky, R.L. Relative Availability of Surface and Object Properties during Early Haptic Processing. J. Exp. Psychol. Hum. Percept. Perform. 1997, 23, 1680–1707. [Google Scholar] [CrossRef]
Ponce Wong, R.D.; Hellman, R.B.; Santos, V.J. Spatial asymmetry in tactile sensor skindeformation aids perception of edgeorientation during haptic exploration. IEEE Trans. Haptics 2014, 7, 191–202. [Google Scholar] [CrossRef]
Kumar, D.; Ghosh, R.; Nakagawa-Silva, A.; Soares, A.B.; Thakor, N.V. Neuromorphic approach to tactile edge orientation estimation using spatiotemporal similarity. Neurocomputing 2020, 407, 246–258. [Google Scholar] [CrossRef]
Parvizi-Fard, A.; Amiri, M.; Kumar, D.; Iskarous, M.M.; Thakor, N.V. A functional spiking neuronal network for tactile sensing pathway to process edge orientation. Sci. Rep. 2021, 11, 1320. [Google Scholar] [CrossRef]
Pruszynski, J.A.; Johansson, R.S. Edge-orientation processing in first-order tactile neurons. Nat. Neurosci. 2014, 17, 1404–1409. [Google Scholar] [CrossRef]
Thorpe, S.; Delorme, A.; Van Rullen, R. Spike-based strategies for rapid processing. Neural Netw. 2021, 14, 715–725. [Google Scholar] [CrossRef]
Bing, Z.; Meschede, C.; Röhrbein, F.; Huang, K.; Knoll, A.C. A survey of robotics control based on learning-inspired spiking neural networks. Front. Neurorobot. 2018, 12, 35. [Google Scholar] [CrossRef] [PubMed]
Nichols, E.; McDaid, L.; Siddique, N. Case study on a self-organizing spiking neural network for robot navigation. Int. J. Neural Syst. 2010, 20, 501–508. [Google Scholar] [CrossRef] [PubMed]
Kheradpisheh, S.R.; Ganjtabesh, M.; Thorpe, S.J.; Masquelier, T. STDP-based spiking deep convolutional neural networks for object recognition. Neural Netw. 2018, 99, 56–67. [Google Scholar] [CrossRef]
Li, J.; Liu, I.B.; Gao, W.; Huang, X. Spiking neural network with synaptic plasticity for recognition. In Proceedings of the 2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference, IAEAC 2018, Chongqing, China, 12–14 October 2018; pp. 1728–1732. [Google Scholar] [CrossRef]
Davies, M.; Wild, A.; Orchard, G.; Sandamirskaya, Y.; Guerra, G.A.F.; Joshi, P.; Plank, P.; Risbud, S.R. Advancing neuromorphic computing with loihi: A survey of results and outlook. Proc. IEEE 2021, 109, 911–934. [Google Scholar] [CrossRef]
Ponulak, F. Supervised learning in spiking neural networks with ReSuMe method. Phd Pozn. Univ. Technol. 2006, 46, 47. [Google Scholar]
Bohte, S.M.; Kok, J.N.; La Poutré, J.A. SpikeProp: Backpropagation for networks of spiking neurons. In Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Bruges, Belgium, 2–4 October 2000; Volume 48, pp. 419–424. [Google Scholar]
Shrestha, S.B.; Orchard, G. Slayer: Spike layer error reassignment in time. Adv. Neural Inf. Process. Syst. 2018, 31. [Google Scholar]
Markram, H.; Gerstner, W.; Sjöström, P.J. Spike-timing-dependent plasticity: A comprehensive overview. Front. Synaptic Neurosci. 2012, 4, 2. [Google Scholar] [CrossRef]
Ward-Cherrier, B.; Pestell, N.; Lepora, N. NeuroTac: A Neuromorphic Optical Tactile Sensor applied to Texture Recognition. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020. [Google Scholar]
Müller, G.R.; Conradt, J. A miniature low-power sensor system for real time 2D visual tracking of LED markers. In Proceedings of the 2011 IEEE International Conference on Robotics and Biomimetics, ROBIO 2011, Phuket, Thailand, 7–11 December 2011; pp. 2429–2434. [Google Scholar] [CrossRef]
Brette, R.; Gerstner, W. Adaptive exponential integrate-and-fire model as an effective description of neuronal activity. J. Neurophysiol. 2005, 94, 3637–3642. [Google Scholar] [CrossRef] [Green Version]
Davison, A.P.; Brüderle, D.; Eppler, J.; Kremkow, J.; Muller, E.; Pecevski, D.; Perrinet, L.; Yger, P. PyNN: A common interface for neuronal network simulators. Front. Neuroinform. 2009, 2. [Google Scholar] [CrossRef]
Fardet, T.; Vennemo, S.B.; Mitchell, J.; Mørk, H.; Graber, S.; Hahne, J.; Spreizer, S.; Deepu, R.; Trensch, G.; Weidel, P.; et al. NEST 2.20.0. 2020. Available online: https://zenodo.org/record/3605514#.YyQcQbRBxPY (accessed on 30 July 2022). [CrossRef]
Gütig, R.; Aharonov, R.; Rotter, S.; Sompolinsky, H. Learning Input Correlations through Nonlinear Temporally Asymmetric Hebbian Plasticity. J. Neurosci. 2003, 23, 3697–3714. [Google Scholar] [CrossRef]
Van der Maaten, L.; Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
Johansson, R.S.; Vallbo, A.B. Tactile sensibility in the human hand: Relative and absolute densities of four types of mechanoreceptive units in glabrous skin. J. Physiol. 1979, 286, 283–300. [Google Scholar] [CrossRef]
Furber, S.; Bogdan, P. SpiNNaker: A Spiking Neural Network Architecture; Now Publishers: Boston, MA, USA, 2020. [Google Scholar] [CrossRef]
Legenstein, R.; Naeger, C.; Maass, W. What can a neuron learn with spike-timing-dependent plasticity? Neural Comput. 2005, 17, 2337–2382. [Google Scholar] [CrossRef]
Xue, F.; Hou, Z.; Li, X. Computational capability of liquid state machines with spike-timing-dependent plasticity. Neurocomputing 2013, 122, 324–329. [Google Scholar] [CrossRef]

Figure 1. Left Panel: The NeuroTac—a neuromorphic version of the TacTip sensor. A lid and casing enclose the event-based camera (mini-eDVS), which, in turn, is held above a 3D-printed silicone tip with small rubber-like pins internally protruding. Right panel: Top and side views of the NeuroTac compliant skin, displaying the internally protruding pins that mimic the structure of human skin.

Figure 2. Experimental setup. Left panel. The Dobot Magician robotic arm with the attached NeuroTac sensor above the edge stimulus at the start of a trial. Middle panel. During the tapping experiment, the NeuroTac is brought vertically downward onto the edge stimulus to a depth of 2 mm, before returning to its starting position and rotating 10

°

. Right panel. During the sliding experiment, the NeuroTac is brought horizontally across the edge stimulus at a constant height of 2 mm below the apex of the edge before being returned to its start position and rotated 10

°

for the next trial.

Figure 2. Experimental setup. Left panel. The Dobot Magician robotic arm with the attached NeuroTac sensor above the edge stimulus at the start of a trial. Middle panel. During the tapping experiment, the NeuroTac is brought vertically downward onto the edge stimulus to a depth of 2 mm, before returning to its starting position and rotating 10

°

. Right panel. During the sliding experiment, the NeuroTac is brought horizontally across the edge stimulus at a constant height of 2 mm below the apex of the edge before being returned to its start position and rotated 10

°

for the next trial.

Figure 3. Architecture of the Spiking Neural Network. The input layer represents the captured data of the NeuroTac. Topographical pixel areas of 16 × 16 were used as inputs for the hidden layer. The output layer was then sampled again from 16 random neurons in the hidden layer. The output layer consisted of 100 neurons whose spike rate was fed into a K-nearest neighbour classifier. Note that the network structure presented here was obtained through optimisation, as described in Section 3.2.

Figure 4. Spike raster plots of the NeuroTac output data for Tap (a) and Slide (b) at

10 °

.

Figure 4. Spike raster plots of the NeuroTac output data for Tap (a) and Slide (b) at

10 °

.

Figure 5. t-SNE of the spatial distribution of spikes in the raw data for tapping (a) and sliding (b) over the range of edge orientations (0–170

°

in steps of

10 °

).

Figure 5. t-SNE of the spatial distribution of spikes in the raw data for tapping (a) and sliding (b) over the range of edge orientations (0–170

°

in steps of

10 °

).

Figure 6. Optimisation of the SNN. (a) Distribution of classification performance based on pooling factor (pf) (which determines the size of the hidden layer) and the number of output neurons

(n_{o u t p u t})

. (b) Change in classification performance over iterations during the Bayesian optimisation of the STDP learning parameters:

A_{+}

,

A_{-}

and

w_{m a x}

.

Figure 6. Optimisation of the SNN. (a) Distribution of classification performance based on pooling factor (pf) (which determines the size of the hidden layer) and the number of output neurons

(n_{o u t p u t})

. (b) Change in classification performance over iterations during the Bayesian optimisation of the STDP learning parameters:

A_{+}

,

A_{-}

and

w_{m a x}

.

Figure 7. Spike raster plots representing the spiking data as they pass through each layer of the SNN. The left panel displays spikes in the input layer, the middle panel in the hidden layer, and the right panel in the output layer. The top row shows data for the tapping experiment and the bottom row for the sliding experiment.

Figure 8. Heatmaps representing the spatial distribution of activity (degree of heat represents spike count per neuron) as data passes through each layer of the SNN. The left panel displays the spike activity in the input layer with a 128 × 128 layout, the middle panel represents the hidden layer with an optimal 16 × 16 layout, and the right panel the output layer with optimal

n_{o u t p u t}

= 100. The top row shows data for the tapping experiment and the bottom row for the sliding experiment.

Figure 8. Heatmaps representing the spatial distribution of activity (degree of heat represents spike count per neuron) as data passes through each layer of the SNN. The left panel displays the spike activity in the input layer with a 128 × 128 layout, the middle panel represents the hidden layer with an optimal 16 × 16 layout, and the right panel the output layer with optimal

n_{o u t p u t}

= 100. The top row shows data for the tapping experiment and the bottom row for the sliding experiment.

Figure 9. Progression of weight connections between the hidden layer and output layer as STDP operates during the training period. The Y-axes display the weights of synaptic connections, and the X-axes display synapse indexes. In this example,

w_{m a x}

= 2.0.

Figure 9. Progression of weight connections between the hidden layer and output layer as STDP operates during the training period. The Y-axes display the weights of synaptic connections, and the X-axes display synapse indexes. In this example,

w_{m a x}

= 2.0.

Figure 10. Confusion matrices for edge orientation classification performance with Tap (a) and Slide (b) over the range of edge orientations (0–170

°

in steps of

10 °

).

Figure 10. Confusion matrices for edge orientation classification performance with Tap (a) and Slide (b) over the range of edge orientations (0–170

°

in steps of

10 °

).

Table 1. Parameters of the IAF neuron model.

Parameter	Value
Resting membrane potential	−60 mV
Membrane capacity	1.0 nF
Membrane time constant	10 ms
Decay time of synaptic current	5 ms
Reset potential	−60 mV
Spike Threshold	−50 mv
Refractory period	5 ms

Table 2. Final optimised parameter values for the SNN.

Parameter	Search Space	Optimised Value
pf	4, 8, 16, 32	16
$n_{o u t p u t}$	50–450 in steps of 50	100
$A_{+}$	0.001–0.1	0.036
$A_{-}$	0.001–0.1	0.073
$w_{m a x}$	1–6	4.75

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Macdonald, F.L.A.; Lepora, N.F.; Conradt, J.; Ward-Cherrier, B. Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network. Sensors 2022, 22, 6998. https://doi.org/10.3390/s22186998

AMA Style

Macdonald FLA, Lepora NF, Conradt J, Ward-Cherrier B. Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network. Sensors. 2022; 22(18):6998. https://doi.org/10.3390/s22186998

Chicago/Turabian Style

Macdonald, Fraser L. A., Nathan F. Lepora, Jörg Conradt, and Benjamin Ward-Cherrier. 2022. "Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network" Sensors 22, no. 18: 6998. https://doi.org/10.3390/s22186998

APA Style

Macdonald, F. L. A., Lepora, N. F., Conradt, J., & Ward-Cherrier, B. (2022). Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network. Sensors, 22(18), 6998. https://doi.org/10.3390/s22186998

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Neuromorphic Tactile Edge Orientation Classification in an Unsupervised Spiking Neural Network

Abstract

1. Introduction

1.1. Neuromorphic Tactile Sensors

1.2. Tactile Sensing and Edge Detection

1.3. Spiking Neural Networks

2. Materials and Methods

2.1. Hardware—The NeuroTac Sensor

2.2. Experimental Setup and Procedure

2.3. Spiking Neural Network Architecture

2.3.1. Input Layer

2.3.2. Hidden Layer

2.3.3. Output Layer

2.4. Neuron Model

2.5. Unsupervised Learning

3. Results

3.1. Inspection of Data

3.1.1. Raw Data

3.1.2. Data Clustering

3.2. Network Optimisation

3.3. Network Dynamics

3.3.1. Spike Raster Plots

3.3.2. Weight Updates through STDP

Heatmaps

3.4. Edge Orientation Classification Performance

4. Discussion

4.1. Hardware

4.2. Network

4.3. Tactile Task

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI