Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations

Martínez-del-Amor, Miguel Ángel; Orellana-Martín, David; Pérez-Hurtado, Ignacio; Cabarle, Francis George C.; Adorna, Henry N.

doi:10.3390/pr9040690

Open AccessArticle

Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations

by

Miguel Ángel Martínez-del-Amor

^1,2,*

,

David Orellana-Martín

¹,

Ignacio Pérez-Hurtado

¹,

Francis George C. Cabarle

³ and

Henry N. Adorna

³

¹

Research Group on Natural Computing, Department of Computer Science and Artificial Intelligence, Universidad de Sevilla, 41012 Seville, Spain

²

Smart Computer Systems Research and Engineering Lab (SCORE), Research Institute of Computer Engineering (I3US), Universidad de Sevilla, 41012 Seville, Spain

³

Algorithms and Complexity Laboratory, Department of Computer Science, University of the Philippines Diliman, Quezon City 1101, Philippines

^*

Author to whom correspondence should be addressed.

Processes 2021, 9(4), 690; https://doi.org/10.3390/pr9040690

Submission received: 28 February 2021 / Revised: 10 April 2021 / Accepted: 12 April 2021 / Published: 14 April 2021

(This article belongs to the Special Issue Modeling, Simulation and Design of Membrane Computing System)

Download

Browse Figures

Versions Notes

Abstract

:

To date, parallel simulation algorithms for spiking neural P (SNP) systems are based on a matrix representation. This way, the simulation is implemented with linear algebra operations, which can be easily parallelized on high performance computing platforms such as GPUs. Although it has been convenient for the first generation of GPU-based simulators, such as CuSNP, there are some bottlenecks to sort out. For example, the proposed matrix representations of SNP systems lead to very sparse matrices, where the majority of values are zero. It is known that sparse matrices can compromise the performance of algorithms since they involve a waste of memory and time. This problem has been extensively studied in the literature of parallel computing. In this paper, we analyze some of these ideas and apply them to represent some variants of SNP systems. We also provide a new simulation algorithm based on a novel compressed representation for sparse matrices. We also conclude which SNP system variant better suits our new compressed matrix representation.

Keywords:

spiking neural P systems; simulation algorithm; sparse matrix-vector operations; compressed matrix representation; GPU computing

1. Introduction

Membrane computing [1,2] is an interdisciplinary research area in the intersection of computer science and cellular biology mainly [3], but also with many other fields such as engineering, neuroscience, systems biology, chemistry, etc. The aim is to study computational devices called P systems, taking inspiration from how living cells process information. Spiking neural P (SNP) systems [4] are a type of P system composed of a directed graph inspired by how neurons are interconnected by axons and synapses in the brain. Neurons communicate through spikes, and the time difference between them plays an important role in the computation. Therefore, this model belongs to the known third generation of artificial neural networks, i.e., based on spikes.

Aside from computing numbers, SNP systems can also compute strings, and hence, languages. More general ways to provide the input or receive the output include the use of spike trains, i.e., a stream or sequence of spikes entering or leaving the system. Further results and details on computability, complexity, and applications of spiking neural P systems are detailed in [5,6,7], a dedicated chapter in the Handbook in [8], and an extensive bibliography until February 2016 in [9]. Moreover, there is a wide range of SNP system variants: with delays, with weights [10], with astrocytes [11], with anti-spikes [12], dendrites [13], rules on synapses [14], scheduled synapses [15], stochastic firing [16], numerical [17], etc.

The research on applications and variants of SNP systems has required the development of simulators. The simulation of SNP systems was initially carried out through sequential simulators such as pLinguaCore [18]. In 2010, a matrix representation of SNP systems was introduced [19]. Since then, most simulation algorithms are based on matrices and vector representations, and consists of a set of linear algebra operations. This way, parallel simulators can be efficiently implemented, since matrix-vector multiplications are easy to parallelize. Moreover, there are efficient algebra libraries that can be used out-of-the-box, although they have not been explored yet for this purpose. For instance, GPUs are parallel devices optimized for certain matrix operations [20], and can handle matrix operations efficiently. We can say, without loss of generality, that these matrix representations of SNP systems fit well to the highly parallel architecture of these devices. This have been harnessed already by introducing CuSNP, a set of simulators for SNP systems implemented with CUDA [21,22,23,24]. Simulators for specific solutions have been also defined in the literature [5,25]. Moreover, this is not unique for SNP systems, many simulators for other P system variants have been accelerated on GPUs [26,27,28].

However, this matrix representation can be sparse (i.e., having a majority of zero values) because the directed graph of SNP systems is not usually fully connected. A first approach to tackle this problem was presented in [29], where some of the ideas described in this work were described. Following these ideas, in [30], the transition matrix was split to reduce the memory footprint of the SNP representation. In many disciplines, sparse vector-matrix operations are very usual, and hence, many solutions based on compressed implementations have been proposed in the literature [31].

In this paper, we introduce compressed representations for the simulation of SNP systems based on sparse vector-matrix operations. First, we provide two approaches to compress the transition matrix for the simulation of SNP systems with static graph. Second, we extend these algorithms and data structures for SNP systems with dynamic graphs (division, budding, and plasticity). Finally, we make a complexity analysis and comparison of the algorithms to draw some conclusions.

The paper is structured as follows: Section 2 provides required concepts for the methods and algorithms here defined; Section 3 defines the designs of the representations; Section 4 contains the detailed algorithms based on the compressed representations; Section 5 shows the results on complexity analyses of the algorithms; Section 6 provides final conclusions, remarks, and plans of future work.

2. Preliminaries

In this section we briefly introduce the concepts employed in this work. Firstly, we define the standard model of spiking neural P systems and three variants. Second, a matrix-based simulation algorithm for this model is revisited. Third, the fundamentals of compressed formats for sparse matrix-vector operations are given.

2.1. Spiking Neural P Systems

Let us first formally introduce the definition of spiking neural P system. This model was first introduced in [4].

Definition 1.

A spiking neural P system of degree

q \geq 1

is a tuple

Π = (O, s y n, σ_{1}, \dots, σ_{q}, i_{o u t})

where:

$O = {a}$ is the singleton alphabet (a is called spike);
$s y n = {(i, j) |, 1 \leq i, j \leq q, i \neq j}$ represents the arcs of a directed graph $G = (V, s y n)$ whose nodes are $V = {1, \dots, q}$ ;
$σ_{1}, \dots, σ_{q}$ are neurons of the form

$σ_{i} = (n_{i}, R_{i}), 1 \leq i \leq q,$

where:
–
$n_{i} \geq 0$ is the initial number of spikes within neuron labeled by i; and
–
$R_{i}$ is a finite set of rules associated to neuron labeled by i, of one of the following forms:
(1)
$E / a^{c} \to a^{p}$ , being E a regular expression over ${a}$ , $c \geq p \geq 1$ (firing rules);
(2)
$a^{s} \to λ$ for some $s \geq 1$ , with the restriction that for each rule $E / a^{c} \to a^{p}$ of type of type $(1)$ from $R_{i}$ , we have $a^{s} \notin L (E)$ (forgetting rules).
$i_{o u t} \in {1, 2, \dots, q}$ such that $(i_{o u t}, j) \notin s y n$ , for any $1 \leq j \leq q$ .

A spiking neural P system of degree

q \geq 1

can be viewed as a set of q neurons

{σ_{1}, \dots, σ_{q}}

interconnected by the arcs of a directed graph

s y n

, called synapse graph. There is a distinguished neuron label

i_{o u t}

, called output neuron (

σ_{i_{o u t}}

), which communicates with the environment.

If a neuron

σ_{i}

contains k spikes at an instant t, and

a^{k} \in L (E), k \geq c

, then the rule

E / a^{c} \to a^{p}

can be applied. By the application of that rule, c spikes are removed from neuron

σ_{i}

and the neuron fires producing p spikes immediately. Thus, each neuron

σ_{j}

such that

(σ_{i}, σ_{j}) \in G

receives p spikes. For

σ_{i_{o u t}}

, the output neuron

i_{o u t}

, the spikes are sent to the environment.

The rules of type

(2)

are forgetting rules, and they are applied as follows: If neuron

σ_{i}

contains exactly s spikes, then the rule

a^{s} \to λ

from

R_{i}

can be applied. By the application of this rule all s spikes are removed from

σ_{i}

.

In spiking neural P systems, a global clock is assumed, marking the time for the whole system. Only one rule can be executed in each neuron at step t. As models of computation, spiking neural P systems are Turing complete, i.e., as powerful as Turing machines. On one hand, a common way to introduce the input (instance of the problem to solve) to the system is to encode it into some or all of the initial spikes

n_{i}

’s (inside each neuron i). On the other hand, a common way to obtain the output is by observing neuron

i_{o u t}

: either by getting the interval

t_{2} - t_{1} = n,

where

σ_{i_{o u t}}

sent its first two spikes at times

t_{1}

and

t_{2}

(we say n is computed or generated by the system), or by counting all the spikes sent by

σ_{i_{o u t}}

to the environment until the system halts.

For the rest of the paper, we call this model spiking neural P systems with static structure, or just static SNP, given that the graph associated with it does not change along the computation. Next, we briefly introduce and focus on three variants with a dynamic graph: division, budding, and plasticity. A broader explanation of them and more variants are provided at [32,33,34,35].

Finally, let us introduce some notations and definitions:

$p r e s (i)$ : for a neuron $σ_{i}$ , the presynapses of this neuron is $p r e s (i) = {j | (i, j) \in s y n}$ .
$o u t d e g r e e (i)$ : for a neuron $σ_{i}$ , the out degree of this neuron is: $o u t d e g r e e (i) = | p r e s (i) |$ .
$i n e s (i)$ : for a neuron $σ_{i}$ , the insynapses of this neuron is $i n e s (i) = {j | (j, i) \in s y n}$ .
$i n d e g r e e (i)$ : for a neuron $σ_{i}$ , the in degree of this neuron is: $o u t d e g r e e (i) = | i n e s (i) |$ .

2.1.1. Spiking Neural P Systems with Budding Rules

Based on the idea of neuronal budding, where a cell is divided into two new cells, we can abstract it to budding rules. In this process, the new neurons can differ in some aspects: their connections, contents, and shape. A budding rule has the following form:

{[E]}_{i} \to {[]}_{i} / {[]}_{j},

where E is a regular expression and

i, j \in {1, \dots, q}

.

If a neuron

σ_{i}

contains s spikes,

a^{s} \in L (E)

, and there is no neuron

σ_{j}

such that there exists a synapse

(i, j)

in the system, then the rule

{[E]}_{i} \to {[]}_{i} / {[]}_{j}

is enabled and it can be executed. A new neuron

σ_{j}

is created, and both neurons

σ_{i}

and

σ_{j}

are empty after the execution of the rule. This neuron

σ_{i}

keeps all the synapses that were going in, and this

σ_{j}

inherits all the synapses that were going out of

σ_{i}

in the previous configuration. There is also a synapse

(i, j)

between neurons

σ_{i}

and

σ_{j}

, and the rest of synapses of

σ_{j}

are given to the neuron depending on the synapses of

s y n

.

2.1.2. Spiking Neural P Systems with Division Rules

Inspired by the process of mitosis, division rules have been widely used within the field of membrane computing. In SNP systems, a division rule can be defined as follows:

{[E]}_{i} \to {[]}_{j} | | {[]}_{k},

where E is a regular expression and

i, j, k \in {1, \dots, q}

.

If a neuron

σ_{i}

contains s spikes,

a^{s} \in L (E)

, and there is no neuron

σ_{g}

such that the synapse

(g, i)

or

(i, g)

exists in the system,

g \in {j, k}

, then the rule

{[E]}_{i} \to {[]}_{j} | | {[]}_{k}

is enabled and it can be executed. Neuron

σ_{i}

is then divided into two new cells,

σ_{j}

and

σ_{k}

. The new cells are empty at the time of their creation. The new neurons keep the synapses previously associated to the original neuron

σ_{i}

, that is, if there was a synapse from

σ_{g}

to

σ_{i}

, then a new synapse from

σ_{g}

to

σ_{j}

and a new one to

σ_{k}

are created, and if there was a synapse from

σ_{i}

to

σ_{g}

, then a new synapse from

σ_{j}

to

σ_{g}

and a new one from

σ_{k}

to

σ_{g}

are created. The rest of synapses of

σ_{j}

and

σ_{k}

are given by the ones defined in

s y n

.

2.1.3. Spiking Neural P Systems with Plasticity Rules

It is known that new synapses can be created in the brain thanks to the process of synaptogenesis. We can recreate this process in the framework of spiking neural P systems defining plasticity rules in the following form:

E / a^{c} \to α k (i, N_{j}),

where E is a regular expression,

c \geq 1

,

α \in {+, -, \pm, \mp}

,

k \geq 1

and

N_{j} \subseteq {1, \dots, q}

(a.k.a. neuron set). Recall that

p r e s (i)

is the set of presynapses of neuron

σ_{i}

.

If a neuron

σ_{i}

contains s spikes,

a^{s} \in L (E)

, then the rule

E / a^{c} \to α k (i, N_{j})

is enabled and can be executed. The rule consumes c spikes and, depending on the value of

α

, it performs one of the following:

If $α = +$ and $N_{j} - p r e s (i) = \emptyset$ , or if $α = -$ and $p r e s (i) = \emptyset$ , then there is nothing more to do.
If $α = +$ and $| N_{j} - p r e s (i) | \leq k$ , deterministically create a synapse to every $σ_{g}$ , $g \in N_{j} - p r e s (i)$ . Otherwise, if $| N_{j} - p r e s (i) | > k$ , then non-deterministically select k neurons in $N_{j} - p r e s (i)$ and create one synapse to each selected neuron.
If $α = -$ and $| p r e s (i) | \leq k$ , deterministically delete all synapses in $p r e s (i)$ . Otherwise, if $| p r e s (i) | > k$ , then non-deterministically select k neurons in $p r e s (i)$ and delete each synapse to the selected neurons.
If $α = {\pm, \mp}$ , create (respectively, delete) synapses at time t and then delete (resp., create) synapses at time $t + 1$ . Even when this rule is applied, neurons are still open, that is, they can continue receiving spikes.

Let us notice that if, for some

σ_{i}

, we apply a plasticity rule with

α \in {+, \pm, \mp}

, when a synapse is created, a spike is sent from

σ_{i}

to the neuron that has been connected. That is, when

σ_{i}

attaches to

σ_{j}

through this method, we have immediately transferring one spike to

σ_{j}

.

2.2. Matrix Representation for SNP Systems

Usually, parallel, P system simulators make use of ad-hoc representations, tailored for a certain variant [26,27,28]. In order to ease the simulation of static SNP system and its deployment to parallel environments, a matrix representation was introduced [19]. By using a set of algebraic operations, it is possible to reproduce the transitions of a computation. Although the baseline representation only involves SNP systems without delays and static structure, many extensions have followed such as for enabling delays [21,22], handling non-determinism [24], plasticity rules [36], rules on synapses [37], and dendrite P systems [38]. In this section we briefly introduce the definitions for the matrix representation of the basic model of spiking neural P systems without delays, as defined above. We also provide the pseudocode to simulate just one computation of any P system of the variant using this matrix representation. In our notation, we use capital letters for vectors and matrices, and

[]

for accessing values:

V [i]

is the value at position i of the vector V, and

M [i, j]

is the value at row i and column j of matrix M.

For an SNP system

Π

of degree

(q, m)

(q neurons and m rules, where

m = \sum_{i = 1}^{q} | R_{i} |

), we define the following vectors and matrices:

Configuration vector:

C_{k}

is the vector containing all spikes in every neuron on the k

t h

computation step/time, where

C_{0}

denotes the initial configuration; i.e.,

C_{0} [i] = n_{i}

, for neuron

σ_{i} = (n_{i}, R_{i})

. It contains q elements.

Spiking vector:

S_{k}

shows if a rule is going to fire at the transition step k (having value 1) or not (having value 0). Given the non-determinism nature of SNP systems, it would be possible to have a set of valid spiking vectors, which is denoted as

S V_{k}

. However, for the computation of the next configuration vector, only a spiking vector is used. It contains m elements.

Transition matrix:

M_{Π}

is a matrix comprised of

m \cdot q

elements given as

M [i, j] = \{\begin{matrix} - c, & rule r_{i} is in σ_{j} and is applied consuming c spikes; \\ p, & rule r_{i} is in σ_{s}, with s \neq j and (s, j) \in s y n, \\ and is applied producing p spikes; \\ 0, & rule r_{i} is in σ_{s} with s \neq j and (s, j) \notin s y n . \end{matrix}

In this representation, rows represent rules and columns represent neurons in the spiking transition matrix. Note also that a negative entry corresponds to the consumption of spikes. Thus, it is easy to observe that each row has exactly one negative entry, and each column has at least one negative entry [19].

Hence, to compute the transition k, it is enough to select a spiking vector

S_{k}

from all possibilities

S V_{k}

and calculate:

C_{k} = S_{k} \cdot M_{Π} + C_{k - 1}

.

The pseudocode to simulate a computation of an SNP system is as described in Algorithm 1. The selection of valid spiking vectors can be done in different ways, as in [21,22]. This returns a set of valid spiking vectors. In this work, we focus on just one computation, but non-determinism can be tackled by maintaining a queue of generated configurations [24].

Algorithm 1 MAIN PROCEDURE: simulating one computation for static spiking neural P systems.

Require: A SNP system $Π$ of degree $(q, m)$ , and a limit L of time steps.
Ensure: A computation of the system
1:
$(M_{Π}, C_{0}) \leftarrow$ INIT( $Π$ )
2:
$k \leftarrow 0$
3:
repeat
4:
     $S V_{k} \leftarrow$ SPIKING_VECTORS( $C_{k}, Π$ ) ▹ Calculate all possible spiking vectors
5:
     $S_{k} \leftarrow$ GET_ONE_RANDOMLY( $S V_{k}$ ) ▹ Pick one spiking vector randomly
6:
     $C_{k + 1} \leftarrow C_{k} + S_{k} \cdot M_{Π}$ ▹ Compute next configuration
7:
     $k \leftarrow k + 1$
8:
until $k \geq L \lor S V_{k} = \emptyset$ ▹ Stop condition: maximum steps or no more applicable rules
9:
return $C_{0} \dots C_{k - 1}$

In this work we focus on compressing the representation, specifically the transition matrix, so the determination of the spiking vector is not affecting these designs. Therefore, we use a straightforward approach and select just one valid spiking vector randomly. The representations here depicted only affect how the computation of the next configuration is done (matrix-vector multiplication at line 6 in Algorithm 1).

2.3. Sparse Matrix-Vector Operations

Algebraic operations have been studied deeply in parallel computing solutions. Specifically, GPU computing provides large speedups when accelerating such kind of operations. This technology allows us to run scientific computations in parallel on the GPU, given that a GPU device typically contains thousands of cores and high memory bandwidth [39]. However, parallel computing on a GPU has more constraints than on a CPU: threads have to run in an SPMD fashion while accessing data in a coalesced way; that is, best performance is achieved when the execution of threads is synchronized and accessing contiguous data from memory. In fact, GPUs have been employed for P system simulations since the introduction of CUDA.

Matrix computation is a highly optimized operation in CUDA [40], and there are many efficient libraries for algebra computations like cuBLAS. It is usual that when working with large matrices, these are almost “empty”, or with a majority of zero values. This is known as sparse matrix, and this downgrades the runtime in two ways: lot of memory is wasted, and lot of operations are redundant.

Given the importance of linear algebra in many computational disciplines, sparse vector-matrix operations (SpMV) have been subject of study in parallel computing (and so, on GPUs). Today there exists many approaches to tackle this problem [41]. Let us focus on two formats to represent sparse matrices in a compressed way, assuming that threads access rows in parallel:

CSR format. Only non-null values are represented by using three arrays: row pointers, non-zero values, and columns (see Figure 1 for an example). First, the row-pointers array is accessed, which contains a position per row of the original matrix. Each position says the index where the row start in the non-zero values and columns arrays. The non-zero values and the columns arrays can be seen as a single array of pairs, since every entry has to be accessed at the same time. Once a row is indexed, then a loop over the values in that row has to be performed, so that the corresponding column is found, and therefore, the value. If the column is not present, then the value is assumed to be zero, since this data structure contains all non-zero values. The main advantage is that it is a full-compressed format if $N u m N o n Z e r o V a l u e s \cdot 2 > N u m Z e r o V a l u e s$ , where $N u m N o n Z e r o V a l u e s$ and $N u m Z e r o V a l u e s$ are the number of non-zero and zero values in the original matrix, respectively. However, the drawbacks is that the search of elements in the non-zero values and columns arrays is not coalesced when using parallelism per row. Moreover, since it is a full-compressed format, there is no room for modifying the values, such as introducing new non-zero values;
ELL format. This representation aims at increasing the memory coalescing access of threads in CUDA. This is achieved by using a matrix of pairs, containing a trimmed, transposed version of the original matrix (see Figure 2 for an example). Each column of the ELL matrix is devoted to each row of the matrix, even though the row is empty (all elements are zero). Every element is a pair, where the first position denotes the column and the second is the value, of only the non-zero elements in the corresponding row. However, the size of the matrix is fixed, so the number of columns equals the number of rows of the original matrix, but the number of rows is the maximum length of a row in terms of non-zero values; in other words, the maximum amount of non-zero elements in a row of the original matrix. Rows containing fewer elements pad the difference with null elements. The main advantage of this format is that threads always access the elements of all rows in coalesced way, and the null elements padded by short rows can be utilized to incorporate new data. However, there is a waste of memory, which is worst when the rows are unbalanced in terms of number of zeros.

3. Methods

SNP systems in the literature typically do not contain fully connected graphs. This means that the transition matrix gets very sparse and, therefore, both computing time and memory are wasted. However, further optimizations based on SpMV can be conveyed. In the following subsections we discuss some approaches. Of course, if the graph inherent to an SNP system leads to a compressed transition matrix, then a normal (sparse) format can be employed, because using compressed formats will increase the memory footprint.

In this work, we focus on the basic model of spiking neural P systems without delays as defined above, as well as three variants with dynamic network: budding, division and plasticity. The set of algorithms defined next are designed to take advantage of data parallelism, what is convenient for GPUs and vector co-processors. Their pseudocodes are detailed in Section 4 of this paper.

In Algorithm 2 we generalize the pseudocode disposed in Algorithm 1 to be able to handle both static and dynamic networks. This way, each variant requires to re-define just some functions from the ones defined for the static SNP system variant using vector and matrices without compression (i.e., sparse representation). In order to understand the algorithms, we will present in this section the main new data structures and their behavior. The detailed pseudocodes are available in Section 4.

Algorithm 2 MAIN PROCEDURE: simulating one computation for spiking neural P systems.

Require: An SNP system $Π$ of degree $(q, m)$ , and a limit L of time steps.
Ensure: A computation of the system
1:
$(C_{0}, M_{0}) \leftarrow$ INIT( $Π$ )
2:
$k \leftarrow 0$
3:
repeat
4:
     $S V_{k} \leftarrow$ SPIKING_VECTORS( $C_{k}, M_{k}$ ) ▹ Calculate all possible spiking vectors
5:
     $S_{k} \leftarrow$ GET_ONE_RANDOMLY( $S V_{k}$ ) ▹ Pick one spiking vector randomly
6:
     $(C_{k + 1}, M_{k + 1}) \leftarrow$ COMPUTE_NEXT( $C_{k}, M_{k}, S_{K}$ ) ▹ Compute next configuration
7:
     $k \leftarrow k + 1$
8:
until $k \geq L \lor S V_{k} = \emptyset$ ▹ Stop condition: maximum steps or no more applicable rules
9:
return $C_{0} \dots C_{k - 1}$

As a convention, those vectors and matrices using subindex k are dynamic and can change during the simulation time, while those with

Π

subindex are constructed at the beginning and are invariant. Capital letters refer to vectors and matrices, and small letters are scalar numbers. A total order over the rules defined in the system is assumed, which is denoted as

R = ⋀_{i = 1}^{q} R_{i}

. For the sake of simplicity, we represent each rule

r_{j} \equiv E_{j} / a^{c_{j}} \to a^{p_{j}}

, with

1 \leq j \leq m

, as a tuple

(i, E_{j}, c_{j}, p_{j})

, where i is the subindex of the set

R_{i}

where

r_{j}

belongs (i.e., the neuron where it is contained). Specifically, forgetting rules just have

p_{j} = 0

.

For static SNP systems using sparse representation, we use the following vectors and matrices:

Preconditions vector $P_{Π}$ is a vector storing the preconditions of the rules; that is, both the regular expression and the consumed spikes. Initially, $P_{Π} [j] = (E_{j}, c_{j})$ , for each $r_{j} \in R, r_{j} = (i_{j}, E_{j}, c_{j}, p_{j}), 1 \leq j \leq m$ .
Neuron-rule map vector $N_{Π}$ is a vector that maps each neuron index with its rules indexes. Specifically, $N_{Π} [i]$ is the index of the first rule in the neuron. Given that rules have been ordered in R as mentioned above, rules belonging to the same neuron have contiguous indexes. Thus, it is enough to store just the first index. In this sense, the first rule in neuron i is $N_{Π} [i]$ and the last one is $N_{Π} [i + 1] - 1$ . In other words, $N_{Π}$ contains $q + 1$ elements, and it is initialized as follows: $N_{Π} [i] = \sum_{h = 1}^{i - 1} 1 + | R_{h} |$ . Specifically, $N_{Π} [1] = 1$ and $N_{Π} [i] = N_{0} [i - 1] + | R_{i - 1} |$ , for $2 \leq i \leq q + 1$
$M_{k}$ is the transition tuple, where $M_{k} = (M_{Π}, P_{Π}, N_{Π})$ . If the variant has a dynamic network, the transition matrix needs to be modified. Therefore, we start with $M_{0}$ . The following algorithms show how they are constructed.

Algorithm 2 can be easily transformed to Algorithm 1 by defining the INIT and COMPUTE_NEXT functions as in Algorithm 3. They work exactly as already specified in Section 2.2; that is, using the usual vector-matrix multiplication operation to calculate the next configuration vector. We will also detail how the selection of spiking vectors can be done. This is defined in Algorithm 4, and it is based on previous ideas already presented in [21,22]. First, SPIKING_VECTORS function calculates the set of all possible spiking vectors by using a recursive function over neuron index i. It gathers all spiking vectors that can be generated for neurons

i^{'} > i

and then. If neuron i contains applicable rules, it populates a spiking vector for each of these rules, and from each of the generated spiking vectors form neurons

i^{'} > i

. Finally, neuron i propagates these spiking vectors to the next neuron

i - 1

.

Algorithm 3 Functions for static SNP systems with sparse matrix representation.

1:: procedureINIT( $Π$ )
2:: $(C_{0}, N_{Π}) \leftarrow$ INIT_NEURON_VECTORS( $Π$ )
3:: $(M_{Π}, P_{Π}) \leftarrow$ INIT_RULE_MATRICES( $Π$ )
4:: $M_{0} \leftarrow (M_{Π}, P_{Π}, N_{Π})$
5:: return $(C_{0}, M_{0})$
6:: end procedure

7:: procedureINIT_NEURON_VECTORS( $Π$ )
8:: $(q, σ_{1}, \dots, σ_{q}) \leftarrow Π$
9:: $C_{0} \leftarrow$ EMPTY_VECTOR(q)
10:: $N_{Π} \leftarrow$ EMPTY_VECTOR( $q + 1$ )
11:: $N_{Π} [1] \leftarrow 1$
12:: for all $i \leftarrow 1 \dots q$ do
13:: $(n_{i}, R_{i}) \leftarrow σ_{i}$
14:: $C_{0} [i] \leftarrow n_{i}$
15:: $N_{Π} [i + 1] \leftarrow N_{Π} [i] + | R_{i} |$
16:: end for
17:: return $(C_{0}, N_{Π})$
18:: end procedure

19:: procedureINIT_RULE_MATRICES( $Π$ )
20:: $(R, m, q, p r e s) \leftarrow Π$
21:: $P_{Π} \leftarrow$ EMPTY_VECTOR(m)
22:: $M_{Π} \leftarrow$ EMPTY_MATRIX( $m, q$ )
23:: for all $r_{j} \in R, j \leftarrow 1 \dots m$ do
24:: $r_{j} \equiv (i_{j}, E_{j}, c_{j}, p_{j})$
25:: $P_{Π} [j] \leftarrow (E_{j}, c_{j})$
26:: $M_{Π} [j, i_{j}] \leftarrow - c_{j}$
27:: for all $i \in p r e s (i_{j})$ do
28:: $M_{Π} [j, i] \leftarrow p_{j}$
29:: end for
30:: end for
31:: return $(M_{Π}, P_{Π})$
32:: end procedure

33:: procedureCOMPUTE_NEXT( $C_{k}, M_{k}, S_{K}$ )
34:: $(M_{Π},_,_) \leftarrow M_{k}$
35:: return $(C_{k} + S_{k} \cdot M_{Π}, M_{k})$
36:: end procedure

▹ Initialize vectors only related to neurons.
▹ Initialize matrices related to rules.
▹ Get information from $Π$
▹ Create initial configuration
▹ Create neuron-rule vector
▹ For each neuron
▹ Get info of the neuron from $Π$
▹ Initial configuration
▹ Neuron-rule map vector initialization
▹ Get information from $Π$
▹ Create preconditions vector
▹ Create transition matrix
▹ For each rule (column). This loop is parallelizable.
▹ Get info of the rule
▹ Store it in precondition vector
▹ Construct transition matrix
▹ For each connected neuron to $i_{j}$
▹ Construct transition matrix
▹ Get some content of transition tuple
▹ Only the configuration is updated.

Algorithm 4 Spiking vectors selection with static SNP systems and sparse representation.

1:: procedureSPIKING_VECTORS( $C_{k}, M_{k}$ )
2:: return COMBINATIONS( $1, C_{k}, M_{k}$ )
3:: end procedure

4:: procedureCOMBINATIONS( $i, C_{k}, M_{k}$ )
5:: $q_{k} \leftarrow | C_{k} |$
6:: $(_,_, N_{Π}) \leftarrow M_{k}$
7:: if $i > q_{k}$ then
8:: return ∅
9:: else
10:: $S V \leftarrow \emptyset$
11:: $S V^{'} \leftarrow$ COMBINATIONS( $i + 1, C_{k}, P_{Π}, N_{Π}$ )
12:: if $S V^{'} = \emptyset$ then
13:: $S \leftarrow$ EMPTY_VECTOR(m)
14:: $S V^{″} \leftarrow {S}$
15:: else
16:: $S V^{″} \leftarrow S V^{'}$
17:: end if
18:: for $j \leftarrow N_{Π} [i] \dots N_{Π} [i + 1] - 1$ do
19:: if APPLICABLE( $i, j, C_{k}, M_{k}$ ) then
20:: for all $S \in S V^{″}$ do
21:: $S^{'} \leftarrow S$
22:: $S^{'} [j] \leftarrow 1$
23:: $S V \leftarrow S V \cup {S^{'}}$
24:: end for
25:: end if
26:: end for
27:: if $S V = \emptyset$ then
28:: return $S V^{'}$
29:: else
30:: return $S V$
31:: end if
32:: end if
33:: end procedure

34:: procedureAPPLICABLE( $i, j, C_{k}, M_{k}$ )
35:: $(_, P_{Π},_) \leftarrow M_{k}$
36:: $(E_{j}, c_{j}) \leftarrow P_{Π} [j]$
37:: return $C_{k} [i] \in L (E_{j}) \land C_{k} [i] \geq c_{j}$
38:: end procedure

39:: procedureGET_ONE_RANDOMLY( $S V_{k}$ )
40:: $s^{'} \leftarrow$ Random( $1, | S V_{k} |)$ )
41:: return $s^{'}$ -th spiking vector in $S V_{k}$
42:: end procedure

▹ Start calculating combinations from neuron 1
▹ With dynamic networks, $q_{k} \geq q$
▹ Get some content of transition tuple
▹ If neuron i is out of index.
▹ An empty set.
▹ The set for the rest of neurons.
▹ All combinations for rest of neurons.
▹ No spiking vectors yet for rest of neurons
▹ Create an empty spiking vector.
▹ The set to loop over just contains S
▹ There are spiking vectors for rest of neurons
▹ The set to loop over is just $S V^{'}$
▹ For each rule in neuron i
▹ If rule j is applicable
▹ For each spiking vector, either $S V^{'}$ or empty vector
▹ Create a copy
▹ Mark rule j as applicable
▹ Add it to the solution
▹ If there are no applicable rules
▹ Just propagate combinations
▹ Return calculated combinations
▹ Get some content of transition tuple
▹ Preconditions of the rule
▹ If rule j is applicable in neuron i
▹ Returns just one randomly chosen

3.1. Approach with ELL Format

Our first approach to compress the representation of the transition matrix,

M_{Π}

, is to use the ELL format (see Figure 3 for an example). The reason for using ELL and not other compressed formats for sparse matrices (CSR, COO, BSR, …) is to enable extensions for dynamic networks, as seen later. ELL can give some room for modifications without much memory re-allocations, while CSR requires us to modify the whole matrix to add new elements.

ELL format leads to the new compressed matrix

M_{Π}^{s}

. The following aspects have been taken into consideration:

The ELL format represents the transpose of the original matrix, so now rows correspond to neurons and columns to rules. This is convenient for SIMD processors such as GPUs.
The number of rows of $M_{Π}^{s}$ equals the maximum amount of non-zero values in a row of $M_{Π}$ , denoted by $z^{'}$ . It can be shown that $z^{'} = z + 1$ , where z is the maximum output degree found in the neurons of the SNP system. Specifically, $z = m a x {o u t d e g r e e (i) | 1 \leq i \leq q}$ (see definition in Section 2.1). $z^{'}$ can be derived from the composition of the transition matrix, where row j devoted for rule $r_{j} \equiv (i_{j}, E_{j}, c_{j}, p_{j})$ contains the values $+ p_{j}$ for every neuron i (columns) connected though an output synapse with the neuron where the rule belongs to (i.e., $i \in p r e s (i_{j})$ ), and a value $- c_{j}$ for consuming the spikes in the neuron the rule belongs to (i.e., $i_{j}$ ).
The values inside columns can be sorted, so that the consumption of spikes ( $- c$ values) are placed at the first row. In this way, if implemented in parallel, all threads can start by doing the same task: consuming spikes.
Every position of $M_{Π}^{s}$ is a pair (as illustrated in Figure 3), where the first element is a neuron label, and the second is the number of spikes produced ( $+ p$ ).

A parallel code can be implemented with this design by assigning a thread to each rule, and so, one per column of the spiking vector

S_{k}

and one per column of

M_{Π}^{s}

(rows of the original transition matrix). For the vector-matrix multiplication, it is enough to have a loop of

z^{'}

steps at maximum through the columns. In the loop of each column j, the corresponding value in the spiking vector

S_{k} [j]

(either 0 or 1) is multiplied to the value

x_{j}

in the pair

(i_{j}, x_{j})

, and added to the neuron id

n_{j}

in the configuration vector

C_{k} [n_{j}]

. In case the SNP network contains hubs (nodes with high amount of input synapses), then we can opt for a parallel reduction per column. Since some threads might write to same positions in the configuration vector at the same time, a solution would be to use atomic adding operations, which are available on devices such as GPUs.

In order to use this representation in Algorithm 2, we only need to re-define functions INIT_RULE_MATRICES and COMPUTE_NEXT from Algorithm 3 (for sparse representation) as shown in Algorithm 5. The rest of functions remain unchanged.

Algorithm 5 Functions for static SNP systems with ELL-based matrix representation.

1:: procedureINIT_RULE_MATRICES( $Π$ )
2:: $(R, m, q, z^{'}, p r e s) \leftarrow Π$
3:: $P_{Π} \leftarrow$ EMPTY_VECTOR(m)
4:: $M_{Π}^{s} \leftarrow$ EMPTY_MATRIX( $z^{'}, m$ )
5:: for all $r_{j} \in R, j \leftarrow 1 \dots m$ do
6:: $r_{j} \equiv (i_{j}, E_{j}, c_{j}, p_{j})$
7:: $P_{Π} [j] \leftarrow (E_{j}, c_{j})$
8:: $M_{Π}^{s} [1, j] \leftarrow (i_{j}, - c_{j})$
9:: if $p_{j} > 0$ then
10:: $k \leftarrow 2$
11:: for all $i \in p r e s (i_{j})$ do
12:: $M_{Π}^{s} [k, j] \leftarrow (i, p_{j})$
13:: $k \leftarrow k + 1$
14:: end for
15:: end if
16:: end for
17:: return $(M_{Π}, P_{Π})$
18:: end procedure
19:: procedureCOMPUTE_NEXT( $C_{k}, M_{k}, S_{K}$ )
20:: $C_{k + 1} \leftarrow C_{k}$
21:: $(M_{Π}^{s},_,_) \leftarrow M_{k}$
22:: for $j \leftarrow 1 \dots m$ do
23:: $i \leftarrow 1$
24:: repeat
25:: $(i_{j}, x_{j}) \leftarrow M_{Π}^{s} [i, j]$
26:: $C_{k + 1} [i_{j}] \leftarrow C_{k + 1} [i_{j}] + S_{k} [j] \cdot x_{j}$
27:: $i \leftarrow i + 1$
28:: until $M_{Π}^{s} [i, j] = (0, 0) \lor i > z^{'}$
29:: end for
30:: return $(C_{k + 1}, M_{k})$
31:: end procedure

▹ Get information from $Π$
▹ Create preconditions vector
▹ Create transition matrix
▹ For each rule (column). This loop is parallelizable.
▹ Get info of the rule
▹ Store it in precondition vector
▹ Only for the first row
▹ Only if $r_{j}$ is not a forgetting rule
▹k is our iterator, compacting the rows
▹ For each out synapse
▹ Add out neuron and produced spikes
▹ We know that $k \leq z^{'}$
▹ Create a copy of $C_{k}$
▹ Get some content of transition tuple
▹ For each rule (column). This loop is parallelizable.
▹ Get info from transition matrix
▹ Update configuration, if not applicable, $S_{k} [j] = 0$
▹ Until reaching an empty value or the maximum

3.2. Optimized Approach for Static Networks

If, in general, more than one rule are associated to each neuron, many of the iterations in the main loop in COMPUTE_NEXT function are wasted. Indeed, if the loop is parallelized and each iteration is assigned to a thread, then many of them will be inactive (having a 0 in the spiking vector), causing performance drops such as branch divergence and non-coalesced memory access in GPUs. Moreover, note in Figure 3 that columns corresponding to rules belonging to the same neuron contain redundant information: the generation of spikes (

+ p

) is replicated for all synapses.

Therefore, a more efficient compressed matrix representation can be obtained when maintaining the synapses separated from the rule information. This is called optimized matrix representation, and can be done with the following data structures:

Rule vector, $R u_{Π}$ . By using a CSR-like format (see Figure 4 for an example), rules of the form $E / a^{c} \to a^{p}$ (also forgetting rules are included, assuming $p = 0$ ) can be represented by an array storing the values c and p in a pair. We can use the already defined neuron-rule map vector $N_{k}$ to relate the subset of rules associated to each neuron.
Synapse matrix, $S y_{Π}$ . It is a transposed matrix as with ELL representation (to better fit to SIMD architectures such as GPU devices), but it has a column per neuron i and a row for every neuron j such that $(i, j) \in S y n$ (there is a synapse). That is, every element of the matrix corresponds to a synapse (the neuron id) or a null value otherwise. Null values are employed for padding the columns, since the number of rows equals z (the maximum output degree in the neurons of the SNP system). See Figure 4 for an example.
Spiking vector is modified, containing only q positions instead of n (i.e., one per neuron), and states which rule is selected.

Note that we replace the transition matrix for a pair with rule vector and synapse matrix:

M_{Π}^{'} = (R u_{Π}, S y_{Π})

. In order to compute the next configuration, it is enough to loop over the neurons. Then, for each neuron i, we check which rule j is selected, according to the spiking vector at position

S_{k} [i]

. This is used to grab the pair

(c_{j}, p_{j})

from the rule vector, and therefore consume

c_{j}

spikes in the neuron i and add

p_{j}

spikes in the neurons at the column i of the synapse matrix. The loop over the column can end prematurely if the out degree of neuron i is not z (that is, when encountering a null value). This operation can be easily parallelized by assigning a thread to each column of the synapse matrix (requiring q threads, one per neuron).

In order to use this optimized representation in Algorithm 2, we need to re-define the spiking selection function, since this vector works differently. To do this, it is enough to just modify two lines in the definition of COMBINATIONS function at Algorithm 4, in order to keep the spiking vector with size q and storing the rule id instead of just 1 or 0 (see Section 4 for more detail). Moreover, we need to define tailored INIT_RULE_MATRICES and COMPUTE_NEXT functions as shown in Algorithm 6, replacing those from Algorithm 3.

Algorithm 6 Functions for static SNP systems with optimized compressed matrix representation.

1:: procedureINIT_RULE_MATRICES( $Π$ )
2:: $(R, m, q, z, p r e s) \leftarrow Π$
3:: $P_{Π} \leftarrow$ EMPTY_VECTOR(m)
4:: $R u_{Π} \leftarrow$ EMPTY_VECTOR(m)
5:: for all $r_{j} \in R (j \leftarrow 1 \dots m)$ do
6:: $r_{j} = (i_{j}, E_{j}, c_{j}, p_{j})$
7:: $P_{Π} [j] \leftarrow (E_{j}, c_{j})$
8:: $R u_{Π} [j] \leftarrow (c_{j}, p_{j})$
9:: end for
10:: $S y_{Π} \leftarrow$ EMPTY_MATRIX( $z, q$ )
11:: for $i \leftarrow 1 \dots q$ do
12:: $k \leftarrow 1$
13:: for all $h \in p r e s (i)$ do
14:: $S y_{Π} [k, i] \leftarrow h$
15:: $k \leftarrow k + 1$
16:: end for
17:: end for
18:: $M_{Π}^{'} \leftarrow (R u_{Π}, S y_{Π})$
19:: return $(M_{Π}^{'}, P_{Π})$
20:: end procedure

21:: procedureCOMPUTE_NEXT( $C_{k}, M_{k}, S_{k}$ )
22:: $C_{k + 1} \leftarrow C_{k}$
23:: $(M_{Π}^{'},_,_) \leftarrow M_{k}$
24:: $(R u_{Π}, S y_{Π}) \leftarrow M_{Π}^{'}$
25:: for $i \leftarrow 1 \dots q$ do
26:: $j \leftarrow S_{k} [i]$
27:: if $j \neq 0$ then
28:: $(c_{j}, p_{j}) \leftarrow R u_{Π} [j]$
29:: $C_{k + 1} [i] \leftarrow C_{k + 1} [i] - c_{j}$
30:: $w \leftarrow 1$
31:: while $p_{j} > 0 \land S y_{Π} [w, i] \neq 0 \land w \leq z$ do
32:: $h \leftarrow S y_{Π} [w, i]$
33:: $C_{k + 1} [h] \leftarrow C_{k + 1} [h] + p_{j}$
34:: $w \leftarrow w + 1$
35:: end while
36:: end if
37:: end for
38:: return $(C_{k + 1}, M_{k})$
39:: end procedure

▹ Get information from $Π$
▹ Create preconditions vector
▹ For each rule (column). This loop is parallelizable.
▹ Get info of the rule
▹ Store it in precondition vector
▹ Store it in rule vector
▹ For each neuron (column in synapse matrix)
▹k is our iterator, compacting the rows
▹ For each out synapse
▹ We know that $k \leq z$
▹ Create a copy of $C_{k}$
▹ Get some content of transition tuple
▹ For each neuron. This loop is parallelizable.
▹ Index of rule to fire in the neuron
▹ Only if there is a rule.
▹ Get rule info
▹ Consume spikes in firing neuron
▹ Next while stops if $p_{j} = 0$ , i.e., a firing rule
▹ Until an empty value or the maximum
▹ Get connected neuron by a synapse
▹ Produce spikes in connected neuron

3.3. Optimized Approach for Dynamic Networks

The optimized compressed matrix representation discussed in Section 3.2 can be further extended to support rules that modify the network, such as budding, division, or plasticity.

3.3.1. Budding and Division Rules

We start by analyzing how to simulate dynamic SNP systems with budding and division rules. They are supported at the same time in order to unify the pseudocode and also because both kind of rules are usually together in the model.

First of all, the synapse matrix has to be flexible enough to host new neurons. This can be accomplished by allocating a matrix large enough to populate new neurons (probably up to fill the whole memory available). We denote

q_{m a x}

as the maximum amount of neurons that the simulator is able to support, and

q_{k}

the amount of neurons in a given step k. The formula to calculate

q_{m a x}

is in Section 5. It is important to point out that the simulator needs to differentiate between neuron label and neuron id [35]. The reason for this separation is that we can have more than one neuron (with different ids) with the same label (and hence, rules).

In order to achieve this separation, it is enough to have a vector to map each neuron id to its label. We will call this new vector, neuron-id map vector

Q_{k}

, and the following holds at step k:

q_{k} = | Q_{k} | = | C_{k} | = | S_{k} | \leq q_{m a x}

. That is, neuron-id map vector, configuration vector and spiking vector have a size of

q_{m a x}

as well. Once the label of a neuron is obtained, the information of its corresponding rules can be accessed as usual, like the neuron-rule map vector

N_{Π}

. For simplicity, we attach the neuron-id map vector to the transition tuple. Moreover, the synapse matrix becomes dynamic, thus using k sub-index:

S y_{k}

; hence, the transition matrix is also dynamic. Let us now introduce this new notation for transition tuple and transition matrix:

The transition matrix is now a dynamic pair: $M_{k}^{'} = (R u_{Π}, S y_{k})$ .
The transition tuple is extended as follows: $M_{k} = (M_{k}^{'}, Q_{k}, P_{Π}, N_{Π})$ .

We use the following encoding for each type of rule. Spiking and forgetting rules remain unchanged:

For a budding rule $r \equiv {[E]}_{i} \to {[]}_{i} / {[]}_{j}$ as $r = (i, E, 0, j)$ . Given that all pairs in the rule vector $R u_{Π}$ are of the form $(c, p)$ , and c is always greater equal than 1, then we can encode a budding rule as a pair $(0, j)$ .
For a division rule $r \equiv {[E]}_{i} \to {[]}_{j} | | {[]}_{k}$ as $r = (i, E, - j, k)$ . Given that all pairs in the rule vector $R u_{Π}$ are of the form $(c, p)$ , and c is always greater equal than 1, then we can encode a division rule as a pair $(- j, k)$ .

The execution of a budding rule

{[E]}_{i} \to {[]}_{i} / {[]}_{l}

requires the following operations (see Figure 5 for an illustration):

Let $i^{'}$ be the neuron id executing this rule.
Allocate a column $l^{'}$ to the synapse matrix $S y_{k}$ for the new neuron, and use this index as its neuron id.
Add an entry to the neuron-id map vector $Q_{k}$ at position $l^{'}$ with the label l.
Copy column $i^{'}$ to the new column $l^{'}$ in $S y_{k}$ .
Delete the content of column $i^{'}$ and add only one element at the first row with the id $l^{'}$ .

Figure 5. Illustration of application of a budding rule in the synapse matrix with compressed representation. Light blue cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron id), and light cells are empty columns allocated in memory (a total of

q_{m a x}

). Neuron 1 is applying budding, and its content is copied to an empty column (5) and replaced by a single synapse to the created neuron.

Figure 5. Illustration of application of a budding rule in the synapse matrix with compressed representation. Light blue cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron id), and light cells are empty columns allocated in memory (a total of

q_{m a x}

). Neuron 1 is applying budding, and its content is copied to an empty column (5) and replaced by a single synapse to the created neuron.

For a division rule

{[E]}_{i} \to {[]}_{j} | | {[]}_{l}

, the following operations have to be performed (see Figure 6 for an example):

Let $i^{'}$ be the neuron id executing this rule.
Allocate a new column $l^{'}$ for the created neuron l in the synapse matrix $S y_{k}$ .
Modify the neuron-id map vector $Q_{k}$ as follows: replace the value at position $i^{'}$ for label j, and add a new entry for $l^{'}$ to associate it with label k.
Copy column $i^{'}$ to $l^{'}$ in $S y_{k}$ (the generated neuron gets the out synapses of the parent).
Find all occurrences of $i^{'}$ in the synapse matrix, and add $l^{'}$ to the columns where it is found.

Figure 6. Illustration of application of a division rule in the synapse matrix with compressed representation. Light blue cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron id), and light cells are empty columns allocated in memory (a total of

q_{m a x}

). Neuron 1 is being divided, and its content is copied to an empty column (5). Columns 0 and 3 represent neurons with a synapse to the neuron being divided (1), so we need to update them as well with the synapse to the created neuron (5). Neuron 3 has reached its limit of maximum out degree, therefore we need to expand the matrix with a new row, or use a COO-like system to store these exceeded elements.

Figure 6. Illustration of application of a division rule in the synapse matrix with compressed representation. Light blue cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron id), and light cells are empty columns allocated in memory (a total of

q_{m a x}

). Neuron 1 is being divided, and its content is copied to an empty column (5). Columns 0 and 3 represent neurons with a synapse to the neuron being divided (1), so we need to update them as well with the synapse to the created neuron (5). Neuron 3 has reached its limit of maximum out degree, therefore we need to expand the matrix with a new row, or use a COO-like system to store these exceeded elements.

The last operation can be very expensive if the amount of neurons is large, since it requires to loop all over the synapse matrix. Moreover, when adding

l^{'}

in all the columns containing

i^{'}

, it would be possible to exceed the predetermined size z. For this situation, a special array of overflows is needed, like ELL + COO format for SpMV [41]. For simplicity, we will assume this situation is weird and the algorithm will allocate a new row for the synapse matrix.

Some functions in the pseudocode are re-defined to support dynamic networks with division and budding:

INIT functions as in Algorithm 7. They now take into account the initialization of structures at its maximum amount $q_{m a x}$ , including the new neuron-id map vector.
SPIKING_VECTORS function, as defined in Algorithm 4 and modified in Section 3.2 for optimized matrix representation, is slightly modified (just two lines) to support the neuron-id map vector.
APPLICABLE function as in Algorithm 8. This function, when dealing with division rules, has to search if there are existing synapses for the neurons involved. If they exist, the division rule does not apply.
COMPUTE_NEXT function as in Algorithm 9, to include the operations described above. It now needs to expand the synapse matrix $S y_{k}$ either by columns (when new neurons are created) or by rows if there is a neuron from which we need to create a synapse to the new neuron and it has already the maximum out degree z. In this case, we need to re-allocate the synapse matrix in order to extend it by one row (this is written in the pseudocode with the function EXPAND_MATRIX). Finally, let us remark that we can easily detect if type of a rule $r_{j}$ at it’s associated $c_{j}$ value: if 0, it is a budding rule, if it is positive number, a spiking rule, otherwise (negative value) it is a division rule.

Algorithm 7 Initialization functions for dynamic SNP systems with budding and division rules over optimized compressed matrix representation.

1:: procedureINIT( $Π$ )
2:: $(C_{0}, N_{Π}) \leftarrow$ INIT_NEURON_VECTORS( $Π$ )
3:: $(M_{0}^{'}, Q_{0}, P_{Π}) \leftarrow$ INIT_RULE_MATRICES( $Π$ )
4:: $M_{0} \leftarrow (M_{0}^{'}, Q_{0}, P_{Π}, N_{Π})$
5:: return $(C_{0}, M_{0})$
6:: end procedure
7:: procedureINIT_NEURON_VECTORS( $Π$ )
8:: $(q, q_{m a x}, σ_{1}, \dots, σ_{q}) \leftarrow Π$
9:: $C_{0} \leftarrow$ EMPTY_VECTOR( $q_{m a x}$ )
10:: $N_{Π} \leftarrow$ EMPTY_VECTOR( $q + 1$ )
11:: $N_{Π} [1] \leftarrow 1$
12:: for all $i \leftarrow 1 \dots q$ do
13:: $(n_{i}, R_{i}) \leftarrow σ_{i}$
14:: $C_{0} [i] \leftarrow n_{i}$
15:: $N_{Π} [i + 1] \leftarrow N_{Π} [i] + | R_{i} |$
16:: end for
17:: return $(C_{0}, N_{Π})$
18:: end procedure
19:: procedureINIT_RULE_MATRICES( $Π$ )
20:: $(R, m, q, q_{m a x}, z, p r e s) \leftarrow Π$
21:: $P_{Π} \leftarrow$ EMPTY_VECTOR(m)
22:: $R u_{Π} \leftarrow$ EMPTY_VECTOR(m)
23:: for all $r_{j} \in R (j \leftarrow 1 \dots m)$ do
24:: $r_{j} \equiv (i_{j}, E_{j}, c_{j}, p_{j})$
25:: $P_{Π} [j] \leftarrow (E_{j}, c_{j})$
26:: $R u_{Π} [j] \leftarrow (c_{j}, p_{j})$
27:: end for
28:: $Q_{0} \leftarrow$ EMPTY_VECTOR( $q_{m a x}$ )
29:: $S y_{0} \leftarrow$ EMPTY_MATRIX( $z, q_{m a x}$ )
30:: for $i \leftarrow 1 \dots q$ do
31:: $Q_{0} [i] \leftarrow i$
32:: $k \leftarrow 1$
33:: for all $h \in p r e s (i)$ do
34:: $S y_{0} [k, i] \leftarrow h$
35:: $k \leftarrow k + 1$
36:: end for
37:: end for
38:: $M_{0}^{'} \leftarrow (R u_{Π}, S y_{0})$
39:: return $(M_{0}^{'}, Q_{0}, P_{Π})$
40:: end procedure

▹ Initialize vectors only related to neurons.
▹ Initialize matrices related to rules.
▹ Get information from $Π$
▹ Create initial configuration
▹ Create neuron-rule vector
▹ For each neuron
▹ Get info of the neuron from $Π$
▹ Initial configuration
▹ Neuron-rule map vector initialization
▹ Get information from $Π$
▹ Create preconditions vector
▹ For each rule (column). This loop is parallelizable.
▹ Get info of the rule
▹ Store it in precondition vector
▹ Store it in rule vector
▹ For each neuron label
▹k is our iterator, compacting the rows
▹ For each out synapse
▹ We know that $k \leq z$

Algorithm 8 Applicable functions for dynamic SNP systems with budding rules over optimized compressed matrix representation.

1:: procedureAPPLICABLE( $i, j, C_{k}, M_{k}$ )
2:: $(M_{k}^{'}, Q_{k}, P_{Π},_) \leftarrow M_{k}$
3:: $(R u_{Π}, S y_{k}) \leftarrow M_{k}^{'}$
4:: $(l_{j}, h_{j}) \leftarrow R u_{Π} [j]$
5:: $(E_{j}, c_{j}) \leftarrow P_{Π} [j]$
6:: if $c_{j} > 0$ then
7:: return $C_{k} [i] \in L (E_{j}) \land C_{k} [i] \geq c_{j}$
8:: $b \leftarrow F a l s e$
9:: $w \leftarrow 1$
10:: while $\neg b \land S y_{k} [w, i] \neq 0 \land w \leq z$ do
11:: $b \leftarrow Q_{k} [S y_{k} [w, i]] = h_{j}$
12:: $w \leftarrow w + 1$
13:: end while
14:: return $C_{k} [i] \in L (E_{j}) \land \neg b$
15:: else
16:: $b \leftarrow F a l s e$
17:: $w \leftarrow 1$
18:: while $\neg b \land S y_{k} [w, i] \neq 0 \land w \leq z$ do
19:: $b \leftarrow Q_{k} [S y_{k} [w, i]] = h_{j} \lor Q_{k} [S y_{k} [w, i]] = - l_{j}$
20:: $w \leftarrow w + 1$
21:: end while
22:: for $x \leftarrow 1 \dots q_{k}$ do
23:: $b^{'} \leftarrow Q_{k} [x] = h_{j} \lor Q_{k} [x] = - l_{j}$ ▹ Either $h_{j}$ or $l_{j}$
24:: $w \leftarrow 1$
25:: while $b^{'} \land \neg b \land S y_{k} [w, x] \neq 0 \land w \leq z$ do
26:: $b \leftarrow Q_{k} [S y_{k} [w, x]] = i$
27:: $w \leftarrow w + 1$
28:: end while
29:: end for
30:: return $C_{k} [i] \in L (E_{j}) \land \neg b$
31:: end if
32:: end procedure

▹ Get some content of transition tuple
▹ Get some content of transition matrix
▹ Preconditions of the rule
▹ Preconditions of the rule
▹ If a spiking or forgetting rule
▹ If rule j is applicable in neuron i $c_{j} = 0$ ▹ If a budding rule
▹ Check if synapse $(i, h_{j})$ exists
▹ Until an empty value or the maximum
▹ If synapse exists
▹ If a division rule
▹ Check if synapse $(i, h_{j})$ or $(i, - l_{j})$ exists
▹ Until an empty value
▹ If either synapse exists
▹ Search for neurons with label $h_{j}$ or $l_{j}$
▹ Either $h_{j}$ or $l_{j}$
▹ Combination of conditions
▹ If the synapse exists

Algorithm 9 Compute next function for dynamic SNP systems with budding and division rules using optimized compressed matrix representation.

1:: procedureCOMPUTE_NEXT( $C_{k}, M_{k}, S_{k}$ )
2:: $C_{k + 1} \leftarrow C_{k}$
3:: $(M_{k}^{'}, Q_{k},_,_) \leftarrow M_{k}$
4:: $(R u_{Π}, S y_{k}) \leftarrow M_{k}^{'}$
5:: $q_{k} \leftarrow | C_{k} |$
6:: for $i \leftarrow 1 \dots q_{k}$ do
7:: $j \leftarrow S_{k} [i]$
8:: if $j \neq 0$ then
9:: $(c_{j}, p_{j}) \leftarrow R u_{Π} [j]$
10:: if $c_{j} > 0$ then
11:: $C_{k + 1} [i] \leftarrow C_{k + 1} [i] - c_{j}$
12:: $w \leftarrow 1$
13:: while $p_{j} > 0 \land S y_{k} [w, i] \neq 0 \land w \leq z$ do
14:: $h \leftarrow S y_{k} [w, i]$
15:: $C_{k + 1} [h] \leftarrow C_{k + 1} [h] + p_{j}$
16:: $w \leftarrow w + 1$
17:: end while
18:: else if $c_{j} = 0$ then
19:: $q_{k} \leftarrow q_{k} + 1$
20:: $C_{k + 1} [i] \leftarrow 0$
21:: $Q_{k} [q_{k}] \leftarrow p_{j}$
22:: for $w \leftarrow 1 \dots z$ do
23:: $S y_{k} [w, q_{k}] \leftarrow S y_{k} [w, i]$
24:: if $w = 1$ then
25:: $S y_{k} [w, i] \leftarrow q_{k}$
26:: else
27:: $S y_{k} [w, i] \leftarrow 0$
28:: end if
29:: end for
30:: else
31:: $(h_{j}, l_{j}) \leftarrow (c_{j}, - p_{j})$
32:: $q_{k} \leftarrow q_{k} + 1$
33:: $C_{k + 1} [i] \leftarrow 0$
34:: $Q_{k} [i] \leftarrow h_{j}$
35:: $Q_{k} [q_{k}] \leftarrow l_{j}$
36:: for $w \leftarrow 1 \dots z$ do
37:: $S y_{k} [w, q_{k}] \leftarrow S y_{k} [w, i]$
38:: end for
39:: for $x \leftarrow 1 \dots q_{k} - 1$ do
40:: $b \leftarrow F a l s e$
41:: $w \leftarrow 1$
42:: while $S y_{k} [w, x] \neq 0 \land w \leq z$ do
43:: $b \leftarrow b \lor S y_{k} [w, x] = i$
44:: $w \leftarrow w + 1$
45:: end while
46:: if b then
47:: if $w = z$ then
48:: $z \leftarrow z + 1$
49:: $S y_{k} \leftarrow$ EXPAND_MATRIX( $S y_{k}, z, q_{k}$ )
50:: end if
51:: $S y_{k} [w, x] \leftarrow q_{k}$
52:: end if
53:: end for
54:: end if
55:: end if
56:: end for
57:: return $(C_{k + 1}, M_{k})$
58:: end procedure

▹ Create a copy of $C_{k}$
▹ Extract info from transition tuple
▹ Extract info from transition matrix
▹ Get current amount of neurons.
▹ For each neuron. This loop is parallelizable.
▹ Index of rule to fire in the neuron
▹ Only if there is a rule.
▹ Get rule info
▹ Execution of a spiking or forgetting rule
▹ Consume spikes in firing neuron
▹ Next while stops if $p_{j} = 0$ , i.e., a firing rule
▹ Until an empty value
▹ Get connected neuron by a synapse
▹ Produce spikes in connected neuron
▹ Execution of a budding rule
▹ Increment counter of neurons
▹ Empty the neuron i, neuron $q_{k}$ is 0 already
▹ $p_{j}$ is the label of the new neuron
▹ Copy column i to the new one
▹ Update out synapses of i
▹ The only new out synapse of i
▹ No more out synapses for i
▹ Execution of a division rule
▹ Get new neurons labels
▹ Increment counter of neurons
▹ Empty the neuron i, neuron $q_{k}$ is 0 already
▹ The new label of the neuron
▹ The label of the new neuron
▹ Copy out synapses to new neuron
▹ Copy column i to the new one
▹ Search for in synapses of neuron i
▹ Boolean saying the synapse was found
▹ Search the end of the column
▹ Search for neuron i
▹ If synapse was found, add new neuron at the end of the column
▹ The neuron x has a larger out degree than z
▹ Extend with one more row
▹ This can lead to overflows if $w = z$

3.3.2. Plasticity Rules

For dynamic SNP systems with plasticity rules, the synapse matrix can be allocated in advance to the exact size q, since no new neurons are created. Thus, there is no need of using a neuron-id map vector as before. However, enough rows (value z) in the synapse matrix have to be pre-established to support the maximum amount of synapses. Fortunately, this can be pre-computed by looking to the initial out degrees of the neurons and the size of the neuron sets in the plasticity rules adding synapses. We encode a plasticity rule

r_{j} \equiv E_{j} / a_{j}^{c} \to α_{j} k_{j} (i_{j}, N_{j})

, with

α_{j} = + / - / \pm / \mp

as follows:

r_{j} = (i_{j}, E_{j}, c_{j}, α_{j}, k_{j}, N_{j})

. Next, we define the value of

z_{p}

for SNP systems with plasticity rules:

z_{p} = m a x {| p r e s (i) \cup N t_{i} |, 1 \leq i \leq q}

, where

N t_{i} = ⋃_{j = 1}^{j = m} N_{j}, r_{j} \in R_{i}, r_{j} = (i_{j}, E_{j}, c_{j}, α_{j}, k_{j}, N_{j}), α_{j} \in {+, \pm, \mp}

. In other words,

z_{p}

is the maximum out degree (z) that a neuron can have initially plus those new connections that can be created with plasticity rules inside that neuron. This result can be refined for plasticity rules having

α \in {\pm, \mp}

, because we know up to k new synapses can be created at a time. However, for simplicity, we will use the formula above.

First, we need to represent plasticity rules into vectors. We assume that

n p

is the total amount of plasticity rules in the system, and that there is a total order between these rules. Given a plasticity rule, we can initialize the neuron-map and the precondition vector as with spiking rules. But in this case, we need a couple of new vectors and modify existing ones in order to represent all plasticity rules

r_{j} = (i_{j}, E_{j}, c_{j}, α_{j}, k_{j}, N_{j})

, with

j \in {1 \dots n p}

(following the imposed total order):

Rule vector $R u_{Π}$ stores the following pair for a plasticity rule $r_{j}$ : $(c_{j}, - j)$ , that is, the consumed spikes $c_{j}$ and the unique index of the plasticity rule j. This index is used to access the following vector, and it is stored as a negative value in order to detect that this is a plasticity rule.
Plasticity rule vector $P r_{Π}$ , of size $n p$ , contains a tuple for each plasticity rule $r_{j}$ of the form $(α_{j}, k_{j}, n i_{j}, n e_{j})$ . The values $n i_{j}$ and $n e_{j}$ are used as indexes, from $n i_{j}$ (start) to $n e_{j}$ (end) for the following vector.
Plasticity neuron vector $P n_{Π}$ , of size $n p r = \sum_{j = 1}^{j = n p} | N_{j} |$ , represents all neuron sets of plasticity rules. Thus, the elements of $N_{j}$ are stored, in an ordered way, between $P n_{Π} [n i_{j}]$ to $P n_{Π} [n e_{j}]$ .
Time vector $T_{k}$ is used to prevent neurons from applying rules during one step if the plasticity rule applied was of the type $α_{j} \in {\pm, \mp}$ . It contains binary (0 or 1) values.
Transition matrix is therefore $M_{k}^{'} = (R u_{Π}, S y_{k}, T_{k}, P r_{Π}, P n_{Π})$ . Note that the Synapse matrix can be modified at each step, so we use sub-index k.

The following operations have to be performed to reproduce the behavior plasticity rules (see Figure 7 for an illustration):

For each column in the synapse matrix executing a plasticity rule deleting x synapses:
(a)
If the intersection of the rule’s neuron set and the current synapses in $S y_{k}$ is larger than x, then randomly select x synapses.
(b)
Loop through the rows (up to $z_{p}$ iterations) to search the selected neurons and set them to null. Given that holes might appear in the column, its values can be sorted (or compacted).
For each column in the synapse matrix executing a plasticity rule adding x synapses:
(a)
If the difference of the rule’s neuron set and the current synapses in $S y_{k}$ is larger than x, then randomly select x neurons.
(b)
Loop through the rows (up to $z_{p}$ iterations) to insert the selected new synapses while keeping the order.

Figure 7. Illustration of application of a plasticity rule in the synapse matrix with compressed representation. Light blue cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron label). Two examples are give, in case of adding new synapses (top) and in case of deleting synapses (bottom). We sort the synapses per column for more efficiency.

Checking the applicability of plasticity rules is much simpler than for division rules, given that the preconditions only affect to the local neuron and we do not need to know if there are existing synapses. However, for a plasticity rule r in a neuron i, and in order to create new or delete existing synapses, we need to check which neurons declared in r are already in the column i in the synapse matrix. This search can be

O (z_{p} \cdot n_{r})

, being

n_{r}

the length of the neuron set in r. Nevertheless, by maintaining always the order in the column, this search can be done easily in

O (z_{p} + n_{r})

.

Given that it is not usual to have budding and division rules together with plasticity rules, the pseudocode is based on the optimized matrix representation for static SNP systems (and not for division and budding) in Section 3.2. Algorithm 10 shows the re-definition of INIT_RULE_MATRICES and COMPUTE_NEXT functions, replacing those from Algorithm 3. For COMPUTE_NEXT, the implementation is very similar to the original one, but it just call to a new function, PLASTICITY, which actually modify the synapses of the neuron (by just modifying its corresponding column in the synapse matrix). This function and its auxiliaries are defined in Algorithms 11 and 12, respectively.

Algorithm 10 Functions for dynamic SNP systems with plasticity rules using optimized compressed matrix representation.

1:: procedureINIT_RULE_MATRICES( $Π$ )
2:: $(R, m, q, z_{p}, n p, n p r, p r e s) \leftarrow Π$
3:: $P_{Π} \leftarrow$ EMPTY_VECTOR(m)
4:: $R u_{Π} \leftarrow$ EMPTY_VECTOR(m)
5:: $P r_{Π} \leftarrow$ EMPTY_VECTOR( $n p$ )
6:: $P n_{Π} \leftarrow$ EMPTY_VECTOR( $n p r$ )
7:: $p k \leftarrow 1$
8:: $n i \leftarrow 1$
9:: for all $r_{j} \in R, j \leftarrow 1 \dots m$ do
10:: if $r_{j} = (i_{j}, E_{j}, c_{j}, p_{j})$ then
11:: $P_{Π} [j] \leftarrow (E_{j}, c_{j})$
12:: $R u_{Π} [j] \leftarrow (c_{j}, p_{j})$
13:: else if $r_{j} = (i_{j}, E_{j}, c_{j}, α_{j}, k_{j}, N_{j})$ then
14:: $P_{Π} [j] \leftarrow (E_{j}, c_{j})$
15:: $R u_{Π} [j] \leftarrow (c_{j}, - p k)$
16:: $P r_{Π} [p k] \leftarrow (α_{j}, k_{j}, n i, n i + | N_{j} |)$
17:: $P n_{Π} [n i] \leftarrow$ SORT( $N_{j}$ )
18:: $p k \leftarrow p k + 1$
19:: $n i \leftarrow n i + | N_{j} |$
20:: end if
21:: end for
22:: $T_{0} \leftarrow$ EMPTY_VECTOR(q)
23:: $S y_{0} \leftarrow$ EMPTY_MATRIX( $z_{p}, q$ )
24:: for all $i \leftarrow 1 \dots q$ do
25:: $k \leftarrow 1$
26:: for all $h \in p r e s (i)$ do
27:: $S y_{0} [k, i] \leftarrow h$
28:: $k \leftarrow k + 1$
29:: end for
30:: end for
31:: $M_{0}^{'} \leftarrow (R u_{Π}, S y_{0}, T_{0}, P r_{Π}, P n_{Π})$
32:: return $(M_{0}^{'}, P_{Π})$
33:: end procedure

34:: procedureCOMPUTE_NEXT( $C_{k}, M_{k}, S_{k}$ )
35:: $C_{k + 1} \leftarrow C_{k}$
36:: $(M_{k}^{'}, P_{Π}, N_{Π}) \leftarrow M_{k}$
37:: $(R u_{Π}, S y_{k}, P r_{Π}, P n_{Π}) \leftarrow M_{k}^{'}$
38:: for $i \leftarrow 1 \dots q$ do
39:: $j \leftarrow S_{k} [i]$
40:: if $j \neq 0 \lor T_{k} [i] = 1$ then
41:: $(c_{j}, p_{j}) \leftarrow R u_{Π} [j]$
42:: $C_{k + 1} [i] \leftarrow C_{k + 1} [i] - c_{j}$
43:: if $p_{j} > 0$ then
44:: $w \leftarrow 1$
45:: while $S y_{Π} [w, i] \neq 0 \land w \leq z_{p}$ do
46:: $h \leftarrow S y_{Π} [w, i]$
47:: $C_{k + 1} [h] \leftarrow C_{k + 1} [h] + p_{j}$
48:: $w \leftarrow w + 1$
49:: end while
50:: else if $p_{j} < 0$ then
51:: $(A, t) \leftarrow$ PLASTICITY( $S y_{k} [, i], - p_{j}, P r_{Π}, P n_{Π}$ )
52:: $S y_{k} [, i] \leftarrow A$ $T_{k} [i] \leftarrow t$
53:: end if
54:: end if
55:: $T_{k} [i] \leftarrow 0$
56:: end for
57:: $M_{k + 1}^{'} \leftarrow (R u_{Π}, S y_{k}, T_{k}, P r_{Π}, P n_{Π})$
58:: return $(C_{k + 1}, (M_{Π}^{'}, P_{Π}, N_{Π})$
59:: end procedure

▹ Get information from $Π$ , $z_{p}$ is different for plasticity.
▹ Create preconditions vector
▹ Create rule vector
▹ Create plasticity rule vector
▹ Create plasticity neuron vector
▹ Counter for plasticity rule vector
▹ Counter for plasticity neuron vector
▹ For each rule (column). This loop is parallelizable.
▹ If spiking rule
▹ Store it in precondition vector
▹ Store it in rule vector
▹ If plasticity rule
▹ Store it in precondition vector
▹ Store it in rule vector
▹ Store it in rule vector
▹ Sort and store $N_{j}$ in $P n$ after position $n i$
▹ Create time vector
▹ Create synapse matrix
▹ For each neuron (column in synapse matrix)
▹k is our iterator, compacting the rows
▹ For each out synapse
▹ We know that $k \leq z_{p}$
▹ New transition matrix
▹ Create a copy of $C_{k}$
▹ Get some content of transition tuple
▹ For each neuron. This loop is parallelizable.
▹ Index of rule to fire in the neuron
▹ Only if there is a rule or blocked neuron
▹ Get rule info
▹ Consume spikes in firing neuron
▹ If a spiking rule
▹ Until an empty value or the maximum
▹ Get connected neuron by a synapse
▹ Produce spikes in connected neuron
▹ If a plasticity rule
▹ Modify only column i
▹ Reset time vector
▹ Next transition matrix

4. Algorithms

In this section we define the algorithms implementing the methods described in Section 3.

Let us first define a generic function to create a new, empty (all values to 0) vector of size s as follows: EMPTY_VECTOR(s). In order to create an empty matrix with f rows and c columns, we will use the following function: EMPTY_MATRIX(

f, c

). Next, the pseudocodes for simulating static SNP systems with sparse representation are given. Algorithm 3 shows the INIT and COMPUTE_NEXT functions, while Algorithm 4 shows the selection of spiking vectors.

For ELL-based matrix representation for static SNP systems, we need to re-define only two functions (INIT_RULE_MATRICES and COMPUTE_NEXT) from Algorithm 3 (static SNP systems with sparse representation) as shown in Algorithm 5.

For our optimized matrix representation for static SNP systems, we need to re-define only two functions (INIT_RULE_MATRICES and COMPUTE_NEXT) from Algorithm 3 (static SNP systems with sparse representation) as shown in Algorithm 6. Moreover, the following two lines in the definition of COMBINATIONS function at Algorithm 4 are required, in order to support a spiking vector of size q:

Line 13 at Algorithm 4: $S \leftarrow$ EMPTY_VECTOR(q)
Line 22 at Algorithm 4: $S^{'} [i] \leftarrow j$

For dynamic SNP systems with budding and division rules, the following functions are redefined: INIT functions as in Algorithm 7, APPLICABLE function as in Algorithm 8, and COMPUTE_NEXT function as in Algorithm 9. The SPIKING_VECTORS function, as defined in Algorithm 4 and modified in Section 3.2 for optimized matrix representation, is slightly modified (just two lines) to support the neuron-id map vector as follows:

Line 6 at Algorithm 4: $(_, Q_{k}, P_{Π}, N_{Π}) \leftarrow M_{k}$
Line 18 at Algorithm 4: for $j \leftarrow N_{Π} [Q_{k} [i]] \dots N_{Π} [Q_{k} [i] + 1] - 1$

For dynamic SNP systems with plasticity rules, the pseudocode is based on the optimized matrix representation for static SNP systems (and not for division and budding) in Section 3.2. Algorithm 10 shows the re-definition of INIT_RULE_MATRICES and COMPUTE_NEXT functions, replacing those from Algorithm 3. As for line 17, we assume that the function SORT exists, which takes a set of neurons, sorts them by id, and generates a vector. Moreover, we can copy vectors directly from one position by just one assignation. The new PLASTICITY function is defined in Algorithm 11, and its auxiliaries are defined in Algorithm 12.

Algorithm 11 Function for plasticity mechanism using optimized compressed matrix representation.

1:: procedurePLASTICITY( $A_{i}, j, P r_{Π}, P n_{Π}$ )
2:: $(α_{j}, k_{j}, n i_{j}, n e_{j}) g e t s P r_{Π} [j]$
3:: $n p r_{j} \leftarrow n e_{j} - n i_{j}$
4:: $N_{j} \leftarrow$ EMPTY_VECTOR( $n p r_{j}$ )
5:: for $x \leftarrow 1 \dots n p r_{j}$ do
6:: $N_{j} [x] \leftarrow P n_{Π} [n i_{j} + x]$
7:: end for
8:: $t \leftarrow 0$
9:: if $α_{j} = -$ then
10:: $A_{i} \leftarrow$ DEL_SYNAPSES( $A_{i}, k_{j}, N_{j}$ )
11:: else if $α_{j} = +$ then
12:: $A_{i} \leftarrow$ ADD_SYNAPSES( $A_{i}, k_{j}, N_{j}$ )
13:: else if $α_{j} = \pm$ then
14:: $A_{i} \leftarrow$ ADD_SYNAPSES( $A_{i}, k_{j}, N_{j}$ )
15:: $A_{i} \leftarrow$ DEL_SYNAPSES( $A_{i}, k_{j}, N_{j}$ )
16:: $t \leftarrow 1$
17:: else if $α_{j} = \mp$ then
18:: $A_{i} \leftarrow$ DEL_SYNAPSES( $A_{i}, k_{j}, N_{j}$ )
19:: $A_{i} \leftarrow$ ADD_SYNAPSES( $A_{i}, k_{j}, N_{j}$ )
20:: $t \leftarrow 1$
21:: end if
22:: return ( $A_{i}, t$ )
23:: end procedure

▹ Get info of plasticity rule
▹ Number of neurons in the neuron set
▹ Create a vector with the neuron set
▹ Copy the contents of the neuron set
▹ Vale for time vector, only 1 for $m p, p m$
▹ Delete synapses
▹ Add synapses
▹ Add and delete synapses
▹ Delete and add synapses

Algorithm 12 Auxiliary functions for plasticity mechanism using optimized compressed matrix representation.

1:: procedureDEL_SYNAPSES( $A, k, N$ )
2:: $N^{'} \leftarrow$ INTERSEC( $A, N$ )
3:: if $| N^{'} | > k$ then
4:: $N^{'} \leftarrow$ DELETE_RANDOM( $N^{'}, | N^{'} | - k$ )
5:: end if
6:: $k \leftarrow | N^{'} |$
7:: $w \leftarrow p \leftarrow s \leftarrow 1$
8:: while $w \leq z_{p} \land A [w] \neq 0$ do
9:: if $A [w] = N^{'} [s]$ then
10:: $A [w] \leftarrow 0$
11:: $s \leftarrow s + 1$
12:: else
13:: if $p < w$ then
14:: $A [p] \leftarrow A [w]$
15:: $A [w] \leftarrow 0$
16:: end if
17:: $p \leftarrow p + 1$
18:: end if
19:: $w \leftarrow w + 1$
20:: end while
21:: return A
22:: end procedure
23:: procedureADD_SYNAPSES( $A, k, N, n$ )
24:: $N^{'} \leftarrow$ DIFF( $N, A$ )
25:: if $| N^{'} | > k$ then
26:: $N^{'} \leftarrow$ DELETE_RANDOM( $N^{'}, | N^{'} | - k$ )
27:: end if
28:: $k \leftarrow | N^{'} |$
29:: $B \leftarrow$ EMPTY_VECTOR( $z_{p}$ )
30:: $w \leftarrow p \leftarrow s \leftarrow 1$
31:: while $w \leq z_{p} \land \neg (A [w] = 0 \land s \leq k)$ do
32:: if $A [w] > N^{'} [s]$ then
33:: $B [p] \leftarrow N^{'} [s]$
34:: $s \leftarrow s + 1$
35:: else
36:: $B [p] \leftarrow A [w]$
37:: $w \leftarrow w + 1$
38:: end if
39:: $p \leftarrow p + 1$
40:: end while
41:: return B
42:: end procedure

▹ Calculate $A \cap N$ (involved synapses to be deleted)
▹ If more than k neurons, select randomly
▹ A random set of k neurons
▹ The new amount of synapses to delete
▹ Initialize iterators
▹ Loop over the column
▹ Synapse to delete
▹ Delete the synapse
▹ Advance in $N^{'}$ vector
▹ Need to compact the vector
▹p is the last compacted position
▹ Advance the last compacted position p
▹ Calculate $N \ A$ (not involved synapses to create)
▹ If more than k neurons, select randomly
▹ A random set of k neurons
▹ The new amount of synapses to delete
▹ Create the output
▹ Initialize iterators
▹ Loop over the column
▹ Synapse to add
▹ Add the synapse
▹ Keep the synapse

In order to keep Algorithm 12 simple, we assume that the functions INTERSEC, DIFF, and DELETE_RANDOM are already defined. As mentioned above, INTERSEC and DIFF can be implemented with algorithms of complexity

O (z_{p} + n_{r})

, given that the vectors (a column of synapse matrix and a chunk of plasticity neuron vector) are already sorted. We also assume that DELETE_RANDOM is a function that randomly select k elements from a total of n while keeping the order between elements. This can be done with an algorithm of complexity

O (k^{2})

.

5. Results

In this section we conduct a complexity analysis (for both time and memory) of the algorithms. In order to define the formulas, we need to introduce a set of descriptors for a spiking neural P system

Π

. These are described in Table 1. Moreover, Table 2 summarizes the vectors and matrices employed by each representation, and their corresponding sizes defined according to the descriptors. We use the following short names for the representations: Sparse (original sparse representation as Section 3), ELL (ELL compressed representation as in Section 3.1), optimized static (optimized static compressed representation as in Section 3.2), division and budding (optimized dynamic compressed representation for division and budding as in Section 3.3.1), and plasticity (optimized dynamic compressed representation for plasticity as in Section 3.3.2).

According to Table 2, we can limit the value of

q_{m a x}

for dynamic SNP systems with division and budding with the following formula:

q_{m a x} = ⌊ \frac{M T - 4 m + 2 q + 1}{z + 2} ⌋

, where

M T

is the maximum amount of memory in the system (measured in the word size employed to encode the elements of all the matrices and vectors; e.g., 4 Bytes). Moreover, we can infer when the matrix representation will be smaller for static SNP systems: ELL is better than sparse when

z < \frac{q - 2}{2}

; optimized is better than ELL when

z > \frac{q - m}{2 m - q}

; optimized is better than sparse when

z < m - 2

. In other words, our optimized compressed representation is worth when the the maximum out degree of the neurons is less than the total number of rules minus 2.

For dynamic SNP systems, given can say that a solution to a problem using an SNP with plasticity rules is better than a solution based on division and budding, if

q_{m a x} > \frac{q (z_{p} + 2) + 4 m + n p r}{z + 2}

; in other words, if we can know the maximum amount of neurons to generate, and this number is greater than a formula based on number of initial neurons, number of rules, and number of elements in the neuron set and the max out degree, then the solution will need less memory using plasticity.

Finally, Table 3 shows the order of complexity of each function as defined for each representation. We can see that COMPUTE_NEXT gets reduced in complexity as well when using optimized static representation against ELL and sparse, given that we expect that

m \leq q

, and also

z < m - 2

. However, we can see that implementing division and budding explodes the complexity of the algorithms, since they need to loop over all the neurons checking for in-synapses. This also depends on the total amount of generated neurons in a given step. This is also the case for the generation of spiking vectors, because the applicability function also needs to loop over all existing neurons. However, for dynamic networks, plasticity keeps the complexity with the amount of neurons, the value z, and the descriptors of plasticity rules (max value of k and amount of neurons in a neuron set

n_{p}

).

Therefore, we can see that using our compressed representations, both the memory footprint of the simulators and their complexity are reduced, as long as the maximum out degree of neurons is a low number. Furthermore, we can see that for dynamic networks, plasticity is an option that keeps the complexity balanced, since we know in advance the amount of neurons and synapses.

Let us make an easy example of comparison with an example from the literature. For example, if we take the SNP system for sorting natural numbers as defined in [42], then we have that

q = 3 n

,

m = n + n^{2}

and

z = n

, where n is the amount of natural numbers to sort. Thus:

The size of the sparse representation is $3 n^{3} + 6 n^{2} + 5 n + 1$ and the complexity of COMPUTE_NEXT is $O (n^{3})$ .
The size of the ELL representation is $2 n^{3} + 7 n^{2} + 11 n + 1$ and the complexity of COMPUTE_NEXT is $O (n^{3})$ .
The size of the optimized representation is $7 n^{2} + 13 n + 1$ and the complexity of COMPUTE_NEXT is $O (n^{2})$ .

The optimized representation drastically decreases the order of complexity and amount of memory spent for the algorithms, going from orders of

n^{3}

to

n^{2}

. ELL has a similar order of complexity to that of sparse, but the amount of memory is just a bit decreased. Figure 8 shows that the reduction of the memory footprint achieved with the compressed representations takes effect after

n > 3

. Figure 9 shows that the optimized representation scales better than ELL and sparse. ELL is only a bit better than the sparse representation, demonstrating the need for using the optimized one, which significantly scales much better.

Finally, we also analyze a uniform solution to 3SAT with SNP systems without delays as in [43] (Figure 10). We can see that

q = 8 n^{3} + 3 n + 3

,

m = 64 n^{3} + 6 n + 3

and

z = 8 n^{3}

, where n is the amount of variables in the 3SAT instance. We can see that

z < m - 3

, so our optimized implementation will be able to save some memory. Therefore:

The size of the sparse representation is $512 n^{6} + 240 n^{4} + 424 n^{3} + 18 n^{2} + 51 n + 25$ and the complexity of COMPUTE_NEXT is $O (n^{6})$ .
The size of the ELL representation is $1024 n^{6} + 96 n^{4} + 384 n^{3} + 36 n + 22$ and the complexity of COMPUTE_NEXT is $O (n^{6})$ .
The size of the optimized representation is $64 n^{6} + 24 n^{4} + 304 n^{3} + 33 n + 22$ and the complexity of COMPUTE_NEXT is $O (n^{6})$ .

Figure 10. Memory size of the matrix representation (Y-axis in log scale) depending on the number of variables in the SAT formula (

1 \leq n \leq 256

, X-axis in log scale) for the model of 3SAT, using sparse, ELL, and optimized representation.

Figure 10. Memory size of the matrix representation (Y-axis in log scale) depending on the number of variables in the SAT formula (

1 \leq n \leq 256

, X-axis in log scale) for the model of 3SAT, using sparse, ELL, and optimized representation.

We can see that the memory footprint is decreased but it is still of the same order of magnitude (

O (n^{6})

), and the same happens with the computing complexity. Thus, our representation helps to reduce memory, although not significantly for this specific solution. This is mainly due to having a high value of z. We can see in Figure 10 how the reduction of memory takes place only for optimized representation as long as n increases. It is interesting to see that the ELL representation is even worse than just using sparse representation.

Finally, let us analyze the size of the solution uniform solution to subset sum with plasticity rules in [34]. The descriptors for the matrix representation of a dynamic SNP system with plasticity rules are the following:

q = 4 n + 9

,

m = 5 n + 11

,

n p r = 2 n

,

z_{p} = 2

, where n is the number of sets V, therefore, the memory footprint is described as:

66 n + 143

. If we were using a sparse representation where the transition matrix is of order

m \cdot q

, then the amount of memory is of order

O (n^{2})

.

6. Conclusions

In this paper, we addressed the problem of having very sparse matrices in the matrix representation of SNP systems. Usually, the graph defined for an SNP system is not fully connected, leading to sparse matrices. This drastically downgrades the performance of the simulators. However, sparse matrices are a known issue in other disciplines, and efficient representations have been introduced in the literature. There are even solutions tailored for parallel architectures such as GPUs.

We propose two efficient compressed representations for SNP systems, one based on the classic format ELL, and an optimized one based on a combination of CSR and ELL. This representation gives room to support rules for dynamic networks: division, budding, and plasticity. The representation for plasticity poses more advantages than the one for division and budding, since the synapse matrix size can be pre-computed. Thus, no label mapping nor empty columns to host new neurons are required. Moreover, simulating the creation of new neurons in parallel can damage the performance of the simulator significantly, because this operation can be sequential. Plasticity rules do not create new neurons, so this is avoided.

As future work, we plan to provide implementations of these designs within cuSNP [21] and P-Lingua [44] frameworks to provide high performance simulations with real examples from the literature. We believe that these concepts will help to bring efficient tools to simulate SNP systems on GPUs, enabling the simulation of large networks in parallel. Specifically, we will use these designs to develop a new framework for automatically designing SNP systems using genetic algorithms [45]. Another tool that could benefit from the inclusion of this new type of representation are visual tools for SNP systems [46]. Moreover, our optimized designs will enable the effective usage of spiking neural P systems on industrial processes such as [47,48,49,50], and to optimization applications as [51,52]. SNP systems have been used in many applications [5], and in order to be used in industrial applications we need efficient simulators where compressed representations of sparse matrices can help.

Numerical SNP systems (or NSNP systems) [17,53] are SNP system variants which are largely dissimilar to many variants of SNP systems, especially to the variants considered in this paper, for at least two main reasons: (1) rules in NSNP systems do not use regular expressions, and instead use linear functions, so that rules are applied when certain values or threshold of the variables in such functions are satisfied, and (2) the variables in the functions are real-valued, unlike the natural numbers associated with strings and regular expressions. One of the main goals in [17] for introducing NSNP systems is to create an SNP system variant, which in a future work may be more feasible for use with training algorithms in traditional neural networks [53]. For these reasons, we plan to extend our algorithms and compressed data structures for NSNP systems. We think that simulators for this variant can be effectively accelerated on GPUs. Specifically, GPUs are devices designed for floating point operations and not for integer arithmetic, although the latter is supported.

We also plan to include more models and ingredients into these new methods, such as delays, weights, dendrites, rules on synapses, and scheduled synapses, among others. Moreover, a recent work in SNP systems with plasticity shows that having the same set of rules in all neurons leads to Turing complete algorithms [54]. This means that m descriptor can be common to all neurons, leading to smaller representations for this kind of systems. We plan to study this deeper and combine it with our representations. Our aim on focusing on plasticity is also related to other results involving this ingredient in other fields such as machine learning [55].

Author Contributions

Conceptualization, M.Á.M.-d.-A., and F.G.C.C.; methodology, M.Á.M.-d.-A. and D.O.-M.; validation, M.Á.M.-d.-A., F.G.C.C. and D.O.-M.; formal analysis, D.O.-M., I.P.-H. and H.N.A.; investigation, M.Á.M.-d.-A. and F.G.C.C.; resources, D.O.-M., F.G.C.C. and I.P.-H.; writing—original draft preparation, M.Á.M.-d.-A., D.O.-M., I.P.-H., F.G.C.C., H.N.A.; writing—review and editing, M.Á.M.-d.-A., D.O.-M., I.P.-H., F.G.C.C., H.N.A.; supervision, I.P.-H. and H.N.A.; All authors have read and agreed to the published version of the manuscript.

Funding

Financiado por: FEDER/Ministerio de Ciencia e Innovación—Agencia Estatal de Investigación/_Proyecto (TIN2017-89842-P). F.G.C. Cabarle is supported in part by the ERDT program of the DOST-SEI, Philippines, and the Dean Ruben A. Garcia PCA from UP Diliman. H.N. Adorna is supported by Semirara Mining Corp Professorial Chair for Computer Science, RLC grant from UPD OVCRD, and ERDT-DOST research grant.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Păun, G. Computing with membranes. J. Comput. Syst. Sci. TUCS Rep. No 208 2000, 61, 108–143. [Google Scholar] [CrossRef] [Green Version]
Song, B.; Li, K.; Orellana-Martín, D.; Pérez-Jiménez, M.J.; Pérez-Hurtado, I. A Survey of Nature-Inspired Computing: Membrane Computing. ACM Comput. Surv. 2021, 54. [Google Scholar] [CrossRef]
Arteta Albert, A.; Díaz-Flores, E.; López, L.F.D.M.; Gómez Blas, N. An In Vivo Proposal of Cell Computing Inspired by Membrane Computing. Processes 2021, 9, 511. [Google Scholar] [CrossRef]
Ionescu, M.; Pundefinedun, G.; Yokomori, T. Spiking Neural P Systems. Fundam. Inform. 2006, 71, 279–308. [Google Scholar]
Fan, S.; Paul, P.; Wu, T.; Rong, H.; Zhang, G. On Applications of Spiking Neural P Systems. Appl. Sci. 2020, 10, 7011. [Google Scholar] [CrossRef]
Păun, G.; Pérez-Jiménez, M.J. Spiking Neural P Systems. Recent Results, Research Topics. Algorithmic Bioprocess. 2009, 273–291. [Google Scholar] [CrossRef] [Green Version]
Rong, H.; Wu, T.; Pan, L.; Zhang, G. Spiking neural P systems: Theoretical results and applications. In Enjoying Natural Computing; Springer: Berlin/Heidelberg, Germany, 2018; pp. 256–268. [Google Scholar]
Ibarra, O.; Leporati, A.; Păun, A.; Woodworth, S. Spiking Neural P Systems. In The Oxford Handbook of Membrane Computing; Păun, G., Rozenberg, G., Salomaa, A., Eds.; Oxford University Press: Oxford, UK, 2010; pp. 337–362. [Google Scholar]
Pan, L.; Wu, T.; Zhang, Z. A Bibliography of Spiking Neural P Systems; Technical Report; Bulletin of the International Membrane Computing Society: Sevilla, Spain, 2016. [Google Scholar]
Wang, J.; Hoogeboom, H.J.; Pan, L.; Păun, G.; Pérez-Jiménez, M.J. Spiking Neural P Systems with Weights. Neural Comput. 2010, 22, 2615–2646. [Google Scholar] [CrossRef]
Pan, L.; Wang, J.; Hoogeboom, H.J. Spiking Neural P Systems with Astrocytes. Neural Comput. 2012, 24, 805–825. [Google Scholar] [CrossRef]
Song, X.; Wang, J.; Peng, H.; Ning, G.; Sun, Z.; Wang, T.; Yang, F. Spiking neural P systems with multiple channels and anti-spikes. Biosystems 2018, 169–170, 13–19. [Google Scholar] [CrossRef]
Peng, H.; Bao, T.; Luo, X.; Wang, J.; Song, X.; Nez, A.R.N.; Pérez-Jiménez, M.J. Dendrite P systems. Neural Netw. 2020, 127, 110–120. [Google Scholar] [CrossRef] [PubMed]
Song, T.; Pan, L.; Păun, G. Spiking neural P systems with rules on synapses. Theor. Comput. Sci. 2014, 529, 82–95. [Google Scholar] [CrossRef]
Cabarle, F.G.C.; Adorna, H.N.; Jiang, M.; Zeng, X. Spiking neural P systems with scheduled synapses. IEEE Trans. Nanobiosci. 2017, 16, 792–801. [Google Scholar] [CrossRef]
Lazo, P.P.L.; Cabarle, F.G.C.; Adorna, H.N.; Yap, J.M.C. A return to stochasticity and probability in spiking neural P systems. J. Membr. Comput. 2021, 1–13. [Google Scholar] [CrossRef]
Wu, T.; Pan, L.; Yu, Q.; Tan, K.C. Numerical Spiking Neural P Systems. IEEE Trans. Neural Netw. Learn. Syst. 2020. [Google Scholar] [CrossRef]
Macías-Ramos, L.F.; Pérez-Hurtado, I.; García-Quismondo, M.; Valencia-Cabrera, L.; Pérez-Jiménez, M.J.; Riscos-Núñez, A. A P—Lingua Based Simulator for Spiking Neural p Systems. In Proceedings 12th International Conference on Membrane Computing; Springer: Berlin/Heidelberg, Germany, 2011; Volume 7184, pp. 257–281. [Google Scholar] [CrossRef]
Zeng, X.; Adorna, H.; Martínez-del-Amor, M.A.; Pan, L.; Pérez-Jiménez, M.J. Matrix Representation of Spiking Neural P Systems. In Proceedings of the 11th International Conference on Membrane Computing, Jena, Germany, 24–27 August 2010; Volume 6501, pp. 377–391. [Google Scholar] [CrossRef]
Fatahalian, K.; Sugerman, J.; Hanrahan, P. Understanding the Efficiency of GPU Algorithms for Matrix-Matrix Multiplication. In Proceedings ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware; Association for Computing Machinery: New York, NY, USA, 2004; pp. 133–137. [Google Scholar] [CrossRef] [Green Version]
Carandang, J.; Villaflores, J.; Cabarle, F.; Adorna, H.; Martínez-del-Amor, M. CuSNP: Spiking Neural P Systems Simulators in CUDA. Rom. J. Inf. Sci. Technol. 2017, 20, 57–70. [Google Scholar]
Carandang, J.; Cabarle, F.; Adorna, H.; Hernandez, N.; Martínez-del-Amor, M.A. Nondeterminism in Spiking Neural P Systems: Algorithms and Simulations. In Proceedings of the 6th Asian Conference on Membrane Computing, Chengdu, China, 21–25 September 2017. [Google Scholar]
Cabarle, F.G.C.; Adorna, H.N.; Martínez-del Amor, M.Á.; Pérez Jiménez, M.d.J. Improving GPU simulations of spiking neural P systems. Rom. J. Inf. Sci. Technol. 2012, 15, 5–20. [Google Scholar]
Carandang, J.P.; Cabarle, F.G.C.; Adorna, H.N.; Hernandez, N.H.S.; Martínez-del-Amor, M.Á. Handling Non-determinism in Spiking Neural P Systems: Algorithms and Simulations. Fundam. Inform. 2019, 164, 139–155. [Google Scholar] [CrossRef]
Ochirbat, O.; Ishdorj, T.O.; Cichon, G. An error-tolerant serial binary full-adder via a spiking neural P system using HP/LP basic neurons. J. Membr. Comput. 2020, 2, 42–48. [Google Scholar] [CrossRef] [Green Version]
Martínez-del-Amor, M.A.; García-Quismondo, M.; Macías-Ramos, L.F.; Valencia-Cabrera, L.; Riscos-Núñez, A.; Pérez-Jiménez, M.J. Simulating P systems on GPU devices: A survey. Fundam. Inform. 2015, 136, 269–284. [Google Scholar] [CrossRef]
Muniyandi, R.C.; Maroosi, A. A Representation of Membrane Computing with a Clustering Algorithm on the Graphical Processing Unit. Processes 2020, 8, 1199. [Google Scholar] [CrossRef]
Martínez-del-Amor, M.; Pérez-Hurtado, I.; Orellana-Martín, D.; Pérez-Jiménez, M.J. Adaptative parallel simulators for bioinspired computing models. Future Gener. Comput. Syst. 2020, 107, 469–484. [Google Scholar] [CrossRef]
Martínez-del-Amor, M.Á.; Orellana-Martín, D.; Cabarle, F.G.C.; Pérez-Jiménez, M.J.; Adorna, H.N. Sparse-matrix representation of spiking neural P systems for GPUs. In Proceedings of the 15th Brainstorming Week on Membrane Computing, Sevilla, Spain, 31 January–5 February 2017; pp. 161–170. [Google Scholar]
Aboy, B.C.D.; Bariring, E.J.A.; Carandang, J.P.; Cabarle, F.G.C.; de la Cruz, R.T.A.; Adorna, H.N.; Martínez-del-Amor, M.Á. Optimizations in CuSNP Simulator for Spiking Neural P Systems on CUDA GPUs. In Proceedings of the 17th International Conference on High Performance Computing & Simulation, Dublin, Ireland, 15–19 July 2019; pp. 535–542. [Google Scholar] [CrossRef]
AlAhmadi, S.; Mohammed, T.; Albeshri, A.; Katib, I.; Mehmood, R. Performance Analysis of Sparse Matrix-Vector Multiplication (SpMV) on Graphics Processing Units (GPUs). Electronics 2020, 9, 1675. [Google Scholar] [CrossRef]
Adorna, H.; Cabarle, F.; Macías-Ramos, L.; Pan, L.; Pérez-Jiménez, M.; Song, B.; Song, T.; Valencia-Cabrera, L. Taking the pulse of SN P systems: A Quick Survey. In Multidisciplinary Creativity; Spandugino: Bucharest, Romania, 2015; pp. 1–16. [Google Scholar]
Cabarle, F.G.; Adorna, H.N.; Pérez-Jiménez, M.J.; Song, T. Spiking Neural P Systems with Structural Plasticity. Neural Comput. Appl. 2015, 26, 1905–1917. [Google Scholar] [CrossRef]
Cabarle, F.G.C.; Hernandez, N.H.S.; Martínez-del-Amor, M.Á. Spiking neural P systems with structural plasticity: Attacking the subset sum problem. In International Conference on Membrane Computing; Springer: Berlin/Heidelberg, Germany, 2015; pp. 106–116. [Google Scholar]
Pan, L.; Păun, G.; Pérez-Jiménez, M. Spiking neural P systems with neuron division and budding. Sci. China Inf. Sci. 2011, 54, 1596–1607. [Google Scholar] [CrossRef] [Green Version]
Jimenez, Z.; Cabarle, F.; de la Cruz, R.T.; Buño, K.; Adorna, H.; Hernandez, N.; Zeng, X. Matrix representation and simulation algorithm of spiking neural P systems with structural plasticity. J. Membr. Comput. 2019, 1, 145–160. [Google Scholar] [CrossRef] [Green Version]
Cabarle, F.G.C.; de la Cruz, R.T.A.; Cailipan, D.P.P.; Zhang, D.; Liu, X.; Zeng, X. On solutions and representations of spiking neural P systems with rules on synapses. Inf. Sci. 2019, 501, 30–49. [Google Scholar] [CrossRef]
Orellana-Martín, D.; Martínez-del-Amor, M.; Valencia-Cabrera, L.; Pérez-Hurtado, I.; Riscos-Núñez, A.; Pérez-Jiménez, M.J. Dendrite P Systems Toolbox: Representation, Algorithms and Simulators. Int. J. Neural Syst. 2021, 31, 2050071. [Google Scholar] [CrossRef] [PubMed]
Kirk, D.B.; Hwu, W.W. Programming Massively Parallel Processors: A Hands-on Approach, 3rd ed.; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 2016. [Google Scholar]
NVIDIA CUDA C Programming Guide. Available online: https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html (accessed on 15 February 2021).
Bell, N.; Garland, M. Efficient Sparse Matrix-Vector Multiplication on CUDA; NVIDIA Technical Report NVR-2008-004; NVIDIA Corporation: Santa Clara, CA, USA, 2008. [Google Scholar]
Ionescu, M.; Sburlan, D. Some Applications of Spiking Neural P Systems. Comput. Inform. 2008, 27, 515–528. [Google Scholar]
Leporati, A.; Mauri, G.; Zandron, C.; Păun, G.; Pérez-Jiménez, M.J. Uniform Solutions to SAT and Subset Sum by Spiking Neural P Systems. Nat. Comput. Int. J. 2009, 8, 681–702. [Google Scholar] [CrossRef]
Pérez-Hurtado, I.; Orellana-Martín, D.; Zhang, G.; Pérez-Jiménez, M.J. P-Lingua in two steps: Flexibility and efficiency. J. Membr. Comput. 2019, 1, 93–102. [Google Scholar] [CrossRef] [Green Version]
Casauay, L.J.P.; Cabarle, F.G.G.; Macababayao, I.C.H.; Adorna, H.N.; Zeng, X.; Martínez-del-Amor, M.Á. A Framework for Evolving Spiking Neural P Systems. Int. J. Unconv. Comput. 2021, 16, 121–139. [Google Scholar]
Fernandez, A.D.C.; Fresco, R.M.; Cabarle, F.G.C.; de la Cruz, R.T.A.; Macababayao, I.C.H.; Ballesteros, K.J.; Adorna, H.N. Snapse: A Visual Tool for Spiking Neural P Systems. Processes 2021, 9, 72. [Google Scholar] [CrossRef]
Lin, H.; Zhao, B.; Liu, D.; Alippi, C. Data-based fault tolerant control for affine nonlinear systems through particle swarm optimized neural networks. IEEE/CAA J. Autom. Sin. 2020, 7, 954–964. [Google Scholar] [CrossRef]
Zerari, N.; Chemachema, M.; Essounbouli, N. Neural network based adaptive tracking control for a class of pure feedback nonlinear systems with input saturation. IEEE/CAA J. Autom. Sin. 2019, 6, 278–290. [Google Scholar] [CrossRef]
Gao, S.; Zhou, M.; Wang, Y.; Cheng, J.; Yachi, H.; Wang, J. Dendritic Neuron Model With Effective Learning Algorithms for Classification, Approximation, and Prediction. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 601–614. [Google Scholar] [CrossRef] [PubMed]
Shang, M.; Luo, X.; Liu, Z.; Chen, J.; Yuan, Y.; Zhou, M. Randomized latent factor model for high-dimensional and sparse matrices from industrial applications. IEEE/CAA J. Autom. Sin. 2019, 6, 131–141. [Google Scholar] [CrossRef]
Liu, W.; Luo, F.; Liu, Y.; Ding, W. Optimal Siting and Sizing of Distributed Generation Based on Improved Nondominated Sorting Genetic Algorithm II. Processes 2019, 7, 955. [Google Scholar] [CrossRef] [Green Version]
Pan, J.S.; Hu, P.; Chu, S.C. Novel Parallel Heterogeneous Meta-Heuristic and Its Communication Strategies for the Prediction of Wind Power. Processes 2019, 7, 845. [Google Scholar] [CrossRef] [Green Version]
Yin, X.; Liu, X.; Sun, M.; Ren, Q. Novel Numerical Spiking Neural P Systems with a Variable Consumption Strategy. Processes 2021, 9, 549. [Google Scholar] [CrossRef]
de la Cruz, R.T.A.; Cabarle, F.G.C.; Macababayao, I.C.H.; Adorna, H.N.; Zeng, X. Homogeneous spiking neural P systems with structural plasticity. J. Membr. Comput. 2021. [Google Scholar] [CrossRef]
Spiess, R.; George, R.; Cook, M.; Diehl, P.U. Structural plasticity denoises responses and improves learning speed. Front. Comput. Neurosci. 2016, 10, 93. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Figure 1. CSR format example. Non-zero val array stores the non-null values, columns array stores the column indexes, and row pointers are the positions where each row starts in the previous arrays.

Figure 2. ELL format example. Note that it represents the transpose of the original matrix to increase the coalesced access in GPU devices. It includes pairs of column and value for every row. The compressed matrix has a number of columns equals to the number of original rows, and a number of rows equals to the maximum amount of non-null values in an original row.

Figure 3. Illustration of compressed representation based on the ELL format of a static spiking neural P (SNP) system. Light cells are empty values (0,0). The first column is a forgetting rule (there is no need to use p = 0). The rows below the spiking vector illustrate threads, showing the level of parallelism that can be achieved; i.e., each column and each position of the spiking vector in parallel can be processed in parallel.

Figure 4. Illustration of optimized compressed matrix representation. Light cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron labels). The rows below illustrate threads, showing the level of parallelism that can be achieved (each column/neuron in parallel). The first column in the rule vector is a forgetting rule, where

p = 0

.

Figure 4. Illustration of optimized compressed matrix representation. Light cells in the synapse matrix are empty values (0), dark cells are positions with values greater than 0 (i.e., with neuron labels). The rows below illustrate threads, showing the level of parallelism that can be achieved (each column/neuron in parallel). The first column in the rule vector is a forgetting rule, where

p = 0

.

Figure 8. Memory size of the matrix representation (Y-axis) depending on the amount of natural numbers (n, X-axis) for the model of natural number sorting in [42], using sparse, ELL and optimized representation, for only

n = 1, 2, 3, 4

.

Figure 8. Memory size of the matrix representation (Y-axis) depending on the amount of natural numbers (n, X-axis) for the model of natural number sorting in [42], using sparse, ELL and optimized representation, for only

n = 1, 2, 3, 4

.

Figure 9. Memory size of the matrix representation (Y-axis in log scale) depending on the amount of natural numbers (n, X-axis in log scale) for the model of natural number sorting, using sparse, ELL, and optimized representation, for

1 \leq n \leq 65536

.

Figure 9. Memory size of the matrix representation (Y-axis in log scale) depending on the amount of natural numbers (n, X-axis in log scale) for the model of natural number sorting, using sparse, ELL, and optimized representation, for

1 \leq n \leq 65536

.

Table 1. Descriptors of an SNP system.

Descriptor	Description
q	Number of initial neurons
m	Total number of rules
z	Number of rows (column size) for optimized matrix representation. Also, maximum out degree of a neuron
$z^{'}$	Number of rows (column size) for ELL matrix representation. $z^{'} = z + 1$
$q_{m a x}$	Maximum amount of neurons to handle during simulation for division and budding. $q^{'} \geq q$
$z_{p}$	Number of rows (column size) for optimized matrix representation for plasticity
$n_{p}$	Maximum size of a neuron set in plasticity rules.
$n p r$	Sum of neuron set sizes of all plasticity rules.
$k_{p}$	Maximum value of k in plasticity rules.

Table 2. Size of matrices employed in the representations for SNP systems. Those whose name are in bold were used for the total calculation, which assumes just one spiking vector.

Notation/Name	Sparse	ELL	Optimized Static	Division and Budding	Plasticity
$C_{k}$ Configuration Vector	q	q	q	$q_{m a x}$	q
$S_{k}$ Spiking vector	m	m	q	$q_{m a x}$	q
$P_{Π}$ Preconditions vector	$2 m$	$2 m$	$2 m$	$2 m$	$2 m$
$N_{Π}$ Neuron-rule map vector	$q + 1$	$q + 1$	$q + 1$	$q + 1$	$q + 1$
$S y_{k}$ Synapse matrix			$q \cdot z$	$q_{m a x} \cdot z$	$q \cdot z_{p}$
$R u_{Π}$ Rule vector			$2 m$	$2 m$	$2 m$
$Q_{k}$ Neuron-id map vector				$q_{m a x}$
$P r_{Π}$ Rule vector					$4 m$
$P n_{Π}$ Rule vector					$n p r$
$T_{k}$ Rule vector					q
$M_{Π}^{'}$ Transition matrix	$m \cdot q$	$2 \cdot m \cdot z^{'}$	$q \cdot z + 2 m$	$q_{m a x} \cdot z + 2 m$	$q (z_{p} + 1) + 6 m + n p r$
$M_{k}$ Transition tuple	$m \cdot q + 2 m + q + 1$	$m (2 z^{'} + 2) + q + 1$	$q (z + 1) + 4 m + 1$	$q_{m a x} (z + 1) + 4 m + q + 1$	$q (z_{p} + 2) + 8 m + n p r + 1$
TOTAL	$m \cdot q + 3 m + 2 q + 1$	$m (2 z^{'} + 3) + 2 q + 1$	$q (z + 3) + 4 m + 1$	$q_{m a x} (z + 2) + 4 m + 2 q + 1$	$q (z_{p} + 4) + 8 m + n p r + 1$

Table 3. Algorithmic order of complexity of the main functions employed in the simulation loop (i.e., excluding init functions) for each representation.

Function	Sparse	ELL	Optimized Static	Division & Budding	Plasticity
APPLICABLE	$O (1)$	$O (1)$	$O (1)$	$O (q_{m a x} \cdot z)$	$O (1)$
SPIKING_VECTORS	$O (m)$	$O (m)$	$O (m)$	$O (m \cdot q_{m a x} \cdot z)$	$O (m)$
PLASTICITY					$O (n_{p} + k_{p}^{2} + z_{p})$
COMPUTE_NEXT	$O (q \cdot m)$	$O (z^{'} \cdot m)$	$O (z \cdot q)$	$O (q_{m a x} \cdot z)$	$O (q \cdot (n_{p} + k_{p}^{2} + z_{p}))$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Martínez-del-Amor, M.Á.; Orellana-Martín, D.; Pérez-Hurtado, I.; Cabarle, F.G.C.; Adorna, H.N. Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations. Processes 2021, 9, 690. https://doi.org/10.3390/pr9040690

AMA Style

Martínez-del-Amor MÁ, Orellana-Martín D, Pérez-Hurtado I, Cabarle FGC, Adorna HN. Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations. Processes. 2021; 9(4):690. https://doi.org/10.3390/pr9040690

Chicago/Turabian Style

Martínez-del-Amor, Miguel Ángel, David Orellana-Martín, Ignacio Pérez-Hurtado, Francis George C. Cabarle, and Henry N. Adorna. 2021. "Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations" Processes 9, no. 4: 690. https://doi.org/10.3390/pr9040690

APA Style

Martínez-del-Amor, M. Á., Orellana-Martín, D., Pérez-Hurtado, I., Cabarle, F. G. C., & Adorna, H. N. (2021). Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations. Processes, 9(4), 690. https://doi.org/10.3390/pr9040690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Simulation of Spiking Neural P Systems with Sparse Matrix-Vector Operations

Abstract

1. Introduction

2. Preliminaries

2.1. Spiking Neural P Systems

2.1.1. Spiking Neural P Systems with Budding Rules

2.1.2. Spiking Neural P Systems with Division Rules

2.1.3. Spiking Neural P Systems with Plasticity Rules

2.2. Matrix Representation for SNP Systems

2.3. Sparse Matrix-Vector Operations

3. Methods

3.1. Approach with ELL Format

3.2. Optimized Approach for Static Networks

3.3. Optimized Approach for Dynamic Networks

3.3.1. Budding and Division Rules

3.3.2. Plasticity Rules

4. Algorithms

5. Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI