Analysis of Basic Features in Dynamic Network Models

Zufiria, Pedro J.; Barriales-Valbuena, Iker

doi:10.3390/e20090681

Open AccessArticle

Analysis of Basic Features in Dynamic Network Models

by

Pedro J. Zufiria

^1,2,*

and

Iker Barriales-Valbuena

¹

Depto. Matemática Aplicada a las TIC, ETSI Telecomunicación, Universidad Politécnica de Madrid, Avda. Complutense 30, E-28040 Madrid, Spain

²

Information Processing and Telecommunications Center (IPTC), Universidad Politécnica de Madrid, E-28040 Madrid, Spain

^*

Author to whom correspondence should be addressed.

Entropy 2018, 20(9), 681; https://doi.org/10.3390/e20090681

Submission received: 30 June 2018 / Revised: 1 September 2018 / Accepted: 5 September 2018 / Published: 7 September 2018

(This article belongs to the Special Issue Selected Papers from 4th International Electronic Conference on Entropy and Its Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Time evolving Random Network Models are presented as a mathematical framework for modelling and analyzing the evolution of complex networks. This framework allows the analysis over time of several network characterizing features such as link density, clustering coefficient, degree distribution, as well as entropy-based complexity measures, providing new insight on the evolution of random networks. First, some simple dynamic network models, based only on edge density, are analyzed to serve as a baseline reference for assessing more complex models. Then, a model that depends on network structure with the aim of reflecting some characteristics of real networks is also analyzed. Such model shows a more sophisticated behavior with two different regimes, one of them leading to the generation of high clustering coefficient/link density ratio values when compared with the baseline values, as it happens in many real networks. Simulation examples are discussed to illustrate the behavior of the proposed models.

Keywords:

complex networks; stochastic modelling; entropy; estimation

1. Introduction

A large variety of complex systems can be analyzed by constructing a model that relies on some network structure [1,2,3,4]. The model may be dynamical, meaning that the values of some (state) variables do change with time and, depending on the nature of such variables, we can have different types of network models. The first type corresponds to dynamic graphs that follow evolution laws defined explicitly on the network [5,6,7,8]; the second type gathers dynamical systems where the state variables are defined on a network [9,10]; finally, the third type refers to co-evolution models that combine evolving networks and dynamical systems. In the first and third type, the underlying network structure changes with time, defining a time-varying or evolving network [11,12]. In the present work, we first characterize the basic features of some simple models of evolving networks whose evolution does not depend on network structure; the time evolution of these features serves as a reference baseline signature of the behavior of simple models. Then, a model that makes use of network structure is proposed to reflect some real network characteristics. The analysis of this model shows several regimes that indicate a sophisticated behavior; for some regime, the network reaches a high clustering coefficient/link density ratio [13] (when compared to the ratio values of baseline signatures), a common feature in many real networks.

The paper is organized as follows: Section 2 presents the general framework for Dynamic Network Models and their characterization via some basic features, whereas entropy measures are shown in Section 3. Section 4 analyzes some simple evolution models whose basic features’ time evolution serves as a behavior reference baseline. More elaborated evolution models that depend on network structure are studied in Section 5. Simulations of Section 6 comparatively illustrate the time evolution of the different features for the proposed models. Finally, concluding remarks are summarized in Section 7.

2. Characterization of Network Sequences via Basic Features

Following [12], discrete-time network evolution over time can be generally defined by a random sequence or trajectory

{G_{t}}_{t = 0, 1, \dots}

, where each

G_{t}

can take values g from

G

, being

G

the set of all possible networks. The analysis of

{G_{t}}_{t = 0, 1, \dots}

can be framed by considering it as a stochastic process, whose full characterization may be very complex. In the following, we present some basic features that help for a partial characterization of such stochastic process.

Time Evolution of Network Features

In some cases, we may be interested in the evolution of some quantifiable properties or features, f, of the network, defined as follows (see [14] for details):

\begin{matrix} f : G & ⟶ R^{l}, \end{matrix}

(1)

\begin{matrix} g & ⟶ f = f (g), \end{matrix}

(2)

where

f (g)

is the function that computes such quantifiable property (number of links, number of triangles, connectivity, degree of nodes, entropy of degree distribution, etc.) in graph g.

Note that, when

G

is endowed with a probability space, then, under some regularity assumptions on f, this function defines a random vector. Therefore, the sequence

f (G_{t}) \in R^{l}

defines a vector stochastic process that can be analyzed using standard stochastic process techniques. In the following analysis, we will focus on several of these properties such as the number of links, number of triangles, the connectivity and the degree distribution entropy (a scalar summary of the distribution vector). Since for these cases

l = 1

, the study will boil down to the analysis of scalar stochastic processes. A basic analysis would estimate, for instance, the deterministic sequence of expected values

E [f (G_{t})]

.

In the following section, we focus on different entropy measures that can also be employed for characterizing the stochastic process

{G_{t}}_{t = 0, 1, \dots}

.

3. Entropy Measures for Stochastic Processes

The stochastic process

{G_{t}}_{t = 0, 1, \dots}

is an indexed sequence of random variables, which can be completely characterized until time instant

t = T

by its joint probability distribution

P (G_{0}, G_{1}, \dots, G_{T}) .

(3)

This joint distribution may be quite complex to study and, therefore, we may acquiesce in characterizing part of it. For instance, if we consider

G_{i}

for a fixed time

t = i

, this snapshot of the process, also called a cross sectional variable, can be represented by a “static” model such as the ones studied in [14], fully characterized by the marginal distribution of

g_{i}

. Accordingly, when considering entropy measures for characterizing a stochastic process, different distributions associated with such process can be considered, as developed below.

3.1. Snapshot Entropy and Entropy of Network Features

The simplest approach focuses on the entropy analysis of cross sectional variables

G_{i}

. Hence, one can define the snapshot entropy of index i,

H (G_{i})

, of a stochastic process as the entropy of the i-th variable

G_{i}

of the process

H (G_{i}) = - \sum_{g \in G} p (G_{i} = g) \log p (G_{i} = g) .

(4)

When considering a network feature f, the entropy of the associated random variable

F_{i} = f (G_{i})

satisfies the condition

H (G_{i}) = H (G_{i} ∣ F_{i}) + H (F_{i}),

(5)

and therefore

H (F_{i}) = H (f (G_{i})) \leq H (G_{i}),

(6)

where the equality holds only if f is an injection.

Note that

H (G_{i})

in (4) is not to be confused with the feature mentioned in Section 2.1 called degree distribution entropy, associated with a concrete sample of

G_{i}

. For a more detailed explanation of degree distributions in static models, see [14].

The computation of

H (G_{i})

, when performed for every

i \in {0, 1, \dots,}

, would lead to a deterministic time series

{H_{t}}_{t = 0, 1, \dots}

as an alternative partial characterization of the stochastic process

{G_{t}}_{t = 0, 1, \dots}

.

3.2. Trajectory Entropy

Furthermore, one can study the entropy of a whole time period evolution of the process, seen as a sequence of

T + 1

variables. We define the trajectory entropy (

H_{0}^{T}

) of a

T + 1

-length time period of a stochastic process, as the entropy of the joint probability

P (G_{0}, G_{1}, \dots, G_{T})

:

H_{0}^{T} = H (G_{0}, \dots, G_{T}) = - \sum_{G^{T + 1}} p (g_{0}, g_{1}, \dots g_{T}) \log p (g_{0}, g_{1}, \dots g_{T}) .

(7)

If all

G_{i}

are independent variables, then:

H_{0}^{T} = \sum_{i = 0}^{T} H (G_{i}) .

(8)

Note that, in general, as T increases,

H_{0}^{T}

may increase unbounded.

3.3. Normalized Asymptotic Entropy

Finally, one may want to characterize the entropy rate as a normalized entropy measure independent of T, which globally characterizes the asymptotic behavior of the stochastic process. This entropy rate is defined as

H^{R} = \lim_{T \to \infty} \frac{1}{T + 1} H_{0}^{T},

(9)

whenever such limit exists. Alternatively, we can also compute

{H^{'}}^{R} = \lim_{T \to \infty} H (G_{T} | G_{T - 1}, G_{T - 2}, \dots, G_{1}, G_{0}),

(10)

again when this limit does exist. For strongly stationary processes, both measures (9) and (10) do exist and they are equal.

After presenting these measures, some basic evolution models are illustrated in the next section.

4. Basic Evolution Models with a Fixed Number of Nodes: Evolution of Number of Links

Let us consider

G_{V}

the set of all networks (or graphs)

g_{i} = (V, E_{i})

having a fixed set of nodes

V = {v_{1}, \dots, v_{N}}

, with

| V | = N

; each

g_{i} \in G

is then characterized by its corresponding set of links

E_{i} \subset E

with E being determined by V as the set of all pairs of nodes (

| E | = (\binom{N}{2}) = M

).

In this framework, any evolution process

{G_{t}}_{t = 0, 1, \dots}

is characterized by the sequence of the corresponding

{E_{t}}_{t = 0, 1, \dots}

. In addition, since

g_{i} \in G_{V}

can be represented via its corresponding binary adjacency matrix

A (g_{i}) \in R^{n} \times R^{n}

, the evolution process can also be characterized as a sequence of adjacency matrices

{A (g_{t})}_{t = 0, 1, \dots} = {A_{t}}_{t = 0, 1, \dots}

.

4.1. Evolution of the Number of Links

In general, a complete characterization of

{G_{t}}_{t = 0, 1, \dots}

will be very cumbersome. Alternatively, we can partially characterize such process by considering

\begin{matrix} f : G & ⟶ {0, 1, \dots M}, \end{matrix}

(11)

\begin{matrix} g_{i} & ⟶ f (g_{i}) = | E_{i} | = ∥ A_{i} ∥_{1} = m_{i}, \end{matrix}

(12)

where f is the function that computes the number of links in the network. We can partition the set

G_{V}

into equivalence classes

C_{k}, k = 0, \dots M

so that each class

C_{k}

gathers all graphs containing k links:

C_{k} = {g_{i} \in G : f (g_{i}) = k}

. Then, we can define a stochastic process

{M_{t}}_{t = 0, 1 \dots}

with each

M_{t} \in {0, 1, \dots, M}

which characterizes the transition between classes, and whose state space represents such equivalence classes (hence, we identify

C_{k}

with state k).

In general, for a given instant of time i, based on (5), we have that the snapshot entropy of

G_{i}

and the entropy of

M_{i}

will satisfy

\begin{matrix} H (G_{i}) = H (G_{i} ∣ M_{i}) + H (M_{i}) \end{matrix}

(13)

and this relationship will help to characterize

G_{i}

via the analysis of

M_{i}

. Therefore, the following proposed models will be partially characterized by analyzing the associated stochastic process,

M_{t} \in {0, 1, \dots, M}

, for the evolution of the number of links.

4.2. A Simple Structure Independent Evolution Model

We define a simple network evolution process that may serve as a reference baseline for comparison purposes. Given

g_{t}

(equivalently,

E_{t}

or

A_{t}

), the next time step network

g_{t + 1}

is generated by randomly selecting a pair of nodes

(v_{i}, v_{j}) \in E

so that, if there exists a link between them (i.e.,

(v_{i}, v_{j}) \in E_{t}

), such link is removed (

E_{t + 1} = E_{t} \ {(v_{i}, v_{j})}

) and, if there is no link between the nodes (i.e.,

(v_{i}, v_{j}) \notin E_{t}

), then it is created (

E_{t + 1} = E_{t} \cup {(v_{i}, v_{j})}

). Note that if we consider the adjacency matrix representation

A_{t}

, at each stage of time, an element of the matrix

A_{t}

is randomly chosen so that its value is changed (from 0 to 1 or vice versa) to derive

A_{t + 1}

.

Note that the evolution law is determined by the number of links of

g_{t}

. Therefore, as mentioned above, we will start the analysis of this evolution model by characterizing the time evolution of the number of links. The corresponding

M_{t} \in {0, 1, \dots, M}

satisfies:

\begin{matrix} P (M_{t + 1} = 1 ∣ M_{t} = 0) & = 1, \end{matrix}

(14)

\begin{matrix} P (M_{t + 1} = M - 1 ∣ M_{t} = M) & = 1, \end{matrix}

(15)

and for

i \in {1, \dots, M - 1}

:

\begin{matrix} P (M_{t + 1} = j ∣ M_{t} = i) & = \{\begin{matrix} 0, & if j = i or | j - i | > 1, \\ \frac{i}{M}, & if j = i - 1, \\ \frac{M - i}{M}, & if j = i + 1 . \end{matrix} \end{matrix}

(16)

This process is a Markov chain with transition probability matrix

\begin{matrix} P & = [\begin{matrix} 0 & \frac{1}{M} & 0 & \dots & 0 & 0 & 0 \\ 1 & 0 & \frac{2}{M} & \dots & 0 & 0 & 0 \\ 0 & \frac{M - 1}{M} & 0 & \dots & 0 & 0 & 0 \\ 0 & 0 & \frac{M - 2}{M} & \dots & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & \frac{M - 2}{M} & 0 & 0 \\ 0 & 0 & 0 & \dots & 0 & \frac{M - 1}{M} & 0 \\ 0 & 0 & 0 & \dots & \frac{2}{M} & 0 & 1 \\ 0 & 0 & 0 & \dots & 0 & \frac{1}{M} & 0 \end{matrix}], \end{matrix}

(17)

which is known as the Ehrenfest model [15], and which can be similarly interpreted as representing an urn with white and black balls, where we randomly select a ball and change it by another ball with different color, hence representing a sort of discrete-time birth–death Markov process [16] but with finite number of states (two boundary conditions). Many discrete distributions have been obtained by studying urn models and Markov processes [17,18,19]. Note that these models can be seen as a reference baseline since they do not exploit the network structure properties (i.e., the relative location of white balls and black balls).

The left stochastic, tri-diagonal, irreducible matrix P of Equation (17) has period 2, but it has a unique eigenvector associated with eigenvalue

λ = 1

. This eigenvector defines the stationary distribution of the process, denoted by

M_{s}

, and it can be easily proved that such distribution is binomial:

\begin{matrix} P (g_{s} \in C_{k}) = P (M_{s} = k) = (\binom{M}{k}) {(\frac{1}{2})}^{M}, \end{matrix}

(18)

so that taking a snapshot of the process for large t is equivalent to generating a sample from the Gilbert model [20] with

p = \frac{1}{2}

or, equivalently, the uniform model with maximum entropy (see [14] for details). Note that, given a number of links

M_{s} = k

, the distribution of

G_{i} ∣ (M_{s} = k)

is uniform, each link having probability

\frac{1}{| C_{k} |} = \frac{1}{(\binom{M}{k})}

. Hence, considering (18), the entropy expression provided in (13) becomes

\begin{matrix} H (G_{i}) & = H (M_{s}) + H (G_{i} ∣ M_{s}) = - \sum_{k} p (k) \log p (k) - \sum_{k} p (k) \log \frac{1}{(\binom{M}{k})} \end{matrix}

(19)

\begin{matrix} = - \sum_{k} p (k) \log \frac{p (k)}{(\binom{M}{k})} = M \cdot \log 2 = M = (\binom{N}{2}), \end{matrix}

(20)

measuring the entropy in bits.

Concerning the entropy of

M_{t}

, it is known that Ehrenfest model snapshot (relative) entropy at time t, defined in terms of the Kullback–Leibler divergence between the distribution and the steady state equilibrium distribution

\begin{matrix} H_{rel} (t) = - D_{K L} (P (t) | | P_{s}) = - \sum_{k = 0}^{M} P (M_{t} = k), \log \frac{P (M_{t} = k)}{P (M_{s} = k)} \end{matrix}

(21)

is non-decreasing in time as approaches the maximum value zero, upon the so called H-Theorem [21].

4.3. Extensions of the Model for Asymmetric Evolution

One can extend the symmetric model provided in (17) with the aim of considering cases in which the network may have an uneven tendency to increase or decrease the number of edges.

Let us consider the following transition behavior from

g_{t}

to

g_{t + 1}

: we start selecting a pair of nodes in network

g_{t}

; if the selected pair already has an associated link, such link is removed with probability

p_{r}

, whereas, if such pair does not have an associated link, a link is added between such pair of nodes with probability

p_{a}

. If no change (removal or addition) happens, the process is repeated until the network undergoes some modification, which is registered in

g_{t + 1}

.

Again, if we focus the analysis on the time evolution of the number of links,

M_{t}

, the corresponding transition matrix becomes:

\begin{matrix} P (p_{r}, p_{a}) & = [\begin{matrix} 0 & \frac{p_{r}}{p_{r} + (M - 1) p_{a}} & 0 & \dots & 0 & 0 & 0 \\ 1 & 0 & \frac{2 p_{r}}{2 p_{r} + (M - 2) p_{a}} & \dots & 0 & 0 & 0 \\ 0 & \frac{(M - 1) p_{a}}{p_{r} + (M - 1) p_{a}} & 0 & \dots & 0 & 0 & 0 \\ 0 & 0 & \frac{(M - 2) p_{a}}{2 p_{r} + (M - 2) p_{a}} & \dots & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & \frac{(M - 2) p_{r}}{(M - 2) p_{r} + 2 p_{a}} & 0 & 0 \\ 0 & 0 & 0 & \dots & 0 & \frac{(M - 1) p_{r}}{(M - 1) p_{r} + p_{a}} & 0 \\ 0 & 0 & 0 & \dots & \frac{2 p_{a}}{(M - 2) p_{r} + 2 p_{a}} & 0 & 1 \\ 0 & 0 & 0 & \dots & 0 & \frac{p_{a}}{(M - 1) p_{r} + p_{a}} & 0 \end{matrix}] . \end{matrix}

(22)

The analysis of this system can be simplified if we denote

\frac{p_{r}}{p_{a}} = u

the unbalance coefficient, since the matrix can be reformulated as

\begin{matrix} P (u) & = [\begin{matrix} 0 & \frac{u}{u + M - 1} & 0 & \dots & 0 & 0 & 0 \\ 1 & 0 & \frac{2 u}{2 u + M - 2} & \dots & 0 & 0 & 0 \\ 0 & \frac{M - 1}{u + M - 1} & 0 & \dots & 0 & 0 & 0 \\ 0 & 0 & \frac{M - 2}{2 u + M - 2} & \dots & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & \frac{(M - 2) u}{(M - 2) u + 2} & 0 & 0 \\ 0 & 0 & 0 & \dots & 0 & \frac{(M - 1) u}{(M - 1) u + 1} & 0 \\ 0 & 0 & 0 & \dots & \frac{2}{(M - 2) u + 2} & 0 & 1 \\ 0 & 0 & 0 & \dots & 0 & \frac{1}{(M - 1) u + 1} & 0 \end{matrix}] . \end{matrix}

(23)

If

u < 1

, the model has more tendency to add links than to remove them, and vice versa for

u > 1

. The analysis and interpretation of the network behavior can be performed either way due to such symmetry. For instance, if

u < 1

, the model can be interpreted as characterizing the following behavior: if the selected pair in

g_{t}

has an associated link, this link is removed with probability u; if the pair does not have an associated link, then a link is added. Again, the selection procedure is repeated until a link is either removed or added, defining

g_{t + 1}

.

It can be proved that the resulting stationary distribution has the form:

\begin{matrix} P_{u} (M_{s} = k) = \frac{(\binom{M}{k}) \frac{k \cdot u + M - k}{M \cdot u^{k}}}{\sum_{i = 0}^{M} (\binom{M}{i}) \frac{i \cdot u + M - i}{M \cdot u^{i}}}, u \in R^{+}, \end{matrix}

(24)

which can be seen as a generalization of the binomial distribution

Bin (\frac{1}{2}, M)

via the new parameter u.

Repeating a similar procedure to (19) and (20), the corresponding

G_{i}

entropy can be computed as

\begin{matrix} H_{u} (G_{i}) & = H_{u} (M_{s}) + H (G_{i} ∣ M_{s}) = - \sum_{k} p_{u} (k) \log p_{u} (k) - \sum_{k} p_{u} (k) \log \frac{1}{(\binom{M}{k})} \\ = - \sum_{k} p_{u} (k) \log \frac{p_{u} (k)}{(\binom{M}{k})}, \end{matrix}

which for

u = 1

becomes

H_{u = 1} (G_{i}) = (\binom{N}{2}) = M

.

Figure 1 represents smoothed probability mass functions for the baseline, theoretical given by (24) and empirical (based in simulations) with

p_{a} = 0.3

and

p_{r} = 1

. Note that asymmetry of the u value generates a probability function with less entropy than the corresponding to the baseline mass function.

Alternative Simple Model

Another simple model could assume that, whenever an existing edge is selected to be removed, it is removed with probability

p_{r} \in [0, 1]

, whereas, alternatively, a new edge is randomly added. The transition matrix of the corresponding

M_{t} \in {0, 1, \dots, M}

for the number of links would be

\begin{matrix} P & = [\begin{matrix} 0 & \frac{p_{r}}{M} & 0 & \dots & 0 & 0 & 0 \\ 1 & 0 & \frac{2 p_{r}}{M} & \dots & 0 & 0 & 0 \\ 0 & 1 - \frac{p_{r}}{M} & 0 & \dots & 0 & 0 & 0 \\ 0 & 0 & 1 - \frac{2 p_{r}}{M} & \dots & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & \frac{(M - 2) p_{r}}{M} & 0 & 0 \\ 0 & 0 & 0 & \dots & 0 & \frac{(M - 1) p_{r}}{M} & 0 \\ 0 & 0 & 0 & \dots & 1 - \frac{(M - 2) p_{r}}{M} & 0 & 1 \\ 0 & 0 & 0 & \dots & 0 & 1 - \frac{(M - 1) p_{r}}{M} & 0 \end{matrix}] . \end{matrix}

(25)

Note that an equivalent symmetric model can be defined as follows. If the selected pair of nodes does not have an associated link, we add such a link with probability

p_{a}

; otherwise, an existing link is removed.

It can be proved that the resulting stationary distribution has the form:

\begin{matrix} P_{p_{r}} (M_{s} = k) & = \{\begin{matrix} \frac{1}{1 + \sum_{i = 1}^{M} \frac{M \cdot (M - p_{r}) \dots (M - (i - 1) p_{r})}{i! p_{r}^{i}}} & if k = 0, \\ \frac{\frac{M \cdot (M - p_{r}) \dots (M - (k - 1) p_{r})}{k! p_{r}^{k}}}{1 + \sum_{i = 1}^{M} \frac{M \cdot (M - p_{r}) \dots (M - (i - 1) p_{r})}{i! p_{r}^{i}}} & if k \in {1, \dots, M}, \end{matrix} \end{matrix}

(26)

which can be seen as another generalization of the binomial distribution

Bin (\frac{1}{2}, M)

via the new parameter

p_{r} \in [0, 1]

. Again, the network snapshot entropy can be computed as

\begin{matrix} H_{p_{r}} (G_{i}) & = - \sum_{k} p_{p_{r}} (k) \log \frac{p_{p_{r}} (k)}{(\binom{M}{k})} . \end{matrix}

(27)

Both models (23) and (25) provide respectively stationary distributions (24) and (26), which, in general, are not binomial. Therefore, if we take a snapshot of these stationary distributions, the resulting network will follow a new static model, different from the standard known reference models for static networks.

Note that again these models can be interpreted as urn-derived finite state discrete-time birth-death models, in the sense that they do not incorporate network structural information, but only the total number of links. In other words, these models do not differentiate among networks that belong to the same equivalence class

C_{k}

, i.e., they are structure independent.

The time evolution of the expected value for the number of links, the clustering coefficient, the connectivity and the sample degree distribution entropy define a vector time series that can be employed as a signature that characterizes the evolution models. The signature of the above considered structure independent models can be employed as a reference baseline to assess more complex behaviors.

In Section 6, these signature quantities are estimated via simulation procedures.

5. Evolution Models Depending on Network Structure: Evolution of Clustering Coefficient

Usually, the evolution of networks depends not only on the number of links but also on the network structure. To illustrate this idea, we will analyze the behavior of models whose dynamics depend on the fact that triangles are going to be created or deleted in the network; then, the evolution of the clustering coefficient will be an essential feature to be considered.

Let us consider, for instance, an extension of the asymmetric model of Section 4.3 where the probability of a given link to be added (or removed) may depend on the fact that a triangle will or will not be generated (or eliminated) when adding (or removing) such link. Precisely, the transition behavior from

g_{t}

to

g_{t + 1}

is defined as follows:

a pair of nodes in network $g_{t}$ is uniformly selected.
(a)
If the selected pair already has an associated link, such link is removed
i.
with probability $p_{r, n}$ when the selected nodes have at least one common neighbor (hence, at least one triangle will be deleted), or
ii.
with probability $p_{r, n n}$ when the selected nodes do not have a common neighbor (no triangle will be deleted);
(b)
if the selected pair does not have an associated link, a link is added between these nodes
i.
with probability $p_{a, n}$ when the selected nodes have at least one common neighbor (hence, some new triangles will be generated), or
ii.
with probability $p_{a, n n}$ when the selected nodes do not have common neighbor (no triangle will be generated).
If no change (removal or addition) happens, the process is repeated until the network undergoes some modification, which is registered in $g_{t + 1}$ .

The discrete process

g_{t}

provided by this model remains invariant to a common scaling of all probability values, provided the proportion among them is preserved. Hence, such dynamical model behavior can be reformulated as a function of, for instance, the following three parameters,

α = \frac{p_{a, n}}{p_{a, n n}}

,

β = \frac{p_{r, n n}}{p_{r, n}}

and

u_{1} = \frac{p_{r, n}}{p_{a, n}}

; note that

α

and

β

measure the strength for favoring the creation and preservation of triangles, respectively, and the meaning of

u_{1}

will become clear below. Note that this model is general enough to represent dynamic networks having a tendency to either create (and preserve) or eliminate triangles. In the specific case when

α

and

β

are larger than 1, triangle creation (and preservation) are promoted.

The analysis of this model can be complex since the existence and characterization of a stationary behavior may depend on the network size (number of nodes), the selected parameter values and the initial state

g_{0}

. Note that

M_{t}

cannot be directly defined anymore via a transition matrix of the type of (22), (23) or (25), since the future evolution of such

M_{t}

depends not only on its actual value but also on some structural properties (i.e., the existence and location of triangles) of

g_{t}

. In addition, the existence and form of a limiting stationary distribution for

M_{t}

may be a complicated issue to deal with.

Two Regimes of Behavior

If we assume that

g_{t}

follows approximately a Gilbert model, the probability

p_{i j, t}

of any pair of nodes

(v_{i}, v_{j})

to have a common neighbor (i.e., they may take part of a triangle) depends on the ratio between the number of links

m_{t}

and the total number of node pairs

M = (\binom{N}{2})

in

g_{t}

:

\begin{matrix} p_{i j, t} = 1 - {(1 - {(\frac{m_{t}}{M})}^{2})}^{N - 2} . \end{matrix}

(29)

The value of

p_{i j, t}

is very sensitive to the link density

d = \frac{m_{t}}{M}

. For large values of N, we have that, if

m_{t} \sim o (N^{\frac{3}{2}}),

then

p_{i j, t}

remains small, approaching value 1 otherwise. Therefore, the behavior of the model presented in Section 5 may be approximated considering two possible regimes.

Regime 1. For large enough $m_{t} \sim Ω (N^{\frac{3}{2}}),$ the probability of creating or deleting triangles is not negligible and the dynamics of the system are approximately governed by a model following (23) with $u_{1} = \frac{p_{r, n}}{p_{a, n}}$ .
Regime 2. For small $m_{t} \sim o (N^{\frac{3}{2}}),$ the probability of creating or deleting triangles is small and the dynamics of the system are approximately governed by a model following (23) with $u_{2} = \frac{p_{r, n n}}{p_{a, n n}}$ .

Hence, within each regime, the dynamics can be approximated via the baseline model (23).

We now focus on the analysis for the cases where triangle creation and preservation is favored, meaning that both

α

and

β

would be larger than 1. Then,

u_{2} = α \cdot β \cdot u_{1} > u_{1}

and several behaviors can be found depending on the concrete selected values for

u_{1}, u_{2}

and

α

(or

β

). Precisely, some regime may be transitory, or both may coexist as stationary behaviors depending on the selected initial conditions. In general, regime 1 will be more common since it fits with a wide range of possible values for

u_{1}

; only if

u_{1}

is very large (note that it would imply a huge

u_{2}

) may we start with a

g_{0}

satisfying regime 1 condition, but the expected value of stationary distribution

M_{s}

for such

u_{1}

may correspond to regime 2, so that the system may end up in such second regime. On the other hand, since regime 2 corresponds to a narrow range of small values of

M_{t}

, a very large value of

u_{2}

will be required for such regime to show up as stationary; if

u_{2}

is not large enough, even if we start with a

g_{0}

satisfying regime 2 condition, the expected value of stationary distribution

M_{s}

for such

u_{2}

may lie in the range of values corresponding to regime 1, so that the system may end up in such first regime. Finally, both regimes may coexist with

u_{2}

large enough and

u_{1}

small enough so that the respective expected values of stationary distributions

M_{s}

correspond to each one of the regimes. Note that, if

α

and

β

are large enough to favor triangle creation and preservation,

u_{1}

and

u_{2}

may differ in some orders of magnitude allowing a natural coexistence of both regimes.

In the next section, different simulations are performed to characterize the time evolution of some basic features (e.g., the expected value for the number of links, the clustering coefficient, the connectivity and the sample degree distribution) for the dynamics models presented above.

6. Simulations for the Time Evolution of Features

Numerical simulations have been performed to characterize the time evolution of the number of links, the clustering coefficient and the entropy of the sample degree distribution for the extended model defined by (23) and the structure dependent model presented in Section 5.

6.1. Extended Asymmetric Model

We begin by characterizing the extended model defined by (23). Figure 2 shows the evolution (starting from the empty graph) of the relative number of edges (i.e., edge density

d = \frac{m_{t}}{M}

), the clustering coefficient and the samples degree distribution entropy of a graph that evolves following the extended model defined by (23) with

p_{a} = 0.3

and

p_{r} = 1

. The estimations of relative number of edges and clustering coefficient converge to the same stationary value as the iteration number increases; hence, their ratio converges to one, this value being a reference baseline signature of structure independent models. Note that the variance of the clustering coefficient is significantly larger than the variance corresponding the relative number of edges. The estimated degree distribution presents also a significant variance.

Figure 3 represents the estimated expected value of the number of edges as a function of iteration number (starting from the empty graph) and parameter u. Due to the uniform nature of

P (G_{i} ∣ M_{i}),

the behavior of the clustering coefficient follows a similar behavior; again, this clustering coefficient/link density ratio value close to one is a reference baseline signature for these types of structure independent models.

Figure 4 represents the estimated expected value of the sample degree distribution entropy as a function of iteration number (starting from the empty graph) and parameter u. Larger values are obtained for

u = 1

as also illustrated in Figure 1.

6.2. Structure Depending Model

We now illustrate the behavior of some models of the type presented in Section 5. A network with

n = 100

has been considered, where different parameter values and initial conditions have been tested. In the following, we indicate some cases that illustrate the different behaviors:

If $p_{a n} = 0.5$ , $p_{a n n} = 0.05$ , $p_{r n} = 0.1$ and $p_{r n n} = 1$ (or, equivalently, $u_{1} = 20$ , $u_{2} = 0.2$ , $α = β = 10$ ) the system always converges to a stationary behavior in regime 1, with link density $d = \frac{m_{t}}{M} \approx 0.8$ , and clustering coefficient $c c \approx 0.8$ (hence the ratio $\frac{c c}{d}$ approaches 1 as it happens in Erdős–Renyi [22] or Gilbert models).
If $p_{a n} = 0.01$ , $p_{a n n} = 0.005$ , $p_{r n} = 0.5$ and $p_{r n n} = 1$ (or, equivalently, $u_{1} = 200$ , $u_{2} = 50$ , $α = β = 2$ ), the system always converges to a stationary behavior in regime 2, with link density $d = \frac{m_{t}}{M} \approx 0.005$ , and oscillating clustering coefficient between 0 and $0.2$ (hence the ratio $\frac{c c}{d}$ presents also large oscillations).
If $p_{a n} = 0.05$ , $p_{a n n} = 0.005$ , $p_{r n} = 0.1$ and $p_{r n n} = 1$ (or, equivalently, $u_{1} = 200$ , $u_{2} = 2$ , $α = β = 10$ ), the system presents the two above illustrated regimes; and, depending on the initial condition, there is more or less probability to evolve within each one of these regimes:
(a)
For $g_{0}$ following a Gilbert model with $p ⪆ 0.0375$ , the system is more likely to remain in regime 1 with link density $d = \frac{m_{t}}{M} \approx 0.32$ , and clustering coefficient $c c \approx 0.32$ (hence the ratio $\frac{c c}{d}$ approaches 1 as it happens in ER or Gilbert models).
(b)
For $g_{0}$ following a Gilbert model with $p ⪅ 0.0375$ , the system is more likely to steadily remain (at least for a long time beyond the number of performed iterations) in regime 2 with link density $d = \frac{m_{t}}{M} \approx 0.008$ , and clustering coefficient $c c \approx 0.3$ . Hence, the ratio $\frac{c c}{d}$ evolves around 40; this large clustering coefficient in proportion to link density illustrates a very common feature of many real networks.

The two regime case is illustrated in Figure 5 and Figure 6, where the estimated expected value of the density of edges and the clustering coefficient as a function of the iteration number are presented for different initial condition graphs

g_{0}

. Figure 5 illustrates the behavior when

g_{0}

is either the complete or the empty graph. All simulations starting from the complete graph led to regime 1, whereas all simulations starting from the empty graph led to regime 2.

Figure 6 illustrates the behavior for two cases when

g_{0}

is obtained as a sample of the Gilbert model with

p = 0.05

and

p = 0.03

, respectively. A majority of the simulations starting from a Gilbert

p = 0.05

graph led to regime 1, whereas a majority of the simulations starting from a Gilbert

p = 0.03

graph led to regime 2. These simulations illustrate approximately the size of the stochastic domains corresponding to each regime.

7. Conclusions

Several basic models for dynamic networks, based only on edge density, have been initially proposed and analyzed in terms of the time evolution of the number of links, clustering coefficient, connectivity and entropy of the sample degree distribution; the evolution of these features helps to characterize the proposed models and provides a reference baseline signature to assess more complex behaviors. The proposed model involving network structure presents a more sophisticated behavior and, for some regime, it leads to the generation of a high clustering coefficient/link density ratio when compared with the reference baseline values. This result is promising for the design of network models with tunable clustering coefficient with the aim to replicate some real networks characteristics.

The proposed framework will serve to assess, in a systematic manner, the properties of existing models as well as future more complex models for time evolving networks.

Author Contributions

P.J.Z. developed the conceptualization and formal analysis, wrote the document and helped with the simulations. I.B.-V. developed the simulations and helped with the theoretical content and the writing of the paper.

Funding

This research and the APC were funded by the Ministerio de Economía y Competitividad, Spain, grant number MTM2015-67396-P, and the ETSI Telecomunicación, Universidad Politécnica de Madrid.

Acknowledgments

This work has been partially supported by project MTM2015-67396-P of Ministerio de Economía y Competitividad, Spain.

Conflicts of Interest

The authors declare no conflict of interest.

References

Watts, D.J.; Strogatz, S.H. Collective dynamics of “small-world” networks. Nature 1998, 393, 440–442. [Google Scholar] [CrossRef] [PubMed]
Albert, R.; Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 2002, 74, 47. [Google Scholar] [CrossRef]
Newman, M. Networks: An Introduction; Oxford University Press: Oxford, UK, 2010. [Google Scholar]
Newman, M.; Barabasi, A.L.; Watts, D.J. The Structure and Dynamics of Networks; Princeton University Press: Princeton, NJ, USA, 2011; Volume 19. [Google Scholar]
Dorogovtsev, S.N.; Mendes, J.F. Evolution of Networks: From Biological Nets to the Internet and WWW; Oxford University Press: Oxford, UK, 2013. [Google Scholar]
Siljak, D.D. Dynamic Graphs. Nonlinear Anal. Hybrid Syst. 2008, 2, 544–567. [Google Scholar] [CrossRef]
Kejžar, N.; Nikoloski, Z.; Batagelj, V. Probabilistic inductive classes of graphs. J. Math. Sociol. 2008, 32, 85–109. [Google Scholar] [CrossRef]
Holme, P.; Saramäki, J. Temporal networks. Phys. Rep. 2012, 519, 97–125. [Google Scholar] [CrossRef] [Green Version]
Pastor-Satorras, R.; Vespignani, A. Epidemic spreading in scale-free networks. Phys. Rev. Lett. 2001, 86, 3200. [Google Scholar] [CrossRef] [PubMed]
Rahmani, A.; Ji, M.; Mesbahi, M.; Egerstedt, M. Controllability of multi-agent systems from a graph-theoretic perspective. SIAM J. Control Optim. 2009, 48, 162–186. [Google Scholar] [CrossRef]
Dorogovtsev, S.N.; Mendes, J.F. Evolution of networks. Adv. Phys. 2002, 51, 1079–1187. [Google Scholar] [CrossRef] [Green Version]
Zufiria, P.J.; Barriales-Valbuena, I. Evolution models for dynamic networks. In Proceedings of the 2015 38th International Conference on Telecommunications and Signal Processing (TSP), Prague, Czech Republic, 9–11 July 2015; pp. 252–256. [Google Scholar]
Herrera, C.; Zufiria, P.J. Generating scale-free networks with adjustable clustering coefficient via random walks. In Proceedings of the 2011 IEEE Network Science Workshop (NSW), West Point, NY, USA, 22–24 June 2011; pp. 167–172. [Google Scholar]
Zufiria, P.J.; Barriales-Valbuena, I. Entropy Characterization of Random Network Models. Entropy 2017, 19, 321. [Google Scholar] [CrossRef]
Klein, M.J. Entropy and the Ehrenfest urn model. Physica 1956, 22, 569–575. [Google Scholar] [CrossRef]
Van Doorn, E.A.; Schrijner, P. Geometric ergodicity and quasi-stationarity in discrete-time birth-death processes. ANZIAM J. 1995, 37, 121–144. [Google Scholar]
Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: New York, NY, USA, 1968; Volume 1. [Google Scholar]
Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions; John Wiley & Sons: New York, NY, USA, 2005; Volume 444. [Google Scholar]
Kemp, A.W. Steady-state Markov chain models for certain q-confluent hypergeometric distributions. J. Stat. Plan. Inference 2005, 135, 107–120. [Google Scholar] [CrossRef]
Gilbert, E.N. Random graphs. Ann. Math. Stat. 1959, 30, 1141–1144. [Google Scholar] [CrossRef]
Morimoto, T. Markov processes and the H-theorem. J. Phys. Soc. Jpn. 1963, 18, 328–331. [Google Scholar] [CrossRef]
Erdos, P.; Rényi, A. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci. 1960, 5, 17–60. [Google Scholar]

Figure 1. Simple and extended asymmetric models. Comparison among smoothed probability mass functions: baseline, theoretical and empirical for

p_{a} = 0.3

and

p_{r} = 1

.

Figure 1. Simple and extended asymmetric models. Comparison among smoothed probability mass functions: baseline, theoretical and empirical for

p_{a} = 0.3

and

p_{r} = 1

.

Figure 2. Extended asymmetric model. Estimated expected values of relative number of edges (rnedges), clustering coefficient (clust) and sample degree distribution entropy (entropy), as a function of the iteration number (

p_{a} = 0.3

and

p_{r} = 1

).

Figure 2. Extended asymmetric model. Estimated expected values of relative number of edges (rnedges), clustering coefficient (clust) and sample degree distribution entropy (entropy), as a function of the iteration number (

p_{a} = 0.3

and

p_{r} = 1

).

Figure 3. Extended asymmetric model. Estimated expected value of number of edges as a function of u at iterations 250, 500, 750 and 1000.

Figure 4. Extended asymmetric model. Estimated expected value of the sample degree distribution entropy as a function of u at iterations 250, 500, 750 and 1000.

Figure 5. Structure depending model. Estimated expected value of number of edges and clustering coefficient as a function of number of steps, starting from complete and empty graphs, respectively. Different line colors correspond to different instantiations of the model.

Figure 6. Structure depending model. Estimated expected value of number of edges and clustering coefficient as a function of the number of steps, starting from a sample of the Gilbert model with

p = 0.05

and

p = 0.03

, respectively. Different line colors correspond to different instantiations of the model.

Figure 6. Structure depending model. Estimated expected value of number of edges and clustering coefficient as a function of the number of steps, starting from a sample of the Gilbert model with

p = 0.05

and

p = 0.03

, respectively. Different line colors correspond to different instantiations of the model.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zufiria, P.J.; Barriales-Valbuena, I. Analysis of Basic Features in Dynamic Network Models. Entropy 2018, 20, 681. https://doi.org/10.3390/e20090681

AMA Style

Zufiria PJ, Barriales-Valbuena I. Analysis of Basic Features in Dynamic Network Models. Entropy. 2018; 20(9):681. https://doi.org/10.3390/e20090681

Chicago/Turabian Style

Zufiria, Pedro J., and Iker Barriales-Valbuena. 2018. "Analysis of Basic Features in Dynamic Network Models" Entropy 20, no. 9: 681. https://doi.org/10.3390/e20090681

APA Style

Zufiria, P. J., & Barriales-Valbuena, I. (2018). Analysis of Basic Features in Dynamic Network Models. Entropy, 20(9), 681. https://doi.org/10.3390/e20090681

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analysis of Basic Features in Dynamic Network Models

Abstract

1. Introduction

2. Characterization of Network Sequences via Basic Features

Time Evolution of Network Features

3. Entropy Measures for Stochastic Processes

3.1. Snapshot Entropy and Entropy of Network Features

3.2. Trajectory Entropy

3.3. Normalized Asymptotic Entropy

4. Basic Evolution Models with a Fixed Number of Nodes: Evolution of Number of Links

4.1. Evolution of the Number of Links

4.2. A Simple Structure Independent Evolution Model

4.3. Extensions of the Model for Asymmetric Evolution

Alternative Simple Model

5. Evolution Models Depending on Network Structure: Evolution of Clustering Coefficient

Two Regimes of Behavior

6. Simulations for the Time Evolution of Features

6.1. Extended Asymmetric Model

6.2. Structure Depending Model

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI