Next Article in Journal
Irreversibility of the Direct and Counter Impinging Liquid Jet onto Profiled Heated Cavity
Previous Article in Journal
UV Total Dose Nonvolatile Sensor Using Fluorine-Treated SOHOS Capacitor Device
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Characterization of Some Dynamic Network Models †

by
Pedro J. Zufiria
1,2,* and
Iker Barriales-Valbuena
1
1
Depto. Matemática Aplicada a las TIC, ETSI Telecomunicación, Universidad Politécnica de Madrid, Avda, Complutense 30, E-28040 Madrid, Spain
2
Information Processing and Telecommunications Center (IPTC), Universidad Politécnica de Madrid, Avda, Complutense 30, E-28040 Madrid, Spain
*
Author to whom correspondence should be addressed.
Presented at the 4th International Electronic Conference on Entropy and Its Applications, 21 November–1 December 2017; Available online: http://sciforum.net/conference/ecea-4.
Proceedings 2018, 2(4), 164; https://doi.org/10.3390/ecea-4-05031
Published: 21 November 2017

Abstract

:
Dynamic random network models are presented as a mathematical framework for modelling and analyzing the time evolution of complex networks. Such framework allows the time analysis of several network characterizing features such as link density, clustering coefficient, degree distribution, as well as entropy-based complexity measures, providing new insight on the evolution of random networks. Some simple dynamic models are analyzed with the aim to provide several basic reference evolution behaviors. Simulation examples are discussed to illustrate the applicability of the proposed framework.

1. Introduction

Many complex systems can be modelled by using some network structure in the model construction. These models may be dynamical, meaning that the values of some (state) variables do change with time, and, depending on the nature of such variables, we can have different types of models. The first type corresponds to dynamic graphs which follow evolution laws defined explicitly on the network [1,2]; the second type gathers dynamical systems where the state variables are defined on a network [3]; finally, the third type refers to co-evolution models which combine evolving networks and dynamical systems. In the first and third type the underlying network structure changes with time, defining a time-varying or evolving network [4]. The characterization of some basic models of evolving networks is the main objective of the present work.

2. Characterization of Network Sequences via Standard Features

Following [4], discrete-time network evolution along time can be generally defined by a random sequence or trajectory { G t } t = 0 , 1 , where each G t can take values g from G , being G the set of all possible networks. The analysis of { G t } t = 0 , 1 , can be framed by considering it as a stochastic process, whose full characterization may be very complex. In the following we present some standard features which help for a partial characterization of such stochastic process.

2.1. Time Evolution of Network Features

In some cases we may be interested in the evolution of some quantifiable properties or features, f, of the network, defined as follows (see [5] for details):
f : G R l ,
g f = f ( g ) ,
where f ( g ) is the function that computes such quantifiable property (number of links, number of triangles, connectivity, degree of nodes, entropy of degree distribution, etc.) in graph g.
Note that when G is endowed with a probability space, then, under some regularity assumptions on f, this function defines a random vector. Therefore the sequence f ( G t ) R l defines a vector stochastic process which can be analyzed using standard stochastic process techniques. In the following analysis we will focus on several of these properties such as the number of links, number of triangles, degree distribution entropy, etc. Since for these cases l   =   1 , the study will boil down to the analysis of scalar stochastic processes. A basic analysis would estimate, for instance, the deterministic sequence of expected values E [ f ( G t ) ] .
In the following section we focus on different entropy measures which can also be employed for characterizing the stochastic process { G t } t = 0 , 1 , .

3. Entropy Measures for Stochastic Processes

The stochastic process { G t } t = 0 , 1 , is an indexed sequence of random variables, which can be completely characterized until time instant t   =   N by its joint probability distribution:
P ( G 0 , G 1 , , G N )
This joint distribution may be quite complex to study and, therefore we may acquiesce in characterizing part of it. For instance, if we consider G i for a fixed time t   =   i , this snapshot of the process, also called a cross-sectional variable, can be represented by a “static” model such as the ones studied in [5], fully characterized by the marginal distribution of g i . Accordingly, when considering entropy measures for characterizing a stochastic process, different distributions associated with such process can be considered, as developed below.

3.1. Cross-Sectional Entropy and Entropy of Network Features

The simplest approach focuses on the entropy analysis of cross sectional variables G i . Hence, one can define the cross-sectional entropy of index i, H ( G i ) , of a stochastic process as the entropy of the i-th variable G i of the process.
H ( G i )   =   g G p ( G i = g ) log p ( G i = g )
When considering a network feature f, the entropy of the associated random variable F i   =   f ( G i ) satisfies the condition
H ( G i )   =   H ( G i / F i )   +   H ( F i )
and therefore
H ( F i )   =   H ( f ( G i ) )     H ( G i )
where the equality holds only if f is an injection.
Note that H ( G i ) in Equation (4) is not to be confused with the feature mentioned in Section 2.1 called degree distribution entropy, associated with a concrete sample of G i . For a more detailed explanation of degree distributions in static models see [5].
The computation of H ( G i ) , when performed for every i { 0 , 1 , , } , would lead to a deterministic time series { H t } t = 0 , 1 , as an alternative partial characterization of the stochastic process { G t } t = 0 , 1 , .

3.2. Trajectory Entropy

Furthermore, one can study the entropy of a whole time period evolution of the process, seen as a sequence of T   +   1 variables. We define the trajectory entropy ( H 0 T ) of a T   +   1 -length time period of a stochastic process, as the entropy of the joint probability P ( G 0 , G 2 , , G T ) .
H 0 T   =   H ( G 0 , , G T )   =   G T + 1 p ( g 0 , g 1 , g T ) log p ( g 0 , g 2 , g T )
If all G i are independent variables, then:
H 0 T   =   i = 0 T H ( G i )
Note that, in general, as T increases, H 0 T may increase unbounded.

3.3. Normalized Asymptotic Entropy

Finally, one may want to characterize the entropy rate as a normalized entropy measure independent of T, which globally characterizes the asymptotic behavior of the stochastic process.
H R   =   lim T 1 T + 1 H 0 T
Or, equivalently,
H R   =   lim T H ( G T | G T 1 , G T 2 , , G 1 , G 0 )
After presenting these measures, the next Section starts considering some basic evolution models.

4. Evolution Models with Fixed Number of Nodes. Evolution of Number of Links

Let us consider G V the set of all networks (or graphs) g i   =   ( V , E i ) having a fixed set of nodes V   =   { v 1 , , v n } , with | V |   =   n ; each g i G is then characterized by its corresponding set of links E i E with E being determined by V as the set of all pairs of nodes ( | E |   =   n 2   =   m ).
In this framework, any evolution process { G t } t = 0 , 1 , is characterized by the sequence of the corresponding { E t } t = 0 , 1 , . In addition, since g i G V can be represented via its corresponding binary adjacency matrix A ( g i ) R n × R n , the evolution process can also be characterized as a sequence of adjacency matrices { A ( g t ) } t = 0 , 1 ,   =   { A t } t = 0 , 1 , .

4.1. Evolution of the Number of Links

In general, a complete characterization of { G t } t = 0 , 1 , will be very cumbersome. Alternatively,we can partially characterize such process by considering
f : G { 0 , 1 , m }
g i f ( g i ) = | E i | = A i 1 = n i
where f is the function that computes the number of links in the network. We can partition the set G V into equivalence classes C k , k   =   0 , m so that each class C k gathers all graphs containing k links: C k   =   { g i G : f ( g i ) = k } . Then, we can define a stochastic process { N t } t = 0 , 1 with each N t { 0 , 1 , , m } which characterizes the transition between classes, and whose state space represents such equivalence classes (hence, we identify C k with state k).
In general, for a given instant of time i, based on Equation (5) we will have that the cross-sectional entropy of G i and the entropy of N i will satisfy
H ( G i )   =   H ( G i / N i )   +   H ( N i )
and this relationship will help to characterize G i via the analysis of N i . Therefore the following proposed models will be partially characterized by analyzing the associated stochastic process, N t { 0 , 1 , , m } , for the evolution of the number of links.

4.2. A Simple Evolution Model

We define a simple network evolution process which may serve as a reference baseline for comparison purposes. Given g t (equivalently, E t or A t ), the next time step network g t + 1 is generated by randomly selecting a pair of nodes ( v i , v j ) E so that if there exists a link between them (i.e., ( v i , v j ) E t ), such link is removed ( E t + 1 = E t { ( v i , v j ) } ) and, if there is no link between the nodes (i.e., ( v i , v j ) E t ), then it is created ( E t + 1 = E t { ( v i , v j ) } ). Note that if we consider the adjacency matrix representation A t , at each stage of time, an element of the matrix A t is randomly chosen so that its value is changed (from 0 to 1 or vice versa) to derive A t + 1 .
Note that the evolution law if determined by the number of links of g t . Therefore, as mentioned above, we will start the analysis of this evolution model by characterizing the time evolution of the number of links. The corresponding N t { 0 , 1 , , m } satisfies:
P ( N t + 1 = 1 / N t = 0 ) =   1 ,
P ( N t + 1 = m 1 / N t = m ) =   1 ,
and for i { 1 , , m 1 } :
P ( N t + 1 = j / N t = i )   =   0 , if j   =   i or | j i |   >   1 , i m , if j   =   i 1 , m i m , if j   =   i + 1 .
This process is a Markov chain with the following matrix of transition probabilities:
P   =   0 1 m 0 0 0 0 1 0 2 m 0 0 0 0 m 1 m 0 0 0 0 0 0 m 2 m 0 0 0 0 0 0 m 2 m 0 0 0 0 0 0 m 1 m 0 0 0 0 2 m 0 1 0 0 0 0 1 m 0
which is known as the Ehrenfest model [6], and which can be similarly interpreted as representing an urn with white and black balls, where we randomly select a ball and change it by another ball with different color, hence representing a sort of discrete-time birth-death Markov process [7] but with finite number of states (two boundary conditions). Many discrete distributions have been obtained by studying urn models and Markov processes [8,9,10]. Note that these models can be seen as a reference baseline since they do not exploit the network structure properties (i.e., the relative location of white balls and black balls).
The left stochastic, tri-diagonal, irreducible matrix P of Equation (17) has period 2, but it has a unique eigenvector associated with eigenvalue λ   =   1 . This eigenvector defines the stationary distribution of the process, denoted by N s , and it can be easily proved that such distribution is binomial:
P ( g s C k )   =   P ( N s = k )   =   m k 1 2 m ,
so that taking a snapshot of the process for large t is equivalent to generating a sample from the Gilbert model with p   =   1 2 or, equivalently, the uniform model with maximum entropy (see [5] for details). Note that given a number of links N s   =   k , the distribution of G i / N s   =   k ) is uniform (following a Erdős-Rényi model [5]), each link having probability 1 | C k |   =   1 m k . Hence, considering Equation (18), the entropy expression provided in Equation (13) becomes
H ( G i )   =   H ( N s )   +   H ( G i / N s )   =   k p ( k ) log p ( k )     k p ( k ) log 1 m k
= k p ( k ) log p ( k ) m k   =   m · log 2
Concerning the entropy of N t , it is known that Ehrenfest model cross-sectional (relative) entropy at time t, defined in terms of the Kullback-Leibler divergence between the distribution and the steady state equilibrium distribution
H rel ( t )   =   D K L ( P ( t ) | | P s )   =   k = 0 n P ( N t = k ) log P ( N t = k ) P ( N s = k )
is non-decreasing in time as approaches the maximum value zero, upon the so called H-Theorem [11].

4.3. Extensions of the Model for Asymmetric Evolution

One can extend the symmetric model provided in Equation (17) with the aim of considering cases in which the network may have an uneven tendency to increase or decrease in the number of edges.
Let us consider the following transition behavior from g t to g t + 1 : we start selecting a pairs of nodes in network g t ; if the selected pair already has an associated link, such link is removed with probability p r , whereas if such pair does not have an associated link, a link is added between such pair of nodes with probability p a . If no change (removal or addition) happens, the process is repeated until the network undergoes some modification, which is registered in g t + 1 .
Again, if we focus the analysis on the time evolution of the number of links, N t , the corresponding transition matrix becomes:
P ( p r , p a ) = 0 p r p r + ( m 1 ) p a 0 0 0 0 1 0 2 p r 2 p r + ( m 2 ) p a 0 0 0 0 ( m 1 ) p a p r + ( m 1 ) p a 0 0 0 0 0 0 ( m 2 ) p a 2 p r + ( m 2 ) p a 0 0 0 0 0 0 ( m 2 ) p r ( m 2 ) p r + 2 p a 0 0 0 0 0 0 ( m 1 ) p r ( m 1 ) p r + p a 0 0 0 0 2 p a ( m 2 ) p r + 2 p a 0 1 0 0 0 0 p a ( m 1 ) p r + p a 0
The analysis of this system can be simplified if we denote p r p a   =   u the unbalance coefficient, since the matrix can be reformulated as
P ( u ) = 0 u u + m 1 0 0 0 0 1 0 2 u 2 u + m 2 0 0 0 0 m 1 u + m 1 0 0 0 0 0 0 m 2 2 u + m 2 0 0 0 0 0 0 ( m 2 ) u ( m 2 ) u + 2 0 0 0 0 0 0 ( m 1 ) u ( m 1 ) u + 1 0 0 0 0 2 ( m 2 ) u + 2 0 1 0 0 0 0 1 ( m 1 ) u + 1 0
If u   <   1 the model has more tendency to add links than to remove them, and vice versa for u   >   1 . The analysis and interpretation of the network behavior can be performed either way due to such symmetry. For instance, if u   <   1 the model can be interpreted as characterizing the following behavior: if the selected pair in g t has an associated link, this link is removed with probability u; if the pair does not have and associated link, then a link is added. Again, the selection procedure is repeated until a link is either removed or added, defining g t + 1 .
In Section 5 the time evolution of the expected value for the number of links, the clustering coefficient, the connectivity and the sample degree distribution are estimated via simulations procedures.
It can be proved that the resulting stationary distribution has the form:
P u ( N s = k )   =   m k k · u + m k m · u k i = 0 m m i i · u + m i m · u i , u R +
which can be seen as a generalization of the binomial distribution Bin ( 1 2 , m ) via the new parameter u.
Figure 1 represents smoothed probability mass functions for the baseline, theoretical given by Equation (23) and empirical (based in simulations) with p a   =   0.3 and p r   =   1 . Note that asymmetry of the u value generates a probability function with less entropy than the corresponding to the baseline mass function.
Repeating a similar procedure to Equations (19) and (20) the corresponding G i entropy can be computed as
H u ( G i ) =   H u ( N s )   +   H ( G i / N s )   =   k p u ( k ) log p u ( k )     k p u ( k ) log 1 m k = k p u ( k ) log p u ( k ) m k
which for u   =   1 becomes H 1 ( G i )   =   m · log 2 .

4.3.1. Alternative Simple Model

Another simple model could assume that whenever an existing edge is selected to be removed, it is removed with probability p r [ 0 , 1 ] , whereas, alternatively, a new edge is randomly added. The transition matrix of the corresponding N t { 0 , 1 , , m } for the number of links would be
P = 0 p r m 0 0 0 0 1 0 2 p r m 0 0 0 0 1 p r m 0 0 0 0 0 0 1 2 p r m 0 0 0 0 0 0 ( m 2 ) p r m 0 0 0 0 0 0 ( m 1 ) p r m 0 0 0 0 1 ( m 2 ) p r m 0 1 0 0 0 0 1 ( m 1 ) p r m 0
Note that an equivalent symmetric model can be defined as follows. If the selected a pair of nodes does not have an associated link, we add such a link with probability p a , otherwise an existing link is removed.
It can be proved that the resulting stationary distribution has the form:
P p r ( N s = k )   =   1 1 + i = 1 m m · ( m p r ) ( m ( i 1 ) p r ) i ! p r i if k = 0 , m · ( m p r ) ( m ( k 1 ) p r ) k ! p r k 1 + i = 1 m m · ( m p r ) ( m ( i 1 ) p r ) i ! p r i if k { 1 , , m } , p r [ 0 , 1 ]
which can be seen as another generalization of the binomial distribution Bin ( 1 2 , m ) via the new parameter p r . Again the network cross-sectional entropy can be computed as
H p r ( G i )   =   k p p r ( k ) log p p r ( k ) m k
Both models Equations (22) and (24) provide respectively stationary distributions Equations (23) and (25) which, in general, are not binomial. Therefore, if we take a snapshot of these stationary distributions, the resulting network will follow a new static model, different from the standard known reference models for static networks.
Note that again these models can be interpreted as urn-derived finite state discrete-time birth-death models, in the sense that they do not incorporate network structural information, but only the total number of links. In other words, these models do not differentiate among networks that belong to the same equivalence class C k .

5. Simulations for the Time Evolution of Features

Numerical simulations have been performed to characterize the time evolution of the number of links, the clustering coefficient and the entropy of the sample degree distribution for the extended model defined by Equation (22).
Figure 2 shows the evolution (starting from the empty graph) of the relative number of edges (number of edges divided by the maximum possible number of edges), the clustering coefficient and the samples degree distribution entropy of a graph that evolves following the extended model defined by Equation (22) with p a   =   0.3 and p r   =   1 . The estimations of relative number of edges and clustering coefficient converge to the same stationary value as the iteration number increases; note that the variance of the clustering coefficient is significantly larger than the variance corresponding the relative number of edges. The estimated degree distribution presents also a significant variance.
Figure 3 represents the estimated expected value of the number of edges as a function of iteration number (starting from the empty graph) and parameter u. Due to the uniform nature of P ( G i / N i ) the behavior of the clustering coefficient follows a similar behavior.
Figure 4 represents the estimated expected value of the sample degree distribution entropy as a function of iteration number (starting from the empty graph) and parameter u. Larger values are obtained for u   =   1 as also illustrated in Figure 1.

6. Concluding Remarks

Several basic models for dynamic networks have been proposed and analyzed in terms of the cross-sectional entropy, and time evolution of the number of links, clustering coefficient and entropy of the sample degree distribution. The evolution of these features seems to be useful to characterize the proposed models. Such models can serve as a reference baseline for future research on more complex models for time evolving networks.

Author Contributions

Pedro J. Zufiria developed the theoretical content, wrote the document and helped with the simulations. Iker Barriales-Valbuena developed the simulations and helped with the theoretical content and the writing of the paper.

Acknowledgments

This work has been partially supported by project MTM2015-67396-P of Ministerio de Economía y Competitividad, Spain.

References

  1. Siljak, D.D. Dynamic Graphs. Nonlinear Anal. Hybrid Syst. 2008, 2, 544–567. [Google Scholar] [CrossRef]
  2. Holme, P.; Saramäki, J. Temporal networks. Phys. Rep. 2012, 519, 97–125. [Google Scholar] [CrossRef]
  3. Rahmani, A.; Ji, M.; Mesbahi, M.; Egerstedt, M. Controllability of multi-agent systems from a graph-theoretic perspective. SIAM J. Control Optim. 2009, 48, 162–186. [Google Scholar] [CrossRef]
  4. Zufiria, P.J.; Barriales-Valbuena, I. Evolution models for dynamic networks. In Proceedings of the 2015 38th International Conference on Telecommunications and Signal Processing (TSP), Prague, Czech, 9–11 July 2015; pp. 252–256. [Google Scholar]
  5. Zufiria, P.J.; Barriales-Valbuena, I. Entropy Characterization of Random Network Models. Entropy 2017, 19, 321. [Google Scholar] [CrossRef]
  6. Klein, M.J. Entropy and the Ehrenfest urn model. Physica 1956, 22, 569–575. [Google Scholar]
  7. van Doorn, E.A.; Schrijner, P. Geometric ergodicity and quasi-stationarity in discrete-time birth-death processes. ANZIAM J. 1995, 37, 121–144. [Google Scholar]
  8. Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: Hoboken, NJ, USA, 1968; Volume 1. [Google Scholar]
  9. Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions; John Wiley & Sons: Hoboken, NJ, USA, 2005; Volume 444. [Google Scholar]
  10. Kemp, A.W. Steady-state Markov chain models for certain q-confluent hypergeometric distributions. J. Stat. Plan. Inference 2005, 135, 107–120. [Google Scholar] [CrossRef]
  11. Morimoto, T. Markov processes and the H-theorem. J. Phys. Soc. Jpn. 1963, 18, 328–331. [Google Scholar] [CrossRef]
Figure 1. Comparison among smoothed probability mass functions: baseline, theoretical and empirical for p a   =   0.3 and p r   =   1 .
Figure 1. Comparison among smoothed probability mass functions: baseline, theoretical and empirical for p a   =   0.3 and p r   =   1 .
Proceedings 02 00164 g001
Figure 2. Estimated expected values of relative number of edges, clustering coefficient and sample degree distribution entropy, as a function of the iteration number ( p a   =   0.3 and p r   =   1 ).
Figure 2. Estimated expected values of relative number of edges, clustering coefficient and sample degree distribution entropy, as a function of the iteration number ( p a   =   0.3 and p r   =   1 ).
Proceedings 02 00164 g002
Figure 3. Estimated expected value of number of edges as a function of u at iterations 250, 500, 750 and 1000.
Figure 3. Estimated expected value of number of edges as a function of u at iterations 250, 500, 750 and 1000.
Proceedings 02 00164 g003
Figure 4. Estimated expected value of the sample degree distribution entropy as function of u at iterations 250, 500, 750 and 1000.
Figure 4. Estimated expected value of the sample degree distribution entropy as function of u at iterations 250, 500, 750 and 1000.
Proceedings 02 00164 g004
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zufiria, P.J.; Barriales-Valbuena, I. Characterization of Some Dynamic Network Models. Proceedings 2018, 2, 164. https://doi.org/10.3390/ecea-4-05031

AMA Style

Zufiria PJ, Barriales-Valbuena I. Characterization of Some Dynamic Network Models. Proceedings. 2018; 2(4):164. https://doi.org/10.3390/ecea-4-05031

Chicago/Turabian Style

Zufiria, Pedro J., and Iker Barriales-Valbuena. 2018. "Characterization of Some Dynamic Network Models" Proceedings 2, no. 4: 164. https://doi.org/10.3390/ecea-4-05031

APA Style

Zufiria, P. J., & Barriales-Valbuena, I. (2018). Characterization of Some Dynamic Network Models. Proceedings, 2(4), 164. https://doi.org/10.3390/ecea-4-05031

Article Metrics

Back to TopTop