You are currently viewing a new version of our website. To view the old version click .
Entropy
  • Article
  • Open Access

7 September 2018

Analysis of Basic Features in Dynamic Network Models

and
1
Depto. Matemática Aplicada a las TIC, ETSI Telecomunicación, Universidad Politécnica de Madrid, Avda. Complutense 30, E-28040 Madrid, Spain
2
Information Processing and Telecommunications Center (IPTC), Universidad Politécnica de Madrid, E-28040 Madrid, Spain
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Selected Papers from 4th International Electronic Conference on Entropy and Its Applications

Abstract

Time evolving Random Network Models are presented as a mathematical framework for modelling and analyzing the evolution of complex networks. This framework allows the analysis over time of several network characterizing features such as link density, clustering coefficient, degree distribution, as well as entropy-based complexity measures, providing new insight on the evolution of random networks. First, some simple dynamic network models, based only on edge density, are analyzed to serve as a baseline reference for assessing more complex models. Then, a model that depends on network structure with the aim of reflecting some characteristics of real networks is also analyzed. Such model shows a more sophisticated behavior with two different regimes, one of them leading to the generation of high clustering coefficient/link density ratio values when compared with the baseline values, as it happens in many real networks. Simulation examples are discussed to illustrate the behavior of the proposed models.

1. Introduction

A large variety of complex systems can be analyzed by constructing a model that relies on some network structure [1,2,3,4]. The model may be dynamical, meaning that the values of some (state) variables do change with time and, depending on the nature of such variables, we can have different types of network models. The first type corresponds to dynamic graphs that follow evolution laws defined explicitly on the network [5,6,7,8]; the second type gathers dynamical systems where the state variables are defined on a network [9,10]; finally, the third type refers to co-evolution models that combine evolving networks and dynamical systems. In the first and third type, the underlying network structure changes with time, defining a time-varying or evolving network [11,12]. In the present work, we first characterize the basic features of some simple models of evolving networks whose evolution does not depend on network structure; the time evolution of these features serves as a reference baseline signature of the behavior of simple models. Then, a model that makes use of network structure is proposed to reflect some real network characteristics. The analysis of this model shows several regimes that indicate a sophisticated behavior; for some regime, the network reaches a high clustering coefficient/link density ratio [13] (when compared to the ratio values of baseline signatures), a common feature in many real networks.
The paper is organized as follows: Section 2 presents the general framework for Dynamic Network Models and their characterization via some basic features, whereas entropy measures are shown in Section 3. Section 4 analyzes some simple evolution models whose basic features’ time evolution serves as a behavior reference baseline. More elaborated evolution models that depend on network structure are studied in Section 5. Simulations of Section 6 comparatively illustrate the time evolution of the different features for the proposed models. Finally, concluding remarks are summarized in Section 7.

2. Characterization of Network Sequences via Basic Features

Following [12], discrete-time network evolution over time can be generally defined by a random sequence or trajectory { G t } t = 0 , 1 , , where each G t can take values g from G , being G the set of all possible networks. The analysis of { G t } t = 0 , 1 , can be framed by considering it as a stochastic process, whose full characterization may be very complex. In the following, we present some basic features that help for a partial characterization of such stochastic process.

Time Evolution of Network Features

In some cases, we may be interested in the evolution of some quantifiable properties or features, f, of the network, defined as follows (see [14] for details):
f : G R l ,
g f = f ( g ) ,
where f ( g ) is the function that computes such quantifiable property (number of links, number of triangles, connectivity, degree of nodes, entropy of degree distribution, etc.) in graph g.
Note that, when G is endowed with a probability space, then, under some regularity assumptions on f, this function defines a random vector. Therefore, the sequence f ( G t ) R l defines a vector stochastic process that can be analyzed using standard stochastic process techniques. In the following analysis, we will focus on several of these properties such as the number of links, number of triangles, the connectivity and the degree distribution entropy (a scalar summary of the distribution vector). Since for these cases l = 1 , the study will boil down to the analysis of scalar stochastic processes. A basic analysis would estimate, for instance, the deterministic sequence of expected values E [ f ( G t ) ] .
In the following section, we focus on different entropy measures that can also be employed for characterizing the stochastic process { G t } t = 0 , 1 , .

3. Entropy Measures for Stochastic Processes

The stochastic process { G t } t = 0 , 1 , is an indexed sequence of random variables, which can be completely characterized until time instant t = T by its joint probability distribution
P ( G 0 , G 1 , , G T ) .
This joint distribution may be quite complex to study and, therefore, we may acquiesce in characterizing part of it. For instance, if we consider G i for a fixed time t = i , this snapshot of the process, also called a cross sectional variable, can be represented by a “static” model such as the ones studied in [14], fully characterized by the marginal distribution of g i . Accordingly, when considering entropy measures for characterizing a stochastic process, different distributions associated with such process can be considered, as developed below.

3.1. Snapshot Entropy and Entropy of Network Features

The simplest approach focuses on the entropy analysis of cross sectional variables G i . Hence, one can define the snapshot entropy of index i, H ( G i ) , of a stochastic process as the entropy of the i-th variable G i of the process
H ( G i ) = g G p ( G i = g ) log p ( G i = g ) .
When considering a network feature f, the entropy of the associated random variable F i = f ( G i ) satisfies the condition
H ( G i ) = H ( G i F i ) + H ( F i ) ,
and therefore
H ( F i ) = H ( f ( G i ) ) H ( G i ) ,
where the equality holds only if f is an injection.
Note that H ( G i ) in (4) is not to be confused with the feature mentioned in Section 2.1 called degree distribution entropy, associated with a concrete sample of G i . For a more detailed explanation of degree distributions in static models, see [14].
The computation of H ( G i ) , when performed for every i { 0 , 1 , , } , would lead to a deterministic time series { H t } t = 0 , 1 , as an alternative partial characterization of the stochastic process { G t } t = 0 , 1 , .

3.2. Trajectory Entropy

Furthermore, one can study the entropy of a whole time period evolution of the process, seen as a sequence of T + 1 variables. We define the trajectory entropy ( H 0 T ) of a T + 1 -length time period of a stochastic process, as the entropy of the joint probability P ( G 0 , G 1 , , G T ) :
H 0 T = H ( G 0 , , G T ) = G T + 1 p ( g 0 , g 1 , g T ) log p ( g 0 , g 1 , g T ) .
If all G i are independent variables, then:
H 0 T = i = 0 T H ( G i ) .
Note that, in general, as T increases, H 0 T may increase unbounded.

3.3. Normalized Asymptotic Entropy

Finally, one may want to characterize the entropy rate as a normalized entropy measure independent of T, which globally characterizes the asymptotic behavior of the stochastic process. This entropy rate is defined as
H R = lim T 1 T + 1 H 0 T ,
whenever such limit exists. Alternatively, we can also compute
H R = lim T H ( G T | G T 1 , G T 2 , , G 1 , G 0 ) ,
again when this limit does exist. For strongly stationary processes, both measures (9) and (10) do exist and they are equal.
After presenting these measures, some basic evolution models are illustrated in the next section.

5. Evolution Models Depending on Network Structure: Evolution of Clustering Coefficient

Usually, the evolution of networks depends not only on the number of links but also on the network structure. To illustrate this idea, we will analyze the behavior of models whose dynamics depend on the fact that triangles are going to be created or deleted in the network; then, the evolution of the clustering coefficient will be an essential feature to be considered.
Let us consider, for instance, an extension of the asymmetric model of Section 4.3 where the probability of a given link to be added (or removed) may depend on the fact that a triangle will or will not be generated (or eliminated) when adding (or removing) such link. Precisely, the transition behavior from g t to g t + 1 is defined as follows:
  • a pair of nodes in network g t is uniformly selected.
  • (a)
    If the selected pair already has an associated link, such link is removed
    i.
    with probability p r , n when the selected nodes have at least one common neighbor (hence, at least one triangle will be deleted), or
    ii.
    with probability p r , n n when the selected nodes do not have a common neighbor (no triangle will be deleted);
    (b)
    if the selected pair does not have an associated link, a link is added between these nodes
    i.
    with probability p a , n when the selected nodes have at least one common neighbor (hence, some new triangles will be generated), or
    ii.
    with probability p a , n n when the selected nodes do not have common neighbor (no triangle will be generated).
  • If no change (removal or addition) happens, the process is repeated until the network undergoes some modification, which is registered in g t + 1 .
The discrete process g t provided by this model remains invariant to a common scaling of all probability values, provided the proportion among them is preserved. Hence, such dynamical model behavior can be reformulated as a function of, for instance, the following three parameters, α = p a , n p a , n n , β = p r , n n p r , n and u 1 = p r , n p a , n ; note that α and β measure the strength for favoring the creation and preservation of triangles, respectively, and the meaning of u 1 will become clear below. Note that this model is general enough to represent dynamic networks having a tendency to either create (and preserve) or eliminate triangles. In the specific case when α and β are larger than 1, triangle creation (and preservation) are promoted.
The analysis of this model can be complex since the existence and characterization of a stationary behavior may depend on the network size (number of nodes), the selected parameter values and the initial state g 0 . Note that M t cannot be directly defined anymore via a transition matrix of the type of (22), (23) or (25), since the future evolution of such M t depends not only on its actual value but also on some structural properties (i.e., the existence and location of triangles) of g t . In addition, the existence and form of a limiting stationary distribution for M t may be a complicated issue to deal with.

Two Regimes of Behavior

If we assume that g t follows approximately a Gilbert model, the probability p i j , t of any pair of nodes ( v i , v j ) to have a common neighbor (i.e., they may take part of a triangle) depends on the ratio between the number of links m t and the total number of node pairs M = N 2 in g t :
p i j , t = 1 1 m t M 2 N 2 .
The value of p i j , t is very sensitive to the link density d = m t M . For large values of N, we have that, if m t o ( N 3 2 ) , then p i j , t remains small, approaching value 1 otherwise. Therefore, the behavior of the model presented in Section 5 may be approximated considering two possible regimes.
  • Regime 1. For large enough m t Ω ( N 3 2 ) , the probability of creating or deleting triangles is not negligible and the dynamics of the system are approximately governed by a model following (23) with u 1 = p r , n p a , n .
  • Regime 2. For small m t o ( N 3 2 ) , the probability of creating or deleting triangles is small and the dynamics of the system are approximately governed by a model following (23) with u 2 = p r , n n p a , n n .
Hence, within each regime, the dynamics can be approximated via the baseline model (23).
We now focus on the analysis for the cases where triangle creation and preservation is favored, meaning that both α and β would be larger than 1. Then, u 2 = α · β · u 1 > u 1 and several behaviors can be found depending on the concrete selected values for u 1 , u 2 and α (or β ). Precisely, some regime may be transitory, or both may coexist as stationary behaviors depending on the selected initial conditions. In general, regime 1 will be more common since it fits with a wide range of possible values for u 1 ; only if u 1 is very large (note that it would imply a huge u 2 ) may we start with a g 0 satisfying regime 1 condition, but the expected value of stationary distribution M s for such u 1 may correspond to regime 2, so that the system may end up in such second regime. On the other hand, since regime 2 corresponds to a narrow range of small values of M t , a very large value of u 2 will be required for such regime to show up as stationary; if u 2 is not large enough, even if we start with a g 0 satisfying regime 2 condition, the expected value of stationary distribution M s for such u 2 may lie in the range of values corresponding to regime 1, so that the system may end up in such first regime. Finally, both regimes may coexist with u 2 large enough and u 1 small enough so that the respective expected values of stationary distributions M s correspond to each one of the regimes. Note that, if α and β are large enough to favor triangle creation and preservation, u 1 and u 2 may differ in some orders of magnitude allowing a natural coexistence of both regimes.
In the next section, different simulations are performed to characterize the time evolution of some basic features (e.g., the expected value for the number of links, the clustering coefficient, the connectivity and the sample degree distribution) for the dynamics models presented above.

6. Simulations for the Time Evolution of Features

Numerical simulations have been performed to characterize the time evolution of the number of links, the clustering coefficient and the entropy of the sample degree distribution for the extended model defined by (23) and the structure dependent model presented in Section 5.

6.1. Extended Asymmetric Model

We begin by characterizing the extended model defined by (23). Figure 2 shows the evolution (starting from the empty graph) of the relative number of edges (i.e., edge density d = m t M ), the clustering coefficient and the samples degree distribution entropy of a graph that evolves following the extended model defined by (23) with p a = 0.3 and p r = 1 . The estimations of relative number of edges and clustering coefficient converge to the same stationary value as the iteration number increases; hence, their ratio converges to one, this value being a reference baseline signature of structure independent models. Note that the variance of the clustering coefficient is significantly larger than the variance corresponding the relative number of edges. The estimated degree distribution presents also a significant variance.
Figure 2. Extended asymmetric model. Estimated expected values of relative number of edges (rnedges), clustering coefficient (clust) and sample degree distribution entropy (entropy), as a function of the iteration number ( p a = 0.3 and p r = 1 ).
Figure 3 represents the estimated expected value of the number of edges as a function of iteration number (starting from the empty graph) and parameter u. Due to the uniform nature of P ( G i M i ) , the behavior of the clustering coefficient follows a similar behavior; again, this clustering coefficient/link density ratio value close to one is a reference baseline signature for these types of structure independent models.
Figure 3. Extended asymmetric model. Estimated expected value of number of edges as a function of u at iterations 250, 500, 750 and 1000.
Figure 4 represents the estimated expected value of the sample degree distribution entropy as a function of iteration number (starting from the empty graph) and parameter u. Larger values are obtained for u = 1 as also illustrated in Figure 1.
Figure 4. Extended asymmetric model. Estimated expected value of the sample degree distribution entropy as a function of u at iterations 250, 500, 750 and 1000.

6.2. Structure Depending Model

We now illustrate the behavior of some models of the type presented in Section 5. A network with n = 100 has been considered, where different parameter values and initial conditions have been tested. In the following, we indicate some cases that illustrate the different behaviors:
  • If p a n = 0.5 , p a n n = 0.05 , p r n = 0.1 and p r n n = 1 (or, equivalently, u 1 = 20 , u 2 = 0.2 , α = β = 10 ) the system always converges to a stationary behavior in regime 1, with link density d = m t M 0.8 , and clustering coefficient c c 0.8 (hence the ratio c c d approaches 1 as it happens in Erdős–Renyi [22] or Gilbert models).
  • If p a n = 0.01 , p a n n = 0.005 , p r n = 0.5 and p r n n = 1 (or, equivalently, u 1 = 200 , u 2 = 50 , α = β = 2 ), the system always converges to a stationary behavior in regime 2, with link density d = m t M 0.005 , and oscillating clustering coefficient between 0 and 0.2 (hence the ratio c c d presents also large oscillations).
  • If p a n = 0.05 , p a n n = 0.005 , p r n = 0.1 and p r n n = 1 (or, equivalently, u 1 = 200 , u 2 = 2 , α = β = 10 ), the system presents the two above illustrated regimes; and, depending on the initial condition, there is more or less probability to evolve within each one of these regimes:
    (a)
    For g 0 following a Gilbert model with p 0.0375 , the system is more likely to remain in regime 1 with link density d = m t M 0.32 , and clustering coefficient c c 0.32 (hence the ratio c c d approaches 1 as it happens in ER or Gilbert models).
    (b)
    For g 0 following a Gilbert model with p 0.0375 , the system is more likely to steadily remain (at least for a long time beyond the number of performed iterations) in regime 2 with link density d = m t M 0.008 , and clustering coefficient c c 0.3 . Hence, the ratio c c d evolves around 40; this large clustering coefficient in proportion to link density illustrates a very common feature of many real networks.
The two regime case is illustrated in Figure 5 and Figure 6, where the estimated expected value of the density of edges and the clustering coefficient as a function of the iteration number are presented for different initial condition graphs g 0 . Figure 5 illustrates the behavior when g 0 is either the complete or the empty graph. All simulations starting from the complete graph led to regime 1, whereas all simulations starting from the empty graph led to regime 2.
Figure 5. Structure depending model. Estimated expected value of number of edges and clustering coefficient as a function of number of steps, starting from complete and empty graphs, respectively. Different line colors correspond to different instantiations of the model.
Figure 6. Structure depending model. Estimated expected value of number of edges and clustering coefficient as a function of the number of steps, starting from a sample of the Gilbert model with p = 0.05 and p = 0.03 , respectively. Different line colors correspond to different instantiations of the model.
Figure 6 illustrates the behavior for two cases when g 0 is obtained as a sample of the Gilbert model with p = 0.05 and p = 0.03 , respectively. A majority of the simulations starting from a Gilbert p = 0.05 graph led to regime 1, whereas a majority of the simulations starting from a Gilbert p = 0.03 graph led to regime 2. These simulations illustrate approximately the size of the stochastic domains corresponding to each regime.

7. Conclusions

Several basic models for dynamic networks, based only on edge density, have been initially proposed and analyzed in terms of the time evolution of the number of links, clustering coefficient, connectivity and entropy of the sample degree distribution; the evolution of these features helps to characterize the proposed models and provides a reference baseline signature to assess more complex behaviors. The proposed model involving network structure presents a more sophisticated behavior and, for some regime, it leads to the generation of a high clustering coefficient/link density ratio when compared with the reference baseline values. This result is promising for the design of network models with tunable clustering coefficient with the aim to replicate some real networks characteristics.
The proposed framework will serve to assess, in a systematic manner, the properties of existing models as well as future more complex models for time evolving networks.

Author Contributions

P.J.Z. developed the conceptualization and formal analysis, wrote the document and helped with the simulations. I.B.-V. developed the simulations and helped with the theoretical content and the writing of the paper.

Funding

This research and the APC were funded by the Ministerio de Economía y Competitividad, Spain, grant number MTM2015-67396-P, and the ETSI Telecomunicación, Universidad Politécnica de Madrid.

Acknowledgments

This work has been partially supported by project MTM2015-67396-P of Ministerio de Economía y Competitividad, Spain.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Watts, D.J.; Strogatz, S.H. Collective dynamics of “small-world” networks. Nature 1998, 393, 440–442. [Google Scholar] [CrossRef] [PubMed]
  2. Albert, R.; Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 2002, 74, 47. [Google Scholar] [CrossRef]
  3. Newman, M. Networks: An Introduction; Oxford University Press: Oxford, UK, 2010. [Google Scholar]
  4. Newman, M.; Barabasi, A.L.; Watts, D.J. The Structure and Dynamics of Networks; Princeton University Press: Princeton, NJ, USA, 2011; Volume 19. [Google Scholar]
  5. Dorogovtsev, S.N.; Mendes, J.F. Evolution of Networks: From Biological Nets to the Internet and WWW; Oxford University Press: Oxford, UK, 2013. [Google Scholar]
  6. Siljak, D.D. Dynamic Graphs. Nonlinear Anal. Hybrid Syst. 2008, 2, 544–567. [Google Scholar] [CrossRef]
  7. Kejžar, N.; Nikoloski, Z.; Batagelj, V. Probabilistic inductive classes of graphs. J. Math. Sociol. 2008, 32, 85–109. [Google Scholar] [CrossRef]
  8. Holme, P.; Saramäki, J. Temporal networks. Phys. Rep. 2012, 519, 97–125. [Google Scholar] [CrossRef]
  9. Pastor-Satorras, R.; Vespignani, A. Epidemic spreading in scale-free networks. Phys. Rev. Lett. 2001, 86, 3200. [Google Scholar] [CrossRef] [PubMed]
  10. Rahmani, A.; Ji, M.; Mesbahi, M.; Egerstedt, M. Controllability of multi-agent systems from a graph-theoretic perspective. SIAM J. Control Optim. 2009, 48, 162–186. [Google Scholar] [CrossRef]
  11. Dorogovtsev, S.N.; Mendes, J.F. Evolution of networks. Adv. Phys. 2002, 51, 1079–1187. [Google Scholar] [CrossRef]
  12. Zufiria, P.J.; Barriales-Valbuena, I. Evolution models for dynamic networks. In Proceedings of the 2015 38th International Conference on Telecommunications and Signal Processing (TSP), Prague, Czech Republic, 9–11 July 2015; pp. 252–256. [Google Scholar]
  13. Herrera, C.; Zufiria, P.J. Generating scale-free networks with adjustable clustering coefficient via random walks. In Proceedings of the 2011 IEEE Network Science Workshop (NSW), West Point, NY, USA, 22–24 June 2011; pp. 167–172. [Google Scholar]
  14. Zufiria, P.J.; Barriales-Valbuena, I. Entropy Characterization of Random Network Models. Entropy 2017, 19, 321. [Google Scholar] [CrossRef]
  15. Klein, M.J. Entropy and the Ehrenfest urn model. Physica 1956, 22, 569–575. [Google Scholar] [CrossRef]
  16. Van Doorn, E.A.; Schrijner, P. Geometric ergodicity and quasi-stationarity in discrete-time birth-death processes. ANZIAM J. 1995, 37, 121–144. [Google Scholar]
  17. Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: New York, NY, USA, 1968; Volume 1. [Google Scholar]
  18. Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions; John Wiley & Sons: New York, NY, USA, 2005; Volume 444. [Google Scholar]
  19. Kemp, A.W. Steady-state Markov chain models for certain q-confluent hypergeometric distributions. J. Stat. Plan. Inference 2005, 135, 107–120. [Google Scholar] [CrossRef]
  20. Gilbert, E.N. Random graphs. Ann. Math. Stat. 1959, 30, 1141–1144. [Google Scholar] [CrossRef]
  21. Morimoto, T. Markov processes and the H-theorem. J. Phys. Soc. Jpn. 1963, 18, 328–331. [Google Scholar] [CrossRef]
  22. Erdos, P.; Rényi, A. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci. 1960, 5, 17–60. [Google Scholar]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.