Next Article in Journal
Automated Identification of Valid Model Networks Using Model-Based Systems Engineering
Previous Article in Journal
Novel Hybrid MPSI–MARA Decision-Making Model for Support System Selection in an Underground Mine
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Investigation into the Trend Stationarity of Local Characteristics in Media and Social Networks

1
Faculty of Mathematics and Mechanics, Saratov State University, Saratov 410012, Russia
2
Faculty of Computer Science and Information Technology, Saratov State University, Saratov 410012, Russia
3
Philosophy Department, Saratov State University, Saratov 410012, Russia
*
Author to whom correspondence should be addressed.
Systems 2022, 10(6), 249; https://doi.org/10.3390/systems10060249
Submission received: 15 October 2022 / Revised: 19 November 2022 / Accepted: 3 December 2022 / Published: 9 December 2022
(This article belongs to the Section Complex Systems)

Abstract

:
We studied the evolution of complex social networks over time. The elements of the networks are users, and the connections correspond to the interactions between them. At a particular moment in time, each node of a complex network has such characteristics as its degree, as well as the total degree of its neighbors. Obviously, in the process of network growth, these characteristics are constantly changing due to the fact that new edges are attached to this node or its neighbors. In this paper, we study the dynamics of these characteristics over time for networks generated on the basis of a nonlinear preferential attachment mechanism, and we find both the asymptotics of their expected values and the characteristics of their spread around the mean. In addition, we analyze the behavior of these local characteristics for three real social networks. The applicability of the findings to actual problems in the study of social media in the digital humanities is discussed.

1. Introduction

Modeling the behavior of social, biological, and technological systems is an important task. One of the most successful approaches is the representation of a complex system in the form of numerous homogeneous elements that interact with each other. The resulting model is a complex network consisting of nodes and links (edges), and it reflects the interactions among them in a real system.
This paper is devoted to the study of the dynamics for complex networks generated on the basis of a model with a nonlinear preferential attachment mechanism. We are interested in the analysis of stochastic processes that arise during the growth of such networks, namely, ones that describe the dynamics of a node degree and the dynamics of the sum of the degree of its neighbors. All of these characteristics of a node change over time, since each newly born node in the network has a chance to join it or its neighbors in each of the subsequent iterations. These local characteristics are extremely important in the analysis of social networks. Note that in this paper, we do not limit ourselves to studying only the expected behavior of these quantities, but we also analyze the asymptotic behavior of their variances over time in order to better understand the features of their distributions. Determining the characteristics of complex networks is crucial when solving a number of practical problems [1,2,3].
In recent years, many models have emerged to describe the growth of complex networks. One of the difficulties in their creation is the need for the networks generated on their basis to have the features of real networks. These features include the presence of a power law distribution of node degrees in real networks [4,5,6,7,8,9,10]. The Barabási–Albert model [11], which uses the preferential attachment mechanism, successfully reproduces this phenomenon. However, some properties of real networks can only be modeled by adding additional mechanisms to the network generation model [12,13,14,15,16,17,18,19,20,21,22]. Another approach is the modification of the preferential attachment mechanism in such a way to allow a nonlinear dependence of the probability of attaching to a vertex on its degree. This mechanism is called nonlinear preferential attachment (NPA). In this paper, we restrict ourselves to the study of networks built on the use of this rule. Note that the analysis of such networks allows us to shed light on both the properties of real networks that use the NPA mechanism in their development and on the non-obvious limitations of the NPA model.
As networks built on this mechanism grow, the degree of each node may change depending on whether a new node joins it or not. Therefore, all local characteristics of a node are described by some stochastic processes, each subsequent state of which is determined by the current one. Such processes are called Markov processes. For networks built according to the classical Barabási–Albert model [11], the behavior of the expected value, variance, kurtosis, and asymmetry of a node degree over time were studied in [23,24], in addition to other local characteristics, such as the average degree of neighbors and their friendship index. In this paper, we extend this analysis to a model that uses the nonlinear preferential attachment mechanism.
By analyzing the evolution of real social networks over time, we discover the following unusual phenomenon: For a group of nodes that appeared almost simultaneously (in a certain short time interval), the ratio of their average degree to the variance of their degrees is actually a constant at any subsequent point in time. In other words, although the trajectories for the degrees of nodes that appeared almost simultaneously can differ greatly, the coefficient of variation of the values of the degrees of nodes in this group practically does not change in time.
In this regard, the main research question of this paper arises: What mechanisms of evolution of complex networks can explain this phenomenon that is observed in real networks? We consider the Barabási–Albert model and analytically show that the mechanisms of growth and preferential attachment used by this model are sufficient for the appearance of this phenomenon in complex networks.
Thus, in this paper, we are interested in the following research questions:
  • What are the features of stochastic processes that describe the growth of the degree of a node and the growth of the sum of degrees of the neighbors of a node for networks built according to the NPA model?
  • What is the asymptotic behavior of the coefficients of variation for the degree of a node and the summary degree of the neighbors of the node in networks built according to the NPA model?
  • Does the behavior of these local characteristics in simulated networks correspond to their behavior in real social networks?
We answer these questions in our study by using the mean field method to find the asymptotic behavior of the quantities [25,26,27].
Note also that for networks built on the basis of the Barabási–Albert model, the authors of [28] obtained the limit distribution of node degrees, and they showed that after some scaling procedures, this stochastic process converged to a Yule process (in distribution). However, for networks using the nonlinear PA mechanism, finding the degree distributions of nodes seems to be a too difficult task. Therefore, the estimates of the node degree variance obtained in this paper shed light on additional characteristics of this distribution.
The aim of the paper is an analysis of the dynamics of node degrees during the growth of a complex network. In particular, we are interested in how greatly the characteristics of these trajectories depend on the time of node appearance.
First, we present the empirical analysis of real networks in Section 2. The trajectories of node degrees over time for a specific node may vary from those of other nodes that appear in close moments in time. Therefore, we explore the average dynamics of the degree of a group of nodes that appear in one interval of time. We find the average trajectory and its variance for all nodes in a group. The results of the analysis of such trajectories in real social networks show that the average value of the node degrees and their variance are proportional. This means that their ratio remains almost constant for the whole duration of network growth. This phenomenon is inherent in all social networks that are considered. In this study, we observe real networks, the growth of which can be fully recreated. This allows us to study the whole trajectory of the growth of the degree of each node over time.
Next, we examine whether this phenomenon exists in randomly growing networks in Section 4. We choose the Barabási–Albert model with the non-polynomial preferential attachment mechanism, since many other more complex models are based on this mechanism of attachment or modifications thereof. A brief description of the model can be found in Section 3.
We use the following method to analyze the degree trajectories of nodes that appear at some moments in time. First, we simulate network growth many times to obtain trajectories of nodes (fixed by iterations at which they appear). Then, we average these trajectories and obtain their variances for each moment in time. These experiments allow us to prove that the same phenomenon exists for random networks as well.
Our theoretical analysis (Section 4) shows that the coefficient of variation for stochastic processes that describe the dynamics of degree growth tends to be constant with the iteration growth.
The same results are obtained for the total degree of all neighbors for a node in the Barabási–Albert model with the non-polynomial preferential attachment mechanism in Section 5.
The applicability of the findings to actual problems in the study of social media in the digital humanities is discussed in Section 6.

2. Empirical Dynamic Networks

In this section, we study the dynamics of local characteristics in real networks. In order to lower the impacts from individual nodes in the network, we observe the degree and summary degree dynamics not for one node, but for a group of nodes that appear sequentially in the network. Each group contains one hundred nodes. At each iteration, the obtained values of the characteristics are averaged for the group.

2.1. Real Network Overview

We consider three real networks that were taken from online repositories (see Table 1). These networks reflect user behavior on websites where users ask questions and receive answers.

2.2. StackOverflow Network

The StackOverflow service connects millions of software developers from all over the world. This network is represented by a bipartite graph, the nodes of which are users and questions. A user is connected to a question if this user adds the question to bookmarks. One may do this in order to have quick access to the question.
The average degree dynamics obtained for this network are shown in Figure 1. The plots of both the average degrees and variances show logarithmic growth. This network is the first one considered, and both random values follow the logarithmic law. Despite the network growth, the coefficient of variation, which is described as the ratio of the standard deviation to the mean, fluctuates around a constant as shown in Figure 2.

2.3. AskUbuntu Network

AskUbuntu is a popular website for consulting on technological topics related to the Ubuntu operating system. The network data reflect user interactions on the platform. Two nodes (u and v) that represent users are connected by an edge with a time stamp t if user u has either answered or commented on the question or answer of user v at time t.
Plots of the dynamics of the average degree and its variance are presented in Figure 3. A clear logarithmic growth can be seen in the first iterations of network growth. However, for the greater part of the network’s existence, the growth remains linear for both the average degrees in the left plot and the variances in the right plot. The coefficient of variation, which is described as the ratio of the standard deviation to the mean, remains constant for the greater part of the network’s growth as can be seen in Figure 4.

2.4. SuperUser Network

SuperUser is another popular Q&A website for technological topics. As with the previous network, the data represent user interactions on the platform. Two nodes (u and v) that represent users are connected by an edge with a time stamp t if:
  • User u answers the question of user v at time t;
  • User u comments on the question of user v at time t;
  • User u comments on the answer of user v at time t;
After network modeling, we acquired the node trajectories, as shown in Figure 5. It can be clearly seen that these trajectories are very reminiscent of those obtained for the AskUbuntu network (Figure 3). It should be noted that the group of nodes that appeared between iterations 400 and 500 differed significantly from the others. The increased values of the characteristics are explained by the presence of a couple of very large nodes inside this group. In Figure 6 we see that still, as in the previous empirically growing networks, the coefficient of variation fluctuated around the constant during the whole process of network growth.
To sum everything up, in real networks, both the degree and its variance grow over time. Moreover, the coefficient of variation quickly converges to a constant. This phenomenon is observed in many social networks. Since the Barabási–Albert model usually produces networks with characteristics that are similar to those of real social networks, we observe this phenomenon in networks produced by the model.

3. Barabási–Albert Model with Nonlinear Preferential Attachment

3.1. Notations and Definitions

We use G t = { V t , E t } to denote a network at time t with a set of its nodes V t = { v 1 , , v t } and a set of its links E t . We denote the degree of node v i at time t as d i ( t ) . The fixed integer m N is a model parameter and denotes a constant number of attached links at each iteration. The quantity 0 < γ < 1 is another model parameter that describes the power law exponent of the dependence of the node degree on the link probability in the NPA rule.
According to the Barabási–Albert model with the NPA mechanism, the network G t + 1 is evolved from a previous state G t (at moments in time t + 1 = m + 1 , m + 2 , ) as follows:
  • At time t = m , G m = { V m , E m } is a complete graph with m nodes;
  • The network adds one newly born node v t + 1 , i.e., V t + 1 = V t { v t + 1 } ;
  • m links are added to the network; they connect the newly born node v t + 1 with m already existing nodes; each of these links appears as the result of the application of NPA rule: We use the discrete random variable ξ t + 1 , which takes the value i with probability
    P ( ξ t + 1 = i ) = d i γ ( t ) v j V t d j γ ( t ) .
    In the case of ξ t + 1 = i , we add a link ( v t + 1 , v i ) to the network. We make m independent random experiments.
Let the random value ξ i t + 1 be such that it selects the value i if ξ t + 1 takes i in at least one of the m experiments at time t + 1 .
d i ( t ) denotes the degree of node v i at iteration t. s i ( t ) denotes the sum of all neighbor degrees of node v i at iteration t. Then, α i ( t ) : = s i ( t ) d i ( t ) is the average degree of all neighbors of node v i at time t.
The time evolution of these characteristics can be represented as non-stationary Markov processes, i.e., processes for which their random values at time moment t depend only on the network state at the previous moment. Asymptotic estimates of their expected values and variations at iteration t are found in [24,30,31]:
E ( d i ( t ) ) = m t i 1 2 , Var ( d i ( t ) ) = m t i t i 1 2 ,
E ( s i ( t ) ) m 2 2 t i 1 2 ( log t + C ) , Var ( s i ( t ) ) m 3 4 log 2 t i 6 log i log t t i ,
E α i ( t ) m 2 log t + a , Var ( α i ( t ) ) const .
The aim of this paper is to investigate the asymptotic behavior of stochastic processes over time for networks generated by the NPA model. In this paper, we focus on estimating the first two moments, i.e., the expectations and variances.

3.2. The Evolution of Barabási–Albert Networks with the NPA Mechanism 

It follows from the description of the iteration step in the Barabási–Albert model with the NPA mechanism that:
  • If ξ t + 1 = i , then d i ( t + 1 ) = d i ( t ) + 1 and s i ( t + 1 ) = s i ( t ) + m , since node v i links to the newly born node v t + 1 that has degree m.
  • If ξ t + 1 = j and ( v j , v i ) V t , then the newly born node v t + 1 links to one of the neighbors v j of node v i , and we have d i ( t + 1 ) = d i ( t ) and s i ( t + 1 ) = s i ( t ) + 1 .
Let ξ i t + 1 and η i t + 1 be auxiliary random variables defined by
ξ i t + 1 = 1 , ( v t + 1 , v i ) V t + 1 0 , otherwise ,
η i t + 1 = 1 , ( v t + 1 , v j ) V t + 1 and ( v j , v i ) V t , 0 , otherwise .
We denote
μ ( t , γ ) : = 1 m t j = 1 t d j γ ( t ) .
Lemma 1. 
The value of μ ( t , γ ) satisfies 1 m < μ ( t , γ ) < 2 and asymptotically follows
μ ( t , γ ) μ ( γ ) = const , t .
Proof. 
Let us estimate v j V t d j γ ( t ) from below and above. First, we should note that v j V t d j ( t ) = 2 m t . Then, we get
lim γ 0 v j V t d j γ ( t ) = t ,
lim γ 1 v j V t d j γ ( t ) = v j V t d j ( t ) = 2 m t .
Therefore, we have
1 m < μ ( t , γ ) < 2 .
Since μ ( t , γ ) is bounded for every fixed 0 < γ < 1 and increases over γ , there exists lim t μ ( t , γ ) , which we denote by μ ( γ ) : = lim t μ ( t , γ ) (see Figure 7). □
Then, the expectations of ξ i t + 1 and η i t + 1 at iteration t + 1 —conditional with respect to the current network state G t —are equal to
E ( ξ i t + 1 | G t ) = d i γ ( t ) t μ ( t , γ ) , E ( η i t + 1 | G t ) = v j : ( v i , v j ) E t d j γ ( t ) t μ ( t , γ ) .
Lemma 2. 
The value of v j : ( v i , v j ) E t d j γ ( t ) is asymptotically equal to s i γ ( t ) :
v j : ( v i , v j ) E t d j γ ( t ) s i γ ( t ) 1 , t .
Proof. 
By the definition of d j ( t ) , we have d j ( t ) as t . There are a neighbor v k of node v i and an infinite subsequence { t l } such that d j ( t l ) d k ( t i ) 0 as t l for all j such that ( v j , v i ) V t l and j k . Then, we have
lim t l v j : ( v i , v j ) E t d j γ ( t l ) v j : ( v i , v j ) E t l d j γ ( t ) γ ( t l ) = lim t l 1 + v j : ( v i , v j ) E t l , j k d j ( t l ) d k ( t l ) γ 1 + v j : ( v i , v j ) E t l , j k d j ( t l ) d k ( t l ) γ = 1 .
Since we are interested in the behavior of the network characteristics for sufficiently large sizes of networks, we will use the asymptotic estimate (6) in our further reasoning.
It follows from Lemma 1 that
E ( ξ i t + 1 | G t ) d i γ ( t ) t μ ( γ ) .
It follows from Lemmas 1 and 2 that
E ( η i t + 1 | G t ) s i γ ( t ) t μ ( γ ) .

4. An Investigation into the Dynamics of the Node Degree: The Evolution of Its Expectation and Variance over Time

4.1. The Expectation of d i ( t )

Theorem 1. 
If  0 < γ < 1 , then the expectation of d i ( t ) asymptotically follows
E ( d i ( t ) ) 1 γ μ ( γ ) log c 1 t i 1 1 γ ,
where constant c 1 is equal to
c 1 = exp μ ( γ ) m 1 γ 1 γ .
If γ = 1 , then the expectation of d i ( t ) follows
E ( d i ( t ) ) = m t i 1 2 .
Proof. 
It follows from the analysis in Section 3.2 that the difference between values of d i at moments t + 1 and t is equal to
d i ( t + 1 ) d i ( t ) = ( d i ( t ) + 1 ) ξ t + 1 + d i ( t ) ( 1 ξ t + 1 ) d i ( t ) = ξ t + 1 .
Using Equation (7), we get the difference equation
Δ d i ( t ) d i γ ( t ) μ ( γ ) t ,
with the corresponding approximate differential equation (with f ( t ) = d i ( t ) )
d f ( t ) d t = f γ ( t ) μ ( γ ) t ,
from which we get its solution
f ( t ) = d i ( t ) = 1 γ μ ( γ ) log c 1 t i 1 1 γ ,
where c 1 is a constant. Then, (10) follows from the initial condition f ( i ) = m .
Equation (11) is well known [11]. □
Let us now simulate the growth of NPA networks to illustrate the behavior of the mean value of d i ( t ) ) : T = 1000 independent simulations of BA networks with the NPA mechanism were carried out, and each graph had N = 160,000 nodes. Then, we calculated the means of the empirical values of d i ( t ) . We used the value of parameter m = 5 and three different values of γ . The trajectories are shown in Figure 8.

4.2. The Variance of d i ( t )

Lemma 3. 
Let 0 < γ < 1 be a constant. Then, the asymptotic behavior of the expectation of d i γ ( t ) can be described as follows:
E ( d i γ ( t ) ) 1 γ μ ( γ ) log c 1 t i γ 1 γ ,
where the constant c 1 is defined in (10).
Proof. 
We have
Δ d i γ ( t + 1 ) : = d i γ ( t + 1 ) d i γ ( t ) = ( d i ( t ) + 1 ) γ ξ i t + 1 + d i γ ( t ) ( 1 ξ i t + 1 ) d i γ ( t ) = 1 + 1 d i ( t ) γ d i γ ξ i t + 1 + d i γ ( t ) ξ i t + 1 1 + γ d i ( t ) d i γ ξ i t + 1 d i γ ( t ) ξ i t + 1 = γ d i γ 1 ξ i t + 1 .
Using Equation (7), we obtain the conditional expectation of Δ d i 2 ( t ) at moment t + 1 :
E ( Δ d i γ ( t ) | G t ) = γ d i 2 γ 1 ( t ) μ ( γ ) t .
If we denote E ( Δ d i γ ( t ) | G t ) as Δ f ( t ) and substitute Δ f ( t ) with d f ( t ) d t , then Equation (14) can be approximated by
d f ( t ) d t = γ f 2 1 γ ( t ) μ ( γ ) t ,
the solution of which is
f ( t ) = 1 γ μ ( γ ) log c t i γ 1 γ ,
where c is a constant that can be found using the initial condition f ( i ) = m γ . Thus, we get Equation (12). □
Lemma 4. 
If 0 < γ < 1 , then the second moment of d i ( t ) follows:
E ( d i 2 ( t ) ) c 2 1 γ μ ( γ ) log t 2 1 γ ,
where c 2 = m 2 1 γ μ ( γ ) log i 2 1 γ is a constant.
If γ = 1 , then the second moment of d i ( t ) follows:
E ( d i 2 ( t ) ) = m ( m + 1 ) t i m t i 1 2 .
Proof. 
We have
Δ d i 2 ( t + 1 ) : = d i 2 ( t + 1 ) d i 2 ( t ) = ( d i ( t ) + 1 ) 2 ξ i t + 1 + d i 2 ( t ) ( 1 ξ i t + 1 ) d i 2 ( t ) = ( 2 d i ( t ) + 1 ) ξ i t + 1 .
Using Equation (7), we obtain the conditional expectation of Δ d i 2 ( t ) at moment t + 1 :
E ( Δ d i 2 ( t ) | G t ) = 2 d i γ + 1 ( t ) μ ( γ ) t + d i γ ( t ) μ ( γ ) t .
If we denote E ( Δ d i 2 ( t ) | G t ) as Δ f ( t ) , substitute Δ f ( t ) with d f ( t ) d t , and change d i γ ( t ) in the second term with its expectation by using Lemma 3, then Equation (18) can be represented by its approximate version as follows:
d f ( t ) d t = 2 f 1 2 ( γ + 1 ) ( t ) μ ( γ ) t + 1 μ ( γ ) t 1 γ μ ( γ ) log c 1 t i γ 1 γ ,
where c 1 is defined in Equation (10).
The solution of the equation is
f ( t ) c 2 1 γ μ ( γ ) log t 2 1 γ 1 γ μ ( γ ) log t 1 1 γ ,
where c 2 is a constant. Thus, we get (15) if we take into account the initial condition, d i 2 ( i ) = m 2 .
Equation (16) is proved in [23]. □
Theorem 2. 
If  0 < γ < 1 , then the variance of d i ( t ) at moment t asymptotically follows:
Var ( d i ( t ) ) ( c 2 1 ) 1 γ μ ( γ ) log t 2 1 γ ,
where c 2 = m 2 1 γ μ ( γ ) log i 2 1 γ .
If γ = 1 , then the variance of d i ( t ) at moment t is
Var ( d i ( t ) ) = m t i t i 1 2 .
Proof. 
The definition of variance implies
Var ( d i ( t ) ) = E ( d i 2 ( t ) ) E 2 ( d i ( t ) ) .
Then, Theorem follows from Lemma 4 and Theorem 1.
The case of γ = 1 is proved in [23]. □
Let us now simulate the growth of NPA networks to illustrate the behavior of the standard deviation of d i ( t ) : T = 1000 independent simulations were carried out for BA networks with the NPA mechanism, and each graph consisted of N = 160,000 nodes. Then, we calculated the square root of the means of the empirical values of ( d i ( t ) E ( d i ( t ) ) ) 2 over 1000 realizations. We used the parameter value of m = 5 and three different values of γ . The trajectories are shown in Figure 9.
In the case of 0 < γ < 1 , the standard deviation of d i ( t ) , which is equal to Var ( d i ( t ) ) , is the same order of magnitude as E ( d i ( t ) ) :
Var ( d i ( t ) ) E ( d i ( t ) ) c 2 1 as t .
If γ = 1 , then the coefficient of variation is equal to
Var ( d i ( t ) ) E ( d i ( t ) ) m m + 1 as t .
We can conclude that the distribution of d i ( t ) is a high-variance one. The trajectories of the empirical values of the variation coefficient in the BA networks with the NPA generation mechanism are shown in Figure 10 for selected nodes of i = 10 , 50 , 100 as t was iterated up to 160,000. The network in (a) was modeled with m = 5 and γ = 0.25 , that in (b) was modeled with m = 5 and γ = 0.5 , and that in (c) was modeled with m = 5 and γ = 0.75 . The figures clearly show that the expectations and the standard deviations of d i ( t ) are commensurate and the dynamics of their ratio tend to a constant.

5. The Evolution of s i ( t ) : Its Expectation and Variance

In this section, we study the behavior of the random variable s i ( t ) , i.e., the sum of the degrees of the neighbors of node v i at moment t. Here, we find the characterizations of the stochastic process of s i ( t ) for BA networks with the NPA mechanism.

5.1. Dynamics of the Total Degree of Node Neighbors in BA Networks with the NPA Mechanism

Theorem 3. 
If  0 < γ < 1 , then the expectation of s i ( t ) asymptotically follows:
E ( s i ( t ) ) 1 γ μ ( γ ) log t 1 1 γ m 1 γ log log t + c 4 ,
where c 4 > 0 is a constant.
If γ = 1 , then the variance of d i ( t ) at moment t is
E ( s i ( t ) ) m 2 2 t i 1 2 log t .
Proof. 
We estimate the change in the value of s i ( t ) after one iteration t + 1 , in which one newly born vertex is added to the network. The analysis in Section 3.2 gives
Δ s i ( t + 1 ) = s i ( t + 1 ) s i ( t ) =     ( s i ( t ) + m ) ξ i t + 1 + ( s i ( t ) + 1 ) η i t + 1 + s i ( t ) ( 1 ξ i t + 1 η i t + 1 ) s i ( t ) = m ξ i ( t + 1 ) + η i ( t + 1 ) .
Then, from (7), (8), and Lemma 2, we get the difference equation:
E ( Δ s i ( t ) | G t ) m d i γ ( t ) μ ( γ ) t + s i γ ( t ) μ ( γ ) t ,
which corresponds to the differential equation (if we denote E ( Δ s i 2 ( t ) | G t ) as Δ f ( t ) and substitute Δ f ( t ) with d f ( t ) d t ):
d f ( t ) d t = m d i γ ( t ) μ ( γ ) t + f γ ( t ) μ ( γ ) t .
Using Lemma 3, we get the solution of (23) in the form of
f ( t ) = 1 γ μ ( γ ) log t 1 1 γ m 1 γ log log t + c 4 ,
where c 4 is a constant, and we obtain the theorem.
In the case of γ = 1 , the expectation of this random variable E ( s i ( t ) ) was obtained in [30] (for m = 1 ) and [24] (for arbitrary m):
E ( s i ( t ) ) = m 2 2 t i 1 2 log t + 1 m log i 2 m .
To exhibit the results, we carried out T = 1000 independent repetitions; in each of them, the BA graph was simulated for N = 160,000 iterations with m = 5 and three different values of γ . Then, the empirical values of E ( s i ( t ) ) were obtained. The results are presented in Figure 11, which depicts the trajectories of s i ( t ) averaged over 1000 independent simulations. The empirical behavior of s i ( t ) followed the prediction of Equation (20).

5.2. The Variance of s i ( t )

Lemma 5. 
If 0 < γ < 1 , then the second moment of s i ( t ) asymptotically follows:
E ( s i 2 ( t ) ) c 5 m 2 ( 1 γ ) 2 1 γ μ ( γ ) log t 2 1 γ ( log log t ) 2 ,
where c 5 > 1 is a constant.
If γ = 1 , then the second moment of s i ( t ) is equal to
E ( s i 2 ( t ) ) = t i m 3 ( m + 1 ) 4 log 2 t m 3 log t + m 2 ( m + 1 ) log 2 i 4 log i + 1 .
Proof. 
If we take into account the reasoning in Section 3.2, then the difference between the values of s i 2 ( t ) at iterations t and t + 1 is
Δ s i 2 ( t ) : = s i 2 ( t + 1 ) s i 2 ( t ) = ξ i t + 1 ( s i ( t ) + m ) 2 + η i t + 1 ( s i ( t ) + 1 ) 2 + ( 1 ξ i t + 1 η i t + 1 ) s i 2 ( t ) s i 2 ( t ) = ξ i t + 1 ( 2 m s i ( t ) + m 2 ) + η i t + 1 ( 2 s i ( t ) + 1 ) .
If we find the conditional expectation at moment t, then we can get
E ( Δ s i 2 ( t + 1 ) | G t ) = E ( ξ i t + 1 | G t ) ( 2 m s i ( t ) + m 2 ) + E ( η i t + 1 | G t ) ( 2 s i ( t ) + 1 ) = 2 s i 1 + γ μ ( γ ) t + 2 m d i γ ( t ) s i ( t ) μ ( γ ) t + s i γ ( t ) μ ( γ ) t + m 2 d i γ ( t ) μ ( γ ) t .
Using the mean field approach, it can be proved that
E ( d i γ ( t ) s i ( t ) ) c 5 m 1 γ 1 γ μ ( γ ) log t 1 + γ 1 γ log log t ,
where c 5 > 1 is a constant.
By using Equations (20) and (28), passing to the mathematical expectation of both sides, and making the substitution f = E ( s i 2 ( t ) | G t ) for convenience, we get the following approximate differential equation:
d f d t 2 f 1 2 ( 1 + γ ) μ ( γ ) t + 2 c 5 m 2 μ ( γ ) ( 1 γ ) t 1 γ μ ( γ ) log t 1 + γ 1 γ log log t .
Then, f asymptotically follows:
f ( t ) c 5 m 2 ( 1 γ ) 2 1 γ μ ( γ ) log t 2 1 γ ( log log t ) 2 ,
where c 5 > 1 is a constant.
The case of γ = 1 was considered in [23]. Thus, we get the lemma. □
Theorem 4. 
If  0 < γ < 1 , then the variance of s i ( t ) asymptotically follows:
Var ( s i ( t ) ) = ( c 5 1 ) m 2 ( 1 γ ) 2 1 γ μ ( γ ) log t 2 1 γ ( log log t ) 2 .
If γ = 1 , then the variance of s i ( t ) is
Var ( s i ( t ) ) = m 3 4 log 2 t i 6 log i log t t i .
Proof. 
Since Var ( s i ( t ) ) = E ( s i 2 ( t ) ) E 2 ( s i ( t ) ) , the statement is the consequence of Theorem 3 and Lemma 5. □
To exhibit the results, we carried out T = 1000 independent repetitions; in each of them, the BA graph was simulated for N = 160,000 iterations with m = 5 and three different values of γ . Then, the empirical values of mean ( s i ( t ) mean ( s i ( t ) ) ) 2 were obtained. The results are presented in Figure 12, which depicts the trajectories of the standard deviations of s i ( t ) averaged over 1000 independent simulations.
In the case of 0 < γ < 1 , the standard deviation of s i ( t ) , which is equal to Var ( s i ( t ) ) , tends to infinity with the same rate as E ( s i ( t ) ) :
Var ( s i ( t ) ) E ( s i ( t ) ) c 2 1 as t .
If γ = 1 , then the coefficient of variation tends to a constant as well:
Var ( s i ( t ) ) E ( s i ( t ) ) 1 m as t .
The trajectories of the empirical values of the variation coefficient for s i ( t ) in BA networks with the NPA generation mechanism are shown in Figure 13 for selected nodes i = 10 , 50 , 100 as t is iterated up to 160,000. The network in (a) was modeled with m = 5 and γ = 0.25 , that in (b) was modeled with m = 5 and γ = 0.5 , and that in (c) was modeled with m = 5 and γ = 0.75 . The figures clearly show that the expectations and the standard deviations of s i ( t ) are commensurate and the dynamics of their ratio tend to a constant.

6. Conclusions and Discussion

In this paper, we investigated the dynamics of some local characteristics in complex social networks over time. At a particular moment of a network’s evolution, every vertex can be characterized by its degree, as well as the total degree of its neighbors, among other attributes. As networks evolve, these qualities are adjusted as new links are attached to a vertex or its neighbors. We analyzed the evolution of these qualities over time in three real social networks. This empirical evidence showed that in real networks, the ratio of the average degree of neighbors to the variance remains at an almost constant value for groups of nodes that appear in sequential iterations.
Then, we studied the presence of this phenomenon in networks generated on the basis of the nonlinear preferential attachment mechanism. We proved that the asymptotics of their expected values and the characteristics of their spread around the mean are of the same magnitude. Therefore, we found out that the mechanism of preferential attachment explains the phenomenon that is seen in real networks.
The obtained conclusions are heuristic for the digital humanities—first of all, for its directions, with a concentration on the mechanisms of popularity of social media content. The algorithms of the most popular mass social networks are not transparent to researchers, although the political and marketing agenda of the last decade has increasingly demonstrated their manageability and controllability. If the principle of distribution of popularity is justified as a mathematical regularity, it becomes possible to “calibrate” an observed situation and demarcate situations of “natural” and “controlled” content popularity.
The second perspective is related to the study of social media agendas, which are usually considered in analogy with media agendas, where the structuring of the content of an agenda is usually interpreted as a competition of expected and extraordinary events and is enhanced by various forms of institutional censorship. These findings are significant for the understanding of the influence of a node’s activity on the popularity of the content that it produces.
The third perspective is related to the study of digital historical memory. In memory studies, the question of if historical knowledge is a background for the media environment, is present in certain quantities in the agenda in any period, or competes with any other content for popularity and is updated when the subject matter of the agenda allows remains unresolved.

Author Contributions

Conceptualization, S.S. and S.M.; methodology, S.S.; software, S.M. and A.G.; validation, S.S., S.M. and A.G.; formal analysis, S.M.; writing—original draft preparation, S.S. and S.M.; writing—review and editing, S.M. and S.T.; visualization, S.M. and A.G.; supervision, S.S.; data preparation, A.G.; discussion, S.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Russian Science Foundation, project 22-18-00153.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
BABarabási–Albert
PAPreferential attachment
NPANonlinear preferential attachment

References

  1. Ye, H.; Sun, X.C.; Lu, Z.M.; Jiang, X.; Zhao, K.; Nie, T. Interest Point Detection Using a Composite Index of Complex Networks. J. Inf. Hiding Multimed. Signal Process. 2018, 9, 265–273. [Google Scholar]
  2. Guo, X.L.; Lu, Z.M. Urban Road Network and Taxi Network Modeling Based on Complex Network Theory. J. Inf. Hiding Multimed. Signal Process. 2016, 7, 558–568. [Google Scholar]
  3. Chen, X.; Lu, Z.M. Protection on Complex Networks with Geometric and Scale-Free Properties. J. Inf. Hiding Multimed. Signal Process. 2016, 7, 569–575. [Google Scholar]
  4. Faloutsos, M.; Faloutsos, P.; Faloutsos, C. On Power-Law Relationships of the Internet Topology. SIGCOMM Comput. Commun. Rev. 1999, 29, 251–262. [Google Scholar] [CrossRef]
  5. Kleinberg, J.M.; Kumar, R.; Raghavan, P.; Rajagopalan, S.; Tomkins, A.S. The Web as a Graph: Measurements, Models, and Methods. In Proceedings of the Computing and Combinatorics; Asano, T., Imai, H., Lee, D.T., Nakano, S.i., Tokuyama, T., Eds.; Springer: Berlin/Heidelberg, Germany, 1999; pp. 1–17. [Google Scholar] [CrossRef]
  6. Clauset, A.; Shalizi, C.R.; Newman, M.E.J. Power-Law Distributions in Empirical Data. SIAM Rev. 2009, 51, 661–703. [Google Scholar] [CrossRef] [Green Version]
  7. De Solla Price, D. Networks of scientific papers. Science 1976, 149, 292–306. [Google Scholar]
  8. Klaus, A.; Yu, S.; Plenz, D. Statistical Analyses Support Power Law Distributions Found in Neuronal Avalanches. PLoS ONE 2011, 6, e19779. [Google Scholar] [CrossRef] [Green Version]
  9. Newman, M.E.J. The Structure and Function of Complex Networks. SIAM Rev. 2003, 45, 167–256. [Google Scholar] [CrossRef] [Green Version]
  10. Albert, R.; Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 2002, 74, 47–97. [Google Scholar] [CrossRef] [Green Version]
  11. Barabási, A.L.; Albert, R. Emergence of Scaling in Random Networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef] [Green Version]
  12. Dorogovtsev, S.N.; Mendes, J.F.F.; Samukhin, A.N. Structure of growing networks with preferential linking. Phys. Rev. Lett. 2000, 85, 4633–4636. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Krapivsky, P.L.; Redner, S. Organization of growing random networks. Phys. Rev. E 2001, 63, 066123. [Google Scholar] [CrossRef] [Green Version]
  14. Krapivsky, P.L.; Redner, S.; Leyvraz, F. Connectivity of growing random networks. Phys. Rev. Lett. 2000, 85, 4629–4632. [Google Scholar] [CrossRef] [PubMed]
  15. Sidorov, S.; Mironov, S. Growth network models with random number of attached links. Phys. Stat. Mech. Its Appl. 2021, 576, 126041. [Google Scholar] [CrossRef]
  16. Tsiotas, D. Detecting differences in the topology of scale-free networks grown under time-dynamic topological fitness. Sci. Rep. 2020, 10, 10630. [Google Scholar] [CrossRef]
  17. Pal, S.; Makowski, A.M. Asymptotic Degree Distributions in Large (Homogeneous) Random Networks: A Little Theory and a Counterexample. IEEE Trans. Netw. Sci. Eng. 2020, 7, 1531–1544. [Google Scholar] [CrossRef] [Green Version]
  18. Rak, R.; Rak, E. The fractional preferential attachment scale-free network model. Entropy 2020, 22, 509. [Google Scholar] [CrossRef]
  19. Cinardi, N.; Rapisarda, A.; Tsallis, C. A generalised model for asymptotically-scale-free geographical networks. J. Stat. Mech. Theory Exp. 2020, 2020, 043404. [Google Scholar] [CrossRef] [Green Version]
  20. Shang, K.K.; Yang, B.; Moore, J.M.; Ji, Q.; Small, M. Growing networks with communities: A distributive link model. Chaos 2020, 30, 041101. [Google Scholar] [CrossRef]
  21. Bertotti, M.L.; Modanese, G. The configuration model for Barabasi-Albert networks. Appl. Netw. Sci. 2019, 4, 32. [Google Scholar] [CrossRef] [Green Version]
  22. Pachon, A.; Sacerdote, L.; Yang, S. Scale-free behavior of networks with the copresence of preferential and uniform attachment rules. Phys. D Nonlinear Phenom. 2018, 371, 1–12. [Google Scholar] [CrossRef] [Green Version]
  23. Sidorov, S.; Mironov, S.; Agafonova, N.; Kadomtsev, D. Temporal Behavior of Local Characteristics in Complex Networks with Preferential Attachment-Based Growth. Symmetry 2021, 13, 1567. [Google Scholar] [CrossRef]
  24. Sidorov, S.P.; Mironov, S.V.; Grigoriev, A.A. Friendship paradox in growth networks: Analytical and empirical analysis. Appl. Netw. Sci. 2021, 6, 35. [Google Scholar] [CrossRef]
  25. Barabási, A.; Albert, R.; Jeong, H. Mean-field theory for scale-free random networks. Phys. A Stat. Mech. Its Appl. 1999, 272, 173–187. [Google Scholar] [CrossRef] [Green Version]
  26. Kadanoff, L. More is the Same; Phase Transitions and Mean Field Theories. J. Stat. Phys. 2009, 137, 777–797. [Google Scholar] [CrossRef]
  27. Parr, T.; Sajid, N.; Friston, K.J. Modules or Mean-Fields? Entropy 2020, 22, 552. [Google Scholar] [CrossRef]
  28. Pachon, A.; Polito, F.; Sacerdote, L. On the continuous-time limit of the Barabási-Albert random graph. Appl. Math. Comput. 2020, 378, 125177. [Google Scholar] [CrossRef] [Green Version]
  29. Paranjape, A.; Benson, A.R.; Leskovec, J. Motifs in Temporal Networks. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, New York, NY, USA, 6–10 February 2017. [Google Scholar] [CrossRef] [Green Version]
  30. Sidorov, S.; Mironov, S.; Malinskii, I.; Kadomtsev, D. Local Degree Asymmetry for Preferential Attachment Model. Stud. Comput. Intell. 2021, 944, 450–461. [Google Scholar] [CrossRef]
  31. Sidorov, S.; Mironov, S.; Tyshkevich, S. Surprising Behavior of the Average Degree for a Node’s Neighbors in Growth Networks. In Proceedings of the Complex Networks & Their Applications X; Benito, R.M., Cherifi, C., Cherifi, H., Moro, E., Rocha, L.M., Sales-Pardo, M., Eds.; Springer: Cham, Switzerland, 2022; pp. 463–474. [Google Scholar]
Figure 1. Trajectories of the averaged degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of users adding posts to their favourites on StackOverflow.
Figure 1. Trajectories of the averaged degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of users adding posts to their favourites on StackOverflow.
Systems 10 00249 g001
Figure 2. Trajectories of the averaged sum of degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of users adding posts to their favourites on StackOverflow.
Figure 2. Trajectories of the averaged sum of degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of users adding posts to their favourites on StackOverflow.
Systems 10 00249 g002
Figure 3. Trajectories of the averaged degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of AskUbuntu.
Figure 3. Trajectories of the averaged degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of AskUbuntu.
Systems 10 00249 g003
Figure 4. Trajectories of the averaged sum of degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of AskUbuntu.
Figure 4. Trajectories of the averaged sum of degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of AskUbuntu.
Systems 10 00249 g004
Figure 5. Trajectories of the averaged degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of SuperUser.
Figure 5. Trajectories of the averaged degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of SuperUser.
Systems 10 00249 g005
Figure 6. Trajectories of the averaged sum of degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of SuperUser.
Figure 6. Trajectories of the averaged sum of degrees (top left), variances (top right), and coefficients of variation (bottom) for three groups of nodes from iterations (21, 121), (401, 501), and (901, 1001) over the growth of the network based on the data of user interactions inside the question and answer service of SuperUser.
Systems 10 00249 g006
Figure 7. The dynamics of the ratio 1 t v j V t d j γ ( t ) in BA networks with t up to 160,000 iterations. Networks are generated with m = 5 and γ = 0.25 , γ = 0.5 , and γ = 0.75 .
Figure 7. The dynamics of the ratio 1 t v j V t d j γ ( t ) in BA networks with t up to 160,000 iterations. Networks are generated with m = 5 and γ = 0.25 , γ = 0.5 , and γ = 0.75 .
Systems 10 00249 g007
Figure 8. Mean value of the node degree in networks generated by the BA model with nonlinear PA for the node v i , i = 10 , 50 , 100 , with t going up to 160,000 iterations. Networks were generated for m = 5 and for the values of γ = 0.25 (a), γ = 0.5 (b), and γ = 0.75 (c).
Figure 8. Mean value of the node degree in networks generated by the BA model with nonlinear PA for the node v i , i = 10 , 50 , 100 , with t going up to 160,000 iterations. Networks were generated for m = 5 and for the values of γ = 0.25 (a), γ = 0.5 (b), and γ = 0.75 (c).
Systems 10 00249 g008
Figure 9. The dynamics of the standard deviation of node degrees in networks generated by the BA model with nonlinear PA for the node v i , i = 10 , 50 , 100 , with t going up to 160,000 iterations. The networks were generated with m = 5 and the values of γ = 0.25 (a), γ = 0.5 (b), and γ = 0.75 (c).
Figure 9. The dynamics of the standard deviation of node degrees in networks generated by the BA model with nonlinear PA for the node v i , i = 10 , 50 , 100 , with t going up to 160,000 iterations. The networks were generated with m = 5 and the values of γ = 0.25 (a), γ = 0.5 (b), and γ = 0.75 (c).
Systems 10 00249 g009
Figure 10. The dynamics of the empirical values of the variation coefficient of the node degrees in networks generated by the BA model with nonlinear PA for the node v i , i = 10 , 50 , 100 , with t going up to 160,000 iterations. The networks were generated with m = 5 and the values of γ = 0.25 (a), γ = 0.5 (b), and γ = 0.75 (c).
Figure 10. The dynamics of the empirical values of the variation coefficient of the node degrees in networks generated by the BA model with nonlinear PA for the node v i , i = 10 , 50 , 100 , with t going up to 160,000 iterations. The networks were generated with m = 5 and the values of γ = 0.25 (a), γ = 0.5 (b), and γ = 0.75 (c).
Systems 10 00249 g010
Figure 11. Dynamics of the sum of the neighbors’ degrees in BA networks with the NPA generation mechanism for selected nodes v i , i = 10 , 50 , 100 , as t grows to 25,000. The network in (a) was modeled with m = 3 and γ = 0.25 , (b) was simulated with m = 5 and γ = 0.5 , and (c) was simulated with m = 5 and γ = 0.75 .
Figure 11. Dynamics of the sum of the neighbors’ degrees in BA networks with the NPA generation mechanism for selected nodes v i , i = 10 , 50 , 100 , as t grows to 25,000. The network in (a) was modeled with m = 3 and γ = 0.25 , (b) was simulated with m = 5 and γ = 0.5 , and (c) was simulated with m = 5 and γ = 0.75 .
Systems 10 00249 g011
Figure 12. Trajectories of empirical values for mean ( s i ( t ) mean ( s i ( t ) ) ) 2 in BA networks with the NPA mechanism for selected nodes i = 10 , 50 as t was iterated up to 160,000. The network in (a) was modeled with m = 5 and γ = 0.25 , that in (b) was modeled with m = 5 and γ = 0.5 , and that in (c) was modeled with m = 5 and γ = 0.75 .
Figure 12. Trajectories of empirical values for mean ( s i ( t ) mean ( s i ( t ) ) ) 2 in BA networks with the NPA mechanism for selected nodes i = 10 , 50 as t was iterated up to 160,000. The network in (a) was modeled with m = 5 and γ = 0.25 , that in (b) was modeled with m = 5 and γ = 0.5 , and that in (c) was modeled with m = 5 and γ = 0.75 .
Systems 10 00249 g012
Figure 13. Trajectories of empirical values for the variation coefficient of s i ( t ) in BA networks with the NPA mechanism for selected nodes i = 10 , 50 as t iterates up to 160,000. The network in (a) was modeled with m = 5 and γ = 0.25 , that in (b) was modeled with m = 5 and γ = 0.5 , and that in (c) was modeled with m = 5 and γ = 0.75 .
Figure 13. Trajectories of empirical values for the variation coefficient of s i ( t ) in BA networks with the NPA mechanism for selected nodes i = 10 , 50 as t iterates up to 160,000. The network in (a) was modeled with m = 5 and γ = 0.25 , that in (b) was modeled with m = 5 and γ = 0.5 , and that in (c) was modeled with m = 5 and γ = 0.75 .
Systems 10 00249 g013
Table 1. Dynamic network statistics.
Table 1. Dynamic network statistics.
Real Network | N | | E | Other Traits
StackOverflow users and posts [29]545,2001,300,000Bipartite networks
AskUbuntu reactions [29]159,000964,000-
SuperUser reactions [29]194,0001,400,000-
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sidorov, S.; Mironov, S.; Grigoriev, A.; Tikhonova, S. An Investigation into the Trend Stationarity of Local Characteristics in Media and Social Networks. Systems 2022, 10, 249. https://doi.org/10.3390/systems10060249

AMA Style

Sidorov S, Mironov S, Grigoriev A, Tikhonova S. An Investigation into the Trend Stationarity of Local Characteristics in Media and Social Networks. Systems. 2022; 10(6):249. https://doi.org/10.3390/systems10060249

Chicago/Turabian Style

Sidorov, Sergei, Sergei Mironov, Alexey Grigoriev, and Sophia Tikhonova. 2022. "An Investigation into the Trend Stationarity of Local Characteristics in Media and Social Networks" Systems 10, no. 6: 249. https://doi.org/10.3390/systems10060249

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop