Next Article in Journal
Network Security Node-Edge Scoring System Using Attack Graph Based on Vulnerability Correlation
Next Article in Special Issue
Reliability of Social Networks on Activity-on-Node Binary-State with Uncertainty Environments
Previous Article in Journal
A Parametric Approach for Conceptual Integration and Performance Studies of Liquid Hydrogen Short–Medium Range Aircraft
Previous Article in Special Issue
Modeling and Profiling of Aggregated Industrial Network Traffic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New BAT and PageRank Algorithm for Propagation Probability in Social Networks

1
Department of Industrial Engineering and Engineering Management, National Tsing Hua University, Hsinchu 300044, Taiwan
2
School of Mechatronical Engineering and Automation, Foshan University, Foshan 528000, China
3
Department of International Logistics and Transportation Management, Kainan University, Taoyuan 33857, Taiwan
*
Author to whom correspondence should be addressed.
Appl. Sci. 2022, 12(14), 6858; https://doi.org/10.3390/app12146858
Submission received: 29 May 2022 / Revised: 5 July 2022 / Accepted: 5 July 2022 / Published: 6 July 2022
(This article belongs to the Special Issue Smart Manufacturing Networks for Industry 4.0)

Abstract

:
Social networks have increasingly become important and popular in modern times. Moreover, the influence of social networks plays a vital role in various organizations, including government organizations, academic research organizations and corporate organizations. Therefore, strategizing the optimal propagation strategy in social networks has also become more important. Increasing the precision of evaluating the propagation probability of social networks can indirectly influence the investment of cost, manpower and time for information propagation to achieve the best return. This study proposes a new algorithm, which includes a scale-free network, Barabási–Albert model, binary-addition tree (BAT) algorithm, PageRank algorithm, Personalized PageRank algorithm and a new BAT algorithm to calculate the propagation probability of social networks. The results obtained after implementing the simulation experiment of social network models show that the studied model and the proposed algorithm provide an effective method to increase the efficiency of information propagation in social networks. In this way, the maximum propagation efficiency is achieved with the minimum investment.

1. Introduction

With the commercial effect of viral marketing and word-of-mouth marketing, social networks are regarded as a medium for propagating information, ideas and influence between nodes [1,2,3,4]. The propagated influence of information propagation in social networks, which use the behavior of user information propagation in social networks to evaluate propagated influence and find the key nodes in order to achieve the greatest influence on social networks during the propagation of these nodes [1,5], is a hot research area [1,5,6,7,8,9]. In other words, investors focus on the high connectivity and high propagation of social networks and accurately invest information in the source nodes of propagation rather than invest many resources in many nodes. The use of social network propagation achieves the purpose of information propagation, which is also an advantage of social networks [10].
In fact, people’s lives and sources of information are largely affected by social networks [11], so understanding how information propagates within social networks is crucial for designing effective promotion strategies and preventing the propagation of malicious information. It is found social networks are scale-free networks in the real world [12,13], and the degree distribution of scale-free networks follows the power law distribution [14]. Many practical and important networks—such as the World Wide Web, power grids, etc.—are scale-free networks, which share information between nodes in the network through network connections [14,15].
In the era of social networking, such networks have both flaws and virtues [16]. The benefits include the rapid transmission of information [16] and the development of e-commerce platforms [17], interpersonal relationships [17], etc., while there are also many disadvantages, such as internet viruses [18], rumors [19], etc. The effective promotion of policies, placement of commercial advertisements and prevention of the spread of malicious information in social networks are important topics in social network analysis. For example, if a node with a high probability of spreading is identified when the expected spread is large, better spread can be achieved by placing advertisements on this node. On the contrary, when misinformation spreads among social networks, identifying nodes with a higher probability of spreading in the early stage of dissemination can effectively prevent the dissemination of information in a timely and effective manner. By predicting the source node with the best communication ability in the social network, providing them with information and asking them to use their own influence to publicize that information, a series of communication effects can be triggered.
Therefore, the motivation of this research is to propose a new method for the prediction and modeling of a social network’s propagation probability so as to quickly determine the nodes with greater propagation influence in social networks and their propagation probabilities. In addition, the personalization vector in the Personalized PageRank is used to adjust the distribution of node propagation probability in the network, and the influence of user preference on the propagation probability is analyzed.
Much research has been devoted to modeling the propagation process in social networks [20,21,22,23,24]. Among the different models of scale-free networks, the Barabási–Albert model proposed by Barabási and Albert is more common and has many applications [15]. Therefore, this study adopts the Barabási–Albert model to simulate propagation in social networks and generate test data to verify the performance of the proposed algorithm.
The method of measuring the probability of propagation in social networks will vary depending on the focus of research. The most intuitive method is to use the number of node connections with a high probability of page appearance when the number of node connections is large, which is called in-degree [25,26]. However, using this method will lead to a large number of spam links intentionally generated by people in order to increase the probability of the page. The TwitterRank algorithm assigns different propagation weights for Twitter users to track the similarity of different specific topics and web page structures, while the probability will vary with the posting rate of the post [27]. The PageRank algorithm is based on web page links; a page’s PageRank value represents random clicks on the page as the probability distribution of reaching any particular node, and the source of its value is based on the PageRank values of all other pages linked to it [28]. This study adopts the PageRank algorithm originally used by Google to analyze the relevance and importance of web pages and uses the PageRank value to measure the importance of nodes to convert it into an appropriate propagation probability. However, studies in [29,30,31] pointed out that users of social networks have different topic preferences, resulting in different topics with different propagation effects. Therefore, in order to analyze the influence of different user preferences on propagation probability, this study also uses Personalized PageRank to measure the different node weights due to different node themes [32,33,34].
There are many methods which can be used to search for the propagation path of social networks. Clauset et al. proposed that the hierarchical structure can be inferred from the network and applied to solve the path prediction problem in social networks [35]. Backstrom and Leskovec proposed a prediction algorithm based on a supervised random propagation path; the random path is more convincing because the transfer probability between nodes is different. However, this has not been applied in practice because the model requires significant computing time [36]. Whether based on random paths or fully supervised, these methods are generally complex and difficult-to-understand and are more suitable for problems with a fixed number of sink nodes. However, the propagation process in the social networks studied in this work does not have a fixed sink node, so the propagation problem in social networks to be predicted in this study requires an easy-to-understand new method to predict its propagation path and propagation probability.
This study adopts a binary-addition tree algorithm (BAT) to predict propagation in social networks [37]. BAT algorithms are similar to depth-first search (DFS) [38] and breadth-first search (BFS) [39] algorithms. However, BAT algorithms–in addition to being easy to understand and program, have the main advantage of flexibly in searching all propagation states in the binary network, which makes path exhaustion simpler and easier to change for different applications [37].
This study aims to propose a new method to calculate the probability of propagation areas in social networks and to quickly identify nodes in the network that require close attention. In addition to adopting the traditional PageRank algorithm and the Personalized PageRank algorithm, this study also proposes a novel state concept and a new method based on the combination of a BAT algorithm and the PageRank algorithm to predict the area and probability of possible impacts of propagation in the modeled social network and perform social network analysis (SNA).
Hence, the manuscript’s value is emphasized in following list:
  • Firstly, we prove that the social network is a scale-free network and use the Barabási–Albert model to construct a prediction model.
  • After enumerating all possible propagation states through the BAT algorithm, the PageRank algorithm is used to calculate the occurrence probability and propagation probability of each social network node, and the Personalized PageRank algorithm is used to meet the user’s preference.
  • Finally, a new BAT algorithm is used to calculate the probability prediction values of all nodes to different propagation ranges.
  • Furthermore, we demonstrate that the maximum propagation state PageRank value proposed in this study can be used as an effective tool for identifying propagation probability and illustrate the influence of user preference topicality on propagation probability.
  • Overall, the value of this prediction model is in providing a new method for predicting social network dissemination, in helping social networks create effective information dissemination effects and in achieving maximum dissemination efficiency with the smallest delivery cost in order to increase the influence of network nodes.
The structure of the remainder of this study is as follows. Section 2 introduces the social network, its propagation model, the scale-free network, the Barabási–Albert model, the PageRank algorithm, the Personalized PageRank algorithm and the BAT algorithm. Section 3 shows the research method and proposes the propagation status in social networks, demonstrates how to numerically present the propagation path and the calculation of the propagation probability, and combines the BAT algorithm with the PageRank algorithm to propose a new BAT algorithm for this study. Propagation in social networks is simulated, and the experimental results are presented to assist the analysis in Section 4. Finally, we summarize the research contributions and identify future work in Section 5.

2. Preliminaries

This study uses the Barabási–Albert model to model the scale-free problem of propagation probability in social networks and adopts the modified BAT algorithm combined with the PageRank algorithm to make predictions. Before proceeding to research methods, this section introduces the background of social networks, scale-free networks, the Barabási–Albert model, the PageRank algorithm and the BAT algorithm.

2.1. Social Networks

The term “social network” was proposed by Barnes in 1954 [24]; it is used to describe a community structure of the relationship between members in a complex social system and can be presented graphically. Theoretically, a social network should contain two elements: nodes and edges (connection relationships) [23]; the former can represent organizations, communities, individuals, etc., and the latter can represent relationships such as friendships, business partners, etc. In this study, all web pages on the Internet, such as social platforms or fan pages, are regarded as nodes, and hyperlinks are regarded as edges [40].
Due to the availability of large datasets and multidomain applications, social networks have been rapidly developed and applied in many fields in the past decade [41,42,43,44,45,46]. The customer data of social networks are used to discover social network groups and develop a comprehensive algorithm to enhance the model of the advertising system [41]. Travelers use online social networks to maintain personal information, and backend systems analyze and record the traveler’s location to provide the best itinerary according to the route and traveler’s preferences [42]. By analyzing the social behavior of social structure nodes, the social attributes of the social network are mined to provide humanized services [43]. Considering the social network in the driving environment, the structure of the video sensor node (VSN) and communication architecture is described to simulate the similar goals of vehicles, passengers and drivers on the road in order to to contribute to future intelligent traffic control [44].
A large number of studies have found that the real social network is a network which appears to be random but actually follows the degree distribution of nodes, which presents a power distribution [47,48,49]. In research [11], a propagation tree of 227 actual blog page nodes is presented. The propagation tree takes the source node as the center point. Nodes with fewer than four propagation layers have an average branch count greater than 1, while the number of branches for nodes with higher propagation layers is almost 1, as shown in Figure 1.
Coincidentally, in order to present the backend computing system of its own platform and prove that it has 2.2 billion monthly active users, Facebook presents all Facebook social platform users with blue dots on the black layer and connects the relationship between users with blue lines, which is drawn in Figure 2. An interesting note is that is that the black layer gradually takes on the shape of the world map. Through Figure 2, we can know the actual communication characteristics of the social network.
In the branch structure of nodes of the propagation tree in Figure 1 and Figure 2, it is found that the propagation path of the social network is centered on the source node. Most nodes in the network are only connected to a few key nodes. Relatively speaking, there are very few key nodes connected to more nodes in the network. A network with this feature is a scale-free network, proposed by Barabási in 1999 [15], and a few key nodes are called a hub.
Many large and important networks are scale-free networks, such as the World Wide Web (WWW) [15], the Internet [50], and the metabolic network [51], for which the degree distribution of the network follows the power distribution. The social network is one of the applications of the WWW [52], and many studies have proved that the social network is a scale-free network [15,53,54,55].
The degree distribution of a scale-free network is a power distribution [15], and most nodes are only connected to the hub node so that the hub node can master the overall propagation of the social network [56]. Moreno and Vazquez, in their research on whether the proportion of immune nodes on scale-free network can eradicate epidemic viruses, believe that this approach would be better than randomly picking nodes across the network to make them immune if the hub nodes on a scale-free network have immune effects against epidemics [48]. Therefore, mastering hub nodes is also very important in the prediction of propagation in social networks.

2.2. Scale-Free Networks and the Barabási–Albert Model

As mentioned in Section 2.1, many scholars have proved that social networks are scale-free networks. In a scale-free network, when a node has k connections, their degree distribution follows a power distribution [15], as in Equation (1).
P(k) ~ k−η
The power distribution has a higher incidence when the value of k is small, and occurrences are very rare when its value grows [13,55]. In the power distribution function, even if the value of k only increases by a small amount, the effect on P(k) can be affected by the power series; when the power is negative, that is k−η as shown in Figure 3.
The scale-free web following a power distribution was discovered by Barabási and Albert in their collaborative study of the WWW in 1998. The WWW consists of several highly connected pages. Most of these pages have no more than 4 hyperlinks, but very few pages have thousands of links [15]. Social networks in the real world will grow in size by adding new nodes, which start from a small source node. By adding new nodes, the number of nodes increases regularly throughout the network [13,14,15,57,58]. For example, the WWW grows exponentially with the addition of new pages, and research citations grow with the publication of new papers, which Barabási and Albert call the scale-free network [15].
In networks in the real world, newly added nodes are generally connected to the nodes with more connections in the original network; therefore, this phenomenon is called “rich-get-richer” [58,59,60,61]. For example, new webpages tend to link to already-popular webpages such as the homepage of the website because such webpages are easy to find and widely known. The generation of new nodes in the network and the preferential connection of nodes with more connections in the original network prompted Barabási and Albert to propose the Barabási–Albert model in 1999 [15].
The algorithm of the Barabási–Albert model is based on two modes [59,61], as follows:
  • Growth mode: The network in the real world continues to expand and grow, such as by adding new friends on social platforms or creating new pages on the Internet.
  • Preferential connection mode: New nodes tend to connect to nodes with more connections in the original network when they join. When selecting a new node to connect to, it is assumed that there are n nodes, where n = 0, 1, 2, …, (n − 1); the probability of the new node connecting to node i, namely Pi, depends on the degree of node i, namely ki, as in Equation (2).
P i = k i j = 0 n 1 k j
To understand how scale-free networks propagate, Barabási and Albert presented a simple process for generating scale-free networks in Figure 4 [61]. Starting from three connected nodes (t = 1), the original nodes in the layer are represented by solid lines, and the newly added nodes are represented by hollow circles.
Due to the growth mode and priority connection mode, the newly added nodes prefer to connect to the original nodes with more connections, leading to the natural output of several highly connected nodes, which means that the probability of selecting a new node is proportional to the number of node connections.
A typical feature of scale-free networks is that the node degree obeys a power distribution. Most nodes in the network are only connected to a few nodes, and very few nodes are connected to most nodes; such key nodes are called hubs. The existence of such nodes makes the scale-free network more resistant to sudden failures, but it is also vulnerable to collective attacks [15], as shown in Figure 5. From Equation (2), it can be seen that most nodes are non-hub nodes and that the degree of connection ki is very small in the scale-free network. Based on the preferential connection mode, some highly numbered non-hub nodes can be gradually transformed into hubs, as new nodes are more likely to be connected to it [15,57,58,59,60,61]. In other words, all nodes approximately follow a power distribution [14,54].
Among different models of scale-free networks, the Barabási–Albert model proposed by Barabási and Albert is the most common and has many applications for scale-free networks [15]. Therefore, this study uses the Barabási–Albert model to construct a scale-free network to predict the propagation probability of social networks.

2.3. PageRank Algorithm

The PageRank algorithm is an algorithm previously invented by Google Inc. to rank the importance of pages in search results by their company’s search engine [62,63]. The PageRank value refers to the possibility of a web page being seen. Each web page has an individual PageRank value, which depends on the link relationship between web pages. A higher PageRank value means that a web page is more popular [26,28,62,63,64].

2.3.1. PageRank

The PageRank algorithm considers the connection structure of a network to measure probability, and its complete definition is as Equation (3) [28]. Equation (3) implements the idea of the PageRank value; that is, the PageRank value of each web page is positively correlated with the PageRank value of web pages linked to the web page [28,62]. Hence, the more pages that are linked to a page, the higher its PageRank value can be. A website with high PageRank value can more easily obtain an even higher PageRank value. The calculation process is shown in Table 1 [65].
P R ( u ) = ( 1 d ) + d v B ( u ) P R ( v ) N v
where u: web page. B(u): a set of pages connecting u. PR(u): the PageRank value of page u. PR(v): the PageRank value of page v. Nv: the number of outgoing links from page v. d: a damping factor between 0 and 1, with a default value of 0.85. When d = 0.85, there is an 85% probability that Internet users will continue to the next page after visiting a particular page.
The PageRank algorithm iteratively updates a node’s PageRank value by accumulating the PageRank value of each node connected to it, then dividing by the number of the referencing node. First, initialize all PageRank values with the same weight, then update the PageRank values of all nodes with Equation (3). The Nv = 0 in step P2 means that the number of outgoing links from page v is 0, such as going to the page directly through the URL. At this time, the PageRank value of other nodes is evenly distributed. The update is repeated until the PageRank value converges to a unique constant.

2.3.2. Personalized PageRank

The PageRank algorithm introduces the concept of random viewers into the calculation process, which gives equal weight to all pages of incoming links. In other words, the PageRank algorithm only uses the link structure of network and cannot judge the similarity in the content of web pages. In addition, the PageRank algorithm distributes the weights evenly according to the number of outgoing links so that the pages that are not related to the theme receive the same attention as the pages that are related to the theme, resulting in a theme shift [66]. No matter how high the PageRank value is, if the page is off-topic, the PageRank value returned to the user has little value [34,66,67].
In order to solve the problem of topic shift in PageRank algorithm, Page et al. proposed a Personalized PageRank method to measure the different importance of nodes [68]. Personalized PageRank affects the assigned weight of each node in the random walk and uses the personalized vector, namely ru, to bias the random walk process to a specific node [32,33,34,67,69,70,71,72,73], as shown in Equation (4).
P R ( u ) = ( 1 d ) r u + d v B ( u ) P R ( v ) N v
where u: web page. B(u): a set of pages connecting u. PR(u): the PageRank value of page u. PR(v): the PageRank value of page v. Nv: the number of outgoing links from page v. d: a damping factor between 0 and 1. i: the preferred target page, when u = i, ru = 1; otherwise, ru = 0.
From Equations (3) and (4), it can be found that the difference between Personalized PageRank and PageRank is that the personalized vector in PageRank moves to any random node with the same probability, while Personalized PageRank adjusts the personalized vector so that random viewers always move to a node or set of nodes that they are interested in rather than from all nodes in the network to any node. If we want random viewers to move to a specific node, such as page i, the personalized vector is 1—that is, ru = 1—and the personalized vectors of the remaining nodes are assigned 0 [32,33,34].
Personalized PageRank can send random viewers to nodes related to preferences according to user-specified preferences, change node weights [33], and improve the problem of PageRank and topic bias. This study uses Personalized PageRank as a way to adjust the topicality of nodes and explore the impact of user preferences on the probability of propagation.

2.4. BAT Algorithm

A binary state network, which indicates that the element state in the network is only two-state (success or failure), is the most basic type of network and is widely used in the path architecture of various fields, including transmission [74,75], network [76,77,78], data analysis [79] and others [80,81,82,83,84]. Therefore, in recent years, research on binary state networks has been carried out and applied to all the above systems. With the growing scale of networks, a simpler and more efficient algorithm is needed to calculate the reliability of binary state networks to evaluate the performance and stability of networks.
The BAT algorithm was developed by Yeh in 2021 [37], which is an exhaustive method to find all possible state vectors and proposed a path-based hierarchical search algorithm to filter all connection vectors. This algorithm has been used in network reliability [37,85], network resilience [86], wildfire propagation path prediction [87], and others [88,89,90,91,92,93,94,95,96]. Compared with other path finding algorithms, such as Minimal Path algorithm (MP) or Minimal Cut algorithm (MC), the time complexity of BAT algorithm is the most efficient in reliability calculation of binary state network [37]. In other exhaustive algorithms, the experimental results confirm that the computer memory and efficiency required by the BAT algorithm are also better than the common depth-first search algorithm and breadth-first search algorithm [37].
The BAT algorithm uses simple binary addition to generate all possible state vectors. Since it is a binary state, the value of any coordinate in the state vector is 0 or 1. In terms of social network propagation, when the status is 1, it means that the linked page has been propagated; when the status is 0, it means that the linked page has not been propagated.
Suppose a binary state network graph G(V, E) has m coordinates, and each edge has only two states, i.e., 0 or 1. V = { 0, 1, 2, …, n − 1}, Xk is the kth obtained state vector, and the ith coordinate X(ai) is the state of one of the edges ai, where aiE. The complete BAT algorithm is shown in Table 2 [37].
As shown in Step B0, all coordinates start with a zero vector. The binary number of Xk is updated by adding 1, as listed in Steps B1 to B3. The exhaustive process can be stopped by repeating the above steps until all coordinates are equal to 1, i.e., SUM = m.
The BAT algorithm proposes a more efficient, easy-to-understand, easy-to-program and easy-to-modify path search algorithm for binary state networks. Therefore, we apply the BAT algorithm to the problem formulated in this study.

3. Research Methods

The main purpose of this study is to predict the propagation probability of social networks in different propagation ranges and identify the key nodes in the network, which enables the attainment of the maximum propagation benefits in a social network through small investment costs. In this study, the spread of propagation paths in social networks is considered, and its final destination and number of destinations are unknown. Thus, it is difficult to estimate what might occur compared to regular network problems. This section focuses on the state presentation of network propagation and the application of algorithms and establishes a series of research methods.

3.1. Mathematics Notations

G(V, E)Graph of binary state network;
VThe total number of nodes included in the network graph, V = {0, 1, 2, …, n − 1};
EThe total number of edges included in the network graph;
i Node, iV;
V(i)The set of adjacent points of node i;
Deg(i)The total number of neighbors of node i;
PR(i)PageRank value of node i;
Sk (i)The kth state of node i, k = 0, 1, …, 2|Deg(i)| − 1;
XState vector of propagation path;
X(i)The state of node i;
PR(Sk(i))PageRank value of state Sk (i);
Pr(Sk(i))The probability of occurrence of state Sk (i);
C(i)Total number of state combinations for node i, C(i) = 2|Deg(i)|;
PRmax(i)The PageRank value of maximum propagation state for node i;
NpageAt least the number of nodes to be propagated including the propagation source node;
Pr (s, Npage) = RProbability of propagating to Npage nodes starting from s;
j Newly propagated node;
sThe source node of the propagation;
TlThe set of nodes that have propagated at state stage l;
T*The set of nodes to be propagated at state stage l;
PlPropagation probability for nodes that have already propagated (Tl);
RlThe newly added path probability at state stage l.

3.2. Propagation Status

The node status of a social network is divided into two types: propagated and not propagated. This section uses a bridge network as an example, as shown in Figure 6. Each node i belongs to the node set V of the network graph, i.e., iV, and each node can be regarded as a user or page that is propagated in the social network.
Node 0 connects node 1 and node 2, i.e., Deg(i) = 2 and V(i) = {2, 1}. The propagation path of node 0 includes propagation to Ø, {1}, {2} and {1,2}, respectively, representing that information is propagated from node 0 but not propagated; information is propagated from node 0 and is not propagated to other nodes after being propagated to node 1; information is propagated from node 0 and is not propagated to other nodes after being propagated to node 2; information is propagated from node 0 and propagates to node 1 and node 2 and then stops propagating.
The PageRank value of node i: PR(i) is obtained using the PageRank algorithm in Section 2.3. Because the PageRank value represents the probability of the node appearing in the network, all PageRank values in the network add up to 1. Table 3 lists the total number of neighbors, the set of neighbors and the PageRank value of node i in the network in Figure 6.

3.2.1. State Labels and State Vectors

Let Sk (i) be the kth state of node i, which represents the different propagation states of nodes and also known as state labels. Because BAT algorithm representations are all binary—and in order to successfully enumerate all node states without repetition—the binary state labels are converted to node state labels according to the rules in Table 4 [77].
Taking the network G(V, E) in Figure 6 as an example, enumerate all state vectors of node 0 by BAT algorithm. According to rule 1, the number of bits of the state value of node 0 is 2. According to rule 2, the V(0) equals {2, 1} by descending arrangement. According to rule 3, the state labels exhaustively enumerated by the BAT algorithm can be represented by binary state labels as 00, 01, 10 and 11, respectively. That is, S0(0) = Ø, S1(0) = {1}, S2 (0) = {2}, S3(0) = {1, 2}. Hence, it can be seen that node 0 is propagated to Ø, {1}, {2} and {1, 2}. Table 5 lists the node state labels and binary state labels of the example network.
In order to record the propagated path of a social network, this study uses X as the state vector to record the node state label of propagation. The state vector is presented with a diagonal line to identify the propagation state, the right side of diagonal line is the propagation node, and the left side of the diagonal line is the current state of propagation node. Taking X = (2/1) of the network G(V, E) in Figure 6 as an example, it represents state 2 of node 1, which is transferred to node 2, as can be found from Table 5. If X = (2/0, 4/2, 0/3), it means that the propagation starts from node 0, it propagates to node 2 after state 2 of node 0 occurs, then state 4 of node 2 occurs after passing to node 2. and the propagation is passed to node 3—that is, the sink node—and node 3 is in state 0 at this time, indicating that propagation has stopped.
In addition to recording the propagated path in social networks, the source node and sink node can also be clearly determined from the state vector. Exhausting the propagation paths in the form of state vectors can reduce the repetition of the propagation paths on one hand and have a comprehensive understanding of the propagation paths is on the other hand.

3.2.2. State Probability and Propagation Probability

In Section 2, it is mentioned that the social network is a scale-free network; that is, it conforms to the Barabási–Albert model with a growth mode and a preferential connection mode. In preferential connection mode, the probability that a new node can be found to connect to node i is proportional to the node’s connectivity ki, so the connection probability of a web page is particularly important. Using the PageRank value introduced in Section 2.3 as probability, the PR(Sk (i)) is defined as the probability of Sk (i), called state probability. When this node occurs in state Sk (i), define Pr(Sk (i)) as the probability of occurrence of state Sk (i). In other words, Pr(Sk (i)) can be regarded as the probability of node propagation. PR(Sk (i)) is to calculate the connection probability obtained by a node according to the PageRank values of all nodes in the state corresponding to node i [82], as shown in Equation (5).
P R ( S k ( i ) ) = | S j ( i ) | = 1 and S j ( i ) S k ( i ) P R ( S j ( i ) )
Because all possible state probabilities must sum to 1, the PageRank value is normalized as in Equation (6).
P r ( S k ( i ) ) = P R ( S k ( i ) ) k = 0 2 | Deg ( i ) | 1 P R ( S k ( i ) )
In all nodes, the probability of state 0 is non-transitive. From the preferential connection mode in the Barabási–Albert model, a newly added node can freely connect to any node in the network, and the probability of connecting to two nodes is twice as high as connecting to one node [82]. Hence, the probability of state 0 is defined as Equation (7).
PR(S0(u)) = 0.5 × Min{PR(i), iV(u)}
Taking node 1 of the network in Figure 6 as an example, the state probability and propagation probability are shown in Table 6.
According to the above method, the state probability and the propagation probability of the network in Figure 6 are obtained as shown in Table 7 and Table 8.

3.2.3. PageRank Value of Maximum Propagation State

When a node reaches the maximum propagation state, it means that the node propagates to the subset of all possible nodes of V(i) at the same time. Because the propagation state of each node is obtained by the BAT algorithm, propagation stops when all state vector coordinates are 1 during the exhaustive process so that the maximum propagation state of each node is the last state vector in the process of BAT algorithm. For example, the maximum state of node 0 is to propagate to node 1 and node 2 at the same time; that is, the binary state label is [1, 1], and C(i) is used to present the total number of state combinations. Because it is a binary network model, C(i) = 2|Deg(i)|, and the value of k in the state label Sk(i) is encoded from 1 so that k = 2|Deg(i)| − 1 for the maximum propagation state.
The PageRank value of each node represents the importance of the corresponding page, and a page linked to by many pages will have a higher PageRank value. When the node propagation state reaches the maximum state, it propagates to all adjacent nodes at the same time. At this time, let PRmax(i) be the PageRank value of the maximum propagation state. Each node has a maximum propagation state. Taking the network in Figure 6 as an example, the maximum propagation state of node 1 is to propagate to node 0, node 2 and node 3 at the same time. At this time, the PageRank value of the maximum propagation state of node 1 is the sum of the PageRank values of node 0, node 2 and node 3 [73]; that is, PRmax(1) = 0.204787 + 0.295213 + 0.204787 = 0.704787. Table 9 shows the set of adjacent points, the total number of state combinations, the PageRank value and the PageRank value of maximum propagation state for each node of the network in Figure 6.

3.3. A New BAT Algorithm

The BAT algorithm can satisfy most of the networks with known conditions, such as the path exhaustion of a network with one source node and one sink node, one source node with multiple sink nodes, or multiple source nodes to multiple sink nodes [80]. However, the propagation of social networks involves a source node and an unknown number of sink nodes. In addition to the unknown number of sink nodes, the node propagation state is also unknown. In order to predict the actual propagation situation of a social network and realize a path search by combining different states, the traditional BAT algorithm is modified to increase the suitability of the BAT algorithm to the practical problems discussed in this study. Assuming that information is propagated from node s, the purpose is to find all possible situations and the corresponding probabilities that the propagation of social network propagates to at least Npage pages. Table 10 shows the process of the new BAT algorithm.
As shown in Step A0, the initial state vector includes the first node to start propagation, thus, its state starts from X(i) = 1. The other nodes that are not the initial propagation node must start from state 1, as shown in Step A5. At this time, Step A1 determines whether the node has reached the maximum state. If the node reaches its maximum state, the algorithm must replace the node or stop searching, as shown in Step A8. If the node does not reach its maximum state, then go to Step A2 and select the unconnected node in the adjacent node set as the node set to be propagated. Step A3 determines whether the node set to be propagated is an empty set. If it is an empty set, go to Step A6 to update the node or go to Step A7 to add the node state. If Step A3 judges that the node set to be propagated has newly added nodes, then go to Step A4 to judge whether it matches the number of Npage. If it matches the number of Npage, add its probability Rl to the required R, then go to Step A7 to increase the state and continue to search for other propagation states. If the node has reached the maximum state and no other nodes can be updated, stop searching, as shown in Step A8.
For example, the propagation probability of propagating at least 2 pages from node 0 is Pr (s = 0, Npage = 2). Based on step A2, node 0 has propagated to adjacent node 1, i.e., |T*Tl| ≥ 2, which means that this state is guaranteed to be propagated to at least 2 nodes, and we can add the propagation probability of this step to Pr (s = 0, Npage = 2), as shown in step A4. Then, increase the original current state of X(i) = 1 by 1 and continue to search for other possible propagation nodes, as shown in step A7. At this time, if Npage = 3, the node should be updated to the next node in Tl, namely node 1, as shown in step A6. At this point, the state of node 1 will start from X(i) = 0 and continue to search for other possible propagation nodes, as shown in step A5. The above analysis is based on steps A4 and A7, which are similar to the steps of the traditional BAT.
Table 11 and Table 12 show the calculation process that starts with node 1 and propagates to Npage = 2 and Npage = 3, respectively.
In Table 11, node 0 is used as the source node for propagation. To calculate Npage = 2, it is only necessary to to propagate to 1 node. First, node 0 starts to search for a path from state 1; that is, S1(0). At this time, according to step A4, it is judged that the propagation of node 0 to node 1 has satisfied |T*Tl| ≥ 2, so the probability of this path (R0) can be added to Pr (s = 0, Npage = 2), and the state of node 1 is added by 1 according to step A7; that is, S2(0). The propagation of node 0 to node 2 also satisfies | T*Tl | ≥ 2 so the probability of this path (R1) can be added to Pr (s = 0, Npage = 2) and the state of node 1 is added by 1 and S3(0) can be calculated in the same way. When node 0 reaches state 4, it has exceeded X(i) = 2|Deg(i)| − 1; that is, the maximum propagation state. Jump to step A8 according to step A2. If there are no other nodes in Tl that can be searched, stop searching. At this time, R = Pr (s = 0, Npage = 2) = R0 + R1 + R2 is the final propagation probability.
In Table 12, when Npage = 3 is calculated with node 0 as the source node for propagation—except for state S3(0)—the remaining two states, namely S1(0) and S2(0), must be reconnected to other nodes to reach Npage = 3. When l = 0, node 0 starts from state 1 as step A0; that is, it propagates to node 1. At this time, | T*Tl | is not greater than or equal to Npage = 3, so the probability R0 at this time cannot be added to the probability R and enter node 1 to find other connection nodes as shown in step A6. When l = 1, node 1 starts from state S0(1), S0(1) is a new empty set and does not satisfy |T*Tl| ≥ 3, so probability R1 cannot be added to probability R. Similarly, l = 2. When l = 3, node 1 reaches state S2(1); that is, it propagates to node 2. At this time, T* adds a new node 2, which satisfies | T*Tl |≧ 3, so the probability R3 is added to the probability R = Pr (s = 0, Npage = 3). The state of node 1 is increased by 1—that is, S3(1)—and continues to find a feasible path, as shown in steps A4 and A7. When node 1 reaches state 7, that is, S7(1), X(i) = 2|Deg(i)| − 1 means node i has reached its maximum state at this time. After calculating R8, find the previous node in Tl to update i; that is, node 0. This means that the path under the premise of S1(0) has been searched as shown in step A8. After updating node i to be node 0, add 1 to its state; that is, S2(0) = {2}. The steps of subsequent state increase and screening probability are the same as S1(0). Until there is no previous node in Tl that can be updated and i also reaches the maximum state, stop finding the point. At this time, R = Pr (s = 0, Npage = 3) is the final propagation probability.

4. Simulation Experiment

This study simulates the propagation in a social network and presents the research results in Section 4. The proposed algorithms in this study are all coded in Python with version 3.6, and the simulation experiments are all executed on Spyder using a Windows 10 laptop with an Intel Core i5-8250U CPU (1.60 GHz, 8 GB RAM).

4.1. Model Introduction

NetworkX is a Python package that can randomly generate different types of networks and can also be used to analyze network structures, build network models or draw networks. Using NetworkX to generate Barabási–Albert models with different numbers of nodes can ensure that the network model conforms to the power distribution and preferential connection mode [97], as shown in Figure 7 and Figure 8.
A Barabási–Albert model with 8 nodes and 12 edges is randomly generated by NetworkX in Python as shown in Figure 7. Its degree distribution conforms to the power distribution as shown in Figure 8.
Figure 7 shows the network graph used to simulate the scale-free network in this study, which contains 8 nodes and 12 edges where i = 0, 1,…, 7 and iV. In the experiment, we assume that the source node of propagation starts from i = 0, 1, …, 7, respectively, and consider all the propagation probabilities to be Npage = 1, 2, …, 8. Therefore, this experiment contains a total of 64 experimental results.
Due to the influence of user preference on the propagation probability [29,30,31], the assigned weight of each node in the random walk varies according to user preference on the web page, as shown in Section 2.3.2. For example, an advertiser plans to advertise a food. One of the nodes in the network of Figure 7 is a food-related website. At this time, the personalization vector of this node is ru = 1, and the personalization vectors of the other nodes are ru = 0. Thus, the model performs an experiment of assigning weights to three different nodes as follows.
Case 1:
All nodes are assigned the same weight; that is, the traditional PageRank is used to calculate the page probability, and the propagation probability is Pr(s, Npage) = R1.
Case 2:
Set node 3, for which the total number of original adjacent nodes is the highest, as the preference node and use Personalized PageRank to calculate the probability of the page; that is, r3 = 1, r0 = r1 = r2 = r4 = r5 = r6 = r7 = 0. At this time, the propagation probability Pr(s, Npage) equals R2.
Case 3:
Set node 1, for which the original PageRank value is the lowest, as the preference node and use the personalized PageRank to calculate the probability of the page; that is, r1 = 1, r0 = r2 = r3 = r4 = r5 = r6 = r7 = 0. At this time, the propagation probability Pr(s, Npage) equals R3.
In addition to analyzing and predicting the propagation probability of a social network, this section also changes theme of the node in disguise through the personalized vector. If the node is more in line with the user’s preference, the allocation weight of the node is increased to interfere with the PageRank value. Table 13 presents the total number of adjacent nodes for node i, the set of adjacent nodes, the total number of state combinations, the PageRank value, the PageRank value of maximum propagation state, the Personalized PageRank value and the Personalized PageRank value of the maximum propagation state, which are denoted as Deg(i), V(i), C(i), PR(i), PRmax(i), PPR(i) and PPRmax(i), respectively.

4.2. Results and Discussion

Table 14 shows the probability of propagation to Npage nodes and the running time of Case 1 with node i as the propagation source node.
From Table 14, it can be found that the propagation probability is 100% when Npage is 1 because the propagation source node in a social network starts to propagate itself in each of Case 1, Case 2 and Case 3. That is Pr(i, Npage = 1) = 1. When the propagation range increases—that is, Npage increases—the propagation probability of each node decreases, which means that the propagation probability of reaching more nodes is lower and that the propagation probability is negatively correlated with the propagation range.

4.2.1. Neighbors and Propagation Probability of Node i

From the results of R1 and R2 in Table 14, if the adjacent nodes of two different nodes are the same, the probability of propagating to Npage nodes is the same; that is, Pr(i, Npage) = Pr(j, Npage) if V(i) = V(j). For example, in Figure 7, the adjacent nodes of node 1 and node 4 are node 5 and node 3, that is V(1) = V(4). At this time, the probability of propagating to Npage = 1, 2, …, 8 is all the same that are 1.0000, 0.9079, 0.9007, 0.8878, 0.8536, 0.7997, 0.6704 and 0.3942, respectively. However, in R3, because the personalized vector changes the assigned weight of node 1, the PageRank value of node 1 is increased resulting in different propagation probabilities even if the adjacent nodes of node 1 and node 4 are the same. Therefore, it can be judged that if the adjacent connected nodes are the same and the PageRank values of the nodes are the same, the probability of the final propagation to Npage nodes is the same although the propagation paths are different. And the propagation probability is different if the PageRank values are different due to the intervention of the personalized vector.
The total number of adjacent nodes also affects the propagation probability. It can be seen from Table 14 that when the number of adjacent nodes is propagated to a smaller number of Npage, the node with more total adjacent nodes has a higher propagation probability; on the contrary, when it propagates to a larger number of Npage, the node with fewer total adjacent nodes has a higher propagation probability. For example, the total number of adjacent nodes of node 3 and node 5 are the first and the second, respectively, which are the nodes with the highest total number of adjacent nodes in Figure 7. Whether in Case 1, Case 2 or Case 3, when Npage = 0, 1, …, 4, node 3 and node 5 have higher propagation probabilities than other nodes—especially node 3, which has the largest number of adjacent nodes. However, when Npage = 8, node 3 has the lowest propagation probability compared to other nodes. On the contrary, the total number of adjacent nodes of node 0, node 1, node 4 and node 6 is 2, which is the lowest total number of adjacent nodes in Figure 7. When Npage = 0, 1, …, 7, node 0 has the lowest propagation probability compared to other nodes. However, when Npage =8, the propagation probability of node 0 compared to other nodes: R1 = 0.4056 and R3 = 0.4214 ranks first, and R2 = 0.3947 ranks second. Hence, the node with a larger total number of adjacent nodes can be selected as the source node when a small range must be propagated. However, in the final stage of propagation, nodes with fewer adjacent nodes have a higher probability of propagating to more pages.

4.2.2. PageRank Value and Propagation Probability of Maximum Propagation State

The PageRank value achieving the maximum propagation state means that it will be propagated to all adjacent nodes at the same time if the node is the source node; that is, if the maximum propagation state is reached. The PageRank value reaching the maximum propagation state is also related to the propagation probability. Table 15 shows the ranking of PageRank value of maximum propagation state nodes relative to other nodes in the three cases. Combining the information analysis of Table 14 and Table 15, the following results are obtained.
In terms of a smaller propagation range, the node with a larger PageRank value of maximum propagation state has a higher propagation probability. For the larger propagation range, the node with a smaller PageRank value of maximum propagation state has a higher propagation probability. For example, the PageRank values of the maximum propagation states of node 3 and node 5 are ranked higher than those of other nodes, and the propagation probabilities corresponding to Npage = 1, 2, …, 4 are all among the top. However, the propagation probabilities of node 3, including R1, R2 and R3, are all the lowest when Npage = 8. On the other hand, nodes 0, 6 and 7—with lower PageRank values of maximum propagation state—have higher propagation probabilities when the propagation range is large.
It can be seen from the above results that to prevent the propagation of malicious messages, it is recommended to prioritize the protection of nodes with a higher PageRank value of maximum propagation state at the beginning of the propagation of messages, which can effectively and quickly prevent the propagation of messages.
In order to verify that the conclusions in Section 4.2.1 and Section 4.2.2 are not mode coincidences, several scale-free networks with different numbers of nodes and edges are built with NetworkX. The models and propagation probabilities are attached in Appendix A Table A1. Judging from the propagation probability of other models, if the set of adjacent nodes of a node is the same, the probability of propagating to Npage nodes is the same; when the propagation range is small, nodes with a higher total number of adjacent nodes and a larger PageRank value of maximum propagation state have a higher propagation probability, which is consistent with the above conclusion.

4.2.3. Personalization Vectors and Propagation Probability

In the propagation process of social network, the authority, professionalism, trust and influence of propagation nodes are all related to user preferences, but most influence studies of social networks largely ignore this [31]. In this study, the personalized vector in the Personalized PageRank algorithm is used to intervene in the random viewers in the PageRank algorithm to prefer a specific node in the random walk process so as to increase the distribution weight of the node to represent the user’s preference node or relevance of information propagation.
In Case 2, in which node 3 was originally the node with the highest number of adjacent nodes and the highest PageRank value, we set random viewers to prefer a specific node 3 in the random walk process. From Table 11, it can be found that when the personalized vector r3 = 1 of node 3, PPR(3) = 0.3438 is higher than the original PR(3) = 0.2372. However, in the R2R1 column in Table 14, it can be found that most of the propagation probabilities increase, while the propagation probability of node 3 decreases. The reason for this is that node 3 connects most of the nodes, resulting in it affecting the PageRank value of maximum propagation state of other nodes, causing them to also increase when the PageRank value of node 3 increases, as shown in Table 16.
In Case 3, in which node 1 was originally the node with the lowest number of adjacent nodes and the lowest PageRank value, we set random viewers to prefer a specific node 1 in the random walk process. When the PageRank value of node 1 increases, it affects the PageRank value of maximum propagation state of other nodes. However, because there are not many nodes connected to node 1, the PageRank value of maximum propagation state does not increase much. As shown in Table 17, the number of nodes with increased propagation probability is small. From the R3R1 column in Table 14, it can be found that most of the propagation probabilities decrease, which means that increasing the personalized vector of nodes with fewer connections has no obvious effect on the propagation probability of nodes.
When we bias the random viewers to the nodes with more adjacent nodes in the random walk process, the propagation probability of more nodes can be increased. However, increasing the distribution weight of nodes with fewer adjacent nodes does not have much influence. Therefore, it is assumed that an advertiser plans to place an advertisement and that there are two nodes with the same theme. If a node with a higher number of adjacent nodes is selected, it relatively increases the probability of propagating more pages.

5. Conclusions

The contribution of this research is to propose a new method, which is novel, simple and easy to apply, to calculate the propagation probability of social network propagation to other web pages. After confirming that the social network is a scale-free network, the Barabási–Albert model is used to build a model. Then, the PageRank algorithm and the Personalized PageRank algorithm are adopted to calculate the occurrence probability and propagation probability of each node. After enumerating all possible propagation states of each node using the BAT algorithm, the new BAT algorithm is used to calculate the propagation probability under different propagation ranges.
The new BAT algorithm can calculate the predicted value of Pr (s, Npage) for all nodes i and Npage; that is, the propagation probability of a social network, as shown in Table 12. In addition, the personalized vector in the Personalized PageRank algorithm is used to intervene the preferences of random viewers in the random walk process so that the prediction model is more in line with the actual user situation.
The Barabási–Albert model is used to construct a scale-free network to simulate the spread of social networks. However, the amount of computation will increase exponentially according to the network scale and spread because the degree distribution of the scale-free network follows a power distribution. The limitation of this model is that running time increases with the expansion of the network scale. If it must be applied to a large scale-free network, the GPU requirement must be increased to overcome the computing speed, or it can be applied to a small network such as a company intranet, etc.
The method proposed in this study could be applied in many areas, such as to cybersecurity problems. The proposed scale-free network model, new BAT algorithm, PageRank value of maximum propagation state and personalized vector concept can be extended to other network problems in the future.

Author Contributions

Conceptualization, W.-C.Y., W.Z., C.-L.H., T.-Y.H., Z.L. and S.-Y.T.; methodology, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; software, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; validation, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; formal analysis, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; investigation, W.-C.Y., W.Z. and C.-L.H.; resources, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; data curation, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; writing—original draft preparation, W.-C.Y., W.Z., C.-L.H. and T.-Y.H.; writing—review and editing, W.-C.Y., W.Z. and C.-L.H.; visualization, W.-C.Y. and W.Z.; supervision, W.-C.Y. and W.Z.; project administration, W.-C.Y. and W.Z.; funding acquisition, W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by National Natural Science Foundation of China (Grant No. 621060482), Research and Development Projects in Key Areas of Guangdong Province (Grant No. 2021B0101410002) and Ministry of Science and Technology, R.O.C (MOST 107-2221-E-007-072-MY3, MOST 110-2221-E-007-107-MY3, MOST 109-2221-E-424-002 and MOST 110-2511-H-130-002).

Acknowledgments

We wish to thank the anonymous editor and the referees for their constructive comments and recommendations, which significantly improved this paper. This article was once submitted to arXiv as a temporary submission that was only for reference and did not provide the copyright.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Propagation probability results for different numbers of nodes and edges.
Table A1. Propagation probability results for different numbers of nodes and edges.
Model 1: 10 nodes and 16 edges.
Applsci 12 06858 i001
Npage12345678910
Node 010.93430.91120.90780.90450.89760.86920.77300.55480.2411
Node 110.98300.97540.97250.96600.94860.89380.75050.49580.2067
Node 210.93350.92640.92390.91770.90350.86100.74110.50650.2279
Node 310.94580.92400.92190.91680.90660.87510.77660.55640.2408
Node 410.90270.90140.89830.89250.87690.82590.70030.46570.2137
Node 510.99840.99400.98890.98230.96040.88930.72560.45990.1879
Node 610.99960.99790.99240.98120.95400.88070.71840.45600.1867
Node 710.90270.90140.89830.89250.87690.82590.70030.46570.2137
Node 810.98300.97540.97250.96600.94860.89380.75050.49580.2067
Node 910.93350.92640.92390.91770.90350.86100.74110.50650.2279
Model 2: 14 nodes and 13 edges.
Applsci 12 06858 i002
Npage12345678910
Node 010.66670.63090.55700.54240.50070.42540.32640.22330.1329
Node 110.66670.63090.55700.54240.50070.42540.32640.22330.1329
Node 210.66670.58360.57970.55880.50890.42730.32470.22060.1307
Node 310.98210.90900.82560.78520.69950.57020.41880.27290.1536
Node 410.95850.87270.85530.80430.70800.57130.41640.26990.1524
Node 510.66670.66390.64640.60020.51910.41270.29500.18860.1050
Node 610.99860.98580.94000.84630.70610.53760.36810.22400.1183
Node 710.95850.87270.85530.80430.70800.57130.41640.26990.1524
Node 810.66670.66390.64640.60020.51910.41270.29500.18860.1050
Node 910.66670.58360.57970.55880.50890.42730.32470.22060.1307
Node 1010.98440.92800.84630.75960.68860.57290.42600.27620.1521
Node 1110.93250.77260.69600.63060.60040.53290.42590.29740.1762
Node 1210.66670.63540.59340.52150.48840.42430.33210.22780.1332
Node 1310.66670.53170.49170.42500.41390.38110.31900.23460.1472
Npage11121314
Node 00.06620.02600.00710.0011
Node 10.06620.02600.00710.0011
Node 20.06640.02720.00800.0012
Node 30.07160.02600.00650.0009
Node 40.07300.02780.00740.0010
Node 50.04970.01890.00520.0008
Node 60.05270.01860.00470.0006
Node 70.07300.02780.00740.0010
Node 80.04970.01890.00520.0008
Node 90.06640.02720.00800.0012
Node 100.06880.02420.00610.0008
Node 110.08520.03180.00830.0011
Node 120.06450.02440.00670.0010
Node 130.07570.03020.00860.0013
Model 3: 20 nodes and 19 edges.
Applsci 12 06858 i003
Npage12345678910
Node 010.99680.97080.89130.80250.74390.68190.59570.49700.3929
Node 110.66670.66020.62130.55390.51080.47880.42690.36330.2949
Node 210.66670.66020.62130.55390.51080.47880.42690.36330.2949
Node 310.66670.66020.62130.55390.51080.47880.42690.36330.2949
Node 410.66670.66020.62130.55390.51080.47880.42690.36330.2949
Node 510.66670.66550.65570.63000.59150.53470.45540.36870.2867
Node 610.66670.66550.65570.63000.59150.53470.45540.36870.2867
Node 710.66670.66550.65570.63000.59150.53470.45540.36870.2867
Node 810.66670.66550.65570.63000.59150.53470.45540.36870.2867
Node 910.99940.99310.96750.91940.84860.74750.62280.49550.3760
Node 1010.92410.90160.82180.74600.71230.65890.58150.48800.3881
Node 1110.96700.80990.59750.54000.52930.50380.46270.40770.3379
Node 1210.66670.60060.42510.36140.35780.34540.32170.28900.2464
Node 1310.66670.60060.42510.36140.35780.34540.32170.28900.2464
Node 1410.97780.94920.87990.81240.73790.66830.59950.51080.4093
Node 1510.93130.77190.73130.68710.61550.56130.52010.46030.3834
Node 1610.66670.52920.49390.47840.42910.38360.36090.32660.2788
Node 1710.97260.85130.70050.64220.58480.54230.50560.45120.3805
Node 1810.66670.61190.48350.44510.40520.36980.35030.31910.2761
Node 1910.66670.61190.48350.44510.40520.36980.35030.31910.2761
Npage11121314151617181920
Node 00.28970.19760.12480.07260.03800.01720.00640.00180.00032.77 × 10−5
Node 10.22440.15760.10230.06150.03360.01600.00630.00190.00043.51 × 10−5
Node 20.22440.15760.10230.06150.03360.01600.00630.00190.00043.51 × 10−5
Node 30.22440.15760.10230.06150.03360.01600.00630.00190.00043.51 × 10−5
Node 40.22440.15760.10230.06150.03360.01600.00630.00190.00043.51 × 10−5
Node 50.21010.14250.08950.05220.02790.01330.00530.00160.00033.05 × 10−5
Node 60.21010.14250.08950.05220.02790.01330.00530.00160.00033.05 × 10−5
Node 70.21010.14250.08950.05220.02790.01330.00530.00160.00033.05 × 10−5
Node 80.21010.14250.08950.05220.02790.01330.00530.00160.00033.05 × 10−5
Node 90.26720.17600.10760.06090.03140.01420.00530.00150.00032.38 × 10−5
Node 100.28930.20140.13130.07900.04260.01970.00740.00210.00043.33 × 10−5
Node 110.25960.18660.12680.08020.04530.02170.00820.00230.00043.59 × 10−5
Node 120.19430.14220.09850.06430.03790.01910.00770.00230.00044.14 × 10−5
Node 130.19430.14220.09850.06430.03790.01910.00770.00230.00044.14 × 10−5
Node 140.30560.21100.13360.07670.03950.01780.00660.00180.00032.95 × 10−5
Node 150.29890.21640.14310.08570.04640.02220.00870.00250.00054.14 × 10−5
Node 160.22270.16600.11340.06990.03900.01950.00810.00250.00054.7 × 10−5
Node 170.29870.21570.14220.08540.04600.02150.00810.00230.00043.52 × 10−5
Node 180.22350.16660.11330.07020.03930.01930.00770.00230.00044.17 × 10−5
Node 190.22350.16660.11330.07020.03930.01930.00770.00230.00044.17 × 10−5

References

  1. Ebrahimi, P.; Basirat, M.; Yousefi, A.; Nekmahmud, M.; Gholampour, A.; Fekete-Farkas, M. Social Networks Marketing and Consumer Purchase Behavior: The Combination of SEM and Unsupervised Machine Learning Approaches. Big Data Cogn. Comput. 2022, 6, 35. [Google Scholar] [CrossRef]
  2. Mallipeddi, R.R.; Kumar, S.; Sriskandarajah, C.; Zhu, Y. A Framework for Analyzing Influencer Marketing in Social Networks: Selection and Scheduling of Influencers. Manag. Sci. 2022, 68, 75–104. [Google Scholar] [CrossRef]
  3. Popova, O.I.; Gagarina, N.M.; Minina, T.B.; Holodilov, A.A. Digital Marketing of Social Networks as a Factor for Sustainable Business Development During the COVID-19 Pandemic. In Advances in Social Science, Education and Humanities Research, Proceedings of the International Scientific and Practical Conference “Sustainable Development of Environment after COVID-19”, Yekaterinburg, Russia, 7–8 December 2021; Atlantis Press: Paris, France, 2022; pp. 249–252. [Google Scholar] [CrossRef]
  4. Sugiyantoro, N.L.A.; Wijaya, M.; Supriyadi, S. Benefits of WhatsApp as a Communication Media on Small Business Social Networks. J. Soc. Media 2022, 6, 1–16. [Google Scholar] [CrossRef]
  5. Luo, J.; Wu, J.; Yang, W. A relationship matrix resolving model for identifying vital nodes based on community in oppor-tunistic social networks. Trans. Emerg. Tel. Tech. 2022, 33, e4389. [Google Scholar]
  6. Zhang, Q.; Li, X.; Fan, Y.; Du, Y. An SEI3R information propagation control algorithm with structural hole and high influential infected nodes in social networks. Eng. Appl. Artif. Intell. 2022, 108, 104573. [Google Scholar]
  7. Bouyer, A.; Beni, H.A. Influence maximization problem by leveraging the local traveling and node labeling method for discovering most influential nodes in social networks. Phys. A Stat. Mech. Its Appl. 2022, 592, 126841. [Google Scholar] [CrossRef]
  8. Bahutair, M.; Al Aghbari, Z.; Kamel, I. NodeRank: Finding influential nodes in social networks based on interests. J. Supercomput. 2021, 78, 2098–2124. [Google Scholar] [CrossRef]
  9. Aghaalizadeh, S.; Afshord, S.T.; Bouyer, A.; Anari, B. Improving the stability of label propagation algorithm by propagating from low-significance nodes for community detection in social networks. Computing 2021, 104, 21–42. [Google Scholar] [CrossRef]
  10. Borgatti, S.P.; Ofem, B. Social network theory and analysis. Soc. Netw. Theory Educ. Chang. 2010, 17, 29. [Google Scholar]
  11. Pei, S.; Muchnik, L.; Tang, S.; Zheng, Z.; Makse, H.A. Exploring the Complex Pattern of Information Spreading in Online Blog Communities. PLoS ONE 2015, 10, e0126894. [Google Scholar] [CrossRef]
  12. Onnela, J.-P.; Saramäki, J.; Hyvönen, J.; Szabó, G.; Lazer, D.; Kaski, K.; Kertész, J.; Barabási, A.-L. Structure and tie strengths in mobile communication networks. Proc. Natl. Acad. Sci. USA 2007, 104, 7332–7336. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Clauset, A.; Shalizi, C.R.; Newman, M.E.J. Power-Law Distributions in Empirical Data. SIAM Rev. 2009, 51, 661–703. [Google Scholar] [CrossRef] [Green Version]
  14. Bornholdt, S.; Ebel, H. World Wide Web scaling exponent from Simon’s 1955 model. Phys. Rev. E 2001, 64, 035104. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Barabaási, A.-L.; Albert, R. Emergence of Scaling in Random Networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef] [Green Version]
  16. Nadaraja, R.; Yazdanifard, R. Social Media Marketing: Advantages and Disadvantages; Center of Southern New Hampshire University: Manchester, NH, USA, 2013; pp. 1–10. [Google Scholar]
  17. Ellison, N.B.; Steinfield, C.; Lampe, C. The benefits of Facebook “friends:” Social capital and college students’ use of online social network sites. J. Comput.-Mediat. Commun. 2007, 12, 1143–1168. [Google Scholar] [CrossRef] [Green Version]
  18. Sun, M.; Zhang, H.; Kang, H.; Zhu, G.; Fu, X. Epidemic spreading on adaptively weighted scale-free networks. J. Math. Biol. 2016, 74, 1263–1298. [Google Scholar] [CrossRef]
  19. Dong, S.; Deng, Y.-B.; Huang, Y.-C. SEIR Model of Rumor Spreading in Online Social Network with Varying Total Population Size. Commun. Theor. Phys. 2017, 68, 545. [Google Scholar] [CrossRef]
  20. Zhu, Y.-X.; Zhang, X.-G.; Sun, G.-Q.; Tang, M.; Zhou, T.; Zhang, Z.-K. Influence of Reciprocal Links in Social Networks. PLoS ONE 2014, 9, e103007. [Google Scholar] [CrossRef] [Green Version]
  21. Liu, C.; Zhang, Z.K. Information spreading on dynamic social networks. Commun. Nonlinear Sci. Numer. Simul. 2014, 19, 896–904. [Google Scholar] [CrossRef] [Green Version]
  22. Lü, L.; Chen, D.-B.; Zhou, T. The small world yields the most effective information spreading. New J. Phys. 2011, 13, 123005. [Google Scholar] [CrossRef]
  23. Haythommthwaite, K. Characterized Social Networks as Having the Following Components: Actors; Nodes: New York, NY, USA, 2005. [Google Scholar]
  24. Barnes, J.A. Class and Committees in a Norwegian Island Parish. Hum. Relat. 1954, 7, 39–58. [Google Scholar] [CrossRef]
  25. Kempe, D.; Kleinberg, J.; Tardos, É. Maximizing the spread of influence through a social network. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 24–27 August 2003. [Google Scholar]
  26. Litvak, N.; Scheinhardt, W.R.; Volkovich, Y. In-degree and PageRank: Why do they follow similar power laws? Internet Math. 2007, 4, 175–198. [Google Scholar] [CrossRef]
  27. Weng, J.; Lim, E.P.; Jiang, J.; He, Q. Twitterrank: Finding topic-sensitive influential twitterers. In Proceedings of the Third ACM International Conference on Web Search and Data Mining, New York, NY, USA, 3–6 February 2010. [Google Scholar]
  28. Xing, W.; Ghorbani, A. Weighted pagerank algorithm. In Proceedings of the Second Annual Conference on Communication Networks and Services Research, Fredericton, NB, Canada, 19–21 May 2004. [Google Scholar]
  29. Liu, L.; Tang, J.; Han, J.; Jiang, M.; Yang, S. Mining topic-level influence in heterogeneous networks. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, Toronto, ON, Canada, 26–30 October 2010. [Google Scholar]
  30. Chen, S.; Fan, J.; Li, G.; Feng, J.; Tan, K.-L.; Tang, J. Online topic-aware influence maximization. Proc. VLDB Endow. 2015, 8, 666–677. [Google Scholar] [CrossRef] [Green Version]
  31. Barbieri, N.; Bonchi, F.; Manco, G. Topic-aware social influence propagation models. Knowl. Inf. Syst. 2013, 37, 555–584. [Google Scholar] [CrossRef]
  32. Agirre, E.; Soroa, A. Personalizing pagerank for word sense disambiguation. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), Athens, Greece, 30 March–3 April 2009. [Google Scholar]
  33. Xie, W.; Bindel, D.; Demers, A.; Gehrke, J. Edge-weighted personalized pagerank: Breaking a decade-old performance barrier. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, Australia, 10–13 August 2015. [Google Scholar]
  34. Pirouz, M.; Zhan, J. Toward Efficient Hub-Less Real Time Personalized PageRank. IEEE Access 2017, 5, 26364–26375. [Google Scholar] [CrossRef]
  35. Clauset, A.; Moore, C.; Newman, M.E.J. Hierarchical structure and the prediction of missing links in networks. Nature 2008, 453, 98–101. [Google Scholar] [CrossRef]
  36. Backstrom, L.; Leskovec, J. Supervised random walks: Predicting and recommending links in social networks. In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, Hong Kong, China, 9–12 February 2011. [Google Scholar]
  37. Yeh, W.-C. Novel binary-addition tree algorithm (BAT) for binary-state network reliability problem. Reliab. Eng. Syst. Saf. 2021, 208, 107448. [Google Scholar] [CrossRef]
  38. Tarjan, R. Depth-First Search and Linear Graph Algorithms. SIAM J. Comput. 1972, 1, 146–160. [Google Scholar] [CrossRef]
  39. Bundy, A.; Wallen, L. Breadth-first Search, in Catalogue of Artificial Intelligence Tools; Springer: Berlin/Heidelberg, Germany, 1984; p. 13. [Google Scholar]
  40. Garton, L.; Haythornthwaite, C.; Wellman, B. Studying online social networks. J. Comput.-Mediat. Commun. 1997, 31, JCMC313. [Google Scholar] [CrossRef]
  41. Tripathi, S.; Verma, S. Analysing Technological Dimensions for Engagement with NGOs on Social Media; John Wiley & Sons: Hoboken, NJ, USA, 2017; pp. 1–11. [Google Scholar]
  42. Khoo, D. Travel Planning for Social Networks. Google Patents US20040193488A1, 15 March 2010. [Google Scholar]
  43. Qiu, T.; Chen, B.; Sangaiah, A.K.; Ma, J.; Huang, R. A Survey of Mobile Social Networks: Applications, Social Characteristics, and Challenges. IEEE Syst. J. 2017, 12, 3932–3947. [Google Scholar] [CrossRef]
  44. Rahim, A.; Kong, X.; Xia, F.; Ning, Z.; Ullah, N.; Wang, J.; Das, S.K. Vehicular Social Networks: A survey. Pervasive Mob. Comput. 2018, 43, 96–113. [Google Scholar] [CrossRef]
  45. Alavi, S.S.; Mehdinezhad, I.; Kahshidinia, B.; Jenab, K. A trend study on the impact of social media on advertisement. Int. J. Data Netw. Sci. 2019, 3, 185–200. [Google Scholar] [CrossRef]
  46. Khoo, D. Travel Planning for Social Networks. Google Patents US20060004590A1, 19 December 2013. [Google Scholar]
  47. Pei, S.; Makse, H.A. Spreading dynamics in complex networks. J. Stat. Mech. Theory Exp. 2013, 2013, P12002. [Google Scholar] [CrossRef]
  48. Moreno, Y.; Vazquez, A. Disease spreading in structured scale-free networks. Eur. Phys. J. B-Condens. Matter Complex Syst. 2003, 31, 265–271. [Google Scholar] [CrossRef] [Green Version]
  49. Boguñá, M.; Pastor-Satorras, R.; Vespignani, A. Absence of Epidemic Threshold in Scale-Free Networks with Degree Correlations. Phys. Rev. Lett. 2003, 90, 028701. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Faloutsos, M. On power-law relationships of the internet topology. ACM SIGCOMM Comput. Commun. Rev. 2011, 29, 195–206. [Google Scholar] [CrossRef]
  51. Albert, R.; Jeong, H.; Barabasi, A. Error and attack tolerance of complex networks. Nature 2000, 406, 378–382. [Google Scholar] [CrossRef] [Green Version]
  52. Blanka, G.; Reisdorf, B. The participatory web: A user perspective on Web 2. Information. Commun. Soc. 2012, 15, 537–554. [Google Scholar] [CrossRef]
  53. Kleinberg, J.M.; Kumar, R.; Raghavan, P.; Rajagopalan, S.; Tomkins, A.S. The Web as a Graph: Measurements, Models, and Methods; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
  54. Broder, A.; Kumar, R.; Maghoul, F.; Raghavan, P.; Rajagopalan, S.; Stata, R.; Tomkins, A.; Wiener, J. Graph Structure in the Web, in the Structure and Dynamics of Networks; Princeton University Press: Princeton, NJ, USA, 2011; pp. 183–194. [Google Scholar]
  55. Adamic, L.A.; Huberman, B.A. Power-Law Distribution of the World Wide Web. Science 2000, 287, 2115. [Google Scholar] [CrossRef] [Green Version]
  56. Lin, F.R. Contagion Dynamics of Scale-Free Networks with Resource Limitation. Master Thesis, Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan, 2005. [Google Scholar]
  57. Barabási, A.-L. Scale-Free Networks: A Decade and Beyond. Science 2009, 325, 412–413. [Google Scholar] [CrossRef] [Green Version]
  58. Bollobás, B.; Riordan, O.; Spencer, J.; Tusnády, G. The degree sequence of a scale-free random graph process. Random Struct. Algorithms 2001, 18, 279–290. [Google Scholar] [CrossRef] [Green Version]
  59. Chen, Q.; Shi, D. The modeling of scale-free networks. Phys. A Stat. Mech. Its Appl. 2004, 335, 240–248. [Google Scholar] [CrossRef]
  60. Albert, R.; Barabási, A.-L. Statistical mechanics of complex networks. Rev. Mod. Phys. 2002, 74, 47–97. [Google Scholar] [CrossRef] [Green Version]
  61. Holme, P.; Kim, B.J. Growing scale-free networks with tunable clustering. Phys. Rev. E 2002, 65, 026107. [Google Scholar] [CrossRef] [Green Version]
  62. Brin, S.; Page, L. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 1998, 30, 107–117. [Google Scholar] [CrossRef]
  63. Luo, S. Distributed PageRank Computation: An Improved Theoretical Study. Proc. Conf. AAAI Artif. Intell. 2019, 33, 4496–4503. [Google Scholar] [CrossRef] [Green Version]
  64. Langville, A.N.; Meyer, C.D. Deeper inside pagerank. Internet Math. 2004, 1, 335–380. [Google Scholar] [CrossRef] [Green Version]
  65. Valente, T. Network Models of the Diffusion of Innovations; Hampton Free Press: Creswell, NJ, USA, 1995. [Google Scholar]
  66. Bharat, K.; Mihaila, G.A. When experts agree: Using non-affiliated experts to rank popular topics. In Proceedings of the 10th International Conference on World Wide Web, Hong Kong, China, 1–5 May 2001. [Google Scholar]
  67. Haveliwala, T. Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search. IEEE Trans. Knowl. Data Eng. 2003, 15, 784–796. [Google Scholar] [CrossRef] [Green Version]
  68. Page, L.; Brin, S.; Motwani, R.; Winograd, T. The PageRank Citation Ranking: Bringing Order to the Web; Stanford University InfoLab: Stanford, CA, USA, 1999. [Google Scholar]
  69. Scozzafava, F.; Maru, M.; Brignone, F.; Torrisi, G.; Navigli, R. Personalized PageRank with syntagmatic information for multilingual word sense disambiguation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Online, 5–10 July 2020; pp. 37–46. [Google Scholar]
  70. Hou, G.; Chen, X.; Wang, S.; Wei, Z. Massively Parallel Algorithms for Personalized PageRank. VLDB 2021, 14, 1668–1680. [Google Scholar] [CrossRef]
  71. Wang, H.; Yuan, Y.; Wei, Z.; Du, X.; Gan, J.; Wen, J.R. Edge-based Local Push for Personalized PageRank. VLDB 2022, 15, 1376–1389. [Google Scholar]
  72. Mo, D.; Luo, S. Agenda: Robust Personalized PageRanks in Evolving Graphs. CIKM 2021. [Google Scholar] [CrossRef]
  73. Wu, H.; Gan, J.; Wei, Z.; Zhang, R. Unifying the Global and Local Approaches: An Efficient Power Iteration with Forward Push. In Proceedings of the 2021 International Conference on Management of Data, Xi’an, China, 20–25 June 2021; pp. 1996–2008. [Google Scholar] [CrossRef]
  74. Aven, T. Availability evaluation of oil/gas production and transportation systems. Reliab. Eng. 1987, 18, 35–44. [Google Scholar] [CrossRef]
  75. Bhavathrathan, B.; Patil, G.R. Analysis of Worst Case Stochastic Link Capacity Degradation to Aid Assessment of Transportation Network Reliability. Procedia-Soc. Behav. Sci. 2013, 104, 507–515. [Google Scholar] [CrossRef] [Green Version]
  76. Yeh, W.-C. A Squeezed Artificial Neural Network for the Symbolic Network Reliability Functions of Binary-State Networks. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 2822–2825. [Google Scholar] [CrossRef] [PubMed]
  77. Kakadia, D.; Ramirez-Marquez, J.E. Quantitative approaches for optimization of user experience based on network resilience for wireless service provider networks. Reliab. Eng. Syst. Saf. 2019, 193, 106606. [Google Scholar] [CrossRef]
  78. Yeh, W.-C.; Lin, J.-S. New parallel swarm algorithm for smart sensor systems redundancy allocation problems in the Internet of Things. J. Supercomput. 2016, 74, 4358–4384. [Google Scholar] [CrossRef]
  79. Yeh, W.C. A Novel Generalized Artificial Neural Network for Mining Two-Class Datasets. arXiv 2019, arXiv:1910.10461. [Google Scholar]
  80. Yeh, W.-C. A Modified Universal Generating Function Algorithm for the Acyclic Binary-State Network Reliability. IEEE Trans. Reliab. 2012, 61, 702–709. [Google Scholar] [CrossRef]
  81. Yeh, W.C. An Evaluation of the Multi-state Node Networks Reliability Using the Traditional Binary-State Networks Reliability Algorithm. Reliab. Eng. Syst. Saf. 2003, 81, 1–7. [Google Scholar] [CrossRef]
  82. Yeh, W.-C. A new exact solution algorithm for a novel generalized redundancy allocation problem. Inf. Sci. 2017, 408, 182–197. [Google Scholar] [CrossRef]
  83. Zhou, J.; Coit, D.W.; Felder, F.A.; Wang, D. Resiliency-based restoration optimization for dependent network systems against cascading failures. Reliab. Eng. Syst. Saf. 2020, 207, 107383. [Google Scholar] [CrossRef]
  84. Singh, A.P.; Luhach, A.K.; Gao, X.-Z.; Kumar, S.; Roy, D.S. Evolution of wireless sensor network design from technology centric to user centric: An architectural perspective. Int. J. Distrib. Sens. Netw. 2020, 16, 1550147720949138. [Google Scholar] [CrossRef]
  85. Yeh, W.C. Novel bounded binary-addition tree algorithm for binary-state network reliability problems. arXiv 2020, arXiv:2011.14832. [Google Scholar]
  86. Su, Y.-Z.; Yeh, W.-C. Binary-Addition Tree Algorithm-Based Resilience Assessment for Binary-State Network Problems. Electronics 2020, 9, 1207. [Google Scholar] [CrossRef]
  87. Yeh, W.-C.; Kuo, C.-C. Predicting and Modeling Wildfire Propagation Areas with BAT and Maximum-State PageRank. Appl. Sci. 2020, 10, 8349. [Google Scholar] [CrossRef]
  88. Yeh, W.C. Self-Adaptive Binary-Addition-Tree Algorithm-Based Novel Monte Carlo Simulation for Binary-State Network Reliability Approximation. arXiv 2022, arXiv:2201.05764. [Google Scholar]
  89. Yeh, W.-C.; Tan, S.-Y.; Forghani-Elahabad, M.; El Khadiri, M.; Jiang, Y.; Lin, C.-S. New binary-addition tree algorithm for the all-multiterminal binary-state network reliability problem. Reliab. Eng. Syst. Saf. 2022, 224, 108557. [Google Scholar] [CrossRef]
  90. Yeh, W.-C.; Hao, Z.; Forghani-Elahabad, M.; Wang, G.-G.; Lin, Y.-L. Novel Binary-Addition Tree Algorithm for Reliability Evaluation of Acyclic Multistate Information Networks. Reliab. Eng. Syst. Saf. 2021, 210, 107427. [Google Scholar] [CrossRef]
  91. Yeh, W.C. Application of Long Short-Term Memory Recurrent Neural Networks Based on the BAT-MCS for Binary-State Network Approximated Time-Dependent Reliability Problems. arXiv 2022, arXiv:2202.07837. [Google Scholar]
  92. Su, Y.Z.; Yeh, W.C. The protection and recovery strategy development of dynamic resilience analysis and cost consideration in the infrastructure network. J. Comput. Des. Eng. 2022, 9, 168–186. [Google Scholar] [CrossRef]
  93. Yeh, W.-C.; Tan, S.-Y.; Zhu, W.; Huang, C.-L.; Yang, G.-Y. Novel binary addition tree algorithm (BAT) for calculating the direct lower-bound of the highly reliable binary-state network reliability. Reliab. Eng. Syst. Saf. 2022, 223, 108509. [Google Scholar] [CrossRef]
  94. Yeh, W.-C.; Lin, E.; Huang, C.-L. Predicting Spread Probability of Learning-Effect Computer Virus. Complexity 2021, 2021, 6672630. [Google Scholar] [CrossRef]
  95. Yeh, W.-C. New Method in Searching for All Minimal Paths for the Directed Acyclic Network Reliability Problem. IEEE Trans. Reliab. 2016, 65, 1263–1270. [Google Scholar] [CrossRef]
  96. Yeh, W.-C. A Novel Cut-Based Universal Generating Function Method. IEEE Trans. Reliab. 2013, 62, 628–636. [Google Scholar] [CrossRef]
  97. Hagberg, A.; Swart, P.; Chult, D.S. Exploring Network Structure, Dynamics, and Function Using NetworkX; No. LA-UR-08-05495; LA-UR-08-5495; Los Alamos National Lab. (LANL): Los Alamos, NM, USA, 2008. [Google Scholar]
Figure 1. Propagation tree and propagation layers of 227 blog page nodes.
Figure 1. Propagation tree and propagation layers of 227 blog page nodes.
Applsci 12 06858 g001
Figure 2. Facebook depicts all connected community users.
Figure 2. Facebook depicts all connected community users.
Applsci 12 06858 g002
Figure 3. Power distribution plot.
Figure 3. Power distribution plot.
Applsci 12 06858 g003
Figure 4. Scale-free network generation process.
Figure 4. Scale-free network generation process.
Applsci 12 06858 g004
Figure 5. Scale-free network.
Figure 5. Scale-free network.
Applsci 12 06858 g005
Figure 6. Network G(V, E).
Figure 6. Network G(V, E).
Applsci 12 06858 g006
Figure 7. Network of Barabási–Albert model with 8 nodes and 12 edges.
Figure 7. Network of Barabási–Albert model with 8 nodes and 12 edges.
Applsci 12 06858 g007
Figure 8. Degree distribution of network in Figure 7.
Figure 8. Degree distribution of network in Figure 7.
Applsci 12 06858 g008
Table 1. PageRank algorithm.
Table 1. PageRank algorithm.
Input:G(V, E), where V = {0, 1, 2, …, u − 1}.
Output:PageRank value of nodes.
Step P0:Let t = 0, set the initial PageRank value of all nodes to 1/u.
Step P1:According to Equation (3), update the PageRank values of all nodes.
Step P2:For nodes with Nv = 0, evenly distribute the PageRank values of other nodes.
Step P3:If the PageRank value of all nodes does not change, stop; otherwise, let t = t + 1 and go to P0.
Table 2. BAT algorithm.
Table 2. BAT algorithm.
Input:G(V, E)
Output:All possible non-repetitive state vectors.
Step B0:Let SUM = 0, k = 1, X1 = X, and X is a zero vector with m coordinates.
Step B1:Assume i = m.
Step B2:If X(ai) = 0, then X(ai) = 1, k = k + 1, Xk = X, SUM = SUM + 1, proceed to Step B4.
Step B3:If i > 1, let X(ai) = 0, SUM = SUM − 1, i = i − 1, and go to Step B2.
Step B4:If SUM = m, stop. At this time, X1, X2, …, Xk are all possible state vectors. Otherwise, proceed to Step B1.
Table 3. Basic information of network in Figure 6.
Table 3. Basic information of network in Figure 6.
iDeg(i)V(i)PR(i)
02{2, 1}0.204787
13{3, 2, 0}0.295213
23{3, 1, 0}0.295213
32{2, 1}0.204787
Table 4. Transition rules of binary state label.
Table 4. Transition rules of binary state label.
Rule 1:In binary status labels, the number of bits (m) of the state value is equal to Deg(i).
Rule 2:In V(i), node i is in descending node order according to its label number.
Rule 3:In the binary state label, the mth digit is equal to 0 or 1, 0 means that the mth node is not included in the state, and 1 means that the mth node is included in the state.
Table 5. State label of network in Figure 6.
Table 5. State label of network in Figure 6.
Node0123
State
V(i){2, 1}{3, 2, 0}{3, 1, 0}{2, 1}
Sk (i)Node State LabelBinary State LabelNode State LabelBinary State LabelNode State LabelBinary State LabelNode State LabelBinary State Label
S0 (i)Ø[0 0]Ø[0 0 0]Ø[0 0 0]Ø[0 0]
S1 (i){1}[0 1]{0}[0 0 1]{0}[0 0 1]{1}[0 1]
S2 (i){2}[1 0]{2}[0 1 0]{1}[0 1 0]{2}[1 0]
S3 (i){2, 1}[1 1]{2, 0}[0 1 1]{1, 0}[0 1 1]{2, 1}[1 1]
S4 (i) {3}[1 0 0]{3}[1 0 0]
S5 (i) {3, 0}[1 0 1]{3, 0}[1 0 1]
S6 (i) {3, 2}[1 1 0]{3, 1}[1 1 0]
S7 (i) {3, 2, 0}[1 1 1]{3, 1, 0}[1 1 1]
Table 6. State probabilities and propagation probabilities for node 1 of network in Figure 6.
Table 6. State probabilities and propagation probabilities for node 1 of network in Figure 6.
StatePropagation NodePR(Sk(i))Pr(Sk(i))
S0(1)Ø0.5 × Min{PR(0), PR(2), PR(3)}
= 0.5 × 0.204787 = 0.102394
0.102394/2.921542 = 0.0350478
S1(1){0}PR(0) = 0.2047870.204787/2.921542 = 0.0700956
S2(1){2}PR(2) = 0.2952130.295213/2.921542 = 0.101047
S3(1){0, 2}PR(0) + PR(2) = 0.50.5/2.921542 = 0.171142
S4(1){3}PR(3) = 0.2047870.204787/2.921542 = 0.0700956
S5(1){0, 3}PR(0) + PR(3) = 0.4095740.409574/2.921542 = 0.140191
S6(1){2, 3}PR(2) + PR(3) = 0.50.5/2.921542 = 0.171142
S7(1){0, 2, 3}PR(0) + PR(2) + PR(3) = 0.7047870.704787/2.921542 = 0.241238
Sum 2.9215421
Table 7. State probability for network in Figure 6.
Table 7. State probability for network in Figure 6.
i0123
V(i){2 1}{3 2 0}{3 1 0}{2 1}
PR(S0(i))0.1476060.1023940.1023940.147606
PR(S1(i))0.2952130.2047870.2047870.295213
PR(S2(i))0.2952130.2952130.2952130.295213
PR(S3(i))0.5904260.50.50.590426
PR(S4(i)) 0.2047870.204787
PR(S5(i)) 0.4095740.409574
PR(S6(i)) 0.50.5
PR(S7(i)) 0.7047870.704787
Table 8. Propagation probability for network in Figure 6.
Table 8. Propagation probability for network in Figure 6.
i0123
V(i){2 1}{3 2 0}{3 1 0}{2 1}
PR(S0(i))0.1111110.0350480.0350480.111111
PR(S1(i))0.2222220.0700960.0700960.222222
PR(S2(i))0.2222220.1010470.1010470.222222
PR(S3(i))0.4444440.1711420.1711420.444444
PR(S4(i)) 0.0700960.070096
PR(S5(i)) 0.1401910.140191
PR(S6(i)) 0.1711420.171142
PR(S7(i)) 0.2412380.241238
Table 9. Values related to the propagation state of network in Figure 6.
Table 9. Values related to the propagation state of network in Figure 6.
iV(i)C(i)PR(i)PRmax(i)
0{2, 1}40.2047870.295213 + 0.295213 = 0.590426
1{3, 2, 0}80.2952130.204787 + 0.295213 + 0.204787 = 0.704787
2{3, 1, 0}80.2952130.204787 + 0.295213 + 0.204787 = 0.704787
3{2, 1}40.2047870.295213 + 0.295213 = 0.590426
Table 10. The new BAT algorithm.
Table 10. The new BAT algorithm.
Input:G(V, E), V(i), Sk(i), s, Npage
Output:Pr (s, Npage) = R
Step A0:Let i = s, l = 0, R = 0, Tl = { i }, Pl = 1, X(i) = 1, go to Step A1.
Step A1:If X(i) = 2|Deg(i)|, then node i has exceeded the maximum propagation state, go to Step A8.
Step A2:Let T* = {j | j ∈ [V(i) − Tl]}.
Step A3:Let Tl+1 = TlT*. If T* = Ø, then go to Step A6.
Step A4:When the number of nodes in T*Tl is greater than or equal to Npage, then Rl = Pl × Pr(SX(i)(i)), R = R + Rl, and let l = l + 1, Tl = Tl−1 and go to Step A7.
Step A5:X(j) = 0 for all jT*, Pl = Pl−1 × Pr(SX(i)(i)).
Step A6:Let l = l + 1, if i has the next node in Tl, update i to be the next node and go to Step A1.
Step A7:Let X(i) = X(i) + 1, go to Step A1.
Step A8:If i has a previous node in Tl, update i to be the previous node and go to Step A7; if i does not have a previous node in Tl, stop searching, and R is the final propagation probability at this time.
Table 11. Process of finding Pr (s = 0, Npage = 2) for the example network.
Table 11. Process of finding Pr (s = 0, Npage = 2) for the example network.
liX(i)TlT*PlPr(SX(i)(i))R
001{0}{1}1Pr(S1(0))R0
102{0}{2}1Pr(S2(0))R0 + R1
203{0}{1, 2}1Pr(S3(0))R0 + R1 + R2
Table 12. Process of finding Pr (s = 0, Npage = 3) for the example network.
Table 12. Process of finding Pr (s = 0, Npage = 3) for the example network.
liX(i)TlT*PlPr(SX(i)(i))R
001{0}{1}1Pr(S1(0))0
110{0, 1}ØPr(S1(0))Pr(S0(1))0
211{0, 1}ØPr(S1(0))Pr(S1(1))0
312{0, 1}{2}Pr(S1(0))Pr(S2(1))R3
413{0, 1}{2}Pr(S1(0))Pr(S3(1))R3 + R4
514{0, 1}{3}Pr(S1(0))Pr(S4(1))R3 + R4 + R5
615{0, 1}{3}Pr(S1(0))Pr(S5(1))R3 + R4 + R5 + R6
716{0, 1}{2, 3}Pr(S1(0))Pr(S6(1))R3 + R4 + R5 + R6 + R7
817{0, 1}{2, 3}Pr(S1(0))Pr(S7(1))R3 + R4 + R5 + R6 + R7 + R8
902{0}{2}1Pr(S2(0))R3 + R4 + R5 + R6 + R7 + R8
1020{0, 2}ØPr(S2(0))Pr(S0(2))R3 + R4 + R5 + R6 + R7 + R8
1121{0, 2}ØPr(S2(0))Pr(S1(2))R3 + R4 + R5 + R6 + R7 + R8
1222{0, 2}{1}Pr(S2(0))Pr(S2(2))R3 + R4 + R5 + R6 + R7 + R8 + R12
1323{0, 2}{1}Pr(S2(0))Pr(S3(2))R3 + R4 + R5 + R6 + R7 + R8 + R12 + R13
1424{0, 2}{3}Pr(S2(0))Pr(S4(2))R3 + R4 + R5 + R6 + R7 + R8 + R12 + R13 + R14
1525{0, 2}{3}Pr(S2(0))Pr(S5(2))R3 + R4 + R5 + R6 + R7 + R8 + R12 + R13 + R14 + R15
1626{0, 2}{1, 3}Pr(S2(0))Pr(S6(2))R3 + R4 + R5 + R6 + R7 + R8 + R12 + R13 + R14 + R15 + R16
1727{0, 2}{1, 3}Pr(S2(0))Pr(S7(2))R3 + R4 + R5 + R6 + R7 + R8 + R12 + R13 + R14 + R15 + R16 + R17
1803{0}{1, 2}1Pr(S3(0))R3 + R4 + R5 + R6 + R7 + R8 + R12 + R13 + R14 + R15 + R16 + R17 + R18
Table 13. Basic information of the network graph in Figure 7.
Table 13. Basic information of the network graph in Figure 7.
CaseCase 1Case 2 (r3 = 1)Case 3 (r1 = 1)
iDeg(i)V(i)C(i)PR(i)PRmax(i)PPR(i)PPRmax(i)PPR(i)PPRmax(i)
02{3 2}40.08790.36260.07950.45260.05690.3191
12{5 3}40.08670.39900.07890.48590.22260.4203
23{7 3 0}80.12540.45120.10880.51620.08240.3807
36{6 5 4 2 1 0}640.23720.63670.34380.56330.23670.6762
42{5 3}40.08670.39900.07890.48590.07260.4203
54{7 4 3 1}160.16180.53680.14210.59450.18360.6189
62{7 3}40.08810.36330.07500.43670.05820.3238
73{6 5 2}80.12610.37530.09290.32600.08710.3242
Table 14. Propagation probability results for the network in Figure 7.
Table 14. Propagation probability results for the network in Figure 7.
iNpageR1 RuntimeR1R2R2R1R3R3R1
010.00001.00001.00000.00001.00000.0000
20.00200.92040.94330.02290.93940.0190
30.00300.90710.93450.02740.93130.0242
40.00400.89230.91870.02650.91970.0274
50.01200.86810.88900.02090.89540.0273
60.07880.81630.82610.00980.83950.0233
70.66820.69920.6940−0.00520.71640.0172
84.75750.40560.3947−0.01090.42140.0159
110.00001.00001.00000.00001.00000.0000
20.00200.90790.93190.02390.9015−0.0064
30.00300.90070.92640.02570.8884−0.0123
40.00400.88780.91360.02590.8703−0.0174
50.01200.85360.87340.01980.8261−0.0274
60.08380.79970.80810.00850.7610−0.0386
70.74160.67040.6660−0.00440.6198−0.0506
84.64960.39420.3836−0.01060.3484−0.0458
210.00001.00001.00000.00001.00000.0000
20.00200.97620.98110.00490.98170.0054
30.00300.95630.96850.01220.96950.0132
40.00500.94010.95420.01410.95880.0187
50.01400.91710.92750.01040.93800.0209
60.10870.86440.86580.00130.88650.0221
70.89530.71450.7020−0.01250.73010.0156
85.52360.37560.3608−0.01480.38490.0093
310.00001.00001.00000.00001.00000.0000
20.00300.99790.99790.00000.99870.0008
30.00300.99010.9891−0.00090.99060.0005
40.00500.96740.9626−0.00480.96740.0000
50.01990.91960.9073−0.01230.9111−0.0085
60.14370.83830.8169−0.02140.8253−0.0131
71.07670.66830.6404−0.02790.6514−0.0169
86.19670.34710.3259−0.02130.3403−0.0068
410.00001.00001.00000.00001.00000.0000
20.00200.90790.93190.02390.9015−0.0064
30.00300.90070.92640.02570.8961−0.0047
40.00300.88780.91360.02590.8814−0.0064
50.01300.85360.87340.01980.8388−0.0148
60.09110.79970.80810.00850.7786−0.0210
70.71260.67040.6660−0.00440.6458−0.0246
84.59090.39420.3836−0.01060.3881−0.0061
510.00001.00001.00000.00001.00000.0000
20.00200.99000.99180.00180.99270.0027
30.00300.97450.98170.00720.9720−0.0026
40.00600.95820.96830.01020.9518−0.0063
50.02090.92330.92980.00640.9068−0.0166
60.13370.85610.8520−0.00410.8304−0.0257
70.99950.68940.6736−0.01580.6599−0.0294
85.96960.35740.3413−0.01610.3447−0.0127
610.00001.00001.00000.00001.00000.0000
20.00300.92010.94950.02930.93700.0168
30.00300.90460.93870.03410.92730.0228
40.00300.89400.92880.03480.91860.0246
50.01000.87390.90490.03100.89910.0252
60.07600.82110.84230.02110.84270.0215
70.65800.70120.70860.00740.71610.0149
84.41820.40360.3967−0.00690.41860.0150
710.00001.00001.00000.00001.00000.0000
20.00200.97150.97200.00050.97810.0065
30.00300.94630.95550.00920.96170.0154
40.00490.93330.94860.01530.95080.0176
50.01800.92070.93820.01760.93960.0189
60.12710.87250.88710.01460.89300.0205
70.89320.72000.72450.00450.73350.0135
85.76680.37730.3721−0.00520.38570.0084
Table 15. Ranking for PageRank value of maximum propagation state nodes.
Table 15. Ranking for PageRank value of maximum propagation state nodes.
iCase 1
PRmax(i)
RankingCase 2
PPRmax(i)
RankingCase 3
PPRmax(i)
Ranking
00.362670.452660.31918
10.399040.485940.42033
20.451230.516230.38075
30.636710.563320.67621
40.399040.485940.42033
50.536820.594510.61892
60.363360.436770.32387
70.375380.326080.32426
Table 16. In Case 2, the PageRank value of maximum propagation state increases and decreases.
Table 16. In Case 2, the PageRank value of maximum propagation state increases and decreases.
iPRmax(i)PPRmax(i)Increase/DecreaseRank Change
00.36260.45260.09007→6
10.39900.48590.08694→4
20.45120.51620.06503→3
30.63670.5633−0.07341→2
40.39900.48590.08694→4
50.53680.59450.05772→1
60.36330.43670.07346→7
70.37530.3260−0.04948→8
Table 17. In Case 3, the PageRank value of maximum propagation state increases and decreases.
Table 17. In Case 3, the PageRank value of maximum propagation state increases and decreases.
iPRmax(i)PPRmax(i)Increase/DecreaseRank Change
00.36260.3191−0.0436 7→8
10.39900.42030.0213 4→3
20.45120.3807−0.0705 3→5
30.63670.67620.0395 1→1
40.39900.42030.0213 4→3
50.53680.61890.0821 2→2
60.36330.3238−0.0395 6→7
70.37530.3242−0.0511 8→6
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yeh, W.-C.; Zhu, W.; Huang, C.-L.; Hsu, T.-Y.; Liu, Z.; Tan, S.-Y. A New BAT and PageRank Algorithm for Propagation Probability in Social Networks. Appl. Sci. 2022, 12, 6858. https://doi.org/10.3390/app12146858

AMA Style

Yeh W-C, Zhu W, Huang C-L, Hsu T-Y, Liu Z, Tan S-Y. A New BAT and PageRank Algorithm for Propagation Probability in Social Networks. Applied Sciences. 2022; 12(14):6858. https://doi.org/10.3390/app12146858

Chicago/Turabian Style

Yeh, Wei-Chang, Wenbo Zhu, Chia-Ling Huang, Tzu-Yun Hsu, Zhenyao Liu, and Shi-Yi Tan. 2022. "A New BAT and PageRank Algorithm for Propagation Probability in Social Networks" Applied Sciences 12, no. 14: 6858. https://doi.org/10.3390/app12146858

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop