Dynamic Community Discovery Method Based on Phylogenetic Planted Partition in Temporal Networks

: As most of the community discovery methods are researched by static thought, some community discovery algorithms cannot represent the whole dynamic network change process efﬁciently. This paper proposes a novel dynamic community discovery method (Phylogenetic Planted Partition Model, PPPM) for phylogenetic evolution. Firstly, the time dimension is introduced into the typical migration partition model, and all states are treated as variables, and the observation equation is constructed. Secondly, this paper takes the observation equation of the whole dynamic social network as the constraint between variables and the error function. Then, the quadratic form of the error function is minimized. Thirdly, the Levenberg–Marquardt (L–M) method is used to calculate the gradient of the error function, and the iteration is carried out. Finally, simulation experiments are carried out under the experimental environment of artiﬁcial networks and real networks. The experimental results show that: compared with FaceNet, SBM + MLE, CLBM, and PisCES, the proposed PPPM model improves accuracy by 5% and 3%, respectively. It is proven that the proposed PPPM method is robust, reasonable, and effective. This method can also be applied to the general social networking community discovery ﬁeld.


Background
Complex network analysis is an interdisciplinary research field which can be applied in a lot of areas such as computer science [1,2] and social, biological and physical sciences [3][4][5], and it is capturing the attention of many scholars. A complex network is a simple graph defined as a set of nodes connected by a set of edges. Nodes can represent individuals or organizations. Edges are relational ties between two nodes, e.g., friendship relationships between two social users. Graphs are one of the most important and powerful data structures. Complex network analysis and modeling can be used to reveal patterns of social interaction, to study recommendation systems, or protein complexes and protein functional modules. By far the most basic tasks in complex networks are node identification, link prediction, and information dissemination. These tasks have received extensive research and attention. In addition, community structure discovery is also one of the most important tasks; it is usually defined as identifying tightly connected subgraphs from a complex network. Because communities help to reveal the structure-function relationship of the network, it has been studied extensively. For example, communities within cancer networks mark key pathways associated with cancer progression [6], and the communities in the networks mark key pathways associated with cancer progression [6], and the communities in the multi-layer transportation network correspond to common practices, which provides clues for airline management [7]. Therefore, a great deal of work has been carried out in the discovery of communities in the network [8][9][10]. A lot of work has been proposed for community discovery, existing algorithms either optimize predefined quantitative functions or acquire potential feature matrices for community detection. Typical methods include modularization-based methods [11], model-based methods [12,13] and random-walk-based methods [14][15][16]. S. Fortunato et al. [17,18] have conducted a comprehensive survey.
However, all these methods assume that the target network is static and ignore the timeliness of the network. In reality, many networks from society and nature are dynamic, meaning that the network structure changes over time; that is, it performs the dynamic network. More specifically, in a dynamic network, nodes may appear or disappear over time, and links between two nodes may also appear or disappear. For example, interpersonal relations often change due to individual behavior [19]. For another example, tumor cell migration leads to metastasis, which is crucial for the diagnosis and treatment of tumors [20]. Therefore, it is worthwhile to track how a community evolves in a dynamic network (also known as a dynamic or evolutionary community).
For dynamic network modeling, the most widely used method is to introduce explicit smoothing frameworks, which quantifies the similarity between snapshots in two subsequent steps by introducing the Temporal Smoothed Framework (TSF). Various TSF-based algorithms have been proposed to evolve the community by extending the static community discovery algorithm. For example, for topological connectivity, the Kim-Han algorithm [21] found dynamic communities by optimizing modularity, and the DYNMOGA method [22] is presented for the multi-objective genetic algorithm to simultaneously optimize clustering accuracy and clustering drift. Regarding matrix decomposition, the ESPC method [23] is used with matrix spectrum, the ECKF method [24] is proposed by using kernel ENMF, and the Se-NMF method [25] is used by a semi-supervised strategy to develop community testing. The Gr-NMF method [26] is adopted by graph-regularized NMF for community discovery in evolution. In the probabilistic model: FaceNet method [27] is researched by using Maximum a Posteriori (MAP) estimate, DSBM method [28] adopted the Bayesian method to obtain the evolving community by extending the random block model. According to the existing literature [29], there are six evolutionary events in the community (as shown in Figure 1), including birth, death, growth, contraction, merging, and splitting. Sometimes, a seventh event is added to these, i.e., continuing. Finally, an eight event was proposed by Cazabet and Amblard [30] and it is resurgence. A generic dynamic community discovery algorithm does not necessarily have to handle all these events, which can be differently managed in different works [31].

Motivation
While much work has been carried out to address the problem of dynamic community discovery, there are still some issues that need to be addressed.
Firstly, most existing dynamic network models assume that it is a hidden Markov structure, in this structure, when the current network state is given, a network snapshot at any given time is conditionally independent for all previous snapshots. This approach may not be flexible enough to replicate some of the observations in real network data.
Secondly, the dynamic system is used by filters, even for Gaussian distributions. However, after a nonlinear transformation, Gaussian terms are lost. Mean and covariance are the only measures computed by the filter. This is the result of a nonlinear transformation approximated by Gauss, and therefore this approximation may be poor.
Finally, how to combine the information of the community structure available at the previous moment with the information available at the current moment is an important question. In the traditional hidden Markov dynamic Bayesian network model, the probability of an edge appearing in a dynamic network is realized by the estimated state.
Therefore, this paper proposes a phylogenetic planted partition method, which uses the graph optimization strategy to continuously discover the evolving communities.

Our Work and Contributes
The main contributions of this paper can be summarized as follows: (1) The time dimension is introduced into the typical stochastic block-model, and all states in the whole dynamic network system are treated as variables, and the observation equation is taken as the constraint between variables to construct an error function about the whole dynamic network system. (2) By adopting the graph-based optimization strategy, the constraints in the entire motion trajectory can be considered once. In the linearization process, only the Jacobian matrix is calculated, and the calculation process is also relative to the entire motion trajectory. Therefore, the entire system evolution process is transformed into the nonlinear system optimization process. (3) In natural ecosystems, inspired by the evolutionary thinking of species populations and combined with the typical probability model of stochastic block-model in community discovery, a phylogenetic planted partition method (PPPM) for dynamic community discovery is proposed. (4) The proposed PPPM method in the two scenarios of artificial network and the real network is verified by experiments, which proves that the performance of the novel method is better than the four state-of-the-art methods (FaceNet, SBM + MLE, CLBM, and PisCES).

Organization
The remainder of this paper is structured as follows: In Section 2, related work is discussed; Section 3 introduces the proposed model in detail, describes the proposed PPPM method, and gives the derivation process. In Section 4, the experimental results of the novel PPPM method in the artificial dynamic network and real dynamic network are presented, and then compared with other existing models. Finally, in Section 5, some conclusions are given and future directions are discussed.

Related Work
According to the research of Aynaud et al. [32], the dynamic community discovery algorithms can be divided into four categories: coupling network, two-stage algorithm, evolutionary clustering, and probability model. However, Hartmann et al. [33] believed that all existing dynamic community discovery methods can be identified as online or offline methods. Rossetti and Cazabet [31] proposed a new survey on community detection in dynamic networks, which proposed the unique functions and challenges of dynamic community discovery algorithms. The first kind of coupled network-based algorithm firstly builds the network by fusing edges at different times. Then, the classical static community detection algorithm is used to find the communities in the coupled network. For example, Agarwal et al. [34] discovered the ongoing events in the microblog message flow by adding edges between vertex instances at different times to build the coupling network, in which the dynamic community corresponds to the community in the built network. Because coupled networks cannot fully describe the dynamic characteristics of networks, these algorithms have been shown to accurately discover only short-cycle communities. To overcome this problem, the second kind of two-stage algorithm separated the community detection from the community dynamic, avoiding the coupling of the dynamic network.
Specifically, these algorithms used static community detection algorithms to find the community each time and then connected the community the next two times to extract the evolving community. Typical algorithms included GraphScope [35] and TRMMC [36] coupled networks and two-stage algorithms that detected dynamic communities in a dynamic network by simply extending static community detection methods and detecting dynamic communities in each operation dynamic network or static community. In general, these algorithms can achieve better performance in the case of weak network dynamics. In this case, the dynamic update method can accurately identify the dynamic community without running the community detection algorithm each time, and only need to update the previously discovered community. However, the accuracy of these algorithms is low.
The third type of dynamic community discovery method is related to clustering evolutionary, which is proposed by Chakrabarti et al. [37]. They extracted the implicit community structure in each snapshot, which is one of the most widely used methods for dynamic community discovery. The evolutionary clustering algorithm adopted the assumption of time smoothness. The community structure will not change much over a continuous-time slice. This time smoothing method can be used to overcome the randomness. Compared with other algorithms, the evolutionary community discovery algorithm aims to discover a smooth sequence of communities in a series of network snapshots (as can be seen in Figure 2). The overall objective function of the evolutionary algorithm can be decomposed into two parts: Snapshot Cost (CS) and Temporal Cost (CT) [38].
The first kind of coupled network-based algorithm firstly builds the network by fus-ing edges at different times. Then, the classical static community detection algorithm is used to find the communities in the coupled network. For example, Agarwal et al. [34] discovered the ongoing events in the microblog message flow by adding edges between vertex instances at different times to build the coupling network, in which the dynamic community corresponds to the community in the built network. Because coupled networks cannot fully describe the dynamic characteristics of networks, these algorithms have been shown to accurately discover only short-cycle communities. To overcome this problem, the second kind of two-stage algorithm separated the community detection from the community dynamic, avoiding the coupling of the dynamic network.
Specifically, these algorithms used static community detection algorithms to find the community each time and then connected the community the next two times to extract the evolving community. Typical algorithms included GraphScope [35] and TRMMC [36] coupled networks and two-stage algorithms that detected dynamic communities in a dynamic network by simply extending static community detection methods and detecting dynamic communities in each operation dynamic network or static community. In general, these algorithms can achieve better performance in the case of weak network dynamics. In this case, the dynamic update method can accurately identify the dynamic community without running the community detection algorithm each time, and only need to update the previously discovered community. However, the accuracy of these algorithms is low.
The third type of dynamic community discovery method is related to clustering evolutionary, which is proposed by Chakrabarti et al. [37]. They extracted the implicit community structure in each snapshot, which is one of the most widely used methods for dynamic community discovery. The evolutionary clustering algorithm adopted the assumption of time smoothness. The community structure will not change much over a continuous-time slice. This time smoothing method can be used to overcome the randomness. Compared with other algorithms, the evolutionary community discovery algorithm aims to discover a smooth sequence of communities in a series of network snapshots (as can be seen in Figure 2). The overall objective function of the evolutionary algorithm can be decomposed into two parts: Snapshot Cost (CS) and Temporal Cost (CT) [38].
Among them, CS measures the adaptability of the community structure and network at the time t , while CT measures how similar the two community structures (the community structure is acquired at the time t to the structure is obtained at previous time -1 t ). The parameter  is for balancing the importance between CS and CT . By introducing different object functions based on modularity, normalization of mutual information and spectrum clustering et al., this framework has been used in much of the literature [39,40] to discover communities in dynamic networks.   Among them, CS measures the adaptability of the community structure and network at the time t, while CT measures how similar the two community structures (the community structure is acquired at the time t to the structure is obtained at previous time t − 1). The parameter α is for balancing the importance between CS and CT. By introducing different object functions based on modularity, normalization of mutual information and spectrum clustering et al., this framework has been used in much of the literature [39,40] to discover communities in dynamic networks.
Folino and Pizzuti [39] formalized the dynamic community discovery algorithm as a multi-objective optimization problem, which maximized the clustering accuracy of the current time step and minimized the clustering drift from one time step to one successive time step. Ma and Dong [40] proposed two evolutionary non-negative matrix decomposition (ENMF) frameworks and proved the equivalence relation between evolutionary module density and evolutionary spectrum clustering. In addition, they introduced a semi-supervised approach, which is called sE-NMF, that incorporated prior information into the ENMF.
Chi et al. [23] extended this idea with two frameworks of evolutionary spectral clustering, which are defined as Preserving Cluster Quality (PCQ) and Preserving Cluster Membership (PCM). Both frameworks have proposed the optimization and correction cost functions, but they differ in how to define the CT. In the PCQ framework, the CT is the cost of the clustering results at the time t applied to the similarity matrix at the time t − 1. In PCM, the CT is defined as a measure of the distance between the clustering results at the time t and t − 1. In the PCQ: where W t−1 and W t represent the adjacency matrix at the time t − 1 and t, respectively.
Finally, the membership of community members can be obtained by calculating the eigenvector of Formula (3).
After the above work, an evolutionary community discovery algorithm is proposed to try to optimize the modified cost function in the definition. Since the user definition of snapshot and CT of community discovery results varies with community discovery algorithms, the aim of the above work is to solve the problem of how to select the parameter α, which can determine how much weight to assign to previous data or community discovery results.
Xu et al. [41] proposed an adaptive evolutionary clustering algorithm, using the following smooth approximation matrixΨ t to better estimate the network state.
where the parameter α t controls the rate of forgetting past information, so it is also defined as a forgetting factor. Ma et al. [26] proposed a non-negative matrix decomposition for co-regularization evolution to identify dynamic communities under a time-smoothing framework.
where β and γ are regularized parameters. In recent years, researchers have proposed some excellent techniques to improve the performance of dynamic community discovery algorithms. In the probability model, the researchers have put forward an innovative model, and this paper puts forward a new dynamic model, which is suitable for dynamic Bayesian networks, namely the system evolution partition transplantation model. In this method, the model parameters are tracked by using the graph of optimization strategy. Table 1 compares some recent representative dynamic social network discovery algorithms based on a probability model.

Formal Definition
In this paper, a novel phylogenetic planted partition model is defined for temporal social networks by the following definitions: Definition 1. A social network can be represented by a graph, G, on a set of nodes, V, and a set of edges, E. Nodes and edges are represented by an adjacency matrix A, where ρ ij = 1 represents the existence of an edge from node i ∈ V to node j ∈ V − {i}, and ρ ij = 0 represents the absence of an edge. This paper assumes the network is a directed graph, which is generally ρ ij = ρ ji and has no self-loop, namely ρ ii = 0. Definition 2. The positive integer n represents the number of nodes, and ε = (ε 1 , . . . , ε k ) is the probability vector on [k] = {1, . . . , k} (k is the number of network communities). W is a symmetric matrix whose element is k × k between [0,1]. (X, G) is defined under the Stochastic Block-Mode (SBM) SBM(n, θ, W). X is a n dimensional random vector; it is independent and identically distributed under ε. Let the community set C r = C r (X) = {C 1 , . . . , C k }, r ∈ [k] denotes the division of V into k communities. In this paper, we use the symbols p and q to indicate two generic communities, c i and c j , where i and j are the two nodes of a simple graph G of n vertices, which respectively belong to c i (i.e., i ∈ p) and c j (i.e., j ∈ q) at time t. Nodes i and j are connected according to a probability ε c t i c t j , independently from the other node pairs. Therefore, the probability distribution of (X, G) [50] is as follows, where G = (V, E).
where N pq (x, y) represents the observed value of the number of edges in the partition, which can be expressed as represents the probability value of the number of edges in the partition, which can be expressed as follows.
Equation (8) can be rewritten by: Definition 3. Define an evolutionary sequence of discrete time steps for social network (dynamic Bayesian network); the nodes and edges may appear or disappear with time. The temporal social networks can be expressed as G 1 , G 2 , . . . , G T , where G t = (V t , E t ), superscript t represents the time step, V t and E t , respectively, in the time step are t collection of nodes and edges. Let A (T) represent the sequence of adjacency matrix on the node-set sequence V (T) = ∪ T t=1 V t , and let c (T) represent the sequence of community member membership vectors of the node. For this dynamic social network, the probability distribution of edges can be defined as: For any pair of nodes i ∈ p and j ∈ q at t − 1 and t, s.t. ρ t−1 ij = 1. Namely, there is an edge from node i to the node j at the time t − 1 and ρ t ij is Independent Identically Distributed (IID). The same is true for ρ t−1 ij = 0. The mapping process of random sampling and probability allocation is shown in Figure 3.
Equation (8) For any pair of nodes ip  and jq  at Namely, there is an edge from node i to the node j at the time . The mapping process of random sampling and probability allocation is shown in Figure 3. ij and allocate  with probability Bernoulli  . Figure 3 shows the hypothesis of this paper. There are only two possibilities for random events: existence or non-existence of edges. In this paper, we randomly sample ( , ) ij and allocate  , with probability 1 Bernoulli  . Each term of the adjacency matrix t A is independent. Therefore, Equation (10) . Figure 3 shows the hypothesis of this paper. There are only two possibilities for random events: existence or non-existence of edges. In this paper, we randomly sample (i, j) and allocate ε, with probability . Each term of the adjacency matrix A t is independent. Therefore, Equation (10) can be rewritten as the likelihood form with parameter W t .
where α t pq and β t pq are, respectively, denoted as follows:

A Migration Partitioning Model for Phylogenetic Evolution
The proposed dynamic community discovery method can track the status of the target over time to discover community results. Therefore, this paper constructs an observation model, which can be described by where Z t is an independent Gaussian noise matrix with zero mean and variance (σ t pq ) 2 = ε t pq (1 − ε t pq )/β t pq . This matrix reflects the transient variations caused by noise. In this paper, we assume that Z t , Z t−1 , . . . are independent of each other.
In the dynamic system model, S (t) expresses the set of observed values, and W (t) represents the state of the sequence of observed values that generate noise in the dynamic system. This paper refines the final model by modeling the evolution of specified states over time. Because ε t is a probability between 0 ∼ 1, and this paper deals with ε t in logarithmic form, that is, y t = log(ε t pq /(1 − ε t pq )), then a time-series dynamic observation model of system evolution can be constructed as follows: where H t denotes the state transition model, y t represents the vector metric representation of matrix W t , z t implies the process noise, z t is a random vector with zero mean and Θ t is the covariance matrix. According to the vectorization expression of S t and observation noise Z t , the observation model (15) can be rewritten as: The logical activation function g(·) is handled by the Sigmod function, which is: This paper assumes that the initial state of the dynamic system obeys the Gaussian distribution, namely, y 0 ∼ N(µ 0 , Θ 0 ). The nonlinear optimization problem in the timeseries dynamic observation model of system evolution is constructed in this paper, which is the problem of calculating the Maximum a Posteriori (MAP) of c t , while for Gaussian distributions, the maximization problem can be translated into the negative logarithm problem of minimizing the target probability c t . Therefore, Equation (12) is converted into the following logarithmic likelihood: The following error function will be constructed: Then, minimize the quadratic form of the error function: Finally, make the first-order expansion of f (x): where J(x) is the derivative of f (x) with respect to x, which is actually a matrix of m × n, which is also Jacobian. The derivative problem can be turned into a recursive approximation problem; therefore, L-M method is adopted in this paper to determine the step size ∆x, the L-M method avoids the non-singular and morbid state properties of the coefficient matrix of linear equations and can provide a more stable and accurate increment ∆x. In the previous methods, as the approximate second-order Taylor expansion adopted in the GaUSs-Newton method could only have a good approximation effect near the expansion point, a trust-region is added to ∆x. It should be noted that the trust-region should not be so large that the approximation is inaccurate. The approximate value in the trust region is considered to be valid; when it is outside of this region, the approximation might go wrong. The scope of the trust region is determined by the difference between the approximate model and the actual function. Determine rules: if the differences are small, let the scope be as large as possible; if the difference is large, narrow the approximation. Therefore, Equation (22) is used to judge whether the Taylor approximation is good enough or not.
where the numerator ρ is the decreasing value of the actual function, and the denominator is the decreasing value of the approximate model. If ρ is close to 1, then the approximation is good. If ρ is too small, meaning that the actual reduced value is far less than the approximate reduced value, then the approximate result is considered to be poor and the approximate range needs to be narrowed. On the contrary, when ρ is large, it means that the actual decline is larger than expected, and the approximate range can be enlarged. Because the temporal dynamic observation model of system evolution constructed is nonlinear and d f /dx is not easy to obtain, this paper intends to adopt an iterative method (if there is an extreme value, then convergence to approximation) to converge the approximation. The steps are shown in Algorithm 1.

Algorithm 1:
Main procedure of the iterative method 1. Given an initial value x 0 , radius r and parameter k 2. for the k-th iteration, solving: min Compute ρ 4.
if convergence 10. break 11. end where the limiting condition r is the radius of the trust region. In Equation (21), the incremental range is limited to a sphere of radius r, which is seen as an ellipsoid after multiplying by D. D is taken as a non-negative diagonal matrix, usually with the square root of the diagonal element J T J, and it is equivalent to directly constraining ∆x in the ball.
where λ is the Lagrange multiplier. Finally, this paper needs to obtain the gradient by solving the objective function (23). Since it is an optimization problem with inequality constraints, the Lagrange multiplier is used in this paper to transform the objective function into an unconstrained optimization problem. Additionally, then the target function is transformed.
Let us expand out the square of the target function of (23).
Then, solve the derivative of ∆x k in Equation (24) and set it to zero: The following equations are obtained: Let J(x k ) T J(x k ) = H, the right-hand side of the equation be defined as g, and the equation can be simplified as follows: In the initial time step of the algorithm, the proposed PPPM method is initialized with the spectral clustering algorithm; that is, the initial estimation of community is generated at the time t = 1. The advantage of using the spectral clustering algorithm as the initialization algorithm here is that it can prevent the local search from falling into c t poor local maximum in the initial time step. The main procedure of the proposed PPPM method can be shown in Algorithm 2. Initialize c 0 by using spectral clustering applied on W 0 3.
if iteration ≤ max iteration//hill-climbing algorithm 5.ĉ t 0 ← −∞ //negative Log of the best adjacent case till to a constant 6.
. c t ← c t //currently being traversing case 7. for i = 1 to V t do//traverse all adjacent solutions 8.
for j = 1 to k; s.t. c t i = j do 9.
. c t i ← j //change community of a node 10.
. c t i ← c t i //refresh community of current node 15.
ifĉ t 0 >ĉ t then//the best adjacent case is better than the current best case 16.

Results
In order to prove the rationality of the novel proposed method, four algorithms are compared, namely FaceNet [27], SBM + MLE [48], CLBM [49], and PisCES [51]. Firstly, FaceNet was chosen because it was the first proposed dynamic web community discovery algorithm that could be compared as a baseline; secondly, SBM + MLE and CLBM were used because they are the latest proposed probabilistic model-based algorithms; finally, PisCES is also a recently proposed non-probabilistic model algorithm. In this paper, the indicators of the following two evaluation models are adopted.
where E(RI) represents the expected value of RI and max(RI) denotes the maximum value of RI. (2) Mean-squared errors (MSE), the smaller the value, the smaller the error, that is, the better the result.
where y i is the real data,ŷ i expresses the fitting data, and n implies the number of samples.   In Figure 4, there are eight rhombic blocks, and the whole dynamic social network can be represented by the evolution of these eight rhombic blocks over time. The upper part represents the dynamic network of a time step, the lower part denotes the community where the current time step may exist, and the lower part is composed of the nodes with the highest degree of nodes of each color in the network absorbing nearby nodes to form larger nodes. (Absorption rule: connected with the node with the greatest degree and with the same color). For example, in the lower half 1 t = , there are three submodules, each of which represents a possible community. More specifically, each submodule can be composed of nodes of different colors and sizes, and each color can represent nodes with the same characteristics in the dynamic social network. it can be seen that the community structure of dynamic social networks is phylogenetic over time.

Synthetic Networks
The artificial network is generated in this paper, which consists of 128 nodes, initially divided into four communities, where each community has 32 nodes. At the initial time step, the edge probability of the system evolution migration partition is set as  In Figure 4, there are eight rhombic blocks, and the whole dynamic social network can be represented by the evolution of these eight rhombic blocks over time. The upper part represents the dynamic network of a time step, the lower part denotes the community where the current time step may exist, and the lower part is composed of the nodes with the highest degree of nodes of each color in the network absorbing nearby nodes to form larger nodes. (Absorption rule: connected with the node with the greatest degree and with the same color). For example, in the lower half t = 1, there are three submodules, each of which represents a possible community. More specifically, each submodule can be composed of nodes of different colors and sizes, and each color can represent nodes with the same characteristics in the dynamic social network. it can be seen that the community structure of dynamic social networks is phylogenetic over time.

Synthetic Networks
The artificial network is generated in this paper, which consists of 128 nodes, initially divided into four communities, where each community has 32 nodes. At the initial time step, the edge probability of the system evolution migration partition is set as ε 1 pp = 0.3109 and ε 1 pq = 0.0765 (p, q = 1, 2, 3, 4 and p = q). The initial covariance Θ 1 is set to the identity matrix 0.04I. The state vector G evolves according to the Gaussian random walk model, namely H t = I in Equation (15). This paper generates 25 and 50 time steps. At each time step, nodes are randomly selected to leave their communities and randomly assigned to one of the other three communities. Table 2 statistically compares the proposed PPPM method with the average ARI experimental results of multiple parameters of four representative models in an artificial network environment. In Table 2, bold font indicates that the result is the best. It can be clearly seen that the proposed PPPM method has the best performance under all parameters. It can be calculated that the average performance of the novel method is improved by 0.05 compared with the other four best models. Figure 5 shows the comparison of average ARI results between the proposed method and four representative models in an artificial network environment.

022, 12, x FOR PEER REVIEW
13 of 20 calculated that the average performance of the novel method is improved by 0.05 compared with the other four best models. Figure 5 shows the comparison of average ARI results between the proposed method and four representative models in an artificial network environment. As shown in Figure 5a, 25 time steps are generated by the artificial network. On each time step, the randomly selected parameter is set to 10%. In this experiment, the parameters of the noise item are changed at the 15th time step (the left line) and set back to the original state at the 16th time step (the right line). It is evident that in the 15th time step, Figure 5. Comparison of the proposed model with 4 different models on Mean ARI (synthetic network). (a) indicates that the time step is 25 and the randomly selected parameter is 10%, (b) indicates that the time step is 50 and the randomly selected parameter is 10%, (c) indicates that the time step is 25 and the randomly selected parameter is 20%, (d) indicates that the time step is 50 and the randomly selected parameter is 20%.
As shown in Figure 5a, 25 time steps are generated by the artificial network. On each time step, the randomly selected parameter is set to 10%. In this experiment, the parameters of the noise item are changed at the 15th time step (the left line) and set back to the original state at the 16th time step (the right line). It is evident that in the 15th time step, the only two models with SBM and PPPM + MLE line charts show the correct change in trend, i.e., a downward trend, and the proposed PPPM method declines faster, and the increasing trend of PisPCES, CLBM, and FaceNet are unaffected and keep the previous state, after the 16th time step, which also can obviously show that compared with the other four kinds of models, the novel method callback trend is more obvious. This indicates that the proposed novel method has a more consistent response to noise terms.
In Figure 5b, the artificial network generates 50 time steps, randomly selects parameters and sets them to 10%, changes the parameters of the noise item at the 20th time step (the left line), and sets them back to the original state at the 21st time step (the right line). It is evident that in the 20th time step, the only two models with PPPM and CLBM line charts show the correct change trend, i.e., a downward trend, and PPPM declines faster, and PisPCES and FaceNet are on the rise, with SBM CLBM + MLE remaining unaffected and they to keep the previous state, after the 16th time step, which also can obviously show that compared with the other four kinds of models, PPPM callback trend is more apparent in terms of reverting to the previous state, which shows that this paper proposed the model of response that is more consistent in noise.
In Figure 5c, the artificial network generates 25 time steps, randomly selects parameters and sets them to 20%, changes the parameters of the noise item at the 15th time step (the left line), and sets them back to the original state at the 16th time step (the right line). It is obvious that at the 15th time step, the line graph of all models shows the correct trend of change, namely the downward trend. It is worth noting that the downward trend of PPPM is the most obvious. After the 16th time step, it is also obvious that compared with the other four models, the proposed novel method has a more obvious callback trend, which also indicates that the novel method has a more consistent response to the noise term. Figure 5d shows that the artificial network generates 50 time steps, randomly selects the parameters and sets them to 20%, changes the parameters of the noise item at the 20th time step (the left line), and sets them back to the original state at the 21st time step (the right line). It is evident that in the 20th time step, only two models with PPPM and FaceNet line charts show the correct change trend downward trend, and PPPM decline faster, while the remaining three kinds of model, CLBM, SBM + MLE, PisPCES, are not affected, and keep the previous state; after the 16th time step, PPPM and FaceNet all can to go back to the previous state, which demonstrates that the proposed method has a more consistent response in noise.
In conclusion, in the artificial network, this paper proposed a dynamic communityfound PPPM method compared with the other four kinds of a typical model. The model is tested in the perturbation parameter test (the noise is changed in a particular time step). The prediction accuracy of the model index (ARI) increased by 5% on average, and the experimental results show that the proposed model is robust.

MIT Reality Mining
This experiment is conducted on the MIT dataset [52]. The dataset is collected by recording the mobile phone activity of 94 students and employees over a year. The dataset built a dynamic network based on physical distance, which is measured by scanning nearby Bluetooth devices every 5 min. Data collected near the beginning and end of experiments with low participation rates are excluded in this experiment. Each time step corresponds to one week, so there are 37 time steps between August 2004 and May 2005. Figure 6 shows the mean-variance error results of the proposed novel method and four representative models under the artificial network.
recording the mobile phone activity of 94 students and employees over a year. The dataset built a dynamic network based on physical distance, which is measured by scanning nearby Bluetooth devices every 5 min. Data collected near the beginning and end of experiments with low participation rates are excluded in this experiment. Each time step corresponds to one week, so there are 37 time steps between August 2004 and May 2005. Figure 6 shows the mean-variance error results of the proposed novel method and four representative models under the artificial network.   Figure 6 shows that, under the MSE evaluation index, the smaller the error, the better the result; that is, the closer the model image is to the x-axis. Obviously, compared with other colors (the other four models), the image with blue color (the proposed method in this paper) is closer to the x-axis; that is, the proposed PPPM method has a lower MSE value and a smaller error. Table 3 compares the average ARI results of the proposed method with those of the four representative models in the real network (MIT reality mining) environment. In Table 3, the bold font shows that the result is the best, you can clearly see that the proposed PPPM in all parameters (the maximum value; the first 75% of the value; the median; the first 25% of the value; minimum value) cases are the best and clear, the average performance of PPPM performance (median) than the best model is increased by 3% in the other four. Figure 7 shows the comparison of MARI values on the MIT dataset between the proposed method and four state-of-the-art models. Appl. Sci. 2022, 12, x FOR PEER REVIEW 16 of 21 Figure 6 shows that, under the MSE evaluation index, the smaller the error, the better the result; that is, the closer the model image is to the x-axis. Obviously, compared with other colors (the other four models), the image with blue color (the proposed method in this paper) is closer to the x-axis; that is, the proposed PPPM method has a lower MSE value and a smaller error. Table 3 compares the average ARI results of the proposed method with those of the four representative models in the real network (MIT reality mining) environment. In Table 3, the bold font shows that the result is the best, you can clearly see that the proposed PPPM in all parameters (the maximum value; the first 75% of the value; the median; the first 25% of the value; minimum value) cases are the best and clear, the average performance of PPPM performance (median) than the best model is increased by 3% in the other four. Figure 7 shows the comparison of MARI values on the MIT dataset between the proposed method and four state-of-the-art models. In Figure 7, the upper and lower edges of each box in the boxplot represent 25% and 75% values, respectively, and the middle red line denotes the median. It is obvious that PPPM, SEM + MLE, and the three boxes perform better than the other two model boxes. In Figure 7, the upper and lower edges of each box in the boxplot represent 25% and 75% values, respectively, and the middle red line denotes the median. It is obvious that PPPM, SEM + MLE, and the three boxes perform better than the other two model boxes. Among the three models with better performance, the PPPM box position is slightly higher than that of SEM + MLE and PisCES boxes, and the median value is also slightly higher than that of SEM + MLE and PisCES models. In conclusion, compared with the other four representative models in the real network, the proposed dynamic community PPPM method performs better under the two evaluation indexes of prediction accuracy and error.

Enron Email Data
The experiment is conducted on the dynamic social network, which is built by Enron [53], and it consisted of about 500,000 emails between 184 Enron employees from 1998 to 2002. The directional edge between the employee and the time point occurs if at least one email is sent within the first week. Each time step corresponds to an interval of 1 week. This dataset does not distinguish between emails sent to "recipients," "CC" or "BCC." In addition to email dataset, most employee roles (such as CEO, president, manager, employee) exist within the company and they are used as known communities. The first 56 weeks and the last 13 weeks are filtered because only a few emails are sent. Figure 8 compares the estimated community probability between a normal week and an event week. The higher the probability, the higher the community activity. Both the x-axis and the y-axis denote the estimated communities, and the color blocks on the diagonals express the activity within each community, and the color blocks of the diagonals imply the activity between each community. The experiment is conducted on the dynamic social network, which is built by Enron [53], and it consisted of about 500,000 emails between 184 Enron employees from 1998 to 2002. The directional edge between the employee and the time point occurs if at least one email is sent within the first week. Each time step corresponds to an interval of 1 week This dataset does not distinguish between emails sent to "recipients," "CC" or "BCC." In addition to email dataset, most employee roles (such as CEO, president, manager, em ployee) exist within the company and they are used as known communities. The first 56 weeks and the last 13 weeks are filtered because only a few emails are sent. Figure 8 com pares the estimated community probability between a normal week and an event week The higher the probability, the higher the community activity. Both the x-axis and the y axis denote the estimated communities, and the color blocks on the diagonals express the activity within each community, and the color blocks of the diagonals imply the activity between each community. As shown in Figure 8a, in a normal week (week 59), the president community is the most active, followed by managers and employees, and the CEO community is the least active. It is also worth noting that from the color block distribution of managers and employees, the two communities may merge into one large community. This phenomenon can be reflected in the fact that communication between department managers and employees is usually close, and managers and employees are more likely to get along with each other. As shown in Figure 8b, in the event week (the 89 weeks when CEO Jeffrey resigns), the most active community is that of the managers, followed by president community, and the brightest color block is the managers to the employee community. This is reflected in the fact that in real life when CEOs resign, the discussion is most intense among managers because it is directly related to their personal interests. Discussions between managers and employees also proliferate for the same simple reason that it is indi- As shown in Figure 8a, in a normal week (week 59), the president community is the most active, followed by managers and employees, and the CEO community is the least active. It is also worth noting that from the color block distribution of managers and employees, the two communities may merge into one large community. This phenomenon can be reflected in the fact that communication between department managers and employees is usually close, and managers and employees are more likely to get along with each other. As shown in Figure 8b, in the event week (the 89 weeks when CEO Jeffrey resigns), the most active community is that of the managers, followed by president community, and the brightest color block is the managers to the employee community. This is reflected in the fact that in real life when CEOs resign, the discussion is most intense among managers because it is directly related to their personal interests. Discussions between managers and employees also proliferate for the same simple reason that it is indirectly related to the employees' personal interests. Figure 9 reveals the estimated edge connections between communities in Enron's email network under the proposed dynamic community's discovery approach PPPM and shows a 95% confidence interval (note: the lines on the left and right of the figure are for weeks 59 and 89, respectively). As shown in Figure 9a, it is the edge probability of presidents to CEOs; it can be seen that the presidents to CEOs edge probability increased slightly at week 59 (normal) and 89 (Jeffrey CEO resigned), which corresponds to presidents to CEOs activity (increased) from Figure 8a,b. Figure 9b shows the side probability within the president community In the 59th and 89th weeks, the side probability inside the community shows a downward trend. This also corresponds to the active state (decreased activity) within the president community from Figure 8a,b. Figure 9c,d show the side probabilities between managers and the manager community and between the managers and the employee community respectively. It can be seen that in the 59th and 89th weeks, the changing trend of the side probabilities of these two communities is consistent with that in Figure 9a.
Similarly, this change also corresponds to the changes in active state between managers and manager community and the employee community (increased activity) from Figure 8a,b. Figure 9e shows the edge probability between the employees and the manager community. It is not difficult to see that there is no obvious trend of change in week 59 and 89. Similarly, this situation also corresponds to the consistent change in the active state between the employees and the manager community from Figure 8a,b (there is no significant change in the activity). Finally, Figure 9f shows the edge probability between employees and the employee community. In week 59 and 89, similarly, the changing trend of edge probability of these two communities is consistent with the change in Figure 9b As shown in Figure 9a, it is the edge probability of presidents to CEOs; it can be seen that the presidents to CEOs edge probability increased slightly at week 59 (normal) and 89 (Jeffrey CEO resigned), which corresponds to presidents to CEOs activity (increased) from Figure 8a,b. Figure 9b shows the side probability within the president community. In the 59th and 89th weeks, the side probability inside the community shows a downward trend. This also corresponds to the active state (decreased activity) within the president community from Figure 8a,b. Figure 9c,d show the side probabilities between managers and the manager community and between the managers and the employee community, respectively. It can be seen that in the 59th and 89th weeks, the changing trend of the side probabilities of these two communities is consistent with that in Figure 9a.
Similarly, this change also corresponds to the changes in active state between managers and manager community and the employee community (increased activity) from Figure 8a,b. Figure 9e shows the edge probability between the employees and the manager community. It is not difficult to see that there is no obvious trend of change in week 59 and 89. Similarly, this situation also corresponds to the consistent change in the active state between the employees and the manager community from Figure 8a,b (there is no significant change in the activity). Finally, Figure 9f shows the edge probability between employees and the employee community. In week 59 and 89, similarly, the changing trend of edge probability of these two communities is consistent with the change in Figure 9b; namely, it displays the downward trend. At the same time, it also corresponds to the consistent change in active state between employees and the employee community from Figure 8a,b (decreased activity).
To sum up, the proposed PPPM method can well reflect some phenomena existing in the real network, and the probability estimated by the novel method can make relatively consistent predictions with the advance of time and the occurrence of specific events.

Conclusions
The proposed model has practical theoretical and practical significance to mine and it also simulates deeper hidden information that is present in dynamic social networks. At present, the dynamic social network community discovery method cannot effectively represent the entire dynamic network evolution process. Therefore, inspired by the evolution theory of natural biosensors, this paper proposes a community discovery method based on phylogenetic planted partition. Firstly, the time dimension is added to the transplant partition model, all states in the whole dynamic network system are treated as variables, the observation equation is used as a constraint between variables, and an error function about the whole dynamic network system is constructed. Then, the quadratic form of the error function is minimized, which can abstract the observation results of the network more realistically. Secondly, a graph optimization strategy is used to consider the constraints in the whole motion trajectory at one time, and the Jacobian matrix is calculated during the linearization process. Because the calculation process is relative to the whole motion trajectory, the whole system evolution process is transformed into a nonlinear system optimization process. The gradient of the error function is obtained by using the L-M method, and then the iteration is carried out according to the direction of the gradient; finally, the proposed method is compared with four state-of-the-art representative models under two scenarios of artificial network and real network. The experimental results show that the PPPM method has better performance than the other four representative models in building a dynamic network model and mining dynamic network hidden information.
Next, this paper will consider how to integrate the multi-layer model mechanism into the proposed model and will study dynamic network hiding information with multi-layer information in future research.