Next Article in Journal
Statistical and Proactive Analysis of an Inter-Laboratory Comparison: The Radiocarbon Dating of the Shroud of Turin
Previous Article in Journal
Time-Limited Codewords over Band-Limited Channels: Data Rates and the Dimension of the W-T Space
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Characterizing the Complexity of Weighted Networks via Graph Embedding and Point Pattern Analysis

1
Division of Biostatistics and Bioinformatics, Department of Epidemiology and Public Health, School of Medicine, University of Maryland, Baltimore, MD 21201, USA
2
Maryland Psychiatric Research Center, Department of Psychiatry, School of Medicine, University of Maryland, Baltimore, MD 21228, USA
3
Department of Accounting, College of Business and Economics, Towson University, Towson, MD 21252, USA
4
Department of Mathematics, University of Maryland, College Park, MD 20742, USA
*
Author to whom correspondence should be addressed.
Entropy 2020, 22(9), 925; https://doi.org/10.3390/e22090925
Submission received: 18 July 2020 / Revised: 13 August 2020 / Accepted: 21 August 2020 / Published: 23 August 2020

Abstract

:
We propose a new metric to characterize the complexity of weighted complex networks. Weighted complex networks represent a highly organized interactive process, for example, co-varying returns between stocks (financial networks) and coordination between brain regions (brain connectivity networks). Although network entropy methods have been developed for binary networks, the measurement of non-randomness and complexity for large weighted networks remains challenging. We develop a new analytical framework to measure the complexity of a weighted network via graph embedding and point pattern analysis techniques in order to address this unmet need. We first perform graph embedding to project all nodes of the weighted adjacency matrix to a low dimensional vector space. Next, we analyze the point distribution pattern in the projected space, and measure its deviation from the complete spatial randomness. We evaluate our method via extensive simulation studies and find that our method can sensitively detect the difference of complexity and is robust to noise. Last, we apply the approach to a functional magnetic resonance imaging study and compare the complexity metrics of functional brain connectivity networks from 124 patients with schizophrenia and 103 healthy controls. The results show that the brain circuitry is more organized in healthy controls than schizophrenic patients for male subjects while the difference is minimal in female subjects. These findings are well aligned with the established sex difference in schizophrenia.

1. Introduction

The research of complex networks has attracted significant attention in the last few decades. Complex networks are a natural representation of real-world interactive processes among multiple units [1]. For example, social, financial, gene-regulation, and brain networks are complex networks, which are neither purely random nor regular [2,3,4,5]. Complex networks consist of organized (although often latent) network topological structures, and they exhibit properties such as scale-free and small-worldness [6]. Analytical models have become fundamental tools to characterize the complex structure and intrinsic mechanisms of complex networks [7,8].
The quantification of the intrinsic network complexity of complex networks is a fundamental problem in network analysis. The complexity, as a permutation invariant graph characteristic, has proved to be more effective than a number of simple alternatives, including edge density, perimeter, and path length. Recently, advanced computational models have been developed to calculate the von Neumann entropy and Shannon entropy of an unweighted graph/network [9,10,11]. The network entropy provides a robust measurement of the complexity of a weighted network/graph. The network entropy can also be used as a graph descriptive statistic and applied to group-level network analysis, for example, to compare the complexities between networks that were collected from two cohorts of subjects: a group of patients with brain diseases and a group of healthy controls [12].
The brain network analysis has become an active research area in the last decade [13]. The recent development in neuroimaging technology has facilitated the non-invasive measurement of brain circuitry. For example, diffusion tensor imaging (DTI) and tractography can be used to construct white matter structural networks, while the coactivation patterns of blood-oxygen-level-dependent (BOLD) signals from functional magnetic resonance imaging (fMRI) can be calculated in order to estimate functional brain networks [14]. The human brain network is a complex network, where brain areas are denoted by graph nodes, and the structural/functional connections are represented by edges [4]. The brain network consists of complex, organized, yet latent topological structures and exhibits network properties, including scale-free, small-worldness, and high-modularity [3]. These network characteristics have been associated with brain diseases and behaviors. In these studies, brain imaging data are acquired for each individual to compute the brain network, and then network complexity scores can be calculated for all participants to perform group-wise statistical analyses (see Figure 1).
The network entropy-based complexity measurement is undoubtedly critical and informative because the complex human cognitive functions and behaviors are associated with organized brain networks [15]. However, the calculation of complexity of brain networks has been challenging because (i) edge weights in the brain network are continuous variables following unknown distributions (e.g., an infinite mixture model of multivariate distributions with unknown covariance); (ii) the empirical distributions of edges can be misleading due to the substantial noise of edge; and, (iii) edges are correlated with each with a large and unknown covariance structure constrained by network and spatial structures. Moreover, simply binarizing weighted edges into binary edges by an arbitrary threshold is subject to massive information loss and reduced statistical power. A straightforward alternative approach to measure the complexity and non-randomness of a weighted network is to directly extract the network topological structure from the weighted adjacency matrix. However, the false positive and false negative noise in the edge weights can easily distort the recovery of the true underlying topological structure. This is more challenging when the structure is complicated, such as a mixture of k-partite and rich-club [16]. To address this unmet need, we propose a new computational method to measure the complexity via graph embedding and statistical point pattern analysis. We first project each weighted complex network into a low dimensional vector space and perform point pattern analysis via the statistical point process. The distribution of points from an unorganized Erdős Renyi random weighted graph is distinct from an organized brain network, which can be reflected by our complexity measurement. Note that our method does not require learning the distribution of edges in the weighted adjacency matrix nor the underlying topological structure, thus it is computationally convenient and is robust to false positive noise.
The rest of this paper is organized, as follows. In Section 2, we introduce the definition of the metric of complexity for weighted complex network. Section 3 describes the validation of the proposed approach using various simulated data and the results of an application to schizophrenia functional network analysis. The results, for the first time, reveal that the complexity of brain networks significantly differs between male healthy controls and schizophrenic patients, whereas they are similar in females, which may further explain the well-established difference between sex in schizophrenia.

2. Methods

2.1. Background

The graph notation G = { V , E , W } is often used to denote the weighted network, where V is the set of nodes/vertices, E is the edge set (links between nodes). We use a weighted adjacency matrix W n × n to denote the edge weights, where an entry w i j is a real number and represents the connectivity strength between the two nodes i , j , 0 < j < i < n . For example, the correlation coefficient can be used and 1 w i j 1 .
The network entropy has been well established for binary networks that are based on a Bernoulli distribution model [9,11]. However, the network entropy for the weighted network is challenging because the edge weights in W a mixture multivariate normal distributions with a large and unknown covariance matrix (the number of parameters is at the order of n 4 ) and unknown mixture component [17]. The parameter estimation is often intractable, and the computation of entropy is then challenging.
In the current research, we are facing a more challenging task, because our data are a group of weighted networks. For a group-level brain connectivity analysis, we denote the data { W n × n 1 , W n × n 2 , , W n × n S } . For one study subject, we have a weighted network W n × n s , s = 1 , , S , and a vector of clinical covariates Z s (e.g., clinical status, age, and sex). The complexity of W n × n s can represent the level of brain organization and efficiency, which influences the symptoms of brain disorders (e.g., cognitive deficit).
We develop a novel framework to quantify the complexity and non-randomness based on graph embedding and point pattern analysis to overcome the computational challenge and understand the association between the organization of brain connectivity networks and the clinical symptoms of mental disorders.

2.2. Graph Embedding of a Weighted Network

Graph embedding projects a graph into a low dimensional vector space R k while preserving the graph information and facilitates the efficient computation of graph analytics [18]. A key advantage of graph embedding in our application is its invariance to the isomorphic mapping of the graph: the projection of our weighed network in a low dimensional vector space remains unchanged when the order of nodes shuffles (see Figure 2). This property alleviates the challenging task of extracting complicated and unknown topological structure from the weighted graph W, because the distribution of points in R k is independent of the detection of graph topological structure (e.g., allocating points to clusters). The complexity and non-randomness of the original weighted network can be well captured by the points in R k .
Graph embedding maps the weighed network W : N × N R and a set of nodes V = { v 1 , v 2 , , v n } into vectors X ˜ k × n = { X 1 , X 2 , , X n } in R k , such that
arg min X ˜ | | X ˜ T X ˜ 1 2 JWJ | | F ,
where J = I n 1 / n 1 n 1 n t and I n and 1 n are the identity matrix and a vector of ones respectively. X ˜ k × n provide a set of k-dimensional coordinates for the nodes V = { v 1 , v 2 , , v n } . Next, we apply spatial statistical methods to analyze the pattern of points in the R k space and measure the complexity and non-randomness of the connectivity that is based on the point distribution patterns.

2.3. Point Pattern Analysis in the R k Space

In the R k space, each point represents a node in the weighted network. The distribution patterns of the points in R k reflect their connectivity in the weighted adjacency matrix. For example, the points that are mapped from a random weighted graph show a point pattern of complete spatial randomness (CSR). In contrast, points mapped from a weighted network with an organized topological structure show a clustered pattern (see Figure 3).
In spatial statistics, stochastic point processes, for example, the Poisson process and Cox process, are often used to model the point patterns [19]. In that, the number of nodes located within a radius of r centered at a point/node i follows a Poisson distribution. Under the assumption of a random graph (i.e., CSR in R k ), the point process can be modeled as a homogeneous point process with the density λ . Hence, we can use the cross entropy (CE) between the observed point pattern from an organized weighted network and the point process by the random graph to characterize the complexity of a weighted network. Specifically, the cross entropy is
L ( r ) = 1 n i = 1 n j i { I ( d i j < r ) } log ( π r 2 λ ) ,
where r is the radius, d i j denotes the Euclidean distance between points i and j, λ is density parameter, and π r 2 λ is the expected number of points within an area centered at point i and radius r. In practice, we estimate L ( r ) similarly to the estimate of Ripley’s K function since L ( r ) = c K ( r ) (c is a constant). The Poisson distribution for the nodes at the boundary of the point distributed area can be distorted because of the edge effect [20]. The distribution function presented in Figure 3d is example of the boundary/edge effect. Thus, the boundary/edge effect can be corrected, which also has been well developed by the Ripley’s K function estimation. Specifically, the K function is estimated by
K ^ ( r ) = λ 1 1 n i = 1 n j i η i j { I ( d i j < r ) } ,
where the weight function η i j provides the edge correction. Then, we can conveniently calculate L ^ ( r ) = c 1 K ^ ( r ) . Last, we integrate the measure by r and name the measurement as graph embedding based point process cross entropy (Geme ).
Geme ^ = L ^ ( r ) d r ,
Geme is built on the contrast between the observed point pattern of the weighted network vs. the point pattern of CSR (projected from a non-organized weighted network random graph) in the R k space. In the low dimensional space R k , the complexity of weighted networks is well captured by the point/node distribution patterns because the Euclidean distance between nodes can appropriately represent the edge weights in the weighted adjacency matrix. The point patterns hence reflect the complexity of weighted networks. The point process methods developed in spatial statistics then provide a convenient pathway to characterize the ‘complexity’ of the point patterns. The computational complexity of Geme is O ( n 2.367 ) , and thus Geme can be scalable to large networks. In summary, Geme is a new metric to measure the complexity and non-randomness of a weighted network and graph. Geme can avoid the arbitrary cut-off to binarize a weighted graph to a binary graph and is invariant to all isomorphic forms of a weighted graph and can quantify the complexity robustly.

3. Results

3.1. Simulations

We first validate and evaluate the proposed complexity metric Geme using synthetic data sets. In the current research, we focus on group level comparison and test whether Geme can accurately capture the difference of complexity between groups of networks with different topological structures. We illustrate examples using the commonly used two group setting; however, the conclusion can be easily extended to the regression setting. Specifically, we simulate a set of weighted networks (i.e., brain networks for multiple subjects) from two covariance matrices Σ n × n 0 Σ n × n 1 , where n is the number of nodes. For subjects s = 1 , 2 , , S 0 in group one, we simulate Z n × T s N ( μ 0 , Σ n × n 0 ) and Z n × T s N ( μ 1 , Σ n × n 1 ) with s = S 0 + 1 , S 0 + 2 , , S 0 + S 1 . T is the number of volumes of the image, for example, the number of time points in fMRI data. In this application, we let S 0 = S 1 = 100 . For the sake of simplicity, we let μ 0 = μ 1 = 0 . For a subject s, we can calculate the weighted network by W n × n s = d i a g ( U ) 1 / 2 U d i a g ( U ) 1 / 2 , U = 1 T 1 Z s ( Z s ) T for each subject. As a result, we obtain a set of weighted adjacency matrix { W 1 , , W S 0 , W S 1 + 1 , , W S 1 } . We set the two covariance matrices Σ 0 and Σ 1 as
Σ 0 = I a 1 ( 1 ρ 1 ) + J a 1 ρ 1 I a 2 ( 1 ρ 2 ) + J a 1 ρ 2 1 , Σ 1 = I b 1 ( 1 ζ 1 ) + J b 1 ζ 1 I b 2 ( 1 ζ 2 ) + J b 1 ζ 2 1 .
Both matrices exhibit a community network structure, where { a 1 , a 2 } and { b 1 , b 2 } are sizes of community subnetworks and { ρ 1 , ρ 2 } and { ζ 1 , ζ 2 } are correlations between nodes in the communities. The off-diagonal blocks are zeros. We simulate data using different settings by set various values of { a 1 , a 2 } and { b 1 , b 2 } , and { ρ 1 , ρ 2 } and { ζ 1 , ζ 2 } . The larger sizes of communities indicate a greater complexity of the weighted network because it is more organized. We consider the variation of correlations as noises because point pattern in the low dimensional space is more clear when { ρ 1 , ρ 2 } and { ζ 1 , ζ 2 } are larger and more likely to distinguish from false positive edges. Because we have two matrices, we set Σ 0 as fixed and vary the parameters in Σ 0 . We let { a 1 , a 2 } = { 20 , 30 } and { b 1 , b 2 } = { 20 , 30 } , { 20 , 20 } , { 10 , 15 } . We set Σ 0 as a fixed matrix with { ρ 1 , ρ 2 } = { 0.5 , 0.6 } and vary the parameters in Σ 1 by letting { ζ 1 , ζ 2 } = { 0.5 , 0.6 } , { 0.4 , 0.5 } , { 0.3 , 0.4 } , { 0.2 , 0.3 } . For each setting, we simulate { Z s } for 100 times, calculate Geme for all subjects, and then test whether the difference of complexity of weighted networks (Geme) can be detected between the two groups. In the Method section, we demonstrate one example subject with the parameters of { ρ 1 , ρ 2 } = { 0.6 , 0.5 } and { a 1 , a 2 } = { 20 , 30 } in Figure 2 and Figure 3. The average computational time for a subject with n = 100 nodes is 0.11 s on a PC with a CPU i7 3.6G HZ and 64G ROM.
The results presented in Table 1 show that Geme can sensitively reflect the complexity of weighted networks. When the difference of complexity between the two groups of weighted networks is small (e.g., one subnetwork size in group one is larger than group two by ten nodes), we have the power of 73 % ± 3.2 % to detect the difference of complexity between the two groups of weighted networks with a statistical significance α = 0.05 , when ( ζ 1 , ζ 2 ) = ( ρ 1 , ρ 2 ) . The power is reduced when ( ζ 1 , ζ 2 ) < ( ρ 1 , ρ 2 ) and connected edges are less different from unconnected edges. We consider the signal to noise ratio (SNR) in a weighted network is lower when ( ζ 1 , ζ 2 ) is closer to zero. When the difference of complexity is larger (i.e., the subnetwork size difference is larger), we are more likely to detect the difference using Geme. When the subnetwork difference is about 20, we have a power of 100%. In addition, the statistical power is related to the sample size, and we use a sample size that is close to our motivation data set.

3.2. Application to Brain Network Analysis for Schizophrenia Research

We apply the proposed new metric to a resting state fMRI data that were collected from 103 patients with schizophrenia (SZ) and 124 healthy controls (HC). This average age of patients with schizophrenia is 36.88 ± 14.17, and 33.75 ± 14.22 for healthy controls. There are 62 males and 41 females in the SZ cohort and 61 males and 63 females in the HC cohort. There are no systematic differences in age ( p = 0.10) or gender (p = 0.13) between the two groups. We would refer readers to Adhikari et al. [21] for the details of imaging acquisition and preprocessing procedure. We denote the nodes of a network based on a brain connectivity-based atlas that parcellates the brain into 246 regions of interest (ROIs) (see http://atlas.brainnetome.org/ and [22]). The functional connection (edge weight) between a pair of nodes for each subject is calculated by the covariation between averaged time series from the two corresponding brain ROIs. The Fisher’s Z transformed Pearson correlation coefficient is then used as the edge connection strength and, thus, a group of weighted networks { W 246 × 246 s } . Our goal is to examine whether the complexity of weighted networks is different between SZ and HC.
We first map each weighted network to a low dimensional space R k via graph embedding and then calculate the Geme metric for each subject. Further, we perform regression analysis to examine the association between disease and network complexity while adjusting for age and sex. We use a step-wise regression model selection procedure to determine the optimal model regarding the interaction terms. In the final model, the sex and disease interaction term is included. The results show that the interaction term is significant ( p = 0.032 ). The network complexity difference between HC and SZ in males is significant ( p = 0.0026 ), whereas the group difference not significant in females ( p = 0.90 ) based on the stratified analysis. Figure 4 shows the group difference in males and females.

4. Discussion

We have developed a new metric Geme to characterize the complexity of the weighted complex network. This current research is motivated by a group-level brain network study, which includes a set of weighted brain networks with different clinical statuses. We aim to investigate whether brain disorder can be associated with the complexity of the functional brain networks. The weighted network is different from the binary network, as the values of edges are continuous variables. Unlike binary edges, the distribution of continuous edge variables is unknown because it can be a mixture distribution with infinite unknown components. Thus, the existing network entropy metrics are not applicable. On the other hand, the extraction of network topological structure from the weighted network can be difficult due to the distortion of false positive and negative noise. To overcome these challenges, we implement an integrative procedure of graph embedding and statistical point pattern analysis that is invariant to the permutation of the network. Interestingly, we find that the point distribution pattern of points projected from the nodes of weighted networks in the low dimensional space can reflect the complexity of weighted networks. We utilize the statistical techniques for point process analysis to implement a cross entropy metric in order to capture the deviation of point patterns in complex networks from random graph networks. We propose a computationally efficient method to calculate the metric. We perform extensive simulation analysis to validate the new metric. The results show that our metric can sensitively detect the difference in network complexity based weighted network data. We also note that this method is rust to noise.
We further apply the new metric to our data example. In this study, brain imaging data and weighted brain networks were acquired for each of the 124 patients with SZ and 103 healthy controls. We calculate the network complexity metric for each subject and perform statistical analysis. The results reveal new neuropathological findings that the complexity of functional brain networks is significantly higher for HC than SZ in male subjects, while the difference is not significant in females. The sex difference in schizophrenia has been long established [23,24]. Many studies have reported that onset time of females are later than males [25,26]. In addition, the symptoms of female patients with SZ are milder than males [27,28,29]. These external phenotypes can be well reflected by the organization and the complexity of functional connectivity networks [30]. Our findings may provide a viable neurological explanation for the long-established sex difference in schizophrenia. The functional brain networks are more disturbed and less organized in male patients than female patients when comparing to healthy controls, which is associated with more severe symptoms regarding cognitive functions, anhedonia, and social functioning in male patients. We can further the location-specific edges and nodes that cause the complexity difference using recently developed network methods [16,17,31].
In the data example, we show the application of Geme to a functional brain connectivity study. In general, Geme can be applied to any weighted (functional and structural) network with a positive semidefinite weighted adjacency matrix. In white matter structural network analysis, we find that clinical symptoms (e.g., cognitive functions) are related to the SC connection strengths (e.g., fractional anisotropy—FA levels) instead of the network complexity [32]. Therefore, these SC connection differences cannot be detected Geme. In contrast, the complexity of functional networks is more informative than the absolute connectivity strengths because the connectivity scale can be influenced by procedures, like global regression. Thus, Geme is more suited to identify the difference of network organization patterns for functional brain network analysis.
In summary, Geme is a computationally efficient metric for measuring the complexity of weighted networks. Geme can become a complement to the existing von Neumann network entropy metrics [11]. The graph embedding and point pattern analysis strategy may also provide an alternative pathway for network entropy analysis in addition to the exiting spectral entropy methods.

Author Contributions

Conceptualization, S.C.; methodology, S.C.; simulation and motivation data analysis, S.C., Z.Z., C.M. and Q.W.; biological explanation and data acquisition, P.K. and L.E.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by an NIH grant 1DP1DA048968-01.

Conflicts of Interest

L.E.H. has received or plans to receive research funding or consulting fees on research projects from Mitsubishi, Your Energy Systems LLC, Neuralstem, Taisho, Heptares, Pfizer, Luye Pharma, Sound Pharma, Takeda, and Regeneron. None was involved in the design, analysis or outcomes of the study.

References

  1. Latora, V.; Nicosia, V.; Russo, G. Complex Networks: Principles, Methods and Applications; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
  2. Yang, Y.; Yang, H. Complex network-based time series analysis. Phys. A Stat. Mech. Its Appl. 2008, 387, 1381–1386. [Google Scholar] [CrossRef]
  3. Rubinov, M.; Sporns, O. Complex network measures of brain connectivity: Uses and interpretations. Neuroimage 2010, 52, 1059–1069. [Google Scholar] [CrossRef] [PubMed]
  4. Simpson, S.L.; Bowman, F.D.; Laurienti, P.J. Analyzing complex functional brain networks: Fusing statistics and network science to understand the brain. Stat. Surv. 2013, 7, 1. [Google Scholar] [CrossRef] [PubMed]
  5. Pagani, G.A.; Aiello, M. The power grid as a complex network: A survey. Phys. A Stat. Mech. Its Appl. 2013, 392, 2688–2700. [Google Scholar] [CrossRef] [Green Version]
  6. Sporns, O. The human connectome: A complex network. Ann. N. Y. Acad. Sci. 2011, 1224, 109–125. [Google Scholar] [CrossRef] [PubMed]
  7. Motter, A.E.; Zhou, C.; Kurths, J. Enhancing complex-network synchronization. EPL 2005, 69, 334. [Google Scholar] [CrossRef] [Green Version]
  8. Csardi, G.; Nepusz, T. The igraph software package for complex network research. InterJ. Complex Syst. 2006, 1695, 1–9. [Google Scholar]
  9. Passerini, F.; Severini, S. The von Neumann entropy of networks. arXiv 2008, arXiv:0812.2597. [Google Scholar] [CrossRef] [Green Version]
  10. Estrada, E. The Structure of Complex Networks: Theory and Applications; Oxford University Press: Oxford, UK, 2012. [Google Scholar]
  11. Han, L.; Escolano, F.; Hancock, E.R.; Wilson, R.C. Graph characterizations from von Neumann entropy. Pattern Recognit. Lett. 2012, 33, 1958–1967. [Google Scholar] [CrossRef]
  12. Bassett, D.S.; Sporns, O. Network neuroscience. Nat. Neurosci. 2017, 20, 353–364. [Google Scholar] [CrossRef] [Green Version]
  13. Sporns, O.; Chialvo, D.R.; Kaiser, M.; Hilgetag, C.C. Organization, development and function of complex brain networks. Trends Cogn. Sci. 2004, 8, 418–425. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Bowman, F.D.; Zhang, L.; Derado, G.; Chen, S. Determining functional connectivity using fMRI data with diffusion-based anatomical weighting. NeuroImage 2012, 62, 1769–1779. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Viol, A.; Palhano-Fontes, F.; Onias, H.; de Araujo, D.B.; Hövel, P.; Viswanathan, G.M. Characterizing complex networks using entropy-degree diagrams: Unveiling changes in functional brain connectivity induced by Ayahuasca. Entropy 2019, 21, 128. [Google Scholar] [CrossRef] [Green Version]
  16. Chen, S.; Bowman, F.D.; Xing, Y. Detecting and testing altered brain connectivity networks with k-partite network topology. Comput. Stat. Data Anal. 2020, 141, 109–122. [Google Scholar] [CrossRef]
  17. Chen, S.; Xing, Y.; Kang, J.; Kochunov, P.; Hong, L.E. Bayesian modeling of dependence in brain connectivity data. Biostatistics 2020, 21, 269–286. [Google Scholar] [CrossRef]
  18. Cai, H.; Zheng, V.W.; Chang, K.C.C. A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Trans. Knowl. Data Eng. 2018, 30, 1616–1637. [Google Scholar] [CrossRef] [Green Version]
  19. Waller, L.A.; Gotway, C.A. Applied Spatial Statistics for Public Health Data; John Wiley & Sons: Hoboken, NJ, USA, 2004; Volume 368. [Google Scholar]
  20. Ripley, B.D. Spatial Statistics; John Wiley & Sons: Hoboken, NJ, USA, 2005; Volume 575. [Google Scholar]
  21. Adhikari, B.M.; Hong, L.E.; Sampath, H.; Chiappelli, J.; Jahanshad, N.; Thompson, P.M.; Rowland, L.M.; Calhoun, V.D.; Du, X.; Chen, S.; et al. Functional network connectivity impairments and core cognitive deficits in schizophrenia. Hum. Brain Mapp. 2019, 40, 4593–4605. [Google Scholar] [CrossRef] [Green Version]
  22. Fan, L.; Li, H.; Zhuo, J.; Zhang, Y.; Wang, J.; Chen, L.; Yang, Z.; Chu, C.; Xie, S.; Laird, A.R.; et al. The human brainnetome atlas: A new brain atlas based on connectional architecture. Cereb. Cortex 2016, 26, 3508–3526. [Google Scholar] [CrossRef]
  23. Castle, D.J.; Murray, R.M. The neurodevelopmental basis of sex differences in schizophrenia. Psychol. Med. 1991, 21, 565–575. [Google Scholar] [CrossRef] [Green Version]
  24. Abel, K.M.; Drake, R.; Goldstein, J.M. Sex differences in schizophrenia. Int. Rev. Psychiatry 2010, 22, 417–428. [Google Scholar] [CrossRef]
  25. Loranger, A.W. Sex difference in age at onset of schizophrenia. Arch. Gen. Psychiatry 1984, 41, 157–161. [Google Scholar] [CrossRef] [PubMed]
  26. Aleman, A.; Kahn, R.S.; Selten, J.P. Sex differences in the risk of schizophrenia: Evidence from meta-analysis. Arch. Gen. Psychiatry 2003, 60, 565–571. [Google Scholar] [CrossRef] [PubMed]
  27. Rubin, L.H.; Haas, G.L.; Keshavan, M.S.; Sweeney, J.A.; Maki, P.M. Sex difference in cognitive response to antipsychotic treatment in first episode schizophrenia. Neuropsychopharmacology 2008, 33, 290–297. [Google Scholar] [CrossRef] [PubMed]
  28. Leung, A.; Chue, P. Sex differences in schizophrenia, a review of the literature. Acta Psychiatr. Scand. 2000, 101, 3–38. [Google Scholar] [CrossRef]
  29. Guillem, F.; Mendrek, A.; Lavoie, M.E.; Pampoulova, T.; Stip, E. Sex differences in memory processing in schizophrenia: An event-related potential (ERP) study. Prog. Neuro-Psychopharmacol. Biol. Psychiatry 2009, 33, 1–10. [Google Scholar] [CrossRef]
  30. Sokunbi, M.O.; Gradin, V.B.; Waiter, G.D.; Cameron, G.G.; Ahearn, T.S.; Murray, A.D.; Steele, D.J.; Staff, R.T. Nonlinear complexity analysis of brain FMRI signals in schizophrenia. PLoS ONE 2014, 9, e95146. [Google Scholar] [CrossRef] [Green Version]
  31. Chen, S.; Kang, J.; Xing, Y.; Wang, G. A parsimonious statistical method to detect groupwise differentially expressed functional connectivity networks. Hum. Brain Mapp. 2015, 36, 5196–5206. [Google Scholar] [CrossRef] [Green Version]
  32. Kochunov, P.; Coyle, T.R.; Rowland, L.M.; Jahanshad, N.; Thompson, P.M.; Kelly, S.; Du, X.; Sampath, H.; Bruce, H.; Chiappelli, J.; et al. Association of white matter with core cognitive deficits in patients with schizophrenia. JAMA Psychiatry 2017, 74, 958–966. [Google Scholar] [CrossRef]
Figure 1. Functional brain connectivity can be denoted by a weighted network, where each node is a brain area and the continuous edge weight represents the connection strength between a pair of brain locations (e.g., calculated by the Fisher’s Z transformed Pearson correlation coefficient between the two blood-oxygen-level-dependent (BOLD) time sequences).
Figure 1. Functional brain connectivity can be denoted by a weighted network, where each node is a brain area and the continuous edge weight represents the connection strength between a pair of brain locations (e.g., calculated by the Fisher’s Z transformed Pearson correlation coefficient between the two blood-oxygen-level-dependent (BOLD) time sequences).
Entropy 22 00925 g001
Figure 2. The demonstration of permutation invariance for points (nodes) in a low dimensional vector space R k via graph embedding. (a,b) are the weighted adjacency matrix for two isomorphic weighted graphs, in that, (b) can be transformed to (a) by rearranging the order of nodes. Nodes in (a,b) are projected into R k (k = 2 for demonstration). The distribution pattern of nodes in R k for the two weighted networks. (c,d) are contour plots, and (e,f) are density plots, which all show very similar point patterns of two weighted networks. Therefore, the point pattern in R k can reflect the intrinsic complexity and is permutation invariant.
Figure 2. The demonstration of permutation invariance for points (nodes) in a low dimensional vector space R k via graph embedding. (a,b) are the weighted adjacency matrix for two isomorphic weighted graphs, in that, (b) can be transformed to (a) by rearranging the order of nodes. Nodes in (a,b) are projected into R k (k = 2 for demonstration). The distribution pattern of nodes in R k for the two weighted networks. (c,d) are contour plots, and (e,f) are density plots, which all show very similar point patterns of two weighted networks. Therefore, the point pattern in R k can reflect the intrinsic complexity and is permutation invariant.
Entropy 22 00925 g002
Figure 3. The demonstration of point patterns in a low dimensional vector space R k for an organized graph (a) vs. a random graph (b) with an identical edge weight distribution. Nodes in (a,b) are projected into R k (k = 2 for demonstration). The distribution pattern of nodes in R k for the two weighted networks. (c,d) are contour plots, and (e,f) are density plots. Plots (cf) show distinct point patterns between the weighted network with an organized structure and a random weighted graph. The difference can be quantified by a cross entropy measurement.
Figure 3. The demonstration of point patterns in a low dimensional vector space R k for an organized graph (a) vs. a random graph (b) with an identical edge weight distribution. Nodes in (a,b) are projected into R k (k = 2 for demonstration). The distribution pattern of nodes in R k for the two weighted networks. (c,d) are contour plots, and (e,f) are density plots. Plots (cf) show distinct point patterns between the weighted network with an organized structure and a random weighted graph. The difference can be quantified by a cross entropy measurement.
Entropy 22 00925 g003
Figure 4. The results show that the weighted network complexity metric Geme is lower in patients with schizophrenia than healthy controls for male subjects, whereas the difference is minimal for female subjects. Thus, the brain is less organized in male patients when comparing to male healthy controls while the brain is organized at a similar level in female patients and female healthy controls. The sex moderated difference in brain organization level between SZ and HC may explain the commonly observed phenomenon that male patients have more severe clinical symptoms than female patients. The figure above demonstrates the subgroup analysis for the female and male cohorts, and the figure below shows the results for all subjects.
Figure 4. The results show that the weighted network complexity metric Geme is lower in patients with schizophrenia than healthy controls for male subjects, whereas the difference is minimal for female subjects. Thus, the brain is less organized in male patients when comparing to male healthy controls while the brain is organized at a similar level in female patients and female healthy controls. The sex moderated difference in brain organization level between SZ and HC may explain the commonly observed phenomenon that male patients have more severe clinical symptoms than female patients. The figure above demonstrates the subgroup analysis for the female and male cohorts, and the figure below shows the results for all subjects.
Entropy 22 00925 g004
Table 1. Simulation results show that Geme can reflect the complexity of weighted networks. We summarize the mean and standard error of the chance to reject the null hypothesis (no difference of network complexity between two groups) based on 100 data sets.
Table 1. Simulation results show that Geme can reflect the complexity of weighted networks. We summarize the mean and standard error of the chance to reject the null hypothesis (no difference of network complexity between two groups) based on 100 data sets.
Subnetwork Sizes
( ζ 1 , ζ 2 ) (20, 30)(20, 20)(20, 15)(10, 15)(10,10)
(0.6, 0.5)0% ± 0%73% ± 3.2%100% ± 0%100% ± 0%100% ± 0%
(0.4, 0.5)2% ± 1%53% ± 3.5%92% ± 1.9%100% ± 0%100% ± 0%
(0.4, 0.3)5% ± 1.5%46% ± 3.5%90% ± 2.1%100% ± 0%100% ± 0%
(0.3, 0.2)9% ± 2%43% ± 3.5%91% ± 2%96% ± 1.4 %100% ± 0%

Share and Cite

MDPI and ACS Style

Chen, S.; Zhang, Z.; Mo, C.; Wu, Q.; Kochunov, P.; Hong, L.E. Characterizing the Complexity of Weighted Networks via Graph Embedding and Point Pattern Analysis. Entropy 2020, 22, 925. https://doi.org/10.3390/e22090925

AMA Style

Chen S, Zhang Z, Mo C, Wu Q, Kochunov P, Hong LE. Characterizing the Complexity of Weighted Networks via Graph Embedding and Point Pattern Analysis. Entropy. 2020; 22(9):925. https://doi.org/10.3390/e22090925

Chicago/Turabian Style

Chen, Shuo, Zhen Zhang, Chen Mo, Qiong Wu, Peter Kochunov, and L. Elliot Hong. 2020. "Characterizing the Complexity of Weighted Networks via Graph Embedding and Point Pattern Analysis" Entropy 22, no. 9: 925. https://doi.org/10.3390/e22090925

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop