Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders

Thomas, Jaya; Seo, Dongmin; Sael, Lee

doi:10.3390/ijms17060862

Open AccessReview

Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders

by

Jaya Thomas

^1,2,*,

Dongmin Seo

³

and

Lee Sael

^1,2,*

¹

Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA

²

Department of Computer Science, State University New York Korea, Incheon 406-840, Korea

³

Korea Institute of Science and Technology Information, 245 Daehak-ro, Yuseong-gu, Daejeon 34141, Korea

^*

Authors to whom correspondence should be addressed.

Int. J. Mol. Sci. 2016, 17(6), 862; https://doi.org/10.3390/ijms17060862

Submission received: 18 March 2016 / Revised: 10 May 2016 / Accepted: 24 May 2016 / Published: 1 June 2016

(This article belongs to the Section Molecular Pathology, Diagnostics, and Therapeutics)

Download

Browse Figures

Versions Notes

Abstract

:

How can complex relationships among molecular or clinico-pathological entities of neurological disorders be represented and analyzed? Graphs seem to be the current answer to the question no matter the type of information: molecular data, brain images or neural signals. We review a wide spectrum of graph representation and graph analysis methods and their application in the study of both the genomic level and the phenotypic level of the neurological disorder. We find numerous research works that create, process and analyze graphs formed from one or a few data types to gain an understanding of specific aspects of the neurological disorders. Furthermore, with the increasing number of data of various types becoming available for neurological disorders, we find that integrative analysis approaches that combine several types of data are being recognized as a way to gain a global understanding of the diseases. Although there are still not many integrative analyses of graphs due to the complexity in analysis, multi-layer graph analysis is a promising framework that can incorporate various data types. We describe and discuss the benefits of the multi-layer graph framework for studies of neurological disease.

Keywords:

graph clustering; graph similarity; neurological disease; biological network; structural brain network; functional brain network; multi-layer graphs

Graphical Abstract

1. Introduction

The study of neurological disorders involves a wide range of specialties and experiments resulting from various data types of various levels of detail. To understand neurological disorders more comprehensively, it is worthwhile to look at the various types of studies performed. We focus our attention on the graph analysis aspect in a wide spectrum of studies in the hopes to find a joining framework for integrative analysis that does not just involve the molecular level or the tissue level of data, but all available types of data being accumulated for the study of a neurological disorder. We focus on graph analysis methods because neurological disorders are caused by and characterized by a complex interplay of various genomic and environmental features that are often represented as graphs. Graphs are able to model the relationship between the features, as well as listing features that are important in the data analysis. Among the various graph analysis methods, graph clustering and subgraph similarity search are two of the most widely-used methods. They have been applied to study the biological data, brain images and neural signaling data in the studies of neurological disorders.

The goal of the review is to provide a wide spectrum of graph analysis applications in the study of neurological disorder. We do not attempt to summarize the finding of neurological disorders. We first look at the properties of neurological disorders and list out graph analysis measures, including graph clustering and graph similarity search, used for analyzing neurological disorders. Then, we review bio-network types and how graph clustering and similarity measures are used for causal and susceptible gene finding and disease characterization in the area of systems biology. We also review how graph analysis techniques are used to analyze structural and functional brain networks constructed from the brain images and neural signals. After a review of the existing work on graph analysis in neurological disorders, we find the need for an integrative analysis that incorporates various data sources in one analysis framework. For this purpose, we suggest that a multi-layer graph is the most appropriate data structure and further review studies on multi-layer graphs.

1.1. Characterizing Neurological Disorders with Graphs

Neurological disorders are characterized by an abnormality of the structure and function of the central nervous system or peripheral nervous system. There are a number of causes associated with neurological disorders, including genetic, environmental influence, physical injuries, infections and nutrition imbalance. Some neurological disorders are strictly inherited, i.e., Huntington’s disease [1], while others are caused by a combination of genetic and environmental factors, i.e., Alzheimer’s disease [2] and Parkinson’s disease [3]. Furthermore, neurological disorders can affect an entire neurological pathway or a single neuron, and such dysfunction can be quite diverse. Thus, various perspectives of neurological diseases are studied, and various bio-medical data types, including the genomic data, bio-specimens and brain images, are generated. These data usually contain features that have a complex relationship, and graphs are often used to capture these complex relationships.

In systems biology, relationships between biological components are represented as a graph, such as Protein-Protein Interaction (PPI), disease-gene association, metabolic pathways, biochemical networks and regulatory networks. The biological network (bio-network) studies the biological system at the genetic and molecular level. This is essential for understanding the causes and mechanisms of disease progression and, thus, aiding in better treatments and novel drug developments. On the other hand, graphs are also frequently used to model and analyze the anatomy and function of brain [4,5]. Brain networks generated from a variety of diagnostic brain imaging and signaling techniques are used for neurological disease. These brain analyses help to monitor the resulting changes in brain structure and function and how they ultimately shape behavior.

1.2. Graph Clustering and Graph Similarity

A graph structure is a representation of a set of entities; these entities constitute the nodes that represent the biological components and edges that describe the association between pairs of nodes. Biological and brain network analysis often involves graph clustering and graph similarity. Analysis of bio-networks using graph clustering and subgraph identification is often applied to understand the complex pathology and to address the important translational challenges. This is also applied in brain networks to improve the understanding of the complex brain structure and of the complex functional relationship between sub-regions of the brain.

Graph clustering methods and module detection methods group the nodes into clusters, or modules, on the basis of graph topology, such that the resulting clusters have high intra-homogeneity/connectivity and low inter-homogeneity/connectivity among the sub-graphs formed. Four commonly-used graph clustering algorithms are the Markov Cluster Algorithm (MCL) [6], Molecular COmplex DEtection(MCODE) [7], Highly Connected Subgraphs (HCS) [8] and Restricted Neighborhood Search Clustering (RNSC) [9]. MCL is based on flow simulations in graphs and works to increase the contrast between regions of high and low flow, by evaluating the successive power of the adjacency matrix. The algorithm converges, resulting in a graph partition with high flow regions separated from regions with no flow. The MCL algorithm is widely applied on protein-protein interaction networks [10,11]. The MCODE algorithm is designed to detect densely-connected regions in PPI for the purpose of predicting protein complexes [12]. Initially, MCODE assigns weights to the vertices based on their density, i.e., local connectivity. It then selects seed vertices based on the weights as the initial clusters. These seed vertices are further augmented by outward traversal of the isolated dense regions in accordance with preset parameters. The HCS algorithm recursively finds the minimum graph cut that leads to a graph partition that outputs highly connected components or subgraphs. HCS have been applied for gene expression analysis [13]. RNSC is a partition-based algorithm that starts with a random cluster assignment and proceeds by reassigning nodes to clusters. The quality of the clusters thus formed is then evaluated using a cost function. The final obtained clusters are filtered based on their size, density and functional homogeneity.

Subgraph isomorphism, given a pair of graphs A and B, is defined as a problem of determining whether graph A contains a subgraph that is isomorphic, i.e., identical in structure, to B. Subgraph similarity is a relaxation of this problem that instead of determining a match or no match, determines the similarity of the subgraph measured by a “similarity score”. The core of the graph similarity search is the similarity scores that each method proposes. Three commonly-used subgraph theoretical algorithms are GraphGrep [14], NetworkBlast [15] and SAGA [16]. GraphGrep is a hash indexing-based method for subgraph matching, which allows efficient filtering by selecting the most relevant subgraphs from the relevant graphs. NetworkBlast has a graph similarity model that is designed for comparing and analyzing multiple protein networks. The log likelihood ratio scoring method was used to evaluate the subnetwork fit to the desired structure. SAGA is an approximate subgraph matching technique based on node gaps, node mismatch and graph structural differences. The distance measure is used for matching the subgraph, where the measures include StructDist to measure the structural difference, NodeMismatches that estimate the penalty for the mismatch of labels and NodeGaps that compute the penalty on the gap nodes.

Graph theoretical measures can be used as similarity values in the subgraph similarity problem, or they can be used to characterize graphs. The two most common graph theoretical measures are the clustering coefficient and characteristic path length, which are often used to distinguish between regular, random and small-world networks [17]. The small-world network refers to a network for which the mean shortest-path distance between nodes increases slowly as a function of the number of nodes in the network. The small-world structure is hypothesized to reflect an optimal situation associated with rapid synchronization and information transfer [18], minimal wiring costs, as well as a balance between local processing and global integration [4]. The clustering coefficient is defined as the ratio of the number of existing connections among the node’s immediate neighbors to all of their possible connections. It defines the local efficiency of the information transfer of a network. The characteristic path length quantifies the average minimum number of connections that link any two nodes. It defines the global efficiency, indicating the capability of the parallel information propagation of a network. There are also centrality measures that identify important nodes in a network, such as hubs [4]. Depending on the characteristic of “important” nodes, different centrality measures can be used. The degree centrality of a node computes the number of edges extending from it. The closeness centrality of a node computes how close it is to all of the other nodes. It is defined as the inverse of the sum of distances based on the length of the average shortest path between a node and all nodes in the graph. The betweenness centrality of a node on the other hand measures how many time the node was part of a communication path between a pair of nodes. Other local measures, including eccentricity, i.e., the maximum distance between that node and any other node of the graph, and radiality, i.e., node centrality index, can also be used to find significant nodes. There are also other global measures, including modularity and the minimum spanning tree of a graph. Modularity is a global measure of the network that measures the separability of nodes to modules. A network with high modularity is able to group nodes to modules with high intra-module node connectivity and low inter-module node connectivity. The minimum spanning tree of a network is also being extensively used to characterize the topology in more recent studies. It has been shown to avoid several methodological biases in the study of brain networks [19]. Table 1 summarizes the graph theoretical measures and the formulae for computing them.

2. Types of Bio-Networks and Applied Analysis on Neurological Disorders

In this section, we first look at various types of bio-networks and publicly available resources. Then, we review how these bio-networks are analyzed to study the neurological disorders in two sub-categories of problems: causal and susceptible gene finding and disease characterization.

2.1. Types of Bio-Networks

There are several types of bio-networks. We classify them as gene networks, protein-protein interaction networks and biological pathways. We describe each type and provide a brief summary of publicly available databases for the biological data in Table 2.

Gene networks model the various types of associations among the genes. A gene network is an undirected graph with nodes representing genes and edges representing a type of association. A gene co-expression network is a kind of gene network in which the edges denote the correlation in the expression patterns of the genes [21]. Gene networks constructed from the analysis of expression patterns are often associated with specific conditions of the conducted studies. Another variant of a gene network is the gene-disease network, which has been extensively studied in the past few years. In a gene-disease network, nodes can also represent the various disease types, and edges denote the associations of gene to diseases. Gene networks are used to study complex neurological disorders, including autism [22] and Alzheimer’s disease [23].

The physical interaction among proteins constitutes a large part of the physiology of living things. PPI networks represent proteins as nodes, and the physical interactions or functional associations between the proteins are represented as an undirected edge. Proteins play a central role in biological function, and their interactions control the mechanism, which may lead to healthy or disease states. Many disease states result due to the change in protein functioning, i.e., change in PPI topology. Experiments, such as yeast-two-hybrid screening, affinity purification coupled to mass spectrometry, co-immunoprecipitation and chemical cross-linking, have been used to determine the PPI. Furthermore, the indirect approach of text mining methods that mine the co-occurrence of gene names in the literature have also been used to construct PPI. A detailed description of PPI can be found in a review by Rivas et al. [24].

Biological pathways are small to medium-sized directed bio-networks representing the chain of actions among bio-molecules that lead to a certain functionality, such as the production of other biomaterial or that trigger changes in cellular content. Most pathways are manually curated results of accumulated information from several experiments. Thus, although smaller in size, they have a more complex form. The nodes in the graph denote genes, proteins, metabolites, small molecules, reactions, chemical compound, diseases or drugs. Edges in this bio-network show various biological reactions, such as a modification in gene expression, alteration in a protein or other biochemical reactions. There are various types of biological pathways, including metabolic pathways, regulatory pathways, signaling pathway and disease pathways. Metabolic pathways represent the series of chemical reactions in a cell. A node of the metabolic pathways represents an enzyme (protein) that catalyzes the reactions, a metabolite (initial chemical compound), a product of a substrate (intermediate chemical compound) of enzymes or other co-factors. Regulatory pathways describe the regulation of gene activities. Nodes of regulatory pathways are often composed of transcription factors (proteins) and DNA (genes). Regulatory networks are a general form of regulatory pathways that can be either directed or undirected and are often composed of transcription factors and genes to which they bind. High-throughput technologies for obtaining the binding information include ChIP-chip and ChIP-seq. Signaling pathways describe the complex signal transduction process that integrates information from PPI, regulatory pathways and metabolite pathways. Disease pathways represent the relationships among the critical components, including genes, regulators and metabolites, of a disease. The Alzheimer’s disease pathway outlines the mechanisms of the formation of amyloid plaques and neurofibrillary tangles [25]. The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway displays current genes, proteolytic events and other processes associated with the progression of Alzheimer’s disease [26].

2.2. Bio-Network-Based Neurological Disorder Analysis

The applications of bio-network analysis for neurological disorders can be largely categorized into finding causative and susceptible genes and disease characterization. We selectively review studies to show the methodological diversity in each category and summarize their characteristics from the perspective of the analysis approaches.

2.2.1. Causal and Susceptible Gene Finding

Graph clustering methods and local graph theoretical measure for finding significant nodes have been dominantly used in the studies of causal and susceptible gene finding in neurological disorders.

The abundance of molecular interaction network data turns out to be particularly powerful for disease gene prediction [42]. These bio-networks provide the basic framework of the cellular processes, and graph analysis helps to model the complex interactions among multiple genes and their higher level organizations. The closely-connected genes are assumed to have similar functions in bio-networks. With this assumption, the functional annotation of genes or unknown proteins can be predicted. Lopez et al. [43] observed that Alzheimer’s disease-related genes are highly interconnected. Based on this observation, they used a graph clustering approach to find novel Alzheimer’s causative and susceptible genes. To do this, a combined bio-network was initially constructed by considering 22,194 interactions between 8347 proteins derived from databases, such as DIP, IntAct, MINTand HPRD. The Markov clustering algorithm [6] was then used to identify the cluster representing functional modules. Similarly, Diao et al. [44] performed a comprehensive gene-level assessment to search important molecular markers and pathways of Parkinson’s disease. The correlation network for Parkinson’s disease was identified by applying the DPClus [45] graph clustering algorithm. The densely-connected nodes of the cluster were assessed by associated GO terms and KEGG pathways. The approach resulted in finding significant pathways, such as the Parkinson’s disease pathway.

Talwar et al. [46] also applied graph clustering to find novel genes in Alzheimer’s disease. However, instead combining existing networks, they generated a consensus PPI from a PPI network generated from three different data type, i.e., genome-wide linkage analysis, genome-wide association studies and genome-wide expression profiling, to identify the candidate genes involved in Alzheimer’s disease development and progression. The consensus PPI was constructed by integrating Alzheimer’s disease linkage, genetic association and gene expression data proceeding by modeling PPI by identifying overlapping genes in the PPI generated form the three sources ranked by cumulative rank score. The MCL clustering of the consensus PPI comprised of 640 nodes and 2214 edges resulted in six significant clusters with seven genes forming the central hub nodes. The majority of top ranked candidate genes found were shown to be associated with the molecular mechanisms and pathways of Alzheimer’s, which may be crucial for predicting Alzheimer’s risk. Graph clustering in the bio-network is also used to determine the molecular markers and significant pathways.

Local characteristics of networks are also used for finding significant genes. Guney et al. [47] proposed a gene prioritization approach for a multiple gene-phenotype association and interaction dataset. For Alzheimer’s, a disease-association score was assigned to the genes in the PPI. The score was computed by determining multiple shorter paths between nodes. The path with more gene association was considered shorter as compared to others. The score helped to evaluate the biological significance of the neighbors. The network was constructed from 11,250 proteins, and the top 1% of the protein, i.e., 116 proteins uniquely mapped to gene for Alzheimer’s, was identified.

Winkler et al. [48] considers both graph characteristic, as well as graph clustering methods to find the roll of sex steroids in the degeneration of hippocampal neurons in Alzheimer’s disease. They considered graph theoretical measures as closeness centrality, eccentricity and radiality to determine the crucial, highly-centered nodes in the PPI network of Alzheimer’s to find causal genes. These measures were used to compute the shortest paths between nodes in the graph. The resulting two highly-connected nodes identified were the Androgen Receptor (AR) and the estrogen receptor alpha (ESR1). Furthermore, applying MCODE [7], a graph theoretic clustering algorithm, they identified five dense subgraph. Three subgraphs were composed of transcription factors that belong to the same protein subfamily, and two subgraphs contained kinases and ligases in the signal and degradation pathways.

Whether obtaining data by publicly available bio-networks, by generating a new bio-network or by combining existing networks with new data, the methods used for gene finding are dominated by graph clustering methods and significant node finding through local graph theoretical measures.

2.2.2. Disease Characterization

There are several approaches for characterizing a disease. Predicting new functions of genes or proteins in the disease pathway, finding abnormal interactions in the bio-network of patients compared to healthy individuals and the analysis of the graph properties of the bio-network of patients are a few ways that have been successful.

Bio-networks are analyzed to find new functions of proteins involved in neurological disorders in the work of Silva et al. [49]. They analyzed Amyloid Precursor Protein (APP) to determine its involvement in several biological functions, such as male fertility, cell adhesion, cell motility, signaling and apoptosis. The clustering coefficient was used to characterize the APP network, and betweenness centrality and closeness centrality were used to determine the relevant proteins involved in the pathways. Figure 1 shows the APP interactors involved in cell adhesion extracted from the extended APP/APLP2 network. It shows the APP interactors involved in vesicle-mediated transport extracted from the extended network. Different colors are used to denote interactors extracted from different sources; red nodes denote proteins from the yeast-two-hybrid screen; whereas blue nodes from the databases. The work provides an insight about the interactions of key APP proteins and on the function of APP in the male reproductive system.

Functional summaries of disease-specific bio-networks can also characterize the disease of interest. Seah et al. [50] proposed a Functional Summary Generator (FUSE), which generates functional maps using graph theoretical analysis that enables the investigation of the higher level organization and modularity within the PPI. FUSE is a greedy approach based on the profit maximization problem [51]. It first determines the functional clusters and then iteratively selects clusters that result in an increased profit. They considered Alzheimer’s as a case study and constructed low- and high-resolution functional summaries that help to understand process-process interactions. The low-resolution summary gave a functional overview of the processes related to the disease, whereas a high-resolution summary provides the in-depth functional landscape of the disease, revealing associations between processes related to the disease.

Unusual PPIs have been implicated in a number of neurological disorders, such as Parkinson’s disease [52], Alzheimer’s disease [53], autism [54] and Huntington’s disease [55]. Hence, detecting abnormal PPI can give a better insight into the root cause of these diseases. For example, Li et al. [56] proposed a systems framework involving the interactome, gene expression and genome sequencing to identify a protein interaction module with members strongly enriched for autism candidate genes. They constructed the PPI network using the human protein interactome from BioGrid comprising 13,039 proteins and 69,113 curated interactions. The network was clustered by Blondel et al. [57], which resulted in smaller modules. To determine the associations of the network modules with autism spectrum disorder, they first considered the curated genes implicated in Autism Spectrum Disorder (ASD); among a total of 484 genes in the database (https://gene.sfari.org/autdb/), 383 were on the protein interaction network. Enrichment tests for each module in the network revealed that for some specific modules’ de novo copy number variations, the rare copy number variations and the disruptive mutations each displayed a significant enrichment.

Graph characterization of a disease-specific bio-networks is also important for understanding diseases. Goñi et al. [58] aided the understanding of the basis of Alzheimer’s Disease (AD) and Multiple Sclerosis (MS) by considered PPI and gene expression to study the centrality-related features of proteins whose genes were differentially expressed (seed proteins) with respect to their protein neighbors. Four disease-specific bio-graph networks were used: an MS network from blood tissue (MS-blood), an MS network from brain tissue (MS-brain), an Alzheimer’s network from blood tissue (AD-blood) and an Alzheimer’s network from brain tissue (AD-brain). The main topological properties considered were the average degree and the betweenness centrality, which were analyzed using the shortest path’s degree and the cluster coefficient. Figure 2 shows the Alzheimer’s network constructed from the brain tissue with 25 seed proteins and 109 neighbors. The purple-colored nodes indicate the seed protein; nodes in orange denote neighbor proteins whose nodes are connected either directly or indirectly; and nodes in green represent neighbor proteins that are not part of the considered network. They observed that there were only two direct interaction links between the seed proteins among the total 191 links. The results presented indicated that both diseases shared common characteristics as the lowest average degree of seed-proteins and a higher degree of betweenness. Their finding also shows that the seed proteins in peripheral regions of the PPI map are involved in different pathways and are integrated into subnetworks of the complete human proteome network.

3. Types of Brain Networks Used in the Studies of Neurological Disorder

Brain images and neural signals are another major type of data used to study neurological diseases. Brain images or signals are often segmented into subsections, and their associations are often represented as brain networks (or biological neural networks). In this section, we first briefly review how such brain networks can be constructed from the brain image or signal data in general and review specific network construction processes and analysis methods applied in each study of neurological disorders.

3.1. Types of Brain Networks

There are two major types of brain networks: structural brain networks and functional brain networks. In structural brain networks, edges connect modules that are structurally connected; while in functional brain networks, edges connect modules that are functionally associated. There are no central depositories for brain networks for neurological disorders, since brain network construction is dependent on the types of experiments performed. In the following, we look at general network construction processes and experiments that generate the data for each type of network.

3.1.1. Functional Brain Networks

A functional brain network is derived from physiological observation of brain activities through brain signals or brain images. The edges of the functional brain network represent the functional connectivity between nodes that represent either anatomical segments or voxels in brain images [59]. Functional connectivities are determined by statistical dependencies among remote components measured in various neuro-physiological events. Furthermore, functional connectivity is dynamic in nature, that is it is highly time-dependent and changes frequently in milliseconds being modulated by sensory stimuli and task context.

There are two types of techniques to derive dynamic functional events: electro-magnetic techniques and hemodynamic techniques. Electro-magnetic techniques measure post-synaptic current flow created by magnetic fields that can be recorded outside and inside the skull that have high temporal resolution, but low spatial resolution. Hemodynamic techniques measure the blood follow in the brain, which results in high spatial resolution, but poor temporal resolution. Electroencephalography (EEG) and Magnetoencephalography (MEG) are two representative non-invasive electro-magnetic techniques. EEG measures the change in electrical signals as the clusters of neurons become active, and MEG measures the change in magnetic fields relative to the change in electrical activity. intracranial Electroencephalography (iEEG), also called Electrocorticography (ECoG), is an invasive electro-magnetic technique that monitors the electrical activity of the cerebral cortex, which uses electrodes placed directly on the exposed surface of the brain. Compared to EEG, iEEG and MEG have better spatial resolution. iEEG has the obvious disadvantage of being invasive. functional Magnetic Resonance Imaging (fMRI) and Positron Emission Tomography (PET) are the two representative hemodynamic techniques. fMRI measures the brain activity during rest or while doing tasks by the correlated fluctuations in the Blood Oxygen Level-Dependent (BOLD)-signal with respect to time. Blood is more oxygenated in an active region of the brain, and the difference in the magnetic susceptibility between oxyhemoglobin and deoxyhemoglobin is detected by MR. PET is a test that uses a radioactive chemical tracer that is traced by a special camera that tracks the positrons, i.e., positively-charged particles, that are emitted by the tracer. Although PET has high spatial resolution, because of the use of radioactive materials, it is hard to find human subjects.

Among the listed experiments, the most popular experimental measure for the functional study of brains is through fMRI. Crosson et al. [60] presents the specific advantages of fMRI over other functional imaging techniques. Three advantages of fMRI are as follows: fMRI is non-invasive and does not expose the subject to radiation; fMRI shows an image activity in deep, subcortical structures, whereas this is difficult if not impossible with EEG- and MEG-based techniques; and in fMRI, the same platform used to acquire functional images can be used to acquire high resolution anatomic images. The general fMRI studies often measure the brain signals given a certain task, but recent studies also apply the fMRI in the resting state to find patterns in neurological disorders. Brain imaging studies have suggested that the brain is in a state of activation even during the resting period. These resting state fMRI (rs-fMRI) data have been applied to a number of different problems in neuroscience, which include diseases, such as Alzheimer’s [61,62], Parkinson’s [63] and schizophrenia [64]. A detailed review of rs-fMRI is presented by Heuvel et al. [65]. Figure 3, shows a general overview of functional brain network analysis, where the brain image data are first subjected to parcellation that divides the brain into a number of regions or parcels with homogeneous characteristics. The functional connectivity matrix forms a full symmetric matrix between elements (voxels, neurons and recording sites) that provides a simple characterization of functional interactions. The network constructed by the functional connectivity is analyzed by the graph clustering approach to find similarity clusters. A detailed review of the construction of functional brain networks can be found in [66,67,68,69,70].

3.1.2. Structural Brain Networks

A structural brain network is derived from anatomical observations from diffusion-based neuro-imaging data, such as diffusion tensor imaging or diffusion spectrum imaging [71]. The edges denote anatomical connections between sets of neural components. More specifically, they refer to the connectivity of axons. Determining the existence of the connectivity is based on the delineation and subsequent assessment of white matter fiber tracts within the brain

The network of structural connectivity in the human brain can be constructed by using structural Magnetic Resonance Imaging (MRI) and diffusion MRI. MRI is a widely-used imaging technique that uses strong static magnetic fields to line up and fall back the nuclei of hydrogen atoms, sending out radio waves that are detectable. Diffusion MRI, also referred to as Diffusion Tensor magnetic resonance Imaging (DTI), is a more recent method that measures the diffusion of water molecules in white matter fibers in the brain, called anisotropic diffusion. In DTI, unlike MRI, which often focuses on the grey matter, white matter fiber tractography is used to build the structural brain network. An assumption states that more directionally-dependent water flow indicates the presence of more axons running through the underlying white matter and, thereby, a greater structural connectivity. The network formed can be analyzed for the structural characteristics. The most common measure used along DTI is Fractional Anisotropy (FA), which specifies the directional dependency of water diffusion in the brain. These measures help in understanding the abnormalities that may be present in the brain network [72]. A detailed review of the construction of structural brain networks can be found in [66,73,74].

3.2. Graph Analysis Applications on Brain Networks

Studies of neurological disorders based on brain images and signals are closely related to the types of experiments performed and, thus, to the types of brain networks constructed and analyzed. The most prevalent types of brain network studies are functional brain networks followed by structural brain networks. In the following, we review studies of neurological disorders classified by the types of brain networks that are analyzed.

3.2.1. Analysis of Functional Brain Networks

In most of the functional brain network analyses, the topological characteristics of the networks of patients in comparison with the healthy individual are analyzed through graph theoretical measures. We look at functional brain graph analysis applied to Alzheimer’s disease, Parkinson’s Disease (PD), MS, Autism Spectrum Disorder (ASD), epilepsy and Attention Deficit Hyperactivity Disorder (ADHD).

The difference in the global topological features of functional brain networks for Alzheimer’s patients compared to the control was analyzed by Supekar et al. [62] and Stam et al. [4]. In the work of Supekar et al. [62], functional graphs were constructed based on the correlation matrix for the anatomical region extracted from wavelet analysis of the rs-fMRI data. In their graph, the edges between the nodes are determined based on a threshold value for wavelet correlation, i.e., edges are formed between two nodes with a wavelet correlation higher than the threshold. The clustering coefficient and characteristic path length were used to characterize the constructed functional graph for Alzheimer’s patients and the healthy control. They observed that the Alzheimer’s patients showed lower clustering coefficients, but similar characteristic path lengths compared to controls, which suggests disrupted global functional organization. They suggested that the graph analysis measures might be useful as an imaging-based biomarker to distinguish Alzheimer’s disease from healthy aging. Similarly, Stam et al. [4] analyzed the changes in the large-scale graph structures of resting-state brain. They used MEG-based images of both cases and controls to generate a functional brain network and then computed graph topological measurements, such as the clustering coefficient and path length, to characterize how the brain functions differently from case to control. The graph consists of 149 nodes, where each node represents the matching MEG channel, edges represent the link between the pair of channels and the edge weight denoted the phase log index values between all pairs of MEG channels. The results show that patients with Alzheimer’s disease had reduced connectivity shown by decreased clustering, similar to Supekar et al. [62], in addition to increased path length. The general pattern that emerged from their study showed that the bio-networks associated with various types of brain disease, such as Alzheimer’s disease, schizophrenia, brain tumors and epilepsy, are closer to random networks, and healthy networks are closer to small-world networks.

Multiple Sclerosis (MS) is another neurological disorder that is studied by brain networks. Tewarie et al. [75] analyzed the functional brain network of MS patients compared to healthy controls using MEG images. To construct the function network, initially, the sensor channel data of MEG were projected onto the Automatic Anatomical Labeling (AAL) atlas using beamforming that resulted in 78 nodes with time series data mapped. Beamforming or spatial filtering is a signal processing technique used in sensor arrays for directional signal transmission or reception. Next, the adjacency matrix was constructed for each frequency band separately based on the functional connectivity between each pair of nodes or time series. The Phase Lag Index (PLI) was used [4], which calculates the asymmetry of the distribution of (instantaneous) phase differences between the two time series. Subsequently, Kruskal’s algorithm [20] was applied to obtain Minimum Spanning Trees (MSTs) to measure the functional connectivity. A comparative analysis was carried out between healthy controls and MS patients by finding the dissimilarity between the respective MSTs using the information theoretic dissimilarity measure that calculates the information change between the two MSTs. The analysis showed a decrease in global integration and hierarchy, which explains the reduced cognitive performance in MS. The finding indicates that MST analysis helped to detect network changes in the core of functional brain networks for MS patients. Moreover, they were able to identify functional brain network differences between early MS patients and healthy controls.

Parkinson’s disease is another widely-studied neurological disorder. Göttlich et al. [63] and Sang et al. [76] investigated the difference of Parkinson patients against the controls in the rs-fMRI-based functional networks. Networks generated by Göttlich et al. [63] and Sang et al. [76] were constructed in a similar manner. The nodes were obtained by segmentation of the brain functional network using the Automatic Anatomical Labeling (AAL) atlas [77]. Edges were obtained by measuring the temporal correlation between each paired node, calculated by the Pearson correlation coefficient in the time series acquired for each node. A threshold was selected to mark an edge between the nodes. If the absolute correlation coefficient was higher than the threshold, an edge is formed. Göttlich et al. [63] measured the cluster coefficient on a local level and the characteristic path length at the global level to characterize the obtained functional brain networks. They observed that the increased characteristic path length indicates a less efficient organization of the brain network for Parkinson’s patients. The finding indicates reduced efficiency in the brain network topology of patients as compared to controls. Sang et al. [76] focused on the topological characteristics of the large-scale functional brain network in early-stage Parkinson’s patients. Their analysis showed that early-stage Parkinson’s patients had a significant decrease in global efficiency, but no significant difference in the local efficiency of the brain network as compared to the control. Significantly decreased global efficiency was associated with decreased long-range connections across remote cortical regions in Parkinson’s patients. This decreased global efficiency indicated a reduced capacity of information transfer across the entire brain. Dubbelink et al. [17,78] used MEG data to characterize the functional brain network of Parkinson’s in different stages or clinical measures of disease progression. In their studies, clustering coefficients, averaged shortest path lengths and MST were to characterize the constructed networks. They showed that impaired local efficiency, i.e., the inverse of the average shortest path connecting all neighbors of a node, and network decentralization are early features of Parkinson’s disease. As the disease progresses with time, changes in brain functional graph topology accumulate and result in reductions in global efficiency, i.e., the inverse of the average shortest path, which have a close association with the deteriorating cognitive and motor function. The results of Dubbelink et al. [17] show that the early-stage non-medicated patients were characterized by lower clustering coefficients, but preserved path lengths, which indicates reduced local integration with a preserved global efficiency of the brain network in the early motor stage of the disease. The results seemed similar to the one obtained from Tewarie et al. [75] and Dubbelink et al. [17] for MS indicating that brain networks move towards a more random network organization in both disease.

Other neurological diseases, including tuberous sclerosis complex, epilepsy and attention deficit hyperactivity disorder, have also been studied by constructing functional brain networks. Peters et al. [79] analyzed functional connectivity through EEG coherence in a large sample set of children with Tuberous Sclerosis Complex (TSC), a disorder with a high prevalence of autism spectrum disorder. An undirected weighted graph was built using the 19 electrodes as nodes and inter-electrode coherence values as edge weights. The graph analysis for autism spectrum disorder showed the absence of a higher clustering coefficient and of a longer path length despite the decrease in long-over-short range coherence. The result shows that the nodes are spatially more clustered, but are not functionally clustered, indicating an altered network topology. These altered network topologies in TSC represent a functional correlate of structural abnormalities and may play a role in the pathogenesis of neurological deficits. Ortega et al. [80] analyzed iEEG data for the localization of the Epileptogenic Zone (EZ), also called focus, responsible for seizures or the ictal state in epilepsy patients. Recurrent seizures are characteristic of epilepsy, and the localization of the seizure sites is crucial for the treatment and prevention of the spread of ictal activity in different regions of the brain. The localization of the focus is done by identifying the crucial nodes in the iEEG-based functional brain network. In the functional network, each node represents electrodes’ time series data, and the edges are weighted by the Pearson correlation coefficient between the two electrodes. The crucial nodes used for localization of the focus are selected to be the nodes with the highest local synchronization power, the most connectedness and the highest seizure interaction load. This approach helped to identify nodes that seem relevant from the global interaction perspective, one being the most connected node, i.e., the node with the highest number of links, and other with the highest load, which is computed by the node betweenness centrality measure. Using these observations, they address the question of whether removal of these nodes during surgery is crucial in the suppression or reduction of the quantity of postoperative seizures. The findings for five ECoG records show that local areas with high synchronization power appear to be significantly involved in the development of epileptic seizures. Hu et al. [81] evaluated the structural symmetry of the weighted brain network for Attention Deficit Hyperactivity Disorder (ADHD) using graph isomorphism. The brain network was built using resting state fMRI (rs-fMRI) data for ADHD. The graph was constructed by first preprocessing of rs-FMRI data and then extracting the anatomical regions of interest using the AAL atlas. The nodes denote the anatomical region, and the edges represent the functional connectivity between the nodes. Isomorphism was used to investigate the structural symmetry of every node pair in a weighted brain network. For a given weighted graph, the symmetry between the two nodes was defined based on the isomorphism level of the residual graphs of those two nodes, and the isomorphism approximation error was computed using the suboptimal eigen decomposition algorithm by Umeyama [82]. The experimental results indicated that for the inattentive type of ADHD subjects, higher network symmetry was observed as compared to the typically development of children.

Many of the functional brain network analysis of existing studies on neurological disorders observe an altered network topology compared to healthy individuals. In the case of Alzheimer’s, similar to the abnormal topological observation in the structural brain network, the functional brain network analysis also shows disrupted organization. The graph theoretical measure on the functional brain network reveals that the small-world metrics can characterize the functional organization of the brain in Alzheimer’s disease. Most of the studies on Alzheimer’s showed that an increased path length has been interpreted to result due to the loss of connectivity. The modularity graph metric for Alzheimer’s indicated a decrease in value for the beta (13–30 Hz) and gamma (>30 Hz) band, implying decreased connectivity due to loss of connector hubs. The functional brain network analysis for MS using the minimum spanning tree measure shows that the altered functional networks in the theta and alpha2 frequency bands of time series data are indicative of large-scale changes in the functional brain network for relapsing-remitting MS patients. The changes in the alpha2 band, such as loss of hierarchical structure, results in poorer cognitive performance. In the case of Parkinson’s, the observed studies reports common evidence for altered resting-state networks on a global, intermediate and local level in Parkinson’s patients. The Parkinson’s patients often show cognitive impairments, effective changes and other non-motor symptoms, suggesting system-wide effects on brain function. The functional graph analysis for both Parkinson’s and Alzheimer’s reveals that with the disease progression, the brain networks move towards a more random network organization.

3.2.2. Analysis of Structural Brain Networks

The focus of most research that is based on the structural brain networks of neurological disorders is on the topological characterization and finding structural abnormality of the brain networks.

The most extensively-studied neurological disorder using structural networks is Alzheimer’s disease. A work by He et al. [83] reports the changes in the coordination of large-scale structural brain networks due to Alzheimer’s disease by considering cortical thickness data from structural MRI. The structural brain network that is constructed denotes the cortical regions as nodes and the physical connectivity of these regions as edges. The graph is analyzed using graph theoretical measures, such as the clustering coefficient, path length and betweenness centrality, to determine abnormalities in Alzheimer’s patients, which are associated with alterations in cortical thickness correlations, small-world parameters, nodal centrality and network robustness. It was observed that these variable values were larger in the Alzheimer’s brain graph as compared to the control cases. The results obtained were consistent with existing studies showing increased structural and functional asymmetry in Alzheimer’s patients that suggested that the widely-distributed cortical networks are altered in Alzheimer’s patients. Additionally, the analysis findings in the paper suggest that Alzheimer’s disease-related alterations in structural networks and their internal topology are biologically meaningful and are likely to explain the functional impairments associated with Alzheimer’s disease. Furthermore, Lo et al. [84] considered graph theoretical measures, including the clustering coefficient and shortest path length, and showed disrupted topological organization in the large-scale white matter structural networks constructed from diffusion MRI of Alzheimer’s patients. Information is processed in the gray matter (cortex and subcortical structures) and passed along the network via the white matter. The result shows an increased ratio of the characteristic shortest path length predominantly in the Alzheimer’s group. This may be due to the degeneration of fiber bundles for information transmission and suggests that the connections between cortical areas have been changed with less strength (reduced white matter integrity) or longer pathways (disconnection). Their work provided the structural evidence for abnormalities of systematic integrity in Alzheimer’s disease.

Structural brain networks have also been analyzed for schizophrenia. Bassett et al. [85] investigated the topological characteristics of the hierarchical structure (global, divisional and regional) of brain networks constructed from whole-brain anatomical networks constructed from structural MRI of 259 healthy individuals and compared those of 203 people with schizophrenia. Measuring the degree, path length, clustering and small-worldness of the brain network, they showed that people with schizophrenia had reduced hierarchy, loss of frontal hub and increased non-frontal hubs, as well as increased connection distance. The analysis shows for people with schizophrenia, highly clustered nodes are more evenly distributed in terms of their degree, and frontal hubs are less prominent. The finding indicates that the neuro-developmental abnormalities in schizophrenia specifically impact multi-modal cortical organization.

4. Need for Integrative Analysis on Large Graphs

Most of the studies that have been performed on neurological disorders use limited types of data to generate single layer networks. However, as neurological disorders are complex systems, integrative analysis that involves various data types over various dimensions is often more informative and will allow better analysis and predictions. However, the analysis of a graph for such a system may turn out to be incomprehensible due to the increased complexity representing heterogeneous information within the same view. Thus, the major bottleneck is in the advancement of the design of algorithms for constructing and analyzing graphs representing multiple levels of information from multiple data types to reveal and characterize the complex pathology of neurological disorders. One of the most information-preserving abstractions of the complex data is multi-layered graphs. However, the actual application of multi-layered graphs is not yet widely studied due to the infancy in the consensus representations and the development of analysis methodologies. In this section, we review existing methods for integrative methods on graphs and look at the possibilities of multi-layer graphs as frameworks for the integrative analysis of various data types.

4.1. Integrative Analysis for Single-Layered Graphs

One of the trends in data science, especially in bioinformatics, is the integrative analysis of various data types. However, existing studies that involve integrative analysis of neurological disease based on graph structures are limited. Following are two examples of the integrative analysis of bio-graphs and brain networks.

Multiple types of bio-networks can be analyzed together. In a work by Hwang et al. [86], the OMIM phenotype-gene relation of the disease phenotype similarity network, the human gene interaction network, the disease categorization and the molecular pathways were analyzed together to determine phenotype, gene clusters and their associations in Alzheimer’s patients. They found that association information can provide a global pathway activity view of human disease classes and can facilitate the understanding of the underlying molecular mechanisms of disease. The work reports finding TMED10, which is a newly predicted gene in the Alzheimer’s pathway. This gene leads to the production of amyloid beta peptides, which is a critical feature of Alzheimer’s disease. They analyze the newly-predicted member gene by providing a network view of the disease pathway for Alzheimer’s. They also considered two-way hierarchical clustering in order to predict disease phenotype cluster-gene cluster associations. They observed predicted associations between 20 disease phenotype clusters and 200 gene clusters (pathways). Some of the pathways are predicted to be associated with neurological and psychiatric disease classes, including the prion disease pathway and the MAPK pathway associated with neurological disease class.

Separate studies of structural brain networks and functional brain networks have and continue to advance our knowledge of neurological disorders. On the other hand, integrative studies can further advance our knowledge of the structural-functional interconnection of the brain components in neurological disorders. Rudie et al. [87] carried out structural and functional brain network analysis of autism spectrum disorders. They analyzed the structural and functional brain networks separately and integratively using Principle Component Analysis (PCA) of graphical features extracted from the networks. Their analysis starts by constructing structural and functional networks with the same set of nodes. The functional network was constructed using the whole-brain parcellation scheme discussed by Power et al. [88], which is based on a large meta-analysis of fMRI studies combined with whole brain functional connectivity mapping. After construction of both the functional and structural networks, six graph features were extracted, and structural and functional were examined separately and interactively. Following are the six graph theoretical properties: (1) clustering coefficient; (2) characteristic path length (3) normalized clustering coefficient (λ); (4) normalized characteristic path length (γ); (5) small-worldness represented as the ratio of λ to γ; and (6) modularity. The graph features of the functional network showed that the individual with autism spectrum disorder had a lower clustering coefficient for nodes within default systems and secondary visual areas. They displayed less robust modular organization implying less distinct communities and indicated higher nodal participation coefficients. A high level of global efficiency was also observed, which reflects a less organized or more random distribution of functional edges. The graph features of the structural network indicated a high level of local and global efficiency in both the typically-developed and autism spectrum disorder groups. An important finding shows that in the autism spectrum disorder group, modularity decreased at a slower rate. This finding is contrary to the findings reported by Hagmann et al. [89], where decreasing modularity and increasing global efficiency of structural networks with development are shown. In a separate analysis of ASD and controls, a similar level of correlation between raw measures of structural and functional connectivity was observed. However, after combining the structural and functional network properties and applying PCA, a reduced balance of local and global efficiency between structural and functional networks was reported in autism spectrum disorder, which displayed association with age and inversely related with autism spectrum disorder symptom severity. They observed weaker connectivity within visual (largely secondary areas) and sensorimotor systems, supporting more widespread alterations in functional connectivity. Their analysis shows that integrative analysis has the potential for unraveling the pathology of neurological disorders that may not be possible by individual analysis.

4.2. Integrative Analysis of Multi-Layer Graphs

In the previous subsection, we have shown that integrative analysis provided more incite into the neurological disorder compared to the analysis of a single type of data. We now show how multi-layer graphs can further aid in the integrative analysis process by first looking at the description of multi-layer graphs and the availability of analysis methods. Then, we review the applications of the multi-layer graphs in practice.

4.2.1. Multi-Layer Graphs

Multi-layer graphs are used to represent the complex behavior of different types of entities or how the relationship of entities changes over different aspects, such as time. The graph constructed includes multiple sub-graphs and the layer of connectivity between them. The multi-layer graph represents a much fairer amount of information than individual layers separately. It gives a suitable framework for integrative analysis.

Kivelä et al. [90] wrote an extensive description of multi-layer graphs. In their description, the most general form of multi-layer graph allows each node to belong to any subset of layers and allow edges to connect any node in any layer. In addition, the multi-layer graphs can have a ‘multi-dimensional’ property that can include every type of data. That is, multi-layer graph MG, with L layers is given as MG_L = (G₁, G₂, …, G_l, E_inter, E_intra ), where L is the total number of layers, G₁, G₂, …, G_L are the subgraphs on each layer with n nodes. E_inter is used to denote inter-layer links, whereas, E_intra denotes the intra-layer links. Each sub-graph is defined as G_l = (V_l, E_l) where V_l = v₁, v₂, …, v_l is the set of nodes common in all of the subgraph, and E_l stands for the intra-layer connectivity for layer l. The adjacency matrix of each sub-graph is denoted by A_l ∈ (0, 1)^(n×n) [90]. In this general form of multi-layer graph, not only can different types of bio-networks be integrated, but also bio-networks with various brain networks can be integrated, as well. Although the definition of multi-layer graphs can be generalized to describe networks that combine bio-networks with brain networks and, maybe, even over time, the analysis of such complex system is not currently possible, especially for large graphs.

Tensors, high dimension arrays, are the most common form of representation for multi-layer graphs being studied. Although higher mode tensors are much more complex than matrix, i.e., two-mode tensor, to analyze, they are still the simplest form of representation of limited types of multi-layer graphs. A three-modetensor, for example, can be describe as follows: the first two modes describe the node to node relationship, while the third mode can be an additional dimension, such as time or types of experiment. In this description, the set of nodes considered in each layer remains the same. De Domenico et al. [91] describes formulas for graph theoretical measures, including degree centrality, clustering coefficients, eigenvector centrality, modularity, von Neumann entropy and diffusion, for multi-layer graphs that can be represented in a three-mode tensor. In addition, tensor factorization methods [92,93], such as for clustering purposes, can also be used to analyze the multi-layer graphs.

4.2.2. Existing Application of Multi-Layer Graph Analysis

The multi-layer graph finds its applicability in various domains, such as social networks, telecommunication networks, smart grids and biological networks. There are still several challenges involved in multi-layer graph analysis. There is still no consensus on the representation or recognized analysis approach. Although with technical challenges still remaining, the multi-layer graph is being recognized for being beneficial in organizing and analyzing disease simultaneously, which enables improvement of the decisions made by medical professionals and patients [94]. We examine a few applications of multi-layer graphs in biological data analysis and brain network analysis.

Multi-layer graphs are being recognized in systems biology applications. We look at just two among several. First, Salem et al. [95] considered a multi-layer graph to integrate the co-expression pattern of genes to find biological modules and showed improved performance in gene function prediction. Multiple independent gene expressions were integrated for module discovery and functional annotation. The multi-layer weighted graph was constructed by considering the topology and co-occurrence between the co-expression link. The biological modules were discovered using the edge-based graph clustering approach on the weighted link graph. A work by Didier et al. [96] showed the use of multi-layer graphs to identify communities from multiplex biological networks. Four biological networks were constructed from different sources of interactions between human genes or proteins. The biological networks were PPI, co-expression network, pathway and network of complexes. Here, each biological network corresponds to the different layers of the multi-layer graph. The community structure for individual layers was computed using a network modularity-based clustering algorithm [97]. Further, consensus-clustering approaches computed a unique community structure from the community structures obtained on each graph independently. They showed that the use of multiplex-modularity better recovers communities in heterogeneous density and missing data contexts.

The multi-layer graph is also used to model time-varying brain networks. Bassett et al. [69] modeled the dynamic reconfiguration of functional brain networks during learning using a multi-layer graph constructed from fMRI imaging brain data during cognitive processing. Using a multi-layer graph, they identified functional modules over short time intervals and characterized their changes over time. The study provides insight to expose the learning-induced autonomy of sensorimotor systems. It also uncovers a distributed network of frontal and anterior cingulate cortices whose disengagement predicted individual differences in learning. Moreover, the availability of neuroimaging modalities with different spatial resolution levels enables integrating the functional connectivity between brain regions that helps to figure out the disruptedfunctional connectivity based on the BOLD signal in neurodegenerative diseases, such as Alzheimer’s [70].

We can see that multi-layer graphs have been applied with improved performance for integrative analysis of various bio-networks and for analyzing time varying brain networks. This shows the potential success of multi-layer graphs for more complex forms of data combination, such bio-networks with brain networks, bio-networks of different data types over time or structural networks in combination with functional networks over time.

5. Discussion and Conclusions

In this review, we showed how complex biological networks are analyzed using graph clustering and other graph theoretical measures in the study of neurological disorders. Various neurological diseases, including Alzheimer’s, Parkinson’s and autism, are analyzed using biological, as well as brain networks. The analysis of these networks help to get a better understanding of the biological changes that contribute to the dysfunctional state in these diseases. For biological networks, we highlighted the different types of data used to construct the networks and showed the significance of the analysis of PPI, gene-gene association and biological pathways in the study of neurological disorders. In the case of brain image networks, we reviewed both structural and functional brain networks constructed from brain signals and images from experiments, including MRI, EEG, MEG and fMRI, and show their importance in understanding neurological disorders.

We showed that graphical analysis of individual data types in the form of single layer graphs is well studied; however, there is a need for the integrative analysis of several data types. We suggest that multi-layer graphs are a good data structure for integrative representations and analysis for obtaining better insight for complex neurological problems. Existing studies of multi-layer graphs in the analysis of neurological disorder are few. However, all are shown to improve the analysis. Thus, we can predict increased analysis performance in many more scenarios of multi-layer graphs. For example, multi-layer graphs can be used to model spatial temporal changes in the brain structural and functional networks for the resting state and during the execution of cognitive tasks. In a dynamic system, firing of an action potential in a neuron takes a few milliseconds; the plastic change in synaptic strength operates over time scales of minutes to hours; and the repair of cognitive function after brain damage occurs over the duration of years. The multi-layer graph data structure seems appropriate to conceptualize the brain graph dynamics of various time scales. In the generalized framework, each layer models the interactions of the system at time t, and the extracted time-varying graph metrics quantify the evolution of the topological properties across time. Moreover, additional modes can be added to incorporate structural and functional brain networks to determine the effects of structural topology on networks and dynamics. The computational studies have shown that brains’ structural and functional networks are intimately related and share common topological features [66], which show high promise in the success of the analysis. Similarly, bio-network analysis can benefit from the use of multi-layer graphs. Multi-layer graphs can integrate the existing bio-network with other sources of related molecular information, such as Gene Ontology, biological processes and pathways. These integrative association models may enable findings in neurological disorders that may not be possible in individual analysis. In the most complex form, multi-layer graphs can represent bio-brain networks, in which the biological network based on molecular data is combined with a network constructed from brain image/signaling data. The bio-networks are informative and reveal the genetic causality of the disease; on the other hand, brain networks help in determining the structural and functional changes of the human brain. Modeling of such a network could help with associating the genetic factors with the observed functional changes. The multi-layer graphs can help to capture the dynamic behavior of the network and allow integrative analysis of data. However, there are still several technical challenges, including efficient representation and inference algorithms, remaining in multi-layer graph analysis.

Acknowledgments

This work was supported by K-15-L03-C02-S01 funded by Korea Institute of Science and Technology Information , by the Basic Science Research Program through the National Research Foundation of Korea (2013R1A1A3005259, 2015R1C1A2A01055739) funded by Ministry of Science, ICT and Future Planning of Korea (MSIP), and by the MSIP of Korea under IITP-2016-R0346-15-1007 supervised by the Institute for Information & Communications Technology.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ridley, R.M.; Frith, C.D.; Crow, T.J.; Conneally, P.M. Anticipation in Huntington’s disease is inherited through the male line but may originate in the female. J. Med. Genet. 1988, 25, 589–595. [Google Scholar] [CrossRef] [PubMed]
Gatz, M.; Pedersen, N.L.; Berg, S.; Johansson, B.; Johansson, K.; Mortimer, J.A.; Posner, S.F.; Viitanen, M.; Winblad, B.; Ahlbom, A. Heritability for Alzheimers Disease: The study of dementia in Swedish twins. J. Gerontol. A Biol. Sci. Med. Sci. 1997, 52A, M117–M125. [Google Scholar] [CrossRef] [PubMed]
Warner, T.T.; Schapira, A.H. Genetic and environmental factors in the cause of Parkinsons disease. Ann. Neurol. 2003, 53, S16–S23. [Google Scholar] [CrossRef] [PubMed]
Stam, C.J.; Jones, B.F.; Nolte, G.; Breakspear, M.; Scheltens, P. Small-world networks and functional connectivity in Alzheimer’s disease. Cereb. Cortex 2007, 17, 92–99. [Google Scholar] [CrossRef] [PubMed]
He, Y.; Evans, A.C. Graph theoretical modeling of brain connectivity. Curr. Opin. Neurol. 2010, 23, 341–350. [Google Scholar] [CrossRef] [PubMed]
Van Dongen, S. Graph Clustering by Flow Simulation. Ph.D. Thesis, University of Utrecht, Utrecht, The Netherlands, 1 May 2000. [Google Scholar]
Bader, G.D.; Hogue, C.W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform. 2003, 4, 1. [Google Scholar] [CrossRef] [Green Version]
Hartuv, E.; Shamir, R. A clustering algorithm based on graph connectivity. Inf. Process. Lett. 2000, 76, 175–181. [Google Scholar] [CrossRef]
King, A.D.; Przulj, N.; Jurisica, I. Protein complex prediction via cost-based clustering. Bioinformatics 2004, 20, 3013–3020. [Google Scholar] [CrossRef] [PubMed]
Brohée, S.; van Helden, J. Evaluation of clustering algorithms for protein-protein interaction networks. BMC Bioinform. 2006, 7, 488. [Google Scholar] [CrossRef] [PubMed]
Vlasblom, J.; Wodak, S.J. Markov clustering versus affinity propagation for the partitioning of protein interaction graphs. BMC Bioinform. 2009, 10, 99. [Google Scholar] [CrossRef] [PubMed]
Asur, S.; Ucar, D.; Parthasarathy, S. An ensemble framework for clustering protein-protein interaction networks. Bioinformatics 2007, 23, i29–i40. [Google Scholar] [CrossRef] [PubMed]
Hartuv, E.; Schmitt, A.O.; Langeb, J.; Meier-Ewert, S.; Lehrach, H.; Shamir, R. An algorithm for clustering cDNA fingerprints. Genomics 2000, 66, 249–256. [Google Scholar] [CrossRef] [PubMed]
Shasha, D.; Wang, J.T.L.; Rosalba, G. Algorithmics and applications of tree and graph searching. In Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Madison, WI, USA, 3–5 June 2002; pp. 39–52.
Sharan, R.; Suthram, S.; Kelley, R.M.; Kuhn, T.; McCuine, S.; Uetz, P.; Sittler, T.; Karp, R.M.; Ideker, T. Conserved patterns of protein interaction in multiple species. Proc. Natl. Acad. Sci. USA 2005, 102, 1974–1979. [Google Scholar] [CrossRef] [PubMed]
Tian, Y.; McEachin, R.C.; Santos, C.; States, D.J.; Patel, J.M. SAGA: A subgraph matching tool for biological graphs. Bioinformatics 2007, 23, 232–239. [Google Scholar] [CrossRef] [PubMed]
Dubbelink, K.T.E.O.; Hillebrand, A.; Stoffers, D.; Deijen, J.B.; Twisk, J.W.R.; Stam, C.J.; Berendse, H.W. Disrupted brain network topology in Parkinson’s disease: A longitudinal magnetoencephalography study. Brain 2014, 137, 197–207. [Google Scholar] [CrossRef] [PubMed]
Watts, D.J.; Strogatz, S.H. Collective dynamics of ’small-world’ networks. Lett. Nat. 1997, 393, 440–442. [Google Scholar] [CrossRef] [PubMed]
Tewarie, P.; van Dellen, E.; Hillebrand, A.; Stam, C. The minimum spanning tree: An unbiased method for brain network analysis. NeuroImage 2015, 104, 177–188. [Google Scholar] [CrossRef] [PubMed]
Kruskal, J.B. On the shortest spanning subtree of a graph and the traveling salesman problem. Proc. Am. Math. Soc. 1956, 7, 48–50. [Google Scholar] [CrossRef]
Andrei, A.; Kendziorski, C. An efficient method for identifying statistical interactors in gene association networks. Biostatistics 2009, 10, 706–718. [Google Scholar] [CrossRef] [PubMed]
Nelson, T.H.; Jung, J.Y.; DeLuca, T.F.; Hinebaugh, B.K.; Gabriel, K.C.S.; Wall, D.P. Autworks: A cross-disease network biology application for Autism and related disorders. BMC Med. Genom. 2012, 5, 56. [Google Scholar] [CrossRef] [PubMed]
Zhang, B.; Gaiteri, C.; Bodea, L.G.; Wang, Z.; McElwee, J.; Podtelezhnikov, A.A.; Zhang, C.; Xie, T.; Tran, L.; Dobrin, R.; et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell 2013, 153, 707–720. [Google Scholar] [CrossRef] [PubMed]
Rivas, J.D.L.; Fontanillo, C. Protein-Protein interactions essentials: Key concepts to building and analyzing interactome networks. PLoS Comput. Biol. 2010, 6, e1000807. [Google Scholar]
Liang, W.S.; Dunckley, T.; Beach, T.G.; Grover, A.; Mastroeni, D.; Ramsey, K.; Caselli, R.J.; Kukull, W.A.; McKeel, D.; Morris, J.C.; et al. Altered neuronal gene expression in brain regions differentially affected by Alzheimer’s disease: A reference data set. Physiol. Genom. 2008, 33, 240–256. [Google Scholar] [CrossRef] [PubMed]
Kanehisa, M.; Goto, S.; Sato, Y.; Furumichi, M.; Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012, 40, D109–D114. [Google Scholar] [CrossRef] [PubMed]
Chatr-aryamontri, A.; Ceol, A.; Palazzi, L.M.; Nardelli, G.; Schneider, M.V.; Castagnoli, L.; Cesareni, G. MINT: The Molecular INTeraction database. Nucleic Acids Res. 2007, 35, D572–D574. [Google Scholar] [CrossRef] [PubMed]
Mering, C.V.; Jensen, L.J.; Snel, B.; Hooper, S.D.; Krupp, M.; Foglierini, M.; Jouffre, N.; Huynen, M.A.; Bork, P. STRING: Known and predicted protein protein associations, integrated and transferred across organisms. Nucleic Acids Res. 2005, 33 (Suppl. S1), D433–D437. [Google Scholar] [CrossRef] [PubMed]
Jensen, L.J.; Kuhn, M.; Stark, M.; Chaffron, S.; Creevey, C.; Muller, J.; Doerks, T.; Julien, P.; Roth, A.; Simonovic, M.; et al. STRING 8-a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res. 2009, 37, D412–D416. [Google Scholar] [CrossRef] [PubMed]
Xenarios, I.; Rice, D.W.; Salwinski, L.; Baron, L.; Marisa, K.A.; Marcotte, E.M.; Eisenberga, D. DIP: The database of interacting proteins. Nucleic Acids Res. 2000, 28, 289–291. [Google Scholar] [CrossRef] [PubMed]
Prasad, T.; Goel, R.; Kandasamy, K.; Keerthikumar, S.; Kumar, S.; Mathivanan, S.; Telikicherla, D.; Raju, R.; Shafreen, B.; Venugopal, A.; et al. Human protein reference database-2009 update. Nucleic Acids Res. 2009, 37, D767–D772. [Google Scholar] [CrossRef] [PubMed]
Kanehisa, M.; Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar] [CrossRef] [PubMed]
Milacic, M.; Haw, R.; Rothfels, K.; Wu, G.; Croft, D.; Hermjakob, H.; D’Eustachio, P.; Stein, L. Annotating cancer variants and anti-cancer therapeutics in reactome. Cancers 2012, 4, 1180–1211. [Google Scholar] [CrossRef] [PubMed]
Croft, D.; Mundo, A.F.; Haw, R.; Milacic, M.; Weiser, J.; Wu, G.; Caudy, M.; Garapati, P.; Gillespie, M.; Kamdar, M.R.; et al. The reactome pathway knowledgebase. Nucleic Acids Res. 2014, 42, D472–D477. [Google Scholar] [CrossRef] [PubMed]
Mizuno, S.; Iijima, R.; Ogishima, S.; Kikuchi, M.; Matsuoka, Y.; Ghosh, S.; Miyamoto, T.; Miyashita, A.; Kuwano, R.; Hiroshi, T. AlzPathway: A comprehensive map of signaling pathways of Alzheimer’s disease. BMC Syst. Biol. 2012, 6, 52. [Google Scholar] [CrossRef] [PubMed]
Cerami, E.G.; Gross, B.E.; Demir, E.; Rodchenkov, I.; Babur, O.; Anwar, N.; Schultz, N.; Bader, G.D.; Sander, C. Pathway commons, a web resource for biological pathway data. Nucleic Acids Res. 2011, 39, D685–D690. [Google Scholar] [CrossRef] [PubMed]
Bauer-Mehren, A.; Rautschka, M.; Sanz, F.; Furlong, L.I. Disgenet: A cytoscape plugin to visualize, integrate, search and analyze gene-disease networks. Bioinformatics 2010, 26, 2924–2926. [Google Scholar] [CrossRef] [PubMed]
Davis, A.P.; Grondin, C.J.; Lennon-Hopkins, K.; Saraceni-Richards, C.; Sciaky, D.; King, B.L.; Wiegers, T.C.; Mattingly, C.J. The comparative toxicogenomics database’s 10th year anniversary: Update 2015. Nucleic Acids Res. 2015, 43, D914–D920. [Google Scholar] [CrossRef] [PubMed]
Aranda, B.; Achuthan, P.; Alam-Faruque, Y.; Armean, I.; Bridge, A.; Derow, C.; Feuermann, M.; Ghanbarian, A.T.; Kerrien, S.; Khadake, J.; et al. The IntAct molecular interaction database in 2010. Nucleic Acids Res. 2010, 38, D525–D531. [Google Scholar] [CrossRef] [PubMed]
Kerrien, S.; Aranda, B.; Breuza, L.; Bridge, A.; Broackes-Carter, F.; Chen, C.; Duesbury, M.; Dumousseau, M.; Feuermann, M.; Hinz, U.; et al. The IntAct molecular interaction database in 2012. Nucleic Acids Res. 2012, 40, D841–D846. [Google Scholar] [CrossRef] [PubMed]
Stark, C.; Breitkreutz, B.J.; Reguly, T.; Boucher, L.; Breitkreutz, A.; Tyers, M. BioGRID: A general repository for interaction datasets. Nucleic Acids Res. 2006, 34, D535–D539. [Google Scholar] [CrossRef] [PubMed]
Kohler, S.; Bauer, S.; Horn, D.; Robinson, P.N. Walking the interactome for prioritization of candidate disease genes. Am. J. Hum. Genet. 2008, 82, 888–905. [Google Scholar] [CrossRef] [PubMed]
Soler-López, M.; Zanzoni, A.; Lluis, R.; Stelzl, U.; Aloy, P. Interactome mapping suggests new mechanistic details underlying Alzheimer’s disease. Genome Res. 2011, 21, 364–376. [Google Scholar] [CrossRef] [PubMed]
Diao, B.; Liu, Y.; Zhang, Y.; Xu, G.Z. A graph-clustering approach to search important molecular markers and pathways of Parkinson’s disease. Afr. J. Biotechnol. 2011, 10, 15656–15661. [Google Scholar] [CrossRef]
Altaf-Ul-Amin, M.; Shinbo, Y.; Mihara, K.; Kurokawa, K.; Kanaya, S. Development and implementation of an algorithm for detection of protein complexes in large interaction networks. BMC Bioinform. 2006, 7, 207. [Google Scholar] [CrossRef] [PubMed]
Talwar, P.; Silla, Y.; Grover, S.; Gupta, M.; Agarwal, R.; Kushwaha, S.; Kukreti, R. Genomic convergence and network analysis approach to identify candidate genes in Alzheimer’s disease. BMC Genom. 2014, 15, 199. [Google Scholar] [CrossRef] [PubMed]
Guney, E.; Oliva, B. Exploiting protein-protein interaction networks for genome-wide disease-gene prioritization. PLoS ONE 2012, 7, e43557. [Google Scholar] [CrossRef] [PubMed]
Winkler, J.M.; Fox, H.S. Transcriptome meta-analysis reveals a central role for sex steroids in the degeneration of hippocampal neurons in Alzheimer’s disease. BMC Syst. Biol. 2013, 7, 51. [Google Scholar] [CrossRef] [PubMed]
Silva, J.V.; Yoon, S.; Domingues, S.; Guimarães, S.; Goltsev, A.V.; da Cruz e Silva, E.F.; Mendes, J.F.F.; da Cruz e Silva, O.B.A.; Fardilha, M. Amyloid precursor protein interaction network in human testis: Sentinel proteins for male reproduction. BMC Bioinform. 2015, 16, 1. [Google Scholar] [CrossRef] [PubMed]
Seah, B.S.; Bhowmick, S.S.; Dewey, C.F.; Yu, H. FUSE: Towards multi-level functional summarization of protein interaction networks. In Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine, Chicago, IL, USA, 31 July–3 August 2011; ACM: New York, NY, USA, 2011; pp. 2–11. [Google Scholar]
Khuller, S.; Mossb, A.; Naor, J. The budgeted maximum coverage problem. Inf. Process. Lett. 1999, 70, 39–45. [Google Scholar] [CrossRef]
Rakshit, H.; Rathi, N.; Roy, D. Construction and analysis of the protein-protein interaction networks based on gene expression profiles of Parkinson’s disease. PLoS ONE 2014, 9, e103047. [Google Scholar]
Mukherjeee, S.; Kaeberlein, M.; Kauwe, J.; Naj, A.C.; Crane, P. A systems-biology approach to identify candidate genes for Alzheimer’s disease by integrating protein-protein interaction network and subsequent in vivo validation of candidate genes using A C. Elegans model of AB toxicity. Alzheimer’s Dement. J. Alzheimer’s Assoc. 2013, 10, 298–299. [Google Scholar] [CrossRef]
Correia, C.; Oliveira, G.; Vicente, A.M. Protein interaction networks reveal novel autism risk genes within GWAS statistical noise. PLoS ONE 2014, 9, e112399. [Google Scholar] [CrossRef] [PubMed]
Giorgini, F.; Muchowsk, P.J. Connecting the dots in huntington’s disease with protein interaction networks. Genome Biol. 2005, 6, 210. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, J.; Shi, M.; Ma, Z.; Zhao, S.; Euskirchen, G.; Ziskin, J.; Urban, A.; Hallmayer, J.; Snyder, M. Integrated systems analysis reveals a molecular network underlying autism spectrum disorders. Mol. Syst. Biol. 2014, 10, 774. [Google Scholar] [CrossRef] [PubMed]
Blondel, V.D.; Guillaume, J.L.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef]
Goñi, J.; Esteban, F.J.; Mendizabal, N.V.D.; Sepulcre, J.; Ardanza-Trevijano, S.; Agirrezabal, I.; Villoslada, P. A computational analysis of protein-protein interaction networks in neurodegenerative diseases. BMC Syst. Biol. 2008, 2, 1. [Google Scholar] [CrossRef] [PubMed]
Stanley, M.L.; Moussa, M.N.; Paolini, B.M.; Lyday, R.G.; Burdette, J.H.; Laurienti, P.J. Defining nodes in complex brain networks. Front. Comput. Neurosci. 2013, 7. [Google Scholar] [CrossRef] [PubMed]
Crosson, B.; Ford, A.; McGregor, K.M.; Meinzer, M.; Cheshkov, S.; Li, X.; Walker-Batson, D.; Briggs, R.W. Functional imaging and related techniques: An introduction for rehabilitation researchers. J. Rehabil. Res. Dev. 2010, 47, 7–34. [Google Scholar] [CrossRef]
Rombouts, S.A.; Barkhof, F.; Goekoop, R.; Stam, C.J.; Scheltens, P. Altered resting state networks in mild cognitive impairment and mild Alzheimer’s disease: An fMRI study. Hum. Brain Mapp. 2005, 26, 231–239. [Google Scholar] [CrossRef] [PubMed]
Supekar, K.; Menon, V.; Rubin, D.; Musen, M.; Greicius, M.D. Network analysis of intrinsic functional brain connectivity in Alzheimer’s disease. PLoS Comput. Biol. 2008, 4, e1000100. [Google Scholar] [CrossRef] [PubMed]
Göttlich, M.; Münte, T.F.; Heldmann, M.; Kasten, M.; Hagenah, J.; Krämer, U.M. Altered resting state brain networks in Parkinson’s Disease. PLoS ONE 2013, 8, e77336. [Google Scholar]
Bluhm, R.L.; Miller, J.; Lanius, R.A.; Osuch, E.A.; Boksman, K.; Neufeld, R.; Théberge, J.; Schaefer, B.; Williamson, P. Spontaneous low-frequency fluctuations in the BOLD signal in Schizophrenic patients: Anomalies in the default network. Schizophr. Bull. 2007, 33, 1004–1012. [Google Scholar] [CrossRef] [PubMed]
Heuvel, M.P.v.d.; Hulshoff Pol, H.E. Exploring the brain network: A review on resting-state fMRI functional connectivity. Eur. Neuropsychopharmacol. 2010, 20, 519–534. [Google Scholar] [CrossRef] [PubMed]
Bullmore, E.; Sporns, O. Complex brain networks: Graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 2009, 10, 186–198. [Google Scholar] [CrossRef] [PubMed]
Vissersa, M.E.; Cohena, M.X.; Geurtsa, H.M. Brain connectivity and high functioning Autism: A promising path of research that needs refined models, methodological convergence, and stronger behavioral links. Neurosci. Biobehav. Rev. 2012, 36, 604–625. [Google Scholar] [CrossRef] [PubMed]
Goldenberg, D.; Galvána, A. The use of functional and effective connectivity techniques to understand the developing brain. Dev. Cogn. Neurosci. 2015, 12, 155–164. [Google Scholar] [CrossRef] [PubMed]
Bassett, D.S.; Yang, M.; Wymbs, N.F.; Grafton, S.T. Learning-Induced autonomy of sensorimotor systems. Nat. Neurosci. 2015, 18, 744–751. [Google Scholar] [CrossRef] [PubMed]
Fallani, F.D.V.; Richiardi, J.; Chavez, M.; Achard, S. Graph analysis of functional brain networks: Practical issues in translational neuroscience. Philos. Trans. R. Soc. B Biol. Sci. 2014. [Google Scholar] [CrossRef]
Cheng, H.; Wang, Y.; Sheng, J.; Kronenberger, W.G.; Mathews, V.P.; Hummer, T.A.; Saykin, A.J. Characteristics and variability of structural networks derived from diffusion tensor imaging. Neuroimage 2012, 61, 1153–1164. [Google Scholar] [CrossRef] [PubMed]
Koch, K.; Wagner, G.; Dahnke, R.; Schachtzabel, C.; Schultz, C.; Roebel, M.; Güllmar, D.; Reichenbach, J.R.; Sauer, H.; Schlösser, R.G.M. Disrupted white matter integrity of corticopontine-cerebellar circuitry in Schizophrenia. Eur. Arch. Psychiatry Clin. Neurosci. 2010, 260, 419–426. [Google Scholar] [CrossRef] [PubMed]
Clayden, J.D. Imaging connectivity: MRI and the structural networks of the brain. Funct. Neurol. 2013, 28, 197–203. [Google Scholar] [PubMed]
Wang, Z.; Dai, Z.; Gong, G.; Zhou, C.; He, Y. Understanding structural-functional relationships in the human brain: A large-scale network perspective. Neuroscientist 2014, 21, 290–305. [Google Scholar] [CrossRef] [PubMed]
Tewarie, P.; Hillebrand, A.; Schoonheimc, M.; Dijk, B.V.; Geurts, J.; Barkhof, F.; Polman, C.; Stamb, C. Functional brain network analysis using minimum spanning trees in multiple sclerosis: An meg source-space study. NeuroImage 2014, 88, 308–318. [Google Scholar] [CrossRef] [PubMed]
Sang, L.; Zhang, J.; Wang, L.; Zhang, J.; Zhang, Y.; Li, P.; Wang, J.; Qiu, M. Alteration of brain functional networks in early-stage Parkinson’s disease: A resting-state fmri study. PLoS ONE 2015, 10, e0141815. [Google Scholar] [CrossRef] [PubMed]
Tzourio-Mazoyer, N.; Landeau, B.; Papathanassiou, D.; Crivello, F.; Etard, O.; Delcroix, N.; Mazoyer, B.; Joliot, M. Automated anatomical labeling of activations in spm using a macroscopic anatomical parcellation of the mni mri single-subject Brain. NeuroImage 2002, 15, 273–289. [Google Scholar] [CrossRef] [PubMed]
Dubbelink, K.T.E.O.; Stoffers, D.; Deijen, J.B.; Twisk, J.W.; Stam, C.J.; Hillebrand, A.; Berendse, H.W. Resting-state functional connectivity as a marker of disease progression in Parkinson’s disease: A longitudinal MEG study. NeuroImage Clin. 2013, 2, 612–619. [Google Scholar] [CrossRef] [PubMed]
Peters, J.M.; Taquet, M.; Vega, C.; Jeste, S.S.; Fernández, I.S.; Tan, J.; Nelson, C.A.; Sahin, M.; Warfield, S.K. Brain functional networks in syndromic and non-syndromic autism: A graph theoretical study of EEG connectivity. BMC Med. 2013, 11, 54. [Google Scholar] [CrossRef] [PubMed]
Ortega, G.J.; Sola, R.G.; Pastor, J. Complex network analysis of human ECoG data. Neurosci. Lett. 2008, 447, 129–133. [Google Scholar] [CrossRef] [PubMed]
Hu, C.; Fakhri, G.E.; Li, Q. Evaluating structural symmetry of weighted brain networks via graph matching. In Medical Image Computing and Computer-Assisted Intervention-MICCAI 2014; Lecture Notes in Computer Science; Golland, P., Hata, N., Barillot, C., Hornegger, J., Howe, R., Eds.; Springer International Publishing: Berlin/Heidelberg, Germany, 2014; Volume 8674, pp. 733–740. [Google Scholar]
Umeyama, S. An eigendecomposition approach to weighted graph matching problems. IEEE Trans. Pattern Anal. Mach. Intell. 1988, 10, 695–703. [Google Scholar] [CrossRef]
He, Y.; Chen, Z.; Gong, G.; Evans, A. Structural insights into aberrant topological patterns of large-scale cortical networks in Alzheimer’s disease. J. Neurosci. 2008, 28, 4756–4766. [Google Scholar] [CrossRef] [PubMed]
Lo, C.Y.; Wang, P.N.; Chou, K.H.; Wang, J.; He, Y.; Lin, C.P. Diffusion tensor tractography reveals abnormal topological organization in structural cortical networks in Alzheimer’s disease. J. Neurosci. 2010, 30, 16876–16885. [Google Scholar] [CrossRef] [PubMed]
Bassett, D.S.; Bullmore, E.; Verchinski, B.A.; Mattay, V.S.; Weinberger, D.R.; Meyer-Lindenberg, A. Hierarchical organization of human cortical networks in health and schizophrenia. J. Neurosci. 2008, 28, 9239–9248. [Google Scholar] [CrossRef] [PubMed]
Hwang, T.; Atluri, G.; Xie, M.; Dey, S.; Hong, C.; Kumar, V.; Kuang, R. Co-clustering phenome-genome for phenotype classification and disease gene discovery. Nucleic Acids Res. 2012, 40, e146. [Google Scholar] [CrossRef] [PubMed]
Rudie, J.D.; Brown, J.; Beck-Pancer, D.; Hernandez, L.; Dennis, E.; Thompson, P.; Bookheimer, S.; Daprettoa, M. Altered functional and structural brain network organization in Autism. NeuroImage Clin. 2013, 2, 79–94. [Google Scholar] [CrossRef] [PubMed]
Power, J.D.; Cohen, A.L.; Nelson, S.M.; Wig, G.S.; Barnes, K.A.; Church, J.A.; Vogel, A.C.; Laumann, T.O.; Miezin, F.M.; Schlaggar, B.L.; Petersen, S.E. Functional network organization of the human brain. Neuron 2011, 72, 665–678. [Google Scholar] [CrossRef] [PubMed]
Hagmann, P.; Sporns, O.; Madan, N.; Cammoun, L.; Pienaar, R.; Wedeen, V.J.; Meuli, R.; Thiran, J.P.; Grant, P.E. White matter maturation reshapes structural connectivity in the late developing human brain. Proc. Natl. Acad. Sci. USA 2010, 107, 19067–19072. [Google Scholar] [CrossRef] [PubMed]
Kivelä, M.; Arenas, A.; Barthelemy, M.; Gleeson, J.P.; Moreno, Y.; Porter, M.A. Multilayer Networks. J. Complex Netw. 2014, 2, 203–271. [Google Scholar] [CrossRef]
De Domenico, M.; Solé-Ribalta, A.; Cozzo, E.; Kivelä, M.; Moreno, Y.; Porter, M.A.; Gómez, S.; Arenas, A. Mathematical Formulation of Multilayer Networks. Phys. Rev. X 2013, 3, 041022. [Google Scholar] [CrossRef]
Sael, L.; Jeon, I.; Kang, U. Scalable Tensor Mining. Big Data Res. 2015, 2, 82–86. [Google Scholar] [CrossRef]
Jeon, B.S.; Jeon, L.S.I.; Kang, U. SCouT: Scalable coupled matrix-tensor factorization—Algorithm and discoveries. Int. Conf. Data Eng. 2016, in press. [Google Scholar]
Gustafsson, M.; Nestor, C.E.; Zhang, H.; Barabási, A.L.; Baranzini, S.; Brunak, S.; Chung, K.F.; Federoff, H.J.; Gavin, A.C.; Meehan, R.R.; et al. Modules, networks and systems medicine for understanding disease and aiding diagnosis. Genome Med. 2014, 6, 82. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Salem, S.; Ozcaglar, C. Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets. Bio. Data Mining 2014, 7, 1. [Google Scholar] [CrossRef] [PubMed]
Didier, G.; Brun, C.; Baudot, A. Identifying communities from multiplex biological networks. Peer J. 2015, 3, e1525. [Google Scholar] [CrossRef] [PubMed]
Newman, M.E.J.; Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 2004, 69, 026113. [Google Scholar] [CrossRef] [PubMed]

Figure 1. APP protein-protein interaction sub-networks. Red nodes represent the proteins from Yeast Two-Hybrid screening and blue nodes indicate interactors extracted from the databases. Adapted from “Amyloid precursor protein interaction network in human testis: sentinel proteins for male reproduction”, 2015, BMC Bioinformatics, 16:12, p. 5. Copyright 2015 Silva et al. [49]; licensee BioMed Central.

Figure 2. The Alzheimer’s brain network showing connectivity of seed proteins. Purple nodes indicate the seed-proteins with their name. Orange nodes indicate neighboring proteins that belong to the giant component, i.e., the largest section of a network whose nodes are connected. Green nodes indicate neighbors that are not included in the giant component. Adapted from “A computational analysis of protein-protein interaction networks in neurodegenerative diseases”, 2008, BMC Systems Biology, 2:52, p. 7. Copyright 2008 Goni et al. [58]; licensee BioMed Central Ltd., London, U.K.

Figure 3. Overview of brain network analysis. Clusters are color coded in the rightmost figure.

Table 1. Graph theoretical measures for network analysis.

**Table 1.** Graph theoretical measures for network analysis.
Measure	Scope	Computation
Clustering coefficient	Local	$c_{i} = \frac{2 e_{i}}{k_{i} (k_{i} - 1)}$ , where $k_{i}$ is the degree and $e_{i}$ is the number of links between neighbors of the i-th node.
Local efficiency	Local	$E_{l o c} = \frac{1}{N} \sum_{i \in G} E (G_{i})$ , where $E (G_{i}) = \frac{2}{N (N - 1)} \sum_{i < j \in G_{i}} \frac{1}{d (i, j)}$ , where $G_{i}$ is the subgraph of G that consists of node i immediate neighbors excluding i and $d (i, j)$ is the shortest path length between nodes i and j.
Degree centrality	Local	number of edges emanating from a node
Betweenness centrality	Local	$b_{i} = \sum_{j, k \in N, j \neq k} \frac{n_{i, k} (i)}{n_{j, k}}$ where $n_{i, k} (i)$ is number of shortest paths between j and k that run through iand $n_{i, k}$ is the number of shortest paths between j and k
Closeness centrality	Local	$c l o s e n e s s (i) = \sum_{j} {[d (i, j)]}^{- 1}$
Eccentricity	Local	$e c c (i) = \frac{1}{m a x {d (i, j) : j \in V}}$ where V is the set of nodes
Radiality	Local	$r a d (i) = \frac{\sum_{j \in V} (△_{G} + 1 - d (i, j))}{N - 1}$ where $△_{G}$ is the value of the diameter
Characteristic path length	Global	$L = \frac{1}{N (N - 1)} \sum_{i, j \in N, i \neq j} d (i, j)$
Global efficiency	Global	$E_{g l o b} = \frac{1}{N (N - 1)} \sum_{i, j \in N, i \neq j} \frac{1}{d (i, j)}$
Minimum spanning tree	Global	Kruskal’s algorithms [20], etc.
Modularity	Global	$Q = \sum_{i = 1}^{k} (e_{i i} - a_{i}^{2})$ where $e_{i i}$ is the fraction of edges that connects nodes in module i, $a_{i}^{2}$ is the fraction of edges that connect at least one node in the module i and k is the number of modules

Table 2. Biological network public resources. PPI, Protein-Protein Interactions.

**Table 2.** Biological network public resources. PPI, Protein-Protein Interactions.
Group	Name	Description	Uniform Resource Locator and Reference
PPI	Mint	Collects experimentally-verified PPIs in a binary or complex representation. Merged with InAct since 2013.	http://mint.bio.uniroma2.it/mint/ [27]
	String	The known and predicted protein interactions. The interactions include direct (physical) and indirect (functional) associations derived from genomic context, high-throughput experiments, coexpression, previous knowledge.	http://string-db.org/ [28,29]
	DIP	Manually- and automatically-curated database. Experimentally-determined interactions between proteins.	http://dip.doe-mbi.ucla.edu/dip/Main.cgi [30]
Biological Pathway	HPRD	Human PPI manually extracted from the literature.	http://www.hprd.org/ [31]
	KEGG	Manually-curated pathway maps representing knowledge of the molecular interaction and reaction networks.	http://www.genome.jp/kegg/ [32]
	Reactome	Manually-curated pathway.	http://www.reactome.org/ [33,34]
	Alz-Pathway	Manually-curated; comprehensively catalogs signaling pathways for Alzheimer’s disease.	http://alzpathway.org/ [35]
	Pathway-Common	Collection of publicly available pathway information from multiple organisms.	http://www.pathwaycommons.org/pc [36]
Gene Disease Network (GDAs)	DisGeNET	Integrated database from various expert-curated databases and text-mining-derived associations, including Mendelian, complex and environmental diseases.	http://www.disgenet.org/web/DisGeNET [37]
Gene Disease Network (GDAs)	CTDTM	Integrated chemical-gene, chemical-disease and gene-disease interactions manually-curated from the literature.	http://ctdbase.org/ [38]
Multiple Type	InAct	Standards-compliant repository of molecular interactions, including protein-protein, protein-small molecule and protein-nucleic acid interactions.	https://www.ebi.ac.uk/intact/ [39,40]
Multiple Type	BioGrid	Curated biological database of protein-protein interactions, genetic interactions, chemical interactions and post-translational modifications.	http://thebiogrid.org/ [41]

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Thomas, J.; Seo, D.; Sael, L. Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders. Int. J. Mol. Sci. 2016, 17, 862. https://doi.org/10.3390/ijms17060862

AMA Style

Thomas J, Seo D, Sael L. Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders. International Journal of Molecular Sciences. 2016; 17(6):862. https://doi.org/10.3390/ijms17060862

Chicago/Turabian Style

Thomas, Jaya, Dongmin Seo, and Lee Sael. 2016. "Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders" International Journal of Molecular Sciences 17, no. 6: 862. https://doi.org/10.3390/ijms17060862

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Review on Graph Clustering and Subgraph Similarity Based Analysis of Neurological Disorders

Abstract

1. Introduction

1.1. Characterizing Neurological Disorders with Graphs

1.2. Graph Clustering and Graph Similarity

2. Types of Bio-Networks and Applied Analysis on Neurological Disorders

2.1. Types of Bio-Networks

2.2. Bio-Network-Based Neurological Disorder Analysis

2.2.1. Causal and Susceptible Gene Finding

2.2.2. Disease Characterization

3. Types of Brain Networks Used in the Studies of Neurological Disorder

3.1. Types of Brain Networks

3.1.1. Functional Brain Networks

3.1.2. Structural Brain Networks

3.2. Graph Analysis Applications on Brain Networks

3.2.1. Analysis of Functional Brain Networks

3.2.2. Analysis of Structural Brain Networks

4. Need for Integrative Analysis on Large Graphs

4.1. Integrative Analysis for Single-Layered Graphs

4.2. Integrative Analysis of Multi-Layer Graphs

4.2.1. Multi-Layer Graphs

4.2.2. Existing Application of Multi-Layer Graph Analysis

5. Discussion and Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI