Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology

Qiao, Chen; Lu, Lujia; Yang, Lan; Kennedy, Paul J.

doi:10.3390/app9102148

Open AccessArticle

Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology

by

Chen Qiao

^1,*,

Lujia Lu

¹,

Lan Yang

¹ and

Paul J. Kennedy

²

¹

School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China

²

Center for Artificial Intelligence, University of Technology Sydney, Sydney 2007, Australia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2019, 9(10), 2148; https://doi.org/10.3390/app9102148

Submission received: 17 April 2019 / Revised: 16 May 2019 / Accepted: 16 May 2019 / Published: 26 May 2019

(This article belongs to the Special Issue Optical Methods for Tissue Diagnostics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Featured Application

The hybrid feature selection method, which combines both machine learning and traditional statistical methods, is proposed to identify the brain abnormalities of schizophrenia. The results suggest that the brain regions and connectivity in SZs are destroyed compared with HCs, which may cause the cognitive deficits and autistic thinking in SZs. The findings support the validation of the proposed hybrid feature selection method, and thus, it is promised that such a hybrid feature selection method can be further used for other kinds of medical data analysis to enhance the diagnosis ability and further for precision medicine.

Abstract

Many medical imaging data, especially the magnetic resonance imaging (MRI) data, usually have a small sample size, but a large number of features. How to reduce effectively the data dimension and locate accurately the biomarkers from such kinds of data are quite crucial for diagnosis and further precision medicine. In this paper, we propose a hybrid feature selection method based on machine learning and traditional statistical approaches and explore the brain abnormalities of schizophrenia by using the functional and structural MRI data. The results show that the abnormal brain regions are mainly distributed in the supramarginal gyrus, cingulate gyrus, frontal gyrus, precuneus and caudate, and the abnormal functional connections are related to the caudate nucleus, insula and rolandic operculum. In addition, some complex network analyses based on graph theory are utilized on the functional connection data, and the results demonstrate that the located abnormal functional connections in brain can distinguish schizophrenia patients from healthy controls. The identified abnormalities in brain with schizophrenia by the proposed hybrid feature selection method show that there do exist some abnormal brain regions and abnormal disruption of the network segregation and network integration for schizophrenia, and these changes may lead to inaccurate and inefficient information processing and synthesis in the brain, which provide further evidence for the cognitive dysmetria of schizophrenia.

Keywords:

magnetic resonance imaging; schizophrenia; feature selection; brain abnormalities; biomarkers

1. Introduction

Schizophrenia (SZ) is a kind of mental disorder characterized by abnormal social behaviour and a failure to understand reality. Recently, decades of research on brain structure and function have provided us with some understanding of the neurobiological mechanisms underlying its symptoms [1,2]. For example, studies on brain structure suggest that neuroanatomical alterations may underlie the clinical onset of psychotic symptoms. The findings from functional brain imaging studies support a leading hypothesis that SZ stems from disconnectivity, namely abnormal interactions between wide-spread brain networks. Recently, neuroimaging techniques like structural magnetic resonance imaging (sMRI) and functional magnetic resonance imaging (fMRI) have become a powerful tool to examine the abnormal regions and aberrant connectivity of brain networks in SZ, which bring psychiatry from subjective descriptive classification into objective and tangible brain-based measures [3]. For example, Du et al. applied a novel group information guided method to estimate inherent dynamic functional brain networks and found that the abnormalities of SZ were mainly distributed in the cerebellum, frontal cortex, thalamus and temporal cortex [4]. With fMRI data of SZ, Shine et al. showed that dynamic changes of functional connectivity are essential for cognitive processing [5]. Rosenberg et al. demonstrated that the whole-brain functional connectivity strength might serve as a biomarker of sustained attention for both healthy and disease assessments [6]. It was shown that functional connectivity profiles can predict levels of fluid intelligence [7]. By a supervised learning strategy that fuses sMRI, as well as fMRI data, some modality-specific biomarkers of generalized cognition with SZ were identified [1]. Based on sMRI data, Palaniyappan et al. suggested that concomitant increase and decrease in grey matter occur in association with persistent negative thought disorder in clinically stable individuals with SZ [8]. These studies on developing biomarkers allow the field of imaging analysis and psychiatry to move forward.

Given that SZ is often accompanied by cognitive decline, the thorough investigation of brain dynamics, as well as brain structure in SZ seems important in order to better understand the underlying neural mechanism. However, for MRI data of SZ, they usually have a small sample size, but a large number of features, i.e.,

n ≪ p

, where n is the sample size and p is the number of features [9]. For such kinds of data, there still lacks a systematic methodology to study them. That is because it is too difficult to discover the potential information contained in the data from a limited number of observations, which form a cognitive concept of the data or complete identification task [10]. To deal with data of dimensions much larger than the sample size, the generally used approach is dimensionality reduction. Feature selection and feature extraction are common methods for dimensionality reduction. For feature selection, those essential features of the raw data that have the greatest contribution to distinguish different objects can be identified. Thus, by feature selection, we can enhance the interpretability of learning, which is crucial for exploring the mechanisms of why things are different. Mathematically, consider any raw data as an N-dimensional vector

X = {(x_{1}, x_{2}, \dots, x_{N})}^{T}

, from which we can select M features

\tilde{X} = {(x_{R_{1}}, x_{R_{2}}, \dots, x_{R_{M}})}^{T}

as required, where

x_{R_{i}}

,

i = 1, 2, \dots, M

are features chosen from

{x_{1}, x_{2}, \dots, x_{N}}

based on some rules R. The rule could be either of the following items.

\tilde{X}

is the optimal choice with some evaluation indexes for classifiers; the feature subset has the lowest dimension for a given accuracy; the conditional probability distribution function for the data and that of the selected features remain the same; the error rate of the classifier would not be reduced by not increasing or decreasing the number of features. By such a selection process, we could get rid of either redundant or irrelevant features without incurring much loss of information. The distinguishing features can be found, and in this way, the dimension of data space declines, the complexity of data reduces and, especially, the performance of classification and prediction can be improved. Because of the direct interpretability of the data, feature selection is widely used in many fields such as genomics, medical image analysis, computer vision, speech recognition, computer vision, information retrieval, time series prediction [11,12,13], etc.

According to different ways of combining the evaluation criteria and classifiers, feature selection methods can be divided into five types, i.e., filter, wrapper, embedded, ensemble and hybrid methods [14]. Filter methods mainly depend on the attribute of features, and the evaluation criteria depend only on the original data, but not on classifiers [15]. Wrapper methods directly take the performance of the classifiers as the evaluation criterion for the selection of feature subsets; thus, the results of wrapper methods are related to specific classifiers [16]. Methods of embedding filter methods and wrapper methods are called embedded methods. For embedded methods, they are usually composed of two stages. Firstly, filter methods are used to eliminate most of the irrelevant and noise features, so as to reduce the data dimension of the subsequent search process effectively. The second stage adopts wrapper methods to carry out the further feature selection process [17]. Ensemble methods are based on different sampling strategies to extract multiple sample sets, and then, they use a specific feature selection algorithm to obtain multiple sets of feature subsets. These feature subsets are further integrated to obtain a more stable feature subset [18]. Compared with the above three methods, the performance of the ensemble methods no longer depends merely on a single subset selected, but it is still limited since it uses only one specific feature learner. Hybrid methods can be combined with some different feature selection methods. Hybrid approaches combine two or more well-studied feature selection algorithms to form a new strategy and achieve a complementary advantage of different feature selection methods to solve a particular problem [19,20]. The hybrid approach usually capitalizes on the advantages from the sub-algorithms and therefore is more robust compared with single approaches. The feature selection techniques mentioned above have been applied to many fields of dimensionality reduction analysis [21,22,23]. In addition to the above five types of feature selection methods, some traditional statistical methods can also be used to reduce dimensionality, such as hypothesis testing, correlation coefficients, etc. These methods can obtain features with higher distinguishing ability, so as to improve the discriminative capacity of different classes [24,25].

Motivated by identifying biomarkers of SZ that are associated with cognitive composite ability and specific cognitive domains such as attention, working memory and verbal learning, in this paper, by proposing a hybrid feature selection method combining both machine learning and traditional statistical approaches, we explore the brain abnormalities of SZ. The data have 410 features, including both functional and structural MRI, i.e., functional network connectivity (FNC) and source-based morphometric (SBM) of 40 patients with SZ and 46 healthy controls (HCs). By applying our method to these two datasets, the results show that there exist six aberrant brain regions and 17 abnormal functional connections between the SZ group and HC group. Among our findings, there was an obvious decrease, as well as increase of both the grey matter volume and the connectivity of brain regions. The decreasing regions mainly appeared in the default mode network (DMN) and salience network (SN), e.g., the grey matter volume of precuneus (PCUN) and caudate (CAU), and the connectivity of these two brain regions, as well as insula (INS) and CAU were significantly reduced. Moreover, all connectivity corresponding with rolandic operculum and insula significantly reduced [26,27,28,29,30,31]. The significantly increased grey matter volume of brain regions was mainly distributed in frontal gyrus (FG) and supramarginal gyrus (SMG), and there also existed four with significantly increased connectivity, such as middle frontal gyrus and superior occipital gyrus, as well as middle occipital gyrus and fusiform gyrus, and the corresponding conclusion of increasing also was discussed [28,29,32]. To further confirm the significance of the selected abnormal functional connections, we also used complex network analysis. Since the level of response activity in brain regions and the ability of functional connectivity between different brain regions can reflect the degree of brain disorders, the results have the potential to provide evidence for accurate diagnosis and further for precision medicine learning of such kinds of psychiatric diseases.

2. Methodology

There are many feature selection methods based on machine learning, as well as traditional statistics. Combining both of them, especially developing a kind of hybrid feature selection method, is still worthy of study. In this section, we will introduce a hybrid feature selection method combining three kinds of machine learning methods and three kinds of statistical methods. In addition, some graph theory will be presented to verify the validation of the features selected by the proposed hybrid feature selection method.

2.1. Feature Selection Methods Based on Machine Learning

2.1.1. Feature Selection with Support Vector Machine

Support vector machine based on recursive feature elimination (SVMRFE) is a multi-variable wrapper feature selection algorithm, and it can keep relevant features and remove relatively insignificant feature variables in order to achieve higher classification performance. SVMRFE was first proposed for gene selection [33], and it has been widely applied to MRI data research, text analysis and biological information processing [34,35,36].

For SVMRFE, the scoring function for each feature i is defined as:

\begin{matrix} S c o r e (i) = | ω_{i} | o r S c o r e (i) = ω_{i}^{2} \end{matrix}

(1)

where

ω_{i}

is the weight for feature i as obtained from the SVM training. Thus, features that contribute the most to discriminating the two classes are represented by

| ω |

with the highest values, and features with small scores are generally considered as noise, redundant or irrelevant to the problem. Therefore, eliminating features with smaller scores does not bring about great changes of the optimization problem, which is the essence of the algorithm [37,38]. The SVMRFE algorithm is briefly described as below.

Algorithm 1: Support vector machine based on recursive feature elimination (SVMRFE)

Input: Dataset D

Process:

1. Initialization

Let the current feature subset

C u r r e n t_{-} D

contain all features, and the optimal feature subset

B e s t_{-} D = \emptyset

;

2. Training the classifier

Train a SVM on the training set with the

C u r r e n t_{-} D

, and evaluate the classification accuracy on the test set;

3. Updating

C u r r e n t_{-} D

Calculate the importance of each feature in

C u r r e n t_{-} D

by the scoring function (1), and eliminate features with the smallest score;

4. Updating

B e s t_{-} D

If the accuracy rate of

C u r r e n t_{-} D

is greater than that of

B e s t_{-} D

, then let

B e s t_{-} D = C u r r e n t_{-} D

;

5. Repeat Steps 2–4 until the stop condition is satisfied.

Output: The optimal feature subset

B e s t_{-} D

The stopping criterion can be a desired dimensionality, a pre-specified number of iterations or a generalization of the performances, etc.

2.1.2. Feature Selection with Random Forest

Random forest (RF) is an ensemble machine learning method using tree-type classifiers. It is built by bootstrap sampling technology and random splitting technology, and the final classification result is made by a majority vote of the trees [39,40]. Because of its excellent generalization performance, RF is also further used for feature selection [41,42].

For a given tree, let

S_{0}

denote the set of input predictor data vectors and

S_{j}

be the subset of the predict data reaching node j in the binary split tree. According to the performance of the current feature on node j,

S_{j}

can be divided into two subsets, i.e.,

S_{j}^{L}

and

S_{j}^{R}

; here,

S_{j}^{L} ⋃ S_{j}^{R} = S_{j}

and

S_{j}^{L} ⋂ S_{j}^{R} = \emptyset

. Choosing the best split according to the mean decrease of the Gini index, which is defined as:

\begin{matrix} Δ G i n i_{i} (j) = G i n i (j) - (\frac{| S_{j}^{L} |}{| S_{j} |} G i n i (j^{L}) + \frac{| S_{j}^{R} |}{| S_{j} |} G i n i (j^{R})) \end{matrix}

(2)

where

G i n i (j) = 1 - \sum_{c \in C} P_{c}^{2}

is the Gini index at node j. This metric reflects the contribution of each feature to node j; therefore, we can get an estimate of feature i with Gini importance:

\begin{matrix} S c o r e_{G i n i} (i) = \frac{1}{n_{t r e e}} \sum_{t = 1}^{n_{t r e e}} \sum_{j} Δ G i n i_{i} (j, t) \end{matrix}

(3)

where

Δ G i n i_{i} (j, t)

is the value of

Δ G i n i_{i} (j)

on one tree t. The Gini importance indicates how large its overall discriminative value is for the classification task. We randomly chose a feature i, calculated its Gini importance defined in (4) and removed the features with Gini importance below feature i. The algorithm for feature section with random forest by Gini importance (RFFS-GI) is briefly described as below.

Algorithm 2: Feature section with random forest by Gini importance (RFFS-GI)

Input: Dataset D;

Process:

1. Randomly choose a feature i into the feature set;

2. Calculate the Gini importance of all features in the feature set with the scoring function (3);

3. Keep features with Gini importance above that of the feature i;

Output: Optimal feature subset

In addition, for bootstrap sampling technology, about 1/3 of the samples will not be collected at the end, and they are called the out of bag (OOB) data [43]. The role of OOB data can be considered as equivalent to the test data. Therefore, we can also use the classification accuracy of the random forest classifier on the OOB data as the feature separability criterion, so as to calculate the importance of each feature:

\begin{matrix} S c o r e_{O O B} (i) = \sum \frac{o o b e r r 2 - o o b e r r 1}{N} \end{matrix}

(4)

where

o o b e r r 1

is the classification error of the OOB data,

o o b e r r 2

is the classification error of the OOB data with adding noise on feature i and N indicates the number of trees in a random forest. We can understand that if a feature is randomly disturbed, the classification error of the OOB data will increase greatly, and it can be considered that this feature has a great influence on the classification result. The algorithm of feature section with random forest by the classification accuracy on the OOB data (RFFS-OOB) is briefly described as below.

Algorithm 3: Feature section with random forest by the classification accuracy on the OOB data (RFFS-OOB)

Input: Dataset D

Process:

1. Generate random forest;

2. Calculate feature importance based the scoring function (4), and sort the scores;

3. The top ranked features are selected as the optimal feature subset.

Output: Optimal feature subset.

In order to improve the accuracy of feature selection results for the SBM and FNC data, we used SVMRFE, RFFS-GI and RFFS-OOB, and repeated them 20 times separately, counted the frequency of the selected features by each feature selection method and integrated the optimal feature subsets.

2.2. Feature Section Based on Statistical Methods

For classical statistical methods, the discriminative ability of a feature can be quantitatively measured by its contribution on distinguishing different classes [25,44].

The Kendall tau correlation coefficient provides a distribution-free test of independence between two variables. The Kendall tau correlation coefficient of feature j can be defined as:

\begin{matrix} τ_{j} = \frac{n_{c} - n_{d}}{n_{1} \times n_{2}} \end{matrix}

(5)

where

n_{c}

and

n_{d}

are the numbers of concordant and discordant pairs, respectively, and

n_{1}

and

n_{2}

correspond to the number of two classes of samples, respectively. For a pair of data

(x_{i j}, y_{i})

and

(x_{k j}, y_{k})

of feature j, it is a concordant pair when

s g n (x_{i j} - x_{k j}) = s g n (y_{i} - y_{k})

, where

s g n ()

is the signum function (i.e.,

s g n (x) = - 1

with

x < 0

,

s g n (x) = 0

with

x = 0

and

s g n (x) = 1

with

x > 0

). Correspondingly, it is a discordant pair when

s g n (x_{i j} - x_{k j}) = - s g n (y_{i} - y_{k})

. The discriminative power of each feature j is defined as the absolute value of its Kendall tau correlation coefficient.

The permutation test is a non-parametric test method, which is suitable for the case of a small sample size and unknown sample distribution. Assume that there are two samples

x_{A}

and

x_{B}

, and

{\bar{x}}_{A}

and

{\bar{x}}_{B}

denote the corresponding sample mean, say

n_{A}

and

n_{B}

are the corresponding sample size. At first, we calculate the observed test statistic

T_{o b s} = {\bar{x}}_{A} - {\bar{x}}_{B}

. Then, the two samples are merged and divided into two groups with size

n_{A}

and

n_{B}

. For each division, the difference between the mean values of the two groups is calculated and recorded. The calculated difference set is the accurate distribution of the difference under the null hypothesis. Finally, the ratio of the absolute value of the calculated difference greater than or equal to the absolute value of

T_{o b s}

is the p-value based on the two-sided test.

By the two-sample t-test, we can also determine whether there are significant differences of each feature. The t-value of the feature j can be defined as:

\begin{matrix} t_{j} = \frac{| {\bar{x}}_{1} - {\bar{x}}_{2} |}{\sqrt{\frac{(n_{1} - 1) s_{1}^{2} + (n_{2} - 1) s_{2}^{2}}{n_{1} + n_{2} - 2} \cdot (\frac{1}{n_{1}} + \frac{1}{n_{2}})}} \end{matrix}

(6)

where

{\bar{x}}_{1}

and

{\bar{x}}_{2}

are the means of feature j of patients and health controls (HCs) and

s_{1}

and

s_{2}

represent the corresponding standard deviations. With the Kendall tau correlation coefficient, permutation test and two-sample t-test, we can identify features with significant differences.

2.3. Hybrid Feature Selection Based on Both Machine Learning and Statistical Methods

By combining the above machine learning methods and statistical methods, we propose a hybrid feature selection approach. In more detail, for machine learning methods, we summed the frequencies of SVMRFE, RFFS-GI and RFFS-OOB, then we selected the features with total frequency greater than a given value b to obtain the significant feature subset. At the same time, we selected features with the absolute values of the Kendall correlation coefficient greater than a given value c and those with the p-value of two-sample t-test, as well as that of the permutation test less than 0.05 as the significant feature subset, respectively. Finally, we integrated the significant feature subset from both the machine learning and statistical method as the optimal feature subset. The above process is a hybrid feature selection procedure, and the flowchart is shown in the Figure 1. The experiment results will show that the proposed hybrid feature section method is an effective attempt to combine machine learning and the statistical methods.

2.4. Complex Network Analysis Based on Graph Theory

The data we used here are a type of MRI data, which contain both the regions and the functional connection information of brains. The hybrid feature selection method can be directly used to explore the disease-related abnormal brain regions and abnormal function connections. Furthermore, since the completion of various tasks allocated for brains is implemented by the coordination and cooperation between various brain regions, so it is necessary to discover the connection networks of brains in depth.

The analysis of complex network properties by several indexes (see Figure A1) can characterize the topological attributes of the network; for example, the clustering coefficient quantifies the functional segregation of the brain network, in which the functional segregation reflects the ability of a specialized process to occur within some densely-interconnected groups of the brain regions. The length of characteristic path quantifies the functional integration of the brain network, and the functional integration reflects the ability to combine rapidly some specialized information from distributed brain regions [45]. Both global and local network efficiencies quantify the transmission capability of the brain network, and the transmission capability reflects the ability of transmitting information between different brain regions in the brain network. The main difference is that the global network efficiency focuses on the global brain network, but the local network efficiency just focuses on the local brain network. Thus, by complex network analysis, we can confirm the significance of those selected abnormal connection features and can further explore the mechanism of SZ.

3. Experiments

In this section, based on the hybrid feature selection method and network topological analysis, we located the brain abnormalities of both regions and connections with SZ. Firstly, by the SVMRFE, RFFS-GI, RFFS-OOB, correlation coefficient and hypothesis test, the candidates of brain regions and connections associated with SZ were selected separately, and then, by the hybrid method, we could confirm the significant regions and connections of SZ. Furthermore, the complex network analysis based on graph theory was used to verify the selected abnormal connections. Ultimately, we could locate some of the abnormal brain regions and abnormal connections with SZ, which provided theoretical guidance for the rapid and accurate diagnosis of psychiatric diseases and adjuvant therapy.

3.1. Data Collection and Preprocessing

In this study, the Machine Learning for Signal Processing (MLSP) 2014 Schizophrenia classification challenge data were used. The data can be download from https://www.kaggle.com/c/mlsp-2014-mri. They were collected on a 3T MRI scanner at the Mind Research Network and funded by the Centers of Biomedical Research Excellence. Image preprocessing was performed using statistical parametric mapping software (SPM, http://www.fil.ion.ucl.ac.uk/spm). Further feature extraction was done by the GIFT Toolbox (http://mialab.mrn.org/software/gift/), yielding different imaging modalities, i.e., SBM and FNC features for structural MRI and resting state functional MRI, correspondingly.

The data consisted of 40 patients with SZ and 46 HCs. A diagnosis of SZ was made by using the Structured Clinical Interview for DSM-IV (SCID; Diagnostic and Statistical Manual of Mental Disorders, DSM) [46]. Each sample had 410 features (32 for SBM and 378 for FNC). SBM features were weights of brain regions, and they indicated the concentration of grey matter in different regions of the subject’s brain [47]. FNC features were the pair-wise correlation values between the time-courses of 28 brain regions and can be seen as a functional modality feature describing the subjects’ overall level of synchronicity between brain areas [48]. These 28 brain regions were selected according to the anatomical automatic labeling (AAL) template, and they are shown in Figure A2, while the connections between the brain regions corresponding to these FNC features are shown in Figure A3.

3.2. Locating the Abnormalities in Brains for SZ

For both the FNC and SBM data, we performed feature selection methods based on machine learning and statistical approaches, respectively. By the hybrid process, the key features can be selected; namely for SBM data, we obtained the abnormal brain region, and for FNC data, the abnormal connectivities were achieved. Further, we used the brain network based on graph theory to analyse the selected abnormal connections. The following Figure 2 shows the whole flowchart of the procedure.

3.2.1. Feature Selection Results Based on Machine Learning Methods

SVMRFE, RFFS-GI and RFFS-OOB were applied to perform feature selections on the MRI data respectively, with each method being repeated 20 times. Since these three methods were implemented based on the classification results and SBM data and FNC data had different classification performance, therefore, in order to obtain the key features of the two types of data more clearly, we selected the features of both of them separately. By the three feature selection methods, the results of the frequency of each feature that has been selected are shown in Figure 3 and Figure 4 and Figure A4, Figure A5, Figure A6 and Figure A7.

It is generally believed that if the frequency of occurrence of a feature is too low, then the feature is not significant. Therefore, we only considered features with a higher frequency to obtain the significant feature subset. In Figure A8, the corresponding characteristic frequency distribution with a frequency greater than or equal to 50 is shown. Each point in this figure corresponds to the number of features with a frequency of occurrence greater than or equal to x. Further, we selected features with a frequency greater than or equal to 55, which is a balance between the numbers of features and the frequency (the details can be found in the illustration of Figure A8). From Figure 3 and Figure 4 and Figure A4 and Figure A7, we can obtain the features of SBM data that are significant for distinguishing the HCs and SZ, and the corresponding indexes were 3, 7, 11, 24, 26, 30 and 32. We can also obtain the discriminative features of FNC data with indexes 244, 295, 183, 243, 33, 37, 40, 189, 220, 48, 78, 279, 353, 13, 185, 211, 265, 292, 328, 337 and 165.

3.2.2. Feature Selection Results Based on Statistical Methods

Statistical methods were utilized to screen out features with significant differences. The results of the Kendall correlation coefficient are shown in Figure 5, and the hypothetical test results are shown in Figure 6.

We selected features with the p-value of the hypothesis test less than 0.05 and the absolute value of Kendall correlation coefficient greater than 0.26, which is a balance between the size of the selected feature subsets and their distinguishing ability of SZ. The results are shown in Figure 7, where

τ

is the Kendall correlation coefficient and

p_{1}

and

p_{2}

are the p-values of the two-sample t-test and the permutation test, respectively.

3.2.3. Feature Selection Results Based on a Hybrid Method

By both machine learning and statistical methods, the key candidate features for SZ were selected, and the dataset were quite similar. We adopted the intersection of them as the final selected feature subset, and thus, the abnormal brain regions from the SBM data (see Figure 8) and the abnormal functional connectivity from the FNC data (see Figure 9) can be obtained.

Figure 8 shows the brain regions selected by our method that differed from healthy controls in SZ, and these abnormal brain regions were mainly distributed in supramarginal gyrus (SMG), cingulate gyrus (CG), middle frontal gyrus (MFG), precuneus (PCUN), superior frontal gyrus (SFG) and caudate (CAU). Compared with the HC group, the SZ group had significantly reduced grey matter volumes in the CG, PCUN and CAU and significantly increased grey matter volume of brain regions including SMG, MFG and SFG.

Figure 9 shows that by the hybrid feature selection method proposed here, 17 abnormal functional connections between the SZ group and HC group can be discovered. Furthermore, by combining with the relationship between the connections and the regions shown in Figure A2, six connections are related to the caudate nucleus (CAU), including rolandic operculum (ROL), insula (INS), supramarginal gyrus (SMG), superior occipital gyrus (SOG), precuneus (PCUN) and median cingulate and paracingulate gyri. In addition, there also existed three abnormal functional connections related to the insula (i.e., ROL, amygdala and CAU) and four aberrant functional connection in rolandic operculum (i.e., insula, lingual gyrus, superior parietal gyrus and caudate). Among these abnormal connections discovered by our method, we can find that all connectivities corresponding with rolandic operculum and insula had significantly reduced, and these connectivities related to caudate nucleus had significantly decreased except the median cingulate and paracingulate gyri. Other than that, we also observed the significantly increased connectivity in middle frontal gyrus and superior occipital and middle occipital gyrus and fusiform gyrus, as well as left and right superior parietal gyrus. In conclusion, the brain connectivity in SZ generally decreased, but also had little increased connectivity. To show these abnormal connections more vividly, in Figure 9, we used the BrainNet Viewer toolbox to draw the precise locations of two brain regions with aberrant connections and to show the aberrant brain connectivity network in SZ [49].

3.3. Network Evaluation

Further, to support the validity of the connectivity findings by the above hybrid feature selection method, we constructed a brain network based on these connections and explored its topological properties [50,51]. More specifically, we first chose the clustering coefficient (C), characteristic path length (L), global network efficiency (

E g

) and local network efficiency (

E l o c

) as the evaluation index for each network. Then, we constructed weight networks with a threshold of one for the original and selected FNC data. At last, these four parameters of both SZ and HCs were calculated and tested by a two-sample t-test. The p-values of these four parameters were

1.70 \times 10^{- 1}

,

5.02 \times 10^{- 3}

,

2.99 \times 10^{- 2}

and

4.27 \times 10^{- 2}

for the original FNC data and

6.64 \times 10^{- 2}

,

3.41 \times 10^{- 6}

,

5.40 \times 10^{- 6}

and

1.90 \times 10^{- 2}

after feature selection by our method. Obviously, from the results of the p-values of the four parameters, we can find that the p-values of all these parameters decreased significantly after feature selection, which means that the distinction of four parameters between the HCs and SZ became more apparent after feature selection, especially the characteristic path length and the global network efficiency. This shows that the HCs and SZ become obviously distinguishable by the hybrid feature selection method and shows the validity of our method.

4. Discussion

The methods based on machine learning pay more attention to the classification accuracy, but the statistical methods emphasize the correlation between feature and label, which explains the essential difference between the two approaches. Comparing the significant subsets selected by these two approaches, it is clear that most of the biomarkers in these two subsets were same, and this means that despite the emphasis of the two approaches being different, both of them did find the significant features. Further, by integrating the significant subset of these two approaches, the significant features can be double checked and obtained finally by the hybrid method proposed in this article. For example, for the data before feature selection, the p-value of characteristic path length, which is referred to as L in the above section, was

5.02 \times 10^{- 3}

. The p-value of L for the optimal subset I, which was obtained by machine learning methods, the p-value of L for the optimal subset II, which was obtained by statistic methods, and the p-value of L for the optimal subset by the proposed hybrid method were

9.40 \times 10^{- 6}

,

2.82 \times 10^{- 5}

and

3.41 \times 10^{- 6}

respectively. The results show that the HCs and SZ became obviously distinguishable after feature selection; specially, our method was more significant than machine learning, as well as statistical methods. In summary, the hybrid method can combine the strength of both machine learning and statistic methods to improve the accuracy of the results, and the results of network evaluation also confirmed this point.

Our findings are quite consistent with those reports that the grey matter volume of CG, PCUN and CAU is significantly reduced in SZ [26,27,52,53]. The CG is considered to be a brain region closely related to task attention, memory and affection, which has been reported to be destroyed in SZ [54]. The PCUN is the portion of the superior parietal lobule on the medial surface of each brain hemisphere, and it is often considered to be a brain region that plays an important role in the pathogenesis of SZ [55]. Given that the Behavioural Inhibition System (BIS) activity and Cloninger’s Temperamental Dimension Harm Avoidance (HA) are mainly bound up with the study of the anxiety trait [56,57] and the research results show that the BIS-sensitively as well as HA are negatively correlated with the regional gray matter volume at the CG and PCUN, the SZ may be accompanied by anxiety trait due to the reduction of the gray matter volume at these two regions [58]. The CAU is one of the structures that makes up the dorsal striatum, which is a component of the basal ganglia. It can affect the cognitive function of patients, resulting in decreased memory ability, and may be the cause of cognitive dysfunction in SZ [59].

In our findings, most of the brain connectivity in SZ was significantly reduced, which had been generally accepted as the fact that the functional connectivity reduces significantly in SZ and the reduction may cause the damage of information integration [60]. Among these abnormal connectivities, CAU, INS and ROL were the most connected regions. The INS mainly participates in the formation of aversion, the regulation of pain, the production of depression, the regulation of cardiac activity and the planning of language [61], and these may be the cause of affective symptoms in SZ. Moreover, many studies have found that the connectivity in the INS decreased, which may cause the disrupted functional integration of the brain [30]. The ROL is mainly involved in language, and Wu et al. suggested that the reduction of connectivity of ROL improves the vulnerability of speech recognition to speech masking [62]. Not only that, the work also showed that the ROL is bound up with hallucination [63]. It has been reported that SZ is often accompanied by motor abnormalities, and the work showed that the abnormalities of the motor system are related to the abnormal functional connectivity of CAU and CG [64]. In addition, the work showed that the network of DMN including posterior cingulate cortex and lateral temporal cortex and SN including INS and CAU have abnormal connectivity in SZ [65]. DMN is mainly related to oriented attention and self-monitoring [66], and SN is implicated in orienting toward salient external stimuli and internal events [67]. These state clearly that the abnormal connectivity of CAU and INS may result in the cognitive deficits.

In addition to the above findings that there exist some decreasing regions and connections, we also found that there exist some increasing regions in SMG, MFG and SFG and the increasing connectivity of MFG and superior occipital gyrus, the median cingulate and paracingulate gyri and CAU, the left and right superior parietal, as well as middle occipital gyrus and fusiform gyrus. Some corresponding conclusions were also mentioned in literatures [28,29,32]. Research showed that the connectivity of the frontoparietal network (FPN) and DMN significantly increased [65]. The FPN including dorsolateral prefrontal cortex and dorsolateral parietal cortex is implicated in executive control [68], which means the function of executive control of SZ is different from HCs. In conclusion, we found that most abnormal brain regions and connectivity discovered by our method were mainly related to cognition and hallucination. These abnormalities may be the reason for the cognitive deficits and autistic thinking in SZ. Moreover, our studies show that compared with HCs, the brain network of SZ is not a single decline or rise, but a mix of both. The most abnormal connectivity may cause the information integration and transmission damage. Thus, by our method, we did find the abnormal regions and the connectivity of brain that were strongly related to SZ, and the results also supported the effectiveness of using functional disconnectivity from neuroimaging as a biomarker for diagnosis of mental disorders [69].

5. Conclusions

By the proposed hybrid feature selection approach, which combined both machine learning and traditional statistical methods, the abnormal brain regions and abnormal connections in brains of SZ were discovered. The results of SBM data showed that the abnormal brain regions of SZ were mainly distributed in supramarginal gyrus, cingulate gyrus, middle frontal gyrus, superior frontal gyrus, precuneus and caudate. These brain regions are reported to have strong association with SZ, and they are mainly involved in perception, thinking, emotion and spiritual activity. The results of FNC data showed that most of the abnormal functional connections in brains of SZ were related to FPN, DMN and SN. These three networks are closely related to cognitive deficits, especially in executive control and salience processing. All of the results suggest that the brain regions and connectivity in SZ are destroyed compared with HCs, and the abnormal activity may cause the cognitive deficits and autistic thinking in SZ. In addition, the complex network analysis further verified the significance of the selected abnormal functional connections. All findings supported the validation of the proposed hybrid feature selection method, and thus, it is promised that such a hybrid feature selection method can be further used for other kinds of medical data analysis to enhance the diagnosis ability.

Author Contributions

Conceptualization, C.Q.; methodology, C.Q. and L.L.; writing, original draft preparation, L.L. and L.Y.; writing, review and editing, C.Q. and P.J.K.; funding acquisition, C.Q.

Funding

This research was funded by NSFC Nos. 11471006 and 11101327, the Fundamental Research Funds for the Central Universities (No. xjj2017126), the Science and Technology Project of Xi’an (No. 201809164CX5JC6) and the HPC Platform of Xi’an Jiaotong University.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Figure A1. Different measuring parameters of the global and local network properties. Where

t_{i}

is the number of triangles around node i,

d_{i j}

is the shortest path length between node i and node j,

C_{r a n d}

and

L_{r a n d}

refer to the average clustering coefficient and characteristic path length values obtained from 100 random networks with the same number of nodes, as well as edges and the same degree of distribution as the original network,

σ_{j k}

is the number of shortest paths between j and k and

σ_{j k} (i)

is the number of shortest paths between j and k that pass through i.

Figure A1. Different measuring parameters of the global and local network properties. Where

t_{i}

is the number of triangles around node i,

d_{i j}

is the shortest path length between node i and node j,

C_{r a n d}

and

L_{r a n d}

refer to the average clustering coefficient and characteristic path length values obtained from 100 random networks with the same number of nodes, as well as edges and the same degree of distribution as the original network,

σ_{j k}

is the number of shortest paths between j and k and

σ_{j k} (i)

is the number of shortest paths between j and k that pass through i.

Figure A2. Twenty eight brain regions selected for the experiment according to the AAL template.

Figure A3. The connections between the brain regions R1 and R2 corresponding to FNC features.

Figure A4. SVMRFE and RFFS results of SBM data, where Fea represents the feature number and Fre represents the frequency at which the feature appears in 20 experiments.

Figure A5. SVMRFE and RFFS results of FNC data, Part 1.

Figure A6. SVMRFE and RFFS results of FNC data, Part 2.

Figure A7. SVMRFE and RFFS results of FNC data, Part 3.

Figure A8. The characteristic frequency distribution with a frequency greater than or equal to 50. The x axis corresponds to the frequency of occurrence, and the y axis is the number of features. We can find that when the frequency is in the red range, i.e., greater than or equal to 52 and less than or equal to 56, the number of features is quite stable. Compared with other ranges, in the red range, there exists a balance between the number of features and the frequency of occurrence, which facilitates the abnormal analysis of brain function connections and structures corresponding to diseases. Therefore, we selected features with a frequency greater than or equal to 55.

References

Sui, J.; Qi, S.; van Erp, T.G.M.; Bustillo, J.; Jiang, R.; Lin, D.; Turner, J.A.; Damaraju, E.; Mayer, A.R.; Cui, Y.; et al. Multimodal neuromarkers in schizophrenia via cognition-guided MRI fusion. Nat. Commun. 2018, 9, 3028. [Google Scholar] [CrossRef] [PubMed]
Mp, V.D.H.; Fornito, A. Brain networks in schizophrenia. Neuropsychol. Rev. 2014, 24, 32–48. [Google Scholar] [CrossRef]
Woo, C.W.; Chang, L.J.; Lindquist, M.A.; Wager, T.D. Building better biomarkers: Brain models in translational neuroimaging. Nat. Neurosci. 2017, 20, 365–377. [Google Scholar] [CrossRef] [PubMed]
Du, Y.; Fryer, S.L.; Fu, Z.; Lin, D.; Sui, J.; Chen, J.; Damaraju, E.; Mennigen, E.; Stuart, B.; Mathalon, D.H.J.N. Dynamic functional connectivity impairments in early schizophrenia and clinical high-risk for psychosis. NeuroImage 2018, 180, 632–645. [Google Scholar] [CrossRef]
Shine, J.M.; Bissett, P.G.; Bell, P.T.; Koyejo, O.; Balsters, J.H.; Gorgolewski, K.J.; Moodie, C.A.; Poldrack, R.A. The dynamics of functional brain networks: Integrated network states during cognitive task performance. Neuron 2016, 92, 544–554. [Google Scholar] [CrossRef]
Rosenberg, M.D.; Finn, E.S.; Scheinost, D.; Papademetris, X.; Shen, X.; Constable, R.T.; Chun, M.M. A neuromarker of sustained attention from wholebrain functional connectivity. Nat. Neurosci. 2016, 19, 165–171. [Google Scholar] [CrossRef]
Finn, E.S.; Shen, X.; Scheinost, D.; Rosenberg, M.D.; Huang, J.; Chun, M.M.; Papademetris, X.; Constable, R.T. Functional connectome fingerprinting: Identifying individuals using patterns of brain connectivity. Nat. Neurosci. 2015, 18, 1664–1671. [Google Scholar] [CrossRef] [PubMed]
Palaniyappan, L.; Mahmood, J.; Balain, V.; Mougin, O.; Gowland, P.A.; Liddle, P.F. Structural correlates of formal thought disorder in schizophrenia: An ultra-high field multivariate morphometry study. Schizophr. Res. 2015, 168, 305–312. [Google Scholar] [CrossRef] [Green Version]
Kong, Y.; Yu, T. A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data. Bioinformatics 2018, 34, 3727–3737. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Suk, H.I.; Lee, S.W.; Shen, D. Deep ensemble learning of sparse regression models for brain disease diagnosis. Med. Image Anal. 2017, 37, 101–113. [Google Scholar] [CrossRef] [Green Version]
Demirhan, A. The effect of feature selection on multivariate pattern analysis of structural brain MR images. Phys. Med. 2018, 47, 103–111. [Google Scholar] [CrossRef]
Cao, F.; Liu, Y.; Wang, D. Efficient Saliency Detection Using Convolutional Neural Networks with Feature Selection. Inf. Sci. 2018, 456, 34–49. [Google Scholar] [CrossRef]
Liu, Z.T.; Wu, M.; Cao, W.H.; Mao, J.W.; Tan, G.Z. Speech emotion recognition based on feature selection and extreme learning machine decision tree. Neurocomputing 2018, 273, 271–280. [Google Scholar] [CrossRef]
Chandrashekar, G.; Sahin, F. A survey on feature selection methods. Comput. Electr. Eng. 2014, 40, 16–28. [Google Scholar] [CrossRef]
Lazar, C.; Taminau, J.; Meganck, S.; Steenhoff, D.; Coletta, A.; Molter, C.; De, S.V.; Duque, R.; Bersini, H.; Nowé, A. A survey on filter techniques for feature selection in gene expression microarray analysis. IEEE/ACM Trans. Comput. Biol. Bioinform. 2012, 9, 1106–1119. [Google Scholar] [CrossRef]
Foithong, S.; Pinngern, O.; Attachoo, B. Feature subset selection wrapper based on mutual information and rough sets. Expert Syst. Appl. 2012, 39, 574–584. [Google Scholar] [CrossRef]
Cadenas, J.M.; Garrido, M.C.; Martínez, R. Feature subset selection Filter-Wrapper based on low quality data. Expert Syst. Appl. 2013, 40, 6241–6252. [Google Scholar] [CrossRef]
Shen, Q.; Diao, R.; Su, P. Feature Selection Ensemble. Turing 100 2012, 10, 289–306. [Google Scholar]
Lu, H.; Chen, J.; Yan, K.; Jin, Q.; Xue, Y.; Gao, Z. A hybrid feature selection algorithm for gene expression data classification. Neurocomputing 2017, 256, 56–62. [Google Scholar] [CrossRef]
Zhe, F.L. A Novel Hybrid Feature Selection Methods and Prediction for Ready Biodegradibility of Chemicals Using Random Forests and Boruta. In Proceedings of the 8th International Conference on Researches in Engineering, Technology and Sciences (ICRETS), Istanbul, Turkey, 13–14 August 2015. [Google Scholar]
Lyu, H.; Wan, M.; Han, J.; Liu, R.; Wang, C. A filter feature selection method based on the Maximal Information Coefficient and Gram-Schmidt Orthogonalization for biomedical data mining. Comput. Biol. Med. 2017, 89, 264–274. [Google Scholar] [CrossRef]
Zhang, X.; Zhang, Q.; Miao, C.; Sun, Y.; Qin, X.; Li, H. A two-stage feature selection and intelligent fault diagnosis method for rotating machinery using hybrid Filter and Wrapper method. Neurocomputing 2017, 275, 2426–2439. [Google Scholar] [CrossRef]
Moon, M.; Nakai, K. Stable feature selection based on the ensemble L1-norm support vector machine for biomarker discovery. BMC Genom. 2016, 17, 1026. [Google Scholar] [CrossRef] [PubMed]
Chen, Y.; Yang, W.; Long, J.; Zhang, Y.; Feng, J.; Li, Y.; Huang, B. Discriminative analysis of Parkinson’s disease based on whole-brain functional connectivity. PLoS ONE 2015, 10, e0124153. [Google Scholar] [CrossRef] [PubMed]
Zeng, L.L.; Shen, H.; Liu, L.; Wang, L.; Li, B.; Fang, P.; Zhou, Z.; Li, Y.; Hu, D. Identifying major depression using whole-brain functional connectivity: A multivariate pattern analysis. Brain J. Neurol. 2012, 135, 1498–1507. [Google Scholar] [CrossRef]
Haznedar, M.M.; Buchsbaum, M.S.; Hazlett, E.A.; Shihabuddin, L.; New, A.; Siever, L.J. Cingulate gyrus volume and metabolism in the schizophrenia spectrum. Schizophr. Res. 2004, 71, 249–262. [Google Scholar] [CrossRef]
Calabrese, D.R.; Wang, L.; Harms, M.P.; Ratnanather, J.T.; Barch, D.M.; Cloninger, C.R.; Thompson, P.A.; Miller, M.I.; Csernansky, J.G. Cingulate gyrus neuroanatomy in schizophrenia subjects and their non-psychotic siblings. Schizophr. Res. 2008, 104, 61–70. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Shah, C.; Zhang, W.; Xiao, Y.; Yao, L.; Zhao, Y.; Gao, X.; Liu, L.; Liu, J.; Li, S.; Tao, B. Common pattern of gray-matter abnormalities in drug-naive and medicated first-episode schizophrenia: A multimodal meta-analysis. Psychol. Med. 2016, 47, 401–413. [Google Scholar] [CrossRef]
Chang, M.; Womer, F.Y.; Bai, C.; Zhou, Q.; Wei, S.; Jiang, X.; Geng, H.; Zhou, Y.; Tang, Y.; Wang, F. Voxel-Based Morphometry in Individuals at Genetic High Risk for Schizophrenia and Patients with Schizophrenia during Their First Episode of Psychosis. PLoS ONE 2016, 11, e0163749. [Google Scholar] [CrossRef]
Liang, M.; Zhou, Y.; Jiang, T.; Liu, Z.; Tian, L.; Liu, H.; Hao, Y. Widespread functional disconnectivity in schizophrenia with resting-state functional magnetic resonance imaging. Neuroreport 2006, 17, 209–213. [Google Scholar] [CrossRef]
Xu, Y.; Qin, W.; Zhuo, C.; Xu, L.; Zhu, J.; Liu, X.; Yu, C. Selective functional disconnection of the orbitofrontal subregions in schizophrenia. Psychol. Med. 2017, 47, 1637–1646. [Google Scholar] [CrossRef]
Zhang, D.; Guo, L.; Hu, X.; Li, K.; Zhao, Q.; Liu, T. Increased cortico-subcortical functional connectivity in schizophrenia. Brain Imaging Behav. 2012, 6, 27–35. [Google Scholar] [CrossRef] [PubMed]
Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V. Gene Selection for Cancer Classification using Support Vector Machines. Mach. Learn. 2002, 46, 389–422. [Google Scholar] [CrossRef]
Martino, F.D.; Valente, G.; Staeren, N.; Ashburner, J.; Goebel, R.; Formisano, E. Combining multivariate voxel selection and support vector machines for mapping and classification of fMRI spatial patterns. NeuroImage 2008, 43, 44–58. [Google Scholar] [CrossRef] [PubMed]
You, W.; Yang, Z.; Ji, G. PLS-based recursive feature elimination for high-dimensional small sample. Knowl.-Based Syst. 2014, 55, 15–28. [Google Scholar] [CrossRef]
Yan, K.; Zhang, D. Feature selection and analysis on correlated gas sensor data with recursive feature elimination. Sens. Actuators B Chem. 2015, 212, 353–363. [Google Scholar] [CrossRef]
Huang, M.L.; Hung, Y.H.; Lee, W.M.; Li, R.K.; Jiang, B.R. SVM-RFE based feature selection and Taguchi parameters optimization for multiclass SVM classifier. Sci. World J. 2014, 2014, 795624. [Google Scholar] [CrossRef] [PubMed]
Kumar, A.; Sharmila, D.J.S.; Singh, S. SVMRFE based approach for prediction of most discriminatory gene target for type II diabetes. Genom. Data 2017, 12, 28–37. [Google Scholar] [CrossRef] [PubMed]
Ho, T.K. Random Decision Forests. In Encyclopedia of Machine Learning; Sammut, C., Webb, G.I., Eds.; Springer: Boston, MA, USA, 2010; p. 827. [Google Scholar]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Rahman, M.S.; Rahman, M.K.; Kaykobad, M.; Rahman, M.S. isGPT: An optimized model to identify sub-Golgi protein types using SVM and Random Forest based feature selection. Artif. Intell. Med. 2017, 84, 90–100. [Google Scholar] [CrossRef]
Zhou, Q.; Hao, Z.; Zhou, Q.; Fan, Y.; Luo, L. Structure damage detection based on random forest recursive feature elimination. Mech. Syst. Signal Process. 2014, 46, 82–90. [Google Scholar] [CrossRef]
Yao, D.J.; Yang, J.; Zhan, X.J. Feature selection algorithm based on random forest. J. Jilin Univ. 2014, 44, 137–141. [Google Scholar] [CrossRef]
Nanthagopal, A.P.; Sukanesh, R. Wavelet statistical texture features-based segmentation and classification of brain computed tomography images. IET Image Process. 2013, 7, 25–32. [Google Scholar] [CrossRef]
Mehlhorn, H.; Schreiber, F. Small-World Property. In Encyclopedia of Systems Biology; Dubitzky, W., Wolkenhauer, O., Cho, K.-H., Yokota, H., Eds.; Springer: New York, NY, USA, 2013; pp. 1957–1959. [Google Scholar]
Mittal, V.A.; Walker, E.F. Diagnostic and Statistical Manual of Mental Disorders. Psychiatry Res. 2011, 189, 158–159. [Google Scholar] [CrossRef] [Green Version]
Segall, J.M.; Allen, E.A.; Jung, R.E.; Erhardt, E.B.; Arja, S.K.; Kiehl, K.; Calhoun, V.D. Correspondence between structure and function in the human brain at rest. Front. Neuroinform. 2012, 6, 10. [Google Scholar] [CrossRef] [Green Version]
Allen, E.A.; Erhardt, E.B.; Damaraju, E.; Gruner, W.; Segall, J.M.; Silva, R.F.; Havlicek, M.; Rachakonda, S.; Fries, J.; Kalyanam, R.; et al. A baseline for the multivariate comparison of resting-state networks. Front. Syst. Neurosci. 2011, 5, 2. [Google Scholar] [CrossRef]
Xia, M.; Wang, J.; He, Y. BrainNet Viewer: A network visualization tool for human brain connectomics. PLoS ONE 2013, 8, e68910. [Google Scholar] [CrossRef]
Xia, M.; Womer, F.Y.; Chang, M.; Zhu, Y.; Zhou, Q.; Edmiston, E.K.; Jiang, X.; Wei, S.; Duan, J.; Xu, K. Shared and Distinct Functional Architectures of Brain Networks Across Psychiatric Disorders. Schizophr. Bull. 2018, 45, 450–463. [Google Scholar] [CrossRef]
Yong, L.; Meng, L.; Yuan, Z.; Yong, H.; Yihui, H.; Ming, S.; Chunshui, Y.; Haihong, L.; Zhening, L.; Tianzi, J. Disrupted small-world networks in schizophrenia. Brain 2008, 131, 945–961. [Google Scholar] [CrossRef] [Green Version]
Benes, F.M. Evidence for neurodevelopment disturbances in anterior cingulate cortex of post-mortem schizophrenic brain. Schizophr. Res. 1991, 5, 187–188. [Google Scholar] [CrossRef]
Mirjalili, M.; Hossein-Zadeh, G.-A. Characterization of schizophrenia by linear kernel canonical correlation analysis of resting-state functional MRI and structural MRI. In Proceedings of the 2017 7th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Iran, 26–27 October 2017; pp. 37–41. [Google Scholar]
Calderone, D.J.; Hoptman, M.J.; Antígona, M.; Sangeeta, N.C.; Mauro, C.J.; Moshe, B.; Javitt, D.C.; Butler, P.D. Contributions of low and high spatial frequency processing to impaired object recognition circuitry in schizophrenia. Cerebr. Cortex 2013, 23, 1849–1858. [Google Scholar] [CrossRef]
Susan, W.G.; Thermenos, H.W.; Snezana, M.; Tsuang, M.T.; Faraone, S.V.; Mccarley, R.W.; Shenton, M.E.; Green, A.I.; Alfonso, N.C.; Peter, L.V.; et al. Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia. Proc. Natl. Acad. Sci. USA 2009, 106, 1279–1284. [Google Scholar] [CrossRef] [Green Version]
Corr, P.J. Reinforcement sensitivity theory and personality. Neurosci. Biobehav. Rev. 2004, 28, 317–332. [Google Scholar] [CrossRef] [PubMed]
Jylhä, P.; Isometsä, E. Temperament, character and symptoms of anxiety and depression in the general population. Eur. Psychiatry 2006, 21, 389–395. [Google Scholar] [CrossRef]
Van Schuerbeek, P.; Baeken, C.; De Raedt, R.; De Mey, J.; Luypaert, R. Individual differences in local gray and white matter volumes reflect differences in temperament and character: A voxel-based morphometry study in healthy young females. Brain Res. 2011, 1371, 32–42. [Google Scholar] [CrossRef] [PubMed]
Trimble, M. Molecular neuropharmacology, a foundation for clinical neuroscience. Psychiatry 2002, 73, 210. [Google Scholar] [CrossRef]
Qingbao, Y.; Allen, E.A.; Jing, S.; Arbabshirani, M.R.; Godfrey, P.; Calhoun, V.D. Brain connectivity networks in schizophrenia underlying resting state functional magnetic resonance imaging. Curr. Top. Med. Chem. 2012, 12, 2415–2425. [Google Scholar] [CrossRef]
Gaudio, S.; Wiemerslage, L.; Brooks, S.J.; Schiöth, H.B. A systematic review of resting-state functional-MRI studies in anorexia nervosa: Evidence for functional connectivity impairment in cognitive control and visuospatial and body-signal integration. Neurosci. Biobehav. Rev. 2016, 71, 578–589. [Google Scholar] [CrossRef]
Wu, C.; Zheng, Y.; Li, J.; Wu, H.; She, S.; Liu, S.; Ning, Y.; Li, L. Brain substrates underlying auditory speech priming in healthy listeners and listeners with schizophrenia. Psychol. Med. 2016, 47, 837–852. [Google Scholar] [CrossRef] [Green Version]
Qiu, L.; Yan, H.; Zhu, R.; Yan, J.; Yuan, H.; Han, Y.; Yue, W.; Tian, L.; Zhang, D. Correlations between exploratory eye movement, hallucination, and cortical gray matter volume in people with schizophrenia. BMC Psychiatry 2018, 18, 226. [Google Scholar] [CrossRef]
Viher, P.; Walther, S. SU67. Aberrant Resting-State Functional Connectivity in the Motor System and Motor Abnormalities in Schizophrenia. Schizophr. Bull. 2017, 43, S185–S186. [Google Scholar] [CrossRef] [Green Version]
Sha, Z.; Wager, T.D.; Mechelli, A.; He, Y. Common Dysfunction of Large-Scale Neurocognitive Networks Across Psychiatric Disorders. Biol. Psychiatry 2019, 85, 379–388. [Google Scholar] [CrossRef] [PubMed]
Anticevic, A.; Cole, M.W.; Murray, J.D.; Corlett, P.R.; Wang, X.-J.; Krystal, J.H. The role of default network deactivation in cognition and disease. Trends Cogn. Sci. 2012, 16, 584–592. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Menon, V. Large-scale brain networks and psychopathology: A unifying triple network model. Trends Cogn. Sci. 2011, 15, 483–506. [Google Scholar] [CrossRef]
Wager, T.D.; Smith, E.E. Neuroimaging studies of working memory. Cogn. Affect. Behav. Neurosci. 2003, 3, 255–274. [Google Scholar] [CrossRef]
Wu, L.; Caprihan, A.; Bustillo, J.; Mayer, A.; Calhoun, V. An approach to directly link ICA and seed-based functional connectivity: Application to schizophrenia. NeuroImage 2018, 179, 448–470. [Google Scholar] [CrossRef]

Figure 1. The flowchart of the hybrid feature selection method. Fre denotes frequency,

τ

the Kendall correlation coefficient, p the p-values of test and b and c the given constants. In which, SVMRFE refers to support vector machine based on recursive feature elimination, RFFS-GI refers to the feature selection with random forest by Gini importance and RFFS-OOB refers to the feature selection with random forest by the classification accuracy on the OOB data.

Figure 1. The flowchart of the hybrid feature selection method. Fre denotes frequency,

τ

the Kendall correlation coefficient, p the p-values of test and b and c the given constants. In which, SVMRFE refers to support vector machine based on recursive feature elimination, RFFS-GI refers to the feature selection with random forest by Gini importance and RFFS-OOB refers to the feature selection with random forest by the classification accuracy on the OOB data.

Figure 2. The flowchart of locating the abnormalities in brains for SZ. Where SBM refers to source-based morphometric, FNC refers to functional network connectivity and FS refers to feature selection.

Figure 3. SVMRFE, RFFS-GI and RFFS-OOB results of SBM data.

Figure 4. SVMRFE, RFFS-GI and RFFS-OOB results of FNC data.

Figure 5. The results obtained by the Kendall correlation coefficient. The x axis corresponds to the features, and the y axis is the absolute value of the Kendall tau correlation coefficient.

Figure 6. The results of hypothesis test for both two-sample t-tests and the permutation test. The x axis corresponds to the features, and the y axis is the significance level

(- l o g_{2} P)

. The red and green lines show the significance levels of 0.05 and 0.01, respectively. The features with

- l o g_{2} P

values above the lines have significant differences, and they are the candidates of abnormal regions or connections.

Figure 6. The results of hypothesis test for both two-sample t-tests and the permutation test. The x axis corresponds to the features, and the y axis is the significance level

(- l o g_{2} P)

. The red and green lines show the significance levels of 0.05 and 0.01, respectively. The features with

- l o g_{2} P

values above the lines have significant differences, and they are the candidates of abnormal regions or connections.

Figure 7. Feature selection results based on statistical methods.

Figure 8. The selected abnormal brain regions of SZ by the hybrid method. Segall et al. presented the relationships between the cortical maps and the brain regions described by the SBM features [47].

Figure 9. The abnormal functional connections of brains with SZ. In this figure, the left table lists the selected abnormal functional connections of the regions of interest (the relationships of the regions and the labels are shown in Figure A2), in which ML refers to machine learning methods and SM refers to statistical methods. The circular connectivity graph in the middle is a schematic map of the selected functional connections, which are listed in the fourth column of the left table. The labels in this graph correspond to the regions of interest, and the corresponding spatial maps of these regions (see [48]) are also shown in this graph. The right graph depicts the locations and their connections of the selected brain regions by the BrainNet Viewer toolbox [49].

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qiao, C.; Lu, L.; Yang, L.; Kennedy, P.J. Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology. Appl. Sci. 2019, 9, 2148. https://doi.org/10.3390/app9102148

AMA Style

Qiao C, Lu L, Yang L, Kennedy PJ. Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology. Applied Sciences. 2019; 9(10):2148. https://doi.org/10.3390/app9102148

Chicago/Turabian Style

Qiao, Chen, Lujia Lu, Lan Yang, and Paul J. Kennedy. 2019. "Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology" Applied Sciences 9, no. 10: 2148. https://doi.org/10.3390/app9102148

APA Style

Qiao, C., Lu, L., Yang, L., & Kennedy, P. J. (2019). Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology. Applied Sciences, 9(10), 2148. https://doi.org/10.3390/app9102148

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Identifying Brain Abnormalities with Schizophrenia Based on a Hybrid Feature Selection Technology

Abstract

Featured Application

Abstract

1. Introduction

2. Methodology

2.1. Feature Selection Methods Based on Machine Learning

2.1.1. Feature Selection with Support Vector Machine

2.1.2. Feature Selection with Random Forest

2.2. Feature Section Based on Statistical Methods

2.3. Hybrid Feature Selection Based on Both Machine Learning and Statistical Methods

2.4. Complex Network Analysis Based on Graph Theory

3. Experiments

3.1. Data Collection and Preprocessing

3.2. Locating the Abnormalities in Brains for SZ

3.2.1. Feature Selection Results Based on Machine Learning Methods

3.2.2. Feature Selection Results Based on Statistical Methods

3.2.3. Feature Selection Results Based on a Hybrid Method

3.3. Network Evaluation

4. Discussion

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI