Quantitative Brain MRI Metrics Distinguish Four Different ALS Phenotypes: A Machine Learning Based Study

Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease whose diagnosis depends on the presence of combined lower motor neuron (LMN) and upper motor neuron (UMN) degeneration. LMN degeneration assessment is aided by electromyography, whereas no equivalent exists to assess UMN dysfunction. Magnetic resonance imaging (MRI) is primarily used to exclude conditions that mimic ALS. We have identified four different clinical/radiological phenotypes of ALS patients. We hypothesize that these ALS phenotypes arise from distinct pathologic processes that result in unique MRI signatures. To our knowledge, no machine learning (ML)-based data analyses have been performed to stratify different ALS phenotypes using MRI measures. During routine clinical evaluation, we obtained T1-, T2-, PD-weighted, diffusion tensor (DT) brain MRI of 15 neurological controls and 91 ALS patients (UMN-predominant ALS with corticospinal tract CST) hyperintensity, n = 21; UMN-predominant ALS without CST hyperintensity, n = 26; classic ALS, n = 23; and ALS patients with frontotemporal dementia, n = 21). From these images, we obtained 101 white matter (WM) attributes (including DT measures, graph theory measures from DT and fractal dimension (FD) measures using T1-weighted), 10 grey matter (GM) attributes (including FD based measures from T1-weighted), and 10 non-imaging attributes (2 demographic and 8 clinical measures of ALS). We employed classification and regression tree, Random Forest (RF) and also artificial neural network for the classifications. RF algorithm provided the best accuracy (70–94%) in classifying four different phenotypes of ALS patients. WM metrics played a dominant role in classifying different phenotypes when compared to GM or clinical measures. Although WM measures from both right and left hemispheres need to be considered to identify ALS phenotypes, they appear to be differentially affected by the degenerative process. Longitudinal studies can confirm and extend our findings.


Introduction
Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease whose diagnosis depends on the presence of degenerating motor neurons in both peripheral nervous system (PNS, or lower motor neuron (LMN)) and central nervous system (CNS, or upper motor neuron (UMN)) regions. Assessment of LMN degeneration is aided by widely used electromyography of peripheral nerve and muscle, while similar quantitative methods to assess CNS dysfunction are not easily accessible. Qualitative magnetic resonance imaging (MRI) is primarily used clinically to exclude conditions mimicking ALS. Although ALS patients eventually develop clinical evidence of both UMN and LMN dysfunction, some present with varying proportions of the two, and in a minority, demonstrate prominent of feature sets; and in Discussion, the findings are analyzed and their implications in each of ALS phenotypes.

Imaging Data
Patients with ALS and neurological controls underwent brain MRI on a 1.5T Siemens Symphony scanner (Erlangen, Germany) as part of routine clinical evaluation. Highresolution 3D T1-weighted axial images were obtained using magnetization-prepared rapid gradient echo (M-PRAGE) sequence with the following parameters: repetition time (TR) = 1800 ms, echo time (TE) = 4.38 ms, flip angle = 10 • , inversion time (TI) = 1100 ms, slice thickness = 1 mm, in-plane resolution = 0.9 × 0.9 mm 2 , and number of slices = 160. T2and PD-weighted images were used to identify WM pathology, including the presence of CST hyperintensity. T2-and PD-weighted images were acquired using a dual-echo fast spin-echo sequence with the following parameters: TR = 3900 ms, TE = 26 ms and 104 ms, echo train length or turbo factor = 7, number of averages = 1, slice thickness = 4 mm, and in-plane resolution = 0.9 × 0.9 mm. Diffusion tensor imaging (DTI) was performed using single-shot echo planar imaging (SS-EPI) in 12 diffusion-weighted directions with b = 1000 s/mm 2 and a non-diffusion-weighted image with b = 0 s/mm 2 . Other parameters included: resolution = 1.9 × 1.9 × 4 mm 3 , TR = 6000 ms, TE = 121 ms, and EPI factor = 128.
Brain MRI of 91 ALS patients (ALS-CST+, n = 21; ALS-CST−, n = 26; ALS-Cl, n = 23; and ALS-FTD, n = 21) and 15 neurological controls were identified for storage, and analysis of deidentified images after patient verbal consent, as approved by the Cleveland Clinic Institutional Review Board.

Data Processing
We obtained 111 quantitative measures (attributes) from the MRI data and 10 nonimaging attributes (2 demographic and 8 clinical measures of ALS), as listed in Table 1. Although details of the image processing methods and quantitative metrics to obtain these attributes have been previously reported [17][18][19][20], they are briefly described below. A flowchart outlining our present study is shown in Figure 1.  fraction, CP-cerebral peduncle, DTI-Diffusion tensor imaging, Dur_Symp-Duration of symptoms, FD-Fractal dimension, FA-Fractional anisotropy, Gen-Brain general structure, GM-Gray matter, Left and L-Left hemisphere, MD-Mean diffusivity, PLIC-Posterior limb of the internal capsule, Prog_Rate-Disease progression rate, RD-Radial diffusivity, Resp-Respiratory rate, Right and R-Right hemisphere, Skel-Skeleton of the brain, SubPMC-Subcortical primary motor cortex, Surf-Brain surface, CSoLV-Centrum semiovale at top of lateral ventricle, WM-White matter, W-Whole brain.

Graph Network Features
Graph theory metrics were obtained from DTI data as follows: (a) preprocessing of DTI data using ExploreDTI (http://www.exploredti.com/ access data 10 January 2019) openware [21]; (b) fitting of diffusion data to the DTI model after correcting for motion artifact by using robust diffusion tensor estimation; (c) performance of brain tractography using deterministic streamline approach with a fractional anisotropy (FA) starting threshold of 0.2 and a stopping threshold of 1.0; (d) obtainment of connectivity matrices between cortical regions based on the AAL atlas [22], which were registered to the subject's space; (e) construction of connectivity matrices for DTI metrics of FA, axial diffusivity (AD), radial diffusivity (RD) and mean diffusivity (MD); (f) performance of whole-brain WM network analysis using Graph Analysis toolbox (GAT) software (https://www.nitrc.org/projects/ gat/ access data 10 January 2019) [23]. Microarchitectural tissues correlates of these DTI measures are believed to be: FA, both myelin and axonal integrity; RD, myelin integrity; AD, axonal integrity; and MD, mobility of water molecules.
The connectivity matrix obtained using ExploreDTI was used in GAT for further processing. We obtained network measures for each subject using the "network measures" module in GAT. Graph measures studied, as listed in Table 1, included: assortativity, density, mean clustering coefficient, transitivity, global efficiency, mean local efficiency, modularity, Louvian modularity, characteristic path length, mean nodal betweenness, mean edge betweenness, normalized path length, normalized clustering, small world index, and mean degree. Further details have been previously published [2,24]. DTI-derived WM measures from graph theory network analyses are shown in Table 1, lines 1-59.

DTI Corticospinal Tract Features
Diffusion-weighted images were first corrected for susceptibility artifacts and eddycurrent distortion using FSL tools with b-matrix rotation after oblique-angle correction to preserve correct orientation information [20]. These images were then processed using DTI Studio open software in the following manner: (a) diffusion tensor fitting using a multivariate linear least square fit; (b) FA, MD, AD and RD maps of whole brain obtained [20]; (c) fiber tracking using the fiber assignment by continuous tracking (FACT) algorithm; (d) reconstruction of bilateral CST virtual fibers after placing a region of interest (ROI) caudally in the cerebral peduncle (CP) and another ROI rostrally just beneath the primary motor cortex (subPMC); (e) identification of four ROIs at specific CST levels: CP, posterior limb of internal capsule (PLIC), centrum semiovale at top of lateral ventricle (CSoLV), and subPMC, as shown in Figure 2. The aforementioned DTI measures in each ROI bilaterally were measured in ALS patients and neurological controls, as described previously [20]. DTI-derived WM measures along the CST are shown in Table 1, lines 60-91. sity, mean clustering coefficient, transitivity, global efficiency, mean local efficienc ularity, Louvian modularity, characteristic path length, mean nodal betweennes edge betweenness, normalized path length, normalized clustering, small world in mean degree. Further details have been previously published [2,24]. DTI-deriv measures from graph theory network analyses are shown in Table 1, lines 1-59.

DTI Corticospinal Tract Features
Diffusion-weighted images were first corrected for susceptibility artifacts an current distortion using FSL tools with b-matrix rotation after oblique-angle corr preserve correct orientation information [20]. These images were then processe DTI Studio open software in the following manner: (a) diffusion tensor fitting multivariate linear least square fit; (b) FA, MD, AD and RD maps of whole brain o [20]; (c) fiber tracking using the fiber assignment by continuous tracking (FAC rithm; (d) reconstruction of bilateral CST virtual fibers after placing a region of (ROI) caudally in the cerebral peduncle (CP) and another ROI rostrally just ben primary motor cortex (subPMC); (e) identification of four ROIs at specific CST lev posterior limb of internal capsule (IC), centrum semiovale at top of lateral v (CSoLV), and subPMC, as shown in Figure 2. The aforementioned DTI measures ROI bilaterally were measured in ALS patients and neurological controls, as d previously [20]. DTI-derived WM measures along the CST are shown in Table 1, l 91.

Fractal Dimension WM and GM Features
FD analysis was performed using customized in-house routines, as detailed previously [18], and included: (a) extraction of brain precedes its segmention into WM and GM probability maps using FSL tools; (b) binarization of WM and GM probability maps using a threshold value of 0.5; (c) application of 3D thinning to WM and GM binary images to produce corresponding 3D skeleton images [18]; generation of skeleton and general structure of left and right hemispheres after applying hemisphere-specific masks; (d) estimation of FD values using 3D box-counting method.
FD values of skeleton, surface, and general structure shape representations were estimated by counting boxes required to cover: (a) skeleton foreground voxels for skeleton FD value; (b) WM and GM interface boundary for surface FD; and (c) all WM and GM foreground voxels (which included skeleton and surface) for general structure FD. The skeleton, which preserves topological and geometric information of WM and GM, represents interior structure complexity of brain WM and GM. The surface structure reflects the shape of gyral and sulcal convolutions at the WM-GM interface. The general structure represents volume changes. Because the skeleton, surface and general structure represent three different aspects of brain WM and GM integrity, they may provide insights into Diagnostics 2023, 13, 1521 9 of 25 brain shape and structural changes occurring during the neurodegeneration of ALS. T1-w imaging-derived FD-based WM and GM measures are shown in Table 1, lines 92-109.

Demographic and Clinical Features
Demographic features of age and gender, as well as clinical characteristics in ALS subgroups of ALSFRS-R, disease duration, El Escorial score, and disease progression rate are shown in Table 1, lines 112-121.

Machine Learning Methods
Based on the article by Grollemund and colleagues [15] that discussed the pros and cons of different ML algorithms employed in ALS, we chose to apply Random Forest (RF) and neural networks (NN) methods. We wrote custom ML codes in MATLAB (Mathworks https://www.mathworks.com/ access date 1 October 2019) version 2016 and also in Python (https://www.python.org/downloads/release/python-390/ access date 3 November 2022) version 3.9 for classification of our ALS phenotypes and controls. Briefly, RF is an ensemble of decision trees, of which a sample of records/data is chosen for each. Gini impurity values were then used to select the best from the attributes and samples for expansion at every node in the decision tree. "Out of bag error" (OOB), which should be as low as possible for accurate prediction, was calculated for each decision tree in RF. Because CST truncation of virtual tracts occurred from DTI calculations in some UMN-predominant ALS patients [5], we filled in the missing data points with the average value of the dataset. Further, we performed feature selection, as suggested in [15], using Waikato Environment for Knowledge (WEKA), which is an ML software developed at the University of Waikato, New Zealand (https://www.cs.waikato.ac.nz/ml/weka/ access date 23 October 2019). Feature selection in WEKA is divided into two parts: attribute evaluator and search method. The feature selection method, CfsSubsetEval, evaluates worthiness of a subset of attributes depending on their individual output predictive power and the level of redundancy between them. We then used the BestFitsearch option to obtain attributes.
When first employing the classification and regression tree (CART) and pruning algorithms to a data sample size of 106 and 121 attributes, accuracy of classification was only 55%; however, after applying the RF algorithm to this sample, accuracy increased to 71%. For this reason, we used the RF algorithm to analyze our data. Attributes selected by WEKA were then classified by considering each as separate category or output class variables: (1) neurologic controls, ALS-CST+ and ALS-CST− subgroups; and (2) neurologic controls, ALS-Cl and ALS-FTD subgroups.

Neural Networks
We employed neural networks for classification after a custom code was written using MATLAB (Mathworks https://www.mathworks.com/ access date 1 October 2019) version 2016. Briefly, a neural network is a computing model comprising of neurons in different layers. After a literature review, we elected to use a two-layered architecture. The number of neurons in each of the two hidden layers was decided on a case-by-case basis (i.e., when WM or GM attributes were considered separately, clinical and demographic measures were considered, or all of these measures were taken together) using the geometric pyramidal rule proposed by Masters [25] calculations given below: If we define r = Number of input data Number of output data 1/3 , then the number of neurons in the first hidden layer = (Number of output data) × r 2 -second hidden layer = (Number of output data) × r Gradient descent with momentum was used for parameter characterization. The value of momentum was fixed at 0.9, training time at 5000, and number of folds for crossvalidation was 5.

Random-Forest-Based Classification
The RF algorithm was used to perform the above classification between ALS clinical phenotypes, and average values were used to fill the missing data values. All 121 attributes were considered for this classification and output classes were 0 for control group, 1 for ALS-CST+ subgroup, 2 for ALS-CST− subgroup, 3 for ALS-Cl subgroup, and 4 for ALS-FTD subgroup. Accuracy with RF was 71% and RF classification after attribute selection using the attributes given by WEKA are shown in Table 2. RF classified the ALS clinical phenotypes from controls as well as between the ALS subgroups with 71% accuracy. Accuracy remained at 71% with the reduced attributes when compared to classifying the ALS phenotypes using all 121 attributes. The confusion matrix, statistical parameters and variable of importance when considering all the attributes, and the WEKA-selected attributes are shown in Figures 3 and 4. A comparison between the statistical measures in Figures 3 and 4 shows that attribute selection using WEKA shows some good improvement in the ALS-CST+ and ALS-CST− groups, suggesting that selected features may be better when compared to considering all 121 features for classification. Accuracy increased to 89.5%, however, when the second category for classification comprised of 0 for controls, 1 for ALS CST+, and 2 for ALS CST−, when the attributes given by WEKA (Table 3) were used and average values to fill the missing data points were considered. Accuracy dropped to 78.9% when all 121 attributes were used. The confusion matrix, statistical parameters and variable of importance when considering all the attributes, and the WEKA-selected attributes are shown in Figures 5 and 6. Since the classification of ALS-CST+ and ALS-CST− groups was based on MRI, in order to understand the role played by imaging attributes in classifying the 0, 1, and 2 groups, we performed an analysis where we considered the imaging measures independently from clinical measures. GM measures, when considered independently, resulted in an accuracy of 36%, whereas WM measures alone yielded 73% accuracy, and 76% accuracy was obtained when clinical and demographic measures were considered alone. As mentioned above, when all the measures were combined, the accuracy of the classification between controls, ALS-CST+, and ALS-CST− groups was 88%.      Similarly, when we considered 0 for controls, 3 for ALS-Cl, and 4 for ALS-FTD as well as all 121 attributes with average values to fill the missing data points, accuracy was 88.9%. Accuracy improved to 94.44% when attributes (given by WEKA in Table 4) were used. The confusion matrix, statistical parameters and variable of importance when considering all the attributes, and the WEKA-selected attributes are shown in Figures 7 and 8. We then considered clinical, GM, and WM measures independently. When clinical measures were used to classify all ALS clinical phenotypes, an accuracy of 64% was obtained. The classification of ALS CST+ and ALS CST− patients resulted in accuracy of 75%, and ALS-Cl and ALS-FTD patients resulted in accuracy of 78%. When GM attributes were used to classify the ALS clinical phenotypes and controls, classification accuracy remained very poor (<48%). On the other hand, using WM measures resulted in an accuracy of 73% when comparing between controls, ALS CST+, and ALS CST− patients, and 75% when comparing between controls, ALS-Cl, and ALS-FTD patients; when all ALS clinical phenotypes combined and controls were considered, accuracy dropped to 58%.

Neural-Network-Based Classification
NN-based classification resulted in a low accuracy of 61% when all 121 attributes were used to stratify the 4 ALS patient phenotypes and control group. Selection of features shown in Table 2 slightly improved classification accuracy to 71%. Classifying the second category of the controls, ALS CST+, and ALS CST− subgroups by incorporating all 121 attributes resulted in 75% accuracy; after selecting features shown in Table 3, this increased to 83%. Similarly, when the controls, ALS-Cl, and ALS-FTD subgroups were classified incorporating all 121 attributes, accuracy was 75% but also increased to 83% after using the selected features shown in Table 4. Considering clinical, GM, and WM features independently, the following accuracies were observed: (1) clinical measures alone provided an accuracy of 55% when considering all ALS phenotypes, 88% when classifying ALS-CST+ from ALS-CST− subgroups, and 77% when classifying ALS-Cl from ALS-FTD subgroups; (2) GM measures alone gave an accuracy of only 28% when stratifying controls and all ALS phenotypes; 57% when classifying controls, ALS-CST+, and ALS-CST− subgroups; and 50% when classifying controls, ALS-Cl, and ALS-FTD subgroups; and (3) WM measures alone produced an accuracy of 66% when classifying between controls and all ALS phenotypes; 58% when classifying controls, ALS-CST+, and ALS-CST subgroups; and also 58% when classifying controls, ALS-Cl, and ALS-FTD subgroups.

Discussion
In this study, we demonstrated the: (1)  In the classification of all ALS phenotypes and controls, assortativity, global and local efficiency, mean edge betweenness, normalized path length, and density played important roles in patient stratification among different graph metrics. The assortativity metric reflects the tendency of nodes to connect with other nodes of similar properties (e.g., high-degree nodes connect with other high-degree nodes), whereas negative assortativity (or disassortativity) indicates nodes of dissimilar properties connecting (e.g., high-degree nodes connect with lower-degree nodes). Mean assortativity values of controls and ALS subgroups for AD and RD graph networks, as shown in Supplementary Table S1, revealed differing machine-learning-selected attributes between patient subgroups. Specifically, values among ALS-CST+, ALS-CST−, and ALS-Cl subgroups showed positive assortativity but slightly reduced values when compared to controls, which may reflect compensatory mechanisms by the disrupted brain network. AD-weighted graph-network disassortativity in ALS-FTD patients may be compensatory for functional deficits occurring during loss of WM connections in the frontal and temporal lobes [4]. In a longitudinal ALS study using functional magnetoencephalography, Sorrentino et al. [26] observed high disassortativity with disease progression, which suggested that more peripheral nodes connect to higherdegree nodes to compensate for dysfunctional degenerating brain areas.
Differences in certain DTI-metric attributes at various CST-rostrocaudal levels (see Figure 2) were important in classifying ALS-CST+, ALS-CST−, ALS-FTD, and ALS-Cl subgroups, including RD and MD at CP, CSoLV, and subPMC levels, and AD at the CSoLV level. The ability of these metrics to differentiate the ALS phenotypes infers that distinct patterns of CST degeneration occur in these patient subgroups. In addition, dominance of right hemisphere FA, AD, RD, and MD values in classifying the disease process suggests that CST fibers are affected differentially between hemispheres, more so on the right.
Graph theory measures involved in classifying controls, ALS-CST+, and ALS-CST− groups (as shown in Table 3) included assortativity_AD, transitivity_FA, characteristic path length_FA, mean degree_MD, normalized clustering coefficient_MD, and normalized path length_RD. Transitivity and clustering coefficients reflect local structural (topologywise) segregation of nodes for functional integration, whereas path length reflects global integration of information in a network. Mean value differences of the above measures were greater between controls and ALS-CST− patients than between controls and ALS-CST+ patients, as seen in Supplementary Table S2. In the ALS-CST− subgroup, increases of characteristic and normalized path length values suggests compromised global integration, whereas decreased assortativity values imply greater interconnection between low-degree and high-degree nodes.
Interestingly, FA, AD, RD, and MD values at only rostral levels of the CST (in CSoLV and subPMC) were used by the feature selection algorithm to classify controls, ALS-CST−, and ALS-CST+ subgroups, while those at caudal levels (in PLIC and CP) were not. Fractal dimension (FD) attributes of WM or GM were not involved in classification. The DTI metrics along the CST revealed abnormal values in both ALS-CST+ and ALS-CST− subgroups when compared to controls and between these two patient subgroups. Mean values of the chosen DTI attributes for control group and UMN-predominant ALS subgroups are shown in Supplementary Table S3. The observation that essentially all DTI metrics along the CST chosen for classification by the feature selection algorithm were from the right hemisphere (except for MD in the left CP) emphasizes the asymmetry of hemispheric involvement, as mentioned above. In addition, lack of GM attributes being chosen for classification and no significant difference of GM FD values between controls, ALS-CST+, and ALS-CST− subgroups suggests either a primary 'axonopathy' (and not a 'neuronopathy') or insensitivity of GM measures in these UMN-predominant ALS patients. Clinical attributes that played a key role in classifying ALS-CST+ and ALS-CST− subgroups include symptom duration, El Escorial criteria score, and ALSFRS-R subscores of bulbar and lumbosacral function; mean values of these measures are shown in Supplementary Table S4.
Graph theory measures used to classify controls, ALS-Cl, and ALS-FTD subgroups (as shown in Table 4), resulted in accuracies of 88% for RF and 83% for NN. Except for attribute #7, these were all different from those used to classify controls, ALS-CST+, and ALS-CST− subgroups (discussed above), and included: (1) density of the AD network (den-sity_AD), (2) mean local efficiency (mean local efficiency_AD), (3) mean edge betweenness (mean edge betweenness_AD), (4) density based on MD network (density_MD), (5) mean clustering coefficient (mean clustering coefficient_MD), (6) mean local efficiency (mean local efficiency_MD), and (7) normalized path length of the RD network (normalized path length_RD). Network density measures the proportion of connections relative to the total number of possible connections. Mean value differences of the above measures were lower in ALS-FTD patients than in controls and ALS-Cl patients, as shown in Supplementary  Table S5. Edge betweenness, local efficiency, and density metrics were important in classifying ALS-FTD from ALS-Cl and controls. Globally lower graph theory metric values in ALS-FTD patients compared to controls and ALS-Cl patients indicate more prominent WM abnormalities in the former patient subgroup. Attributes from DTI measures along the CST involved in classifying ALS-Cl and ALS-FTD subgroups, and controls were not restricted to its caudal level (as when classifying ALS-CST+ and ALS-CST− subgroups) but involved its entire rostrocaudal extent, as shown in Supplementary Table S6. FD WM and GM metrics were important in classifying ALS-Cl and ALS-FTD patients and controls with their mean values, as shown in Supplementary Table S7. Interestingly, WM fractal values were reduced in ALS-FTD patients compared to ALS-Cl patients and controls, whereas GM fractal values were increased in ALS-Cl and ALS-FTD patients when compared to controls. Increased FD GM values suggest an amorphous change in GM structures, as could occur with loss of neurons and their processes. Importance of GM features in classifying ALS-Cl and ALS-FTD patients and controls suggests that neurodegeneration in ALS-FTD (and possibly ALS-Cl) patients may be primarily a 'neuronopathy'. The mean values of clinical attributes that played a key role in classifying ALS-Cl and ALS-FTD subgroups including symptom duration, El Escorial score, and ALSFRS-R are shown in Supplementary Table S8.

Conclusions
The RF algorithm was the most accurate in classifying the four different ALS phenotypes. WM metrics played a major role in classifying these phenotypes compared to GM or clinical measures. Although WM measures from both right and left hemispheres should be considered when identifying ALS phenotypes (consistent with bihemispheric degeneration occurring in ALS), they appear to be differentially affected by the disease process. Longitudinal studies of multiple time-point data can confirm and expand on our findings.
Increasing patient numbers in each subgroup, potentially through a multisite study will strengthen the statistical validity of our present findings. Time constraints in obtaining MRI under a clinical protocol prevented the use of smaller voxel sizes and higher-directionnumber DTI acquisitions. Follow-up studies using higher-resolution imaging parameters at 3T will also be useful to confirm our findings. As a biomarker, neuroimaging is useful not only in cross-sectional studies (as ours here) for diagnosis and identification of abnormalities present at a single timepoint, but also in multiple timepoint analyses. Therefore, future longitudinal studies of various patient subgroups beginning even earlier in the course of ALS will provide insights into the evolution and spread of disease throughout the brain.
Supplementary Materials: The following supporting information can be downloaded at: https: //www.mdpi.com/article/10.3390/diagnostics13091521/s1, Table S1: Mean assortativity values of the control group and ALS subgroups; Table S2: Mean graph measure values in the control group and UMN-predominant ALS subgroups; Table S3: Mean DTI metrics along the CST in the control group and UMN-predominant ALS subgroups; Table S4: Clinical measures influencing classification of UMN-predominant ALS subgroups; Table S5: Mean graph measure values in controls and patients with classic ALS and ALS-FTD; Table S6: Mean DTI metrics along the CST in controls and patients with classic ALS and ALS-FTD; Table S7: Mean FD attributes influencing classification of controls and patients with classic ALS and ALS-FTD; Table S8: Clinical measures influencing classification of patients with classic ALS and ALS-FTD.
Author Contributions: V.R.-acquired the MRI data, processed the MRI data, guided the machinelearning-based data analysis, analyzed the results, and wrote the manuscript. K.G.C.-wrote machine learning programs, processed the data using machine learning methods, analyzed the results, and wrote portions of the methods section. E.P.P.-designed the study, acquired the MRI data, analyzed the results, and made extensive revisions in the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.

Institutional Review Board Statement:
The study was conducted in accordance with the Declaration of Helsinki, and was approved by the Institutional Review Board (IRB) at Cleveland Clinic (IRB Nos 07-592 and 10-084). Because IRB approval was for a Registry Study to store and analyze MRI and clinical data, only verbal consent was required from patients and controls.
Informed Consent Statement: Verbal consent was obtained from patients and controls as per IRB's requirements.

Data Availability Statement:
The data used in this study are property of Cleveland Clinic and therefore cannot be shared.