Abstract
Amyotrophic lateral sclerosis (ALS) consists of the progressive degeneration of motor neurons, caused by poorly understood mechanisms for which there is no cure. Some of the cellular perturbations associated with ALS can be detected in peripheral cells, including lymphocytes from blood. A related cell system that is very suitable for research consists of human lymphoblastoid cell lines (LCLs), which are immortalized lymphocytes. LCLs that can be easily expanded in culture and can be maintained for long periods as stable cultures. We investigated, on a small set of LCLs, if a proteomics analysis using liquid chromatography followed by tandem mass spectrometry reveals proteins that are differentially present in ALS versus healthy controls. We found that individual proteins, the cellular and molecular pathways in which these proteins participate, are detected as differentially present in the ALS samples. Some of these proteins and pathways are already known to be perturbed in ALS, while others are new and present interest for further investigations. These observations suggest that a more detailed proteomics analysis of LCLs, using a larger number of samples, represents a promising approach for investigating ALS mechanisms and to search for therapeutic agents. Proteomics data are available via ProteomeXchange with identifier PXD040240.
1. Introduction
Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease also known as Lou Gehrig’s disease [,]. It manifests by the gradual loss of upper and lower motor neurons in the motor cortex, the brain stem nuclei, and the anterior horn of the spinal cord [,]. The incidence of ALS is 1–2.6 per 100,000 people per year, the point of prevalence is 3–6 per 100,000 people in Europe and the United States, and the lifetime risk is approx. 1 in 300 [,]. The prognosis for survival is 2 to 5 years []. There are two types of ALS, differentiated by the genetic background: inherited (familial) and sporadic. The sporadic form represents approx. 90% of all cases [,]. The causal factors of sporadic ALS are currently not known. There is no cure or effective treatment for ALS. The treatment consists of multidisciplinary care, including nutritional and respiratory support and symptom management [,].
Studying the pathological processes of ALS in the affected cells of alive patients is difficult. The pathological changes in ALS affect the motor neurons, which are the neurons that control the muscles. The death of these neurons leads to general progressive paralysis, which is invariably fatal. Human motor neurons are not accessible for biochemical or cellular investigations, except from deceased patients. Post-mortem neurons display processes that occur in the advanced stage of the disease, which is not very informative about how the pathology is initiated. Moreover, these neurons are affected by the delay between death and the time when the cells become available for analysis. This delay produces damages that make the disease-specific changes more difficult to detect.
As an alternative, model systems are used, based on easily accessible peripheral human cells, either freshly collected or maintained in cell culture. Such systems are used for attempts to discover causal mechanisms and potential therapeutic agents. The relevance of this strategy is supported by numerous observations, which showed that consistent ALS-associated differences exist in peripheral cells unrelated to the nervous systems, e.g., in peripheral blood mononuclear cells or fibroblasts in [,].
A particular model system based on blood cells consists of lymphoblastoid cell lines (LCL), which are lymphocytes immortalized using the Epstein–Barr virus [,]. Compared to the primary cells, these cells can be stably propagated and multiplied in culture to high numbers and for long periods. This characteristic allows for consistent comparative investigations of various features and interventions, which is favorable for both mechanistic studies and for attempts to find therapeutic agents, e.g., by screening large libraries of small molecules. Most importantly, LCL from ALS donors present clear differences in comparison with non-ALS control cells [,].
Genome-wide analyses of differences between ALS and control LCL could reveal cellular networks that are dysregulated in ALS, which could in turn shed light on the mechanisms of the disease and reveal potential biomarkers. Such comparative genome-wide analyses for LCL has been done only at the RNA level and only for the most common variant of familial ALS [], which represents only 7% of the total ALS cases [,].
We present here the comparison of the gene expression at the protein level, for LCL from sporadic ALS vs. control cells from non-diseased donors.
With few exceptions, the entities that carry out the cellular functions are proteins, not RNAs. However, when attempting to analyze cellular functions, RNA measurements are frequently preferred as surrogates for the abundance of corresponding proteins, although there is only an approximate proportionality between the RNA and the corresponding protein levels. The reason is that the amount of input material necessary for RNA analysis is, as a cell number, almost two orders of magnitude lower for RNA-Seq than for proteomics. Moreover, RNA can be amplified if necessary. A key advantage of using LCL is that the obtainable cell numbers are not limiting factors: the immortalized LCL can be easily multiplied to large numbers in a cell culture, to obtain sufficient material for proteomics studies.
Most importantly, compared to the RNA level, proteomics can reveal new information about the disease. RNA analysis can leave undiscovered essential pathological differences because RNA could reflect imprecisely variations in some key proteins perturbed in the disease, due, e.g., to differences in the protein synthesis or turnover rates. As shown, for instance, by a recent study on Alzheimer’s disease, RNA-seq analysis missed almost half of the differences detected at the protein level [].
Proteomics is used not only in the identification of proteins [,,], but also in the identification of protein post-translational modifications (PTMs) [,], of stable and transient protein–protein interactions (PPIs) and on the PTMs of these stable and transient PPIs [,,]. In addition, proteomics is used to identify differentially expressed/dysregulated proteins using label-based or label-free proteomics [,]. Here, we used label-free proteomics of LCL derived from ALS vs. healthy donors to identify differentially expressed (upregulated and downregulated) proteins that are related to ALS. Some of these proteins, filtered and then validated through a series of statistical analysis and pathway analysis software, confirmed that they are involved in ALS. These and other dysregulated proteins are discussed in relation to previously published studies and the pathogenesis of ALS.
2. Results and Discussion
Differentially present proteins. The presence of proteins differentially present in the ALS vs. the healthy donors was investigated using the statistical packages for proteomics: Scaffold, NormalizeR [], ProDA [], and ROTS [].
The Scaffold package includes two tests: Student’s t-test and Fisher’s exact test. Fisher’s exact test performed with the Benjamini–Hochberg adjustment for multiple testing and a threshold of p < 0.05 identified 223 proteins that had p values below the critical value reported by the test, 0.0178. We excluded from this list the proteins that had less than two non-zero value in the eight samples, and the proteins that showed a difference between the average of ALS and the control sets of less than 150%, corresponding to a log2 ratio smaller than +/− 0.854. Under these conditions, 120 proteins were differentially present, of which 52 were downregulated in ALS and 68 were upregulated (Table 1).

Table 1.
Dysregulated proteins identified in our study.
Some of these proteins have been reported to be dysregulated in ALS. For instance, among the most 20 downregulated proteins, 4 are known to be downregulated in ALS: the drebrin-like protein DBNL [], the calcyclin-binding protein (CacyBP) [], and the UV excision repair protein RAD23 homolog B (R23B) []. The fructose-bisphosphate aldolase A (ALDO A) is downregulated in rapidly progressing ALS [].
Among the 20 most upregulated proteins in ALS, the trifunctional purine biosynthetic protein adenosine-3 (glycyl-tRNA synthetase (GART) enzyme) has a potential connection with ALS: a dominant mutation in this protein causes a toxic gain-of-function, which causes peripheral neuropathy that principally affects the upper limbs [].
If the proteins detected by us as differentially expressed are indeed perturbed in ALS, and are not just statistical noise, proteins known to be perturbed in ALS should be much less frequent among the proteins that are not differentially expressed in our samples. This is indeed the case: among 25 proteins that are not differentially expressed (having ratios of ALS/healthy between 0.7 and 1.3), none was reported to have differential expression in ALS. Compared to this frequency (0/25), the frequency of ALS-associated proteins among the downregulated proteins is 4/20 = 20%. The difference is statistically significant: as the test cannot compare zero frequency, we compare the immediate higher frequency, 1/25, with 4/20, which leads to p(Exp-Obs>=3) = 0.007 by the binomial test, two-sided.
Other strategies frequently used for detecting differentially present proteins in proteomics data are based on the statistical package limma (“linear models for microarray data”). Limma was initially developed for DNA microarray analysis and subsequently used extensively for the analysis of RNA-seq data []. More recently, limma was applied to the analysis of proteomics data [,]. We analyzed our data using three packages based on limma. We submitted to these packages the same set of 120 differentially expressed proteins mentioned previously.
NormalizerDE allows the differential expression analysis of liquid chromatography-mass spectrometry data using the empirical Bayes limma approach []. The software was accessed online via a web server. The proteomics data across the eight samples were normalized using the CycLoess procedure available in this package. Subsequently the data were log2 transformed using the option available in the package. The analysis revealed three differentially expressed proteins (adj p < 0.1): transaldolase (ALDOA), phospho-ribosyl-formyl-glycinamidine synthase (PFAS), and the ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1 (ATP5F1A or ATP5A1). Two of these proteins, ALDOA and PFAS, were also detected as significantly differentially present by the Scaffold Fisher’s exact test (Table 1). Among the three differentially expressed proteins, two are known to be downregulated in ALS: ALDOA [] and ATP5F1A [].
ROTs (reproducibility-optimized test statistic) is a bioconductor R package, which adjusts a modified t-statistic according to the inherent properties of the data and provides a ranking of the features based on the differential expression between the two groups []. ROTS has been applied successfully in a range of different studies from transcriptomics to proteomics, showing a competitive performance against other state-of-the-art methods. The file generated by Scaffold was normalized using the R function “normalize.quantiles” from BioConductor. The value 0.3 was added to all values to allow for the application of the log function to the initial zero values and the data were subjected to a log2 transformation, to correct the high skewness of the primary data. One protein was detected as significantly downregulated (FDR < 0.1), transaldolase (ALDOA) (average 36.6 in controls versus 0 in ALS). As mentioned, ALDOA is known to be downregulated in ALS.
proDA (inference of protein differential abundance by probabilistic dropout analysis) is a recently introduced method for identifying differentially abundant proteins in label-free mass spectrometry, which boosts statistical power for small sample sizes by using variance moderation []. The method is implemented as an open-source R package [] as the method requires log2 transformed data, 0.3 was added to all the values. Two proteins are detected as significantly differentially expressed (p < 0.1). The first is gamma enolase (ENO2), p-adj = 0.0032, average 23.9 in controls vs. 12.5 in ALS. ENO2 is detected as significantly downregulated in ALS by the Fisher’s exact test in Scaffold. The second is HSPC108, downregulated in ALS (0.25 in ALS vs. 2.65 in controls), also detected as differentially expressed in Scaffold by Fisher’s exact test. ENO2 levels are known to be elevated in the cerebrospinal fluid (CSF) of ALS patients [], in the CSF, and serum in patients with a cervico-thoracic form of ALS and in patients with a disease duration from 1 to 4 years [].
Pathway analysis. Ingenuity pathway analysis (IPA). The list of differentially expressed proteins (Table 1) was subjected to the QIAGEN IPA package (https://digitalinsights.qiagen.com/IPA [], accessed on 15 September 2022. The top 20 most statistically enriched pathways are listed in Table 2. Perturbations in 6 of these 20 pathways (35%) have been reported to occur in ALS, as detailed below.

Table 2.
Signaling pathways found dysregulated in our study.
The pathway that presents the most statistically significant enrichment is the “EIF2 pathway”. EIF2α is key factor for the initiation of protein translation. Recent studies suggest that endoplasmic reticulum stress may play a critical role in ALS pathogenesis through an altered regulation of the proteostasis, the cellular pathway-balancing protein synthesis, and degradation. EIF2α is a key factor in this process []. In response to proteotoxic stress, EIF2α is phosphorylated by MARK2, which is activated via phosphorylation in human patients with ALS []. Our differentially expressed protein set is enriched in components of the “Ran signaling pathway”. Ran is a key effector of nucleocytoplasmic-transport and its deficit could play an important role in the pathology of ALS [,,]. The differential protein set is also enriched in members of the “Protein Ubiquitination pathway”, the disruption of which is a widely accepted factor in ALS []. Another enriched pathway is the “Antigen presenting pathway”, which was reported to modulate the age of onset of ALS []. The blood levels of the “Interferon-gamma (IFN-γ) pathway” present significant differences in ALS patients [] and the components of this pathway are enriched in our differential protein set. Another pathway that presents enrichment is the “Actin Cytoskeleton Signaling”, which contributes to motor neuron degeneration []. The modulation of actin polymerization affects nucleocytoplasmic transport in multiple forms of ALS [].
Two other pathways associated with neuronal pathology are also in the top 20 of the enriched pathways: “Multiple Sclerosis Signaling Pathway” and “Neuro-inflammation Signaling Pathway”. This observation supports the assumption that although LCLs are peripheral cells, some processes in these cells correlate with processes in neuronal cells, and therefore LCLs can be used to derive information about the pathology of the neuronal cells affected by the disease.
The last column in Table 2 (“Ratio”) represents the ratio between the proteins that belong to the specified pathway and are differentially present in ALS, and the total number of the proteins in the pathway. The average ratio is 6.7% (range 2.1–21%). These percentages seem low, but it should be taken into account that only approx. 1300 proteins have been detected by us, which represent probably only approx. 10% of the total number of proteins expressed in LCLs. Based on this assumption, it can be extrapolated that if all the proteins would be detected, the “coverage” of the differential pathways would be on average approx. 67%, the majority of the proteins participating in the top 20 pathways.
pathfindR is an R package for the identification of enriched pathways based on protein–protein interaction networks []. pathfindR can identify relevant pathways which cannot be identified by other tools []. The package can use five protein interaction networks (Biogrid, STRING, GeneMania, IntAct, KEGG) and seven gene sets (GO-CC (cellular component), GO-BP (biological process), GO-MF (molecular function), GO-All (all combined), and KEGG, Biocarta, Reactome). These combinations detect a total number of 223 significantly enriched pathways (p < 0.05). The combination Biogrid–Reactome detects the highest number (119) of significantly enriched pathways.
The most frequent significantly enriched pathways are related to rRNA, ribosomes, protein translation (15 occurrences), antigen presentation (9), protein ubiquitination–protein degradation (6), and apoptosis regulation (9). Some of these perturbations have been detected previously in ALS. Relevant for the first category, the following perturbations in ALS has been reported: a profound destabilization of ribosomal and mitochondrial RNAs [], perturbation in the ribosomal function [], and increased ribosome numbers in axons as an early event []. Regarding the second category, protein homeostasis, ALS is associated with proteostasis collapse []. Central to the maintenance of proteostasis are the predominant protein degradation pathways, the ubiquitin–proteasome system (UPS), and the autophagy system.
Gene set enrichment analysis (GSEA) is a computational method that determines whether a set of genes shows statistically significant similarity with gene sets from a large collection that have certain characteristics, e.g., participate in specific cellular or molecular processes, or are perturbed in diseases or by various interventions, e.g., specific gene modifications or treatment with chemical agents []. GSEA was initially introduced for analysis of cDNA microarrays and was subsequently adapted for RNA-Seq data. Based on the correlation between the RNA and protein levels, the method could also provide information about protein sets, as illustrated below. Compared to other pathway identification methods, many of the gene sets available in GSEA sets are not pieced together from interactions between protein pairs or protein subsets but are the direct results of RNA-seq or microarray experiments, with no theoretical deductions regarding the pathway that may or may not be actually functioning in cells.
From the list of 614 proteins present in at least 2 samples, as generated by Scaffold and having associated NCBI gi tags, GSEA identified 484 proteins. The parameters used for analysis were: 1000 permutations, no collapse (i.e., data are used in the original format), and enrichment statistics-weighted, max size 500, min size 8, normalization mode: none. Eight types of cumulative gene sets are available in GSEA for comparison with the submitted set. The significance threshold used was p < 0.05 and the FDR (false discovery rate) was 0.25. Across all 8 sets, GSEA identifies 147 sets that are significantly enriched in genes present in our set of ALS differentially expressed genes. Six of these are clearly relevant for ALS, based on published information about processes perturbed in ALS, or interventions that could alleviate ALS. These enrichments are shown in Figure 1).

Figure 1.
Proteins downregulated in the ALS samples are enriched in proteins of GSEA sets known to be perturbed in ALS: (a) the “E2F1_UP.V1_UP” set. The lower histogram shows the ratio in ALS vs. control for each protein differentially present in our samples, ranked from the most upregulated (on the left) to the most downregulated (to the right). The vertical bars in the upper panel show the positions of the proteins from the GSEA set present in the proteomics set; (b–g): similar upper histograms for other GSEA pathways enriched in ALS differentially expressed proteins; (b) “Rutella response to HGF_DN” (genes induced by HGF); (c) “KEGG spliceosome”; (d) “module 183 cancer RNA splicing”; (e) “QI_Hypoxia”, (HIF-induced genes); (f) “GOBP positive regulation of protein localization to nucleus”; (g) “GOBP regulation of protein localization to nucleus”.
(a) “E2F1_UP.V1_UP” set, which contains genes up-regulated by everolimus. Everolimus, an inhibitor of the TORC pathway, is a derivative of rapamycin, which is an mTOR inhibitor. This signaling pathway is upregulated in ALS and reduces autophagy. Rapamycin has been evaluated for ALS therapy by the restoration of autophagy [].
(b) “Rutella response to HGF_DN”, this set contains genes induced by HGF (hepatocyte growth factor), which is currently explored as a potential therapy for ALS []. The implication is that the LCLs from ALS patients present a reduction in proteins whose restoration by HGF could treat ALS.
(c) and (d) “KEGG spliceosome” and “module 183 cancer RNA splicing”. The RNA splicing process is known to be defective in ALS [].
(e) “QI_Hypoxia”, which contains HIF-induced genes, as detected by applying hypoxia to a prostate cancer cell line. Deficiency in HIF1-alpha signaling is known to occur in ALS, due to defective import of HIF1-alpha from cytoplasm in to the nucleus, which is necessary to trigger the induction of hypoxia-responsive genes [,].
(f) and (g) “GOBP positive regulation of protein localization to nucleus” and “GOBP regulation of protein localization to nucleus”; there is ample evidence that nucleocytoplasmic-transport deficits could play an important role in the pathology of ALS [].
In all the above cases, the enrichment occurs for genes that are downregulated in the ALS LCLs, and the direction of the perturbation is consistent with the pathological significance of the GSEA pathways. This observation further indicates that the perturbations detected in LCLs in our proteomics study correctly reflect the pathological processes that occur in ALS in human patients.
An interesting observation is that the proteins upregulated in our data do not appear to belong to any functional pathways. The explanation could be that the common feature of these proteins is not participation in some biological pathways, but common physical characteristics. For instance, these proteins could be more prone to folding errors, or more resistant to degradation by the unfolding protein response machinery, leading to the accumulation of these proteins, which confusingly appearing to be upregulated. Notably, such defect cannot be detected by RNA-seq or microarray studies, but can be revealed by proteomics, which detects protein fragments and does not depend on the folding state of the proteins.
In conclusion, the proteomics analysis LCLs derived from ALS donors versus controls revealed statistically significant differences in some proteins and pathways. A subset of these pathways has been previously reported to be perturbed in ALS, while others have not been previously linked to ALS. The latter observation suggests that proteomics of LCLs from ALS donors versus controls could be a useful strategy to derive new information about the ALS pathological processes and identify new markers that could be used for drug discovery.
Compared to primary cells freshly obtained from donors, LCLs offer a stable platform, which can be used repeatedly over long periods, to compare the effects of various interventions. Moreover, the cells can be expanded to large numbers, sufficient for drug discovery by high content screening of thousands of small compounds, or for genome-wide gene knockout of overexpression.
Apart from the statistical significance included in the software packages analyzed, the fact that proteins known to be perturbed in ALS are significantly more frequent among the proteins detected by the proteomics study to be differential, which increases the confidence that the proteomics analysis, although performed on a small number of samples, indeed detects proteins differentially expressed in ALS.
A new hypothesis supported by this study is that many proteins upregulated in ALS do not belong to specific upregulated pathways, but rather have similarities in either a higher propensity of failing to fold properly, or, if unfolded, they have a higher resistance to degradation by the unfolded protein response (UPR) system. Investigations aimed at finding such common characteristics, such as common motifs in the protein structures, could shed a new mechanistic light on ALS, and could suggest new therapies.
Overall, the study yielded promising results, which supports the utility of LCLs for ALS studies, and, in particular, the proteomics approach. Further studies using higher numbers of samples and other proteomics methods could be used to confirm and extend the reported observations.
3. Materials and Methods
Cell Lines
ALS and control lymphoblastoid cell lines (B-lymphocytes immortalized with the Epstein–Barr virus) [,] were obtained from NINDS Repository at Coriell Cell Biorepositories, Camden, New Jersey. In total, 4 cell lines originated from ALS donors (2 females and 2 males, 44 years old) and 4 lines were derived from healthy donors (2 females and 2 males, 41 to 47 years old). The cells were maintained in T25 flasks in a vertical position in 10 mL of RPMI 1640 medium containing 10% heat-inactivated fetal bovine serum (FBS), penicillin/streptomycin at 100 U/mL and 100 μg/mL in an incubator in 5% CO2 at 37 °C.
Protein extraction. Each cell line was grown to 500,000 cells/mL in 2 T25 flasks with 10 mL of medium. The proteins from FBS were removed by placing the cells from each flask in 50 mL conical tubes, centrifugation at 1410 RPM for 10 min at 4 °C, followed by resuspension of the pellets in 40 mL of washing buffer (0.29 M mannitol, 10 mM Tris pH 7.4). The centrifugations were repeated 3 times. The last pellet was resuspended in 1 mL of cold lysis buffer (1 M NaOH) after adding to the pellet 10 µL of Protease and Phosphatase Inhibitors Cocktail (MSSAFE-1VL, Sigma Aldrich St. Louis, MO, USA)). The pellet was resuspended by repeated pipetting and incubated on ice for 30 min, followed by microfugation at 14,000 RPM for 20 min at 4 °C. The supernatant was collected and the protein concentration was measured using the BCA colorimetric assay kit (Pierce Cat. No. 23227). Samples of 250–300 ug of protein were flash-frozen in liquid nitrogen and shipped for mass spectrometry analysis.
Mass spectrometry. Sample pre-processing. The samples were subjected to desalting and buffer exchange using Amicon Ultra 0.5 mL 3 kDa centrifugal filters (Millipore, Burlington, MA, USA), as previously described []. The total volume was brought to 200 µL using HPLC-grade water and centrifuged for 10 min at 13,000 RPM. HPLC-grade water (100 µL) was added to each filter and centrifuged for another 5 min at 13,000 RPM. The volume left in the filter (70–100 µL) was pipetted repeatedly up and down. The filter was then placed upside down in a new 1.5 mL tube and centrifuged for 2 min at 3500 RPM. The proteins left in the filter were collected by placing 100 µL of HPLC water in the filter and repeating the transfer to the new 1.5 mL tube. The protein concentration was then measured using a Bradford assay. Volumes of solution containing 200 ug of protein were dried down completely in a SpeedVac and resuspended in 20 µL of 6 M urea, 100 mM of Tris buffer pH 7.8, and sonicated for 30 min. Each sample was reduced by adding 1 µL of a solution containing 200 mM of dithiothreitol (DTT) and 100 mM of Tris buffer, gently vortexed and incubated at room temperature (RT) for one hour. Subsequently, the samples were alkylated using 4 µL of the alkylating agent, 200 mM of iodoacetamide (IAA), and 100 mM of Tris buffer, then gently vortexed and incubated in the dark for an hour at RT. Each sample was reduced again by adding 4 µL of the same reducing agent as before. HPLC water (155 µL) was added to each sample to reduce the urea concentration and then 20 ug trypsin in 100 µL of 20 mM Tris was added and the samples were incubated for 16–18 h at 37 °C. The reaction was stopped by adding a drop to glacial acetic acid to bring the pH < 6, and the samples were dried completely in a SpeedVac. The dried samples are then resolubilized in 100 µL of 0.1% formic acid and sonicated for 15 min. For each sample, a volume of 50 µL of 50% acetonitrile and 0.1% formic acid was pushed through 1 mg top-tips (Glygen) 3 times into waste, followed by 3 washes using 0.1% formic acid. The samples were added to the top-tips, pushed into the same 1.5 mL tube, reapplied to the same top-tip, and pushed into the same tube. Then, 50 µL of 0.1% formic acid was pushed through the tip into waste twice. After that, 50 µL of elution buffer (50% acetonitrile with 0.1% formic acid) was added to the tip and pushed into a new 0.6 mL tube. The same 50 µL volume was then pushed through the tip a second time. A new volume of 50 µL elution buffer was added to the tip and pushed into the same 0.6 mL tube, and then pushed again through the tip. The total volume in the new 0.6 mL tube was 100 µL. The samples were then dried down completely and re-solubilized in 50 µL 2% acetonitrile and 0.1% formic acid.
Proteomics analysis. The samples were analyzed using a NanoAcquity UPLC coupled with a QTOF Xevo G2-XS mass spectrometer (both from Waters Corp, Milford, MA, USA). The peptides were loaded on a NanoAcquity BEH130 C18 1.7 µm reversed phase chromatography UPLC column (Waters, Milford, MA, USA), that was coupled to a fused silica nano-ESI emitter (363 µm OD × 20 µm ID × 6.25 cm length, New Objective, Littleton, MA, USA). The samples (3 µL, 12 µg protein) were injected onto the column followed by a linear gradient with a flow rate of 0.6 µL/min: 1–7% organic solvent B (acetonitrile with 0.1% formic acid) for 1–9 min, 7% B (9–15 min), 7–40% B (15–350 min), 40% B (350–359 min), and back to the initial conditions of 1% B (359–360). Each sample was followed by 3 20 min washes, 2 30 min washes, 1 130 min wash, and 1 35 min external standard Glu-Fib. The MS instrument parameters were: capillary voltage: 2.8 kV, sampling cone: 40 V, source temperature: 80 °C, nano flow gas: 0.30 bar, purge gas: 50 L/h, gas source: nitrogen generator (Peak Scientific, Inchinnan, UK). The acquisition mode is positive polarity in sensitivity mode. MS survey scans from m/z of 100–2000 and automatic data-dependent analysis (DDA) of the top 8 ions with a high intensity with charges of 2+ to 6+. The MS/MS acquisition was then triggered when the MS signal rose above 1000 counts/second. The top 8 ions selected for MS/MS from the MS survey were subjected to further fragmentation until the MS/MS ion count reached 10,000/sec or after 0.4 s have elapsed. The charge state recognition was used for collision energy with a collision gas being argon. The washing runs (20 min runs for short washing steps and 130 min runs for long, extensive washing steps) were acquired under MSe conditions scanning a range m/z of 50–2000. Each of the 4 ALS and 4 control samples were run in triplicate.
Protein identification. The raw data were submitted ProteoWizard MS Convert to convert the input raw data to mzML readable data, and then the mzML data were further submitted to our in-house Mascot Daemon server (Matrix Science, London, UK) for database search. The following parameters were used: NCBI database (Homo sapiens) with a peptide tolerance of 1.3 Da and zero 13C, a MS/MS tolerance of 0.8 Da, trypsin enzyme, 3 maximum missed cleavages, a fixed modification of carbamidomethyl on a cysteine and a variable modification of oxidation on methionine.
The output of the database search (.dat files) were further analyzed by the software package Scaffold v. 4.3.4 (Proteome Software, Portland, OR, USA) for protein identification, label-free quantitation, and statistical analysis. The data files were then loaded and analyzed with the following options: protein threshold 90%, minimum number of peptides = 1; peptide threshold 80%; total spectrum count, no filter. The threshold for the minimum number of peptides was selected as 1, to avoid missing true positives. The caveat is obviously that the number of false positives is higher than for more stringent thresholds.
To determine the integrity of the data and proteins identified, we narrowed the thresholds for protein identification. The threshold used in this publication was a peptide threshold of 90%, minimum number of peptides as 1, and a peptide threshold of 80%. This had an FDR of 1.3% for proteins, and an FDR of 2.68% for spectra. To narrow this, we increased both the protein and peptide threshold as well as increased the minimum number of peptides identified. We did 3 different trials, each with a narrower threshold. The thresholds used were as follows: protein threshold of 90, min peptide of 1, and peptide threshold of 90 (90-1-90), a second with protein threshold of 90, min peptide of 2, and peptide threshold of 90 (90-2-90), and, finally, a protein threshold of 90, min peptide of 2, and peptide threshold of 95 (90-2-95). The first variation, 90-1-90, had an FDR of 1.3% for proteins and an FDR of 1.49% for spectra. The second, 90-2-90, had an FDR for proteins of 0.4% and an FDR for spectra of 1.46%. The final trial, 90-2-95, had and FDR of 0.3% for proteins and an FDR of 0.82% for spectra. When narrowed, we were able to identify almost all of the proteins in our tightest constraint, and others were still found, but were considered to be insignificant (p-value ≤ 0.05) under the specific conditions. Doing this confirmed that the proteins identified in this study are most likely true positives.
Proteins differentially expressed between the ALS and the control cells were assessed using Scaffold and the software packages NormalizeR [], ProDA [], and ROTS [].
Pathway analysis was performed using the software packages Ingenuity [], PathfindR [], and GSEA [].
Author Contributions
A.R., C.C.D. and D.W. performed the experimental design; A.R. selected and prepared the cell lysates; D.W. and C.C.D. performed the mass spectrometry experiments and the primary data analysis; A.R. and E.B. computed the differential expression, the pathway analysis and the biological significance; A.R., C.C.D. and D.W. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Proteomics data are available via ProteomeXchange with identifier PXD040240.
Acknowledgments
We thank the members of the Biochemistry and Proteomics Laboratories for the pleasant working environment.
Conflicts of Interest
The authors declare no conflict of interest.
Sample Availability
Not applicable.
References
- Van Es, M.A.; Hardiman, O.; Chio, A.; Al-Chalabi, A.; Pasterkamp, R.J.; Veldink, J.H.; Van den Berg, L.H. Amyotrophic lateral sclerosis. Lancet 2017, 390, 2084–2098. [Google Scholar] [CrossRef]
- Masrori, P.; Van Damme, P. Amyotrophic lateral sclerosis: A clinical review. Eur. J. Neurol. 2020, 27, 1918–1929. [Google Scholar] [CrossRef]
- Talbott, E.O.; Malek, A.M.; Lacomis, D. The epidemiology of amyotrophic lateral sclerosis. Handb. Clin. Neurol. 2016, 138, 225–238. [Google Scholar]
- Iacoangeli, A.; Al Khleifat, A.; Jones, A.R.; Sproviero, W.; Shatunov, A.; Opie-Martin, S.; Morrison, K.E.; Shaw, P.J.; Shaw, C.E.; Fogh, I. C9orf72 intermediate expansions of 24–30 repeats are associated with ALS. Acta Neuropathol. Commun. 2019, 7, 1–7. [Google Scholar] [CrossRef] [PubMed]
- Kwiatkowski Jr, T.; Bosco, D.; Leclerc, A.; Tamrazian, E.; Vanderburg, C.; Russ, C.; Davis, A.; Gilchrist, J.; Kasarskis, E.; Munsat, T. Mutations in the FUS/TLS gene on chromosome 16 cause familial amyotrophic lateral sclerosis. Science 2009, 323, 1205–1208. [Google Scholar] [CrossRef] [PubMed]
- Hulisz, D. Disease state overview. Am. J. Manag. Care 2018, 24, S320–S326. [Google Scholar] [PubMed]
- Pansarasa, O.; Garofalo, M.; Scarian, E.; Dragoni, F.; Garau, J.; Di Gerlando, R.; Diamanti, L.; Bordoni, M.; Gagliardi, S. Biomarkers in Human Peripheral Blood Mononuclear Cells: The State of the Art in Amyotrophic Lateral Sclerosis. Int. J. Mol. Sci. 2022, 23, 2580. [Google Scholar] [CrossRef]
- Riancho, J.; Arozamena, S.; De Munain, A. Dermic-derived fibroblasts for the study of amyotrophic lateral sclerosis. Neural Regen. Res. 2020, 15, 2043. [Google Scholar] [CrossRef]
- Hussain, T.; Mulherkar, R. Lymphoblastoid cell lines: A continuous in vitro source of cells to study carcinogen sensitivity and DNA repair. Int. J. Mol. Cell. Med. 2012, 1, 75. [Google Scholar]
- Hui-Yuen, J.; McAllister, S.; Koganti, S.; Hill, E.; Bhaduri-McIntosh, S. Establishment of Epstein-Barr virus growth-transformed lymphoblastoid cell lines. JoVE (J. Vis. Exp.) 2011, 57, e3321. [Google Scholar]
- Pansarasa, O.; Bordoni, M.; Drufuca, L.; Diamanti, L.; Sproviero, D.; Trotti, R.; Bernuzzi, S.; La Salvia, S.; Gagliardi, S.; Ceroni, M. Lymphoblastoid cell lines as a model to understand amyotrophic lateral sclerosis disease mechanisms. Dis. Model. Mech. 2018, 11, dmm031625. [Google Scholar] [CrossRef]
- Annesley, S.J.; Fisher, P. Lymphoblastoid cell lines as models to study mitochondrial function in neurological disorders. Int. J. Mol. Sci. 2021, 22, 4536. [Google Scholar] [CrossRef]
- Cooper-Knock, J.; Bury, J.J.; Heath, P.R.; Wyles, M.; Higginbottom, A.; Gelsthorpe, C.; Highley, J.R.; Hautbergue, G.; Rattray, M.; Kirby, J. C9ORF72 GGGGCC expanded repeats produce splicing dysregulation which correlates with disease severity in amyotrophic lateral sclerosis. PLoS ONE 2015, 10, e0127376. [Google Scholar] [CrossRef] [PubMed]
- Johnson, E.C.; Carter, E.K.; Dammer, E.B.; Duong, D.M.; Gerasimov, E.S.; Liu, Y.; Liu, J.; Betarbet, R.; Ping, L.; Yin, L. Large-scale deep multi-layer analysis of Alzheimer’s disease brain reveals strong proteomic disease-related changes not observed at the RNA level. Nat. Neurosci. 2022, 25, 213–225. [Google Scholar] [CrossRef] [PubMed]
- Channaveerappa, D.; Lux, J.C.; Wormwood, K.L.; Heintz, T.A.; McLerie, M.; Treat, J.A.; King, H.; Alnasser, D.; Goodrow, R.J.; Ballard, G.; et al. Atrial electrophysiological and molecular remodelling induced by obstructive sleep apnoea. J. Cell Mol. Med. 2017, 21, 2223–2235. [Google Scholar] [CrossRef]
- Dupree, E.J.; Crimmins, B.S.; Holsen, T.M.; Darie, C.C. Proteomic Analysis of the Lake Trout (Salvelinus namaycush) Liver Identifies Proteins from Evolutionarily Close and -Distant Fish Relatives. Proteomics 2019, 19, e1800429. [Google Scholar] [CrossRef]
- Dupree, E.J.; Crimmins, B.S.; Holsen, T.M.; Darie, C.C. Developing Well-Annotated Species-Specific Protein Databases Using Comparative Proteogenomics. Adv. Exp. Med. Biol. 2019, 1140, 389–400. [Google Scholar] [PubMed]
- Neagu, A.-N.; Jayathirtha, M.; Baxter, E.; Donnelly, M.; Petre, B.A.; Darie, C.C. Applications of Tandem Mass Spectrometry (MS/MS) in Protein Analysis for Biomedical Research. Molecules 2022, 27, 2411. [Google Scholar] [CrossRef] [PubMed]
- Neagu, A.-N.; Whitham, D.; Buonanno, E.; Jenkins, A.; Alexa-Stratulat, T.; Tamba, B.I.; Darie, C.C. Proteomics and its applications in breast cancer. Am. J. Cancer Res. 2021, 11, 4006–4049. [Google Scholar]
- Darie, C.C.; Deinhardt, K.; Zhang, G.; Cardasis, H.S.; Chao, M.V.; Neubert, T.A. Identifying transient protein-protein interactions in EphB2 signaling by blue native PAGE and mass spectrometry. Proteomics 2011, 11, 4514–4528. [Google Scholar] [CrossRef]
- Ngounou Wetie, A.G.; Sokolowska, I.; Channaveerappa, D.; Dupree, E.J.; Jayathirtha, M.; Woods, A.G.; Darie, C.C. Proteomics and Non-proteomics Approaches to Study Stable and Transient Protein-Protein Interactions. Adv. Exp. Med. Biol. 2019, 1140, 121–142. [Google Scholar] [PubMed]
- Ngounou Wetie, A.G.; Sokolowska, I.; Woods, A.G.; Roy, U.; Deinhardt, K.; Darie, C.C. Protein-protein interactions: Switch from classical methods to proteomics and bioinformatics-based approaches. Cell. Mol. Life Sci. 2014, 71, 205–228. [Google Scholar] [CrossRef]
- Jayathirtha, M.; Neagu, A.N.; Whitham, D.; Alwine, S.; Darie, C.C. Investigation of the effects of downregulation of jumping translocation breakpoint (JTB) protein expression in MCF7 cells for potential use as a biomarker in breast cancer. Am. J. Cancer Res. 2022, 12, 4373–4398. [Google Scholar] [PubMed]
- Jayathirtha, M.; Neagu, A.N.; Whitham, D.; Alwine, S.; Darie, C.C. Investigation of the effects of overexpression of jumping translocation breakpoint (JTB) protein in MCF7 cells for potential use as a biomarker in breast cancer. Am. J. Cancer Res. 2022, 12, 1784–1823. [Google Scholar]
- Willforss, J.; Chawade, A.; Levander, F. NormalyzerDE: Online tool for improved normalization of omics expression data and high-sensitivity differential expression analysis. J. Proteome Res. 2018, 18, 732–740. [Google Scholar] [CrossRef] [PubMed]
- Ahlmann-Eltze, C.; Anders, S. proDA: Probabilistic dropout analysis for identifying differentially abundant proteins in label-free mass spectrometry. Biorxiv 2020, 661496. [Google Scholar]
- Suomi, T.; Seyednasrollah, F.; Jaakkola, M.K.; Faux, T.; Elo, L.L. ROTS: An R package for reproducibility-optimized statistical testing. PLoS Comput. Biol. 2017, 13, e1005562. [Google Scholar] [CrossRef]
- Cacabelos, D.; Ayala, V.; Granado-Serrano, A.B.; Jove, M.; Torres, P.; Boada, J.; Cabre, R.; Ramirez-Nunez, O.; Gonzalo, H.; Soler-Cantero, A.; et al. Interplay between TDP-43 and docosahexaenoic acid-related processes in amyotrophic lateral sclerosis. Neurobiol. Dis. 2016, 88, 148–160. [Google Scholar] [CrossRef]
- Iridoy, M.O.; Zubiri, I.; Zelaya, M.V.; Martinez, L.; Ausín, K.; Lachen-Montes, M.; Santamaría, E.; Fernandez-Irigoyen, J.; Jericó, I. Neuroanatomical Quantitative Proteomics Reveals Common Pathogenic Biological Routes between Amyotrophic Lateral Sclerosis (ALS) and Frontotemporal Dementia (FTD). Int. J. Mol. Sci. 2019, 20, 4. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Rangaswamy, S.; Kodavati, M.; Mitra, J.; Guo, W.; Guerrero, E.N.; Van Den Bosch, L.; Hegde, M.L. RT2 PCR array screening reveals distinct perturbations in DNA damage response signaling in FUS-associated motor neuron disease. Mol. Brain 2019, 12, 1–5. [Google Scholar] [CrossRef]
- Leoni, E.; Bremang, M.; Mitra, V.; Zubiri, I.; Jung, S.; Lu, C.H.; Adiutori, R.; Lombardi, V.; Russell, C.; Koncarevic, S.; et al. Combined tissue-fluid proteomics to unravel phenotypic variability in amyotrophic lateral sclerosis. Sci. Rep. 2019, 9, 1–16. [Google Scholar] [CrossRef]
- Sleigh, J.N.; Mech, A.M.; Aktar, T.; Zhang, Y.; Schiavo, G. Altered sensory neuron development in CMT2D mice is site-specific and linked to increased GlyRS levels. Front. Cell. Neurosci. 2020, 14, 232. [Google Scholar] [CrossRef]
- Ritchie, M.E.; Phipson, B.; Wu, D.I.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
- Dowell, J.A.; Wright, L.J.; Armstrong, E.A.; Denu, J.M. Benchmarking quantitative performance in label-free proteomics. ACS Omega 2021, 6, 2494–2504. [Google Scholar] [CrossRef] [PubMed]
- Choi, S.Y.; Lopez-Gonzalez, R.; Krishnan, G.; Phillips, H.L.; Li, A.N.; Seeley, W.W.; Yao, W.D.; Almeida, S.; Gao, F.B. C9ORF72-ALS/FTD-associated poly (GR) binds Atp5a1 and compromises mitochondrial function in vivo. Nat. Neurosci. 2019, 22, 851–862. [Google Scholar] [CrossRef]
- Tsukahara, A.; Hosokawa, T.; Nishioka, D.; Kotani, T.; Ishida, S.; Takeuchi, T.; Kimura, F.; Arawaka, S. Neuron-specific enolase level is a useful biomarker for distinguishing amyotrophic lateral sclerosis from cervical spondylotic myelopathy. Sci. Rep. 2021, 11, 1–9. [Google Scholar]
- Rushkevich, Y.N.; Pashkouskaya, I.; Likhachev, S. Neurospecific proteins in cerebrospinal fluid and in the bloodserum of patients with amyotrophic lateral sclerosis. Zhurnal Nevrol. I Psikhiatrii Im. SS Korsakova 2018, 118, 75–80. [Google Scholar] [CrossRef]
- Krämer, A.; Green, J.; Pollard Jr, J.; Tugendreich, S. Causal analysis approaches in ingenuity pathway analysis. Bioinformatics 2014, 30, 523–530. [Google Scholar] [CrossRef]
- Dalla Bella, E.; Tramacere, I.; Antonini, G.; Borghero, G.; Capasso, M.; Caponnetto, C.; Chiò, A.; Corbo, M.; Eleopra, R.; Filosto, M.; et al. Protein misfolding, amyotrophic lateral sclerosis and guanabenz: Protocol for a phase II RCT with futility design (ProMISe trial). BMJ Open 2017, 7, e015434. [Google Scholar] [CrossRef] [PubMed]
- Vanneste, J.; Van Den Bosch, L. The Role of Nucleocytoplasmic Transport Defects in Amyotrophic Lateral Sclerosis. Int. J. Mol. Sci. 2021, 22, 12175. [Google Scholar] [CrossRef]
- Ferreira, P.A. The coming-of-age of nucleocytoplasmic transport in motor neuron disease and neurodegeneration. Cell. Mol. Life Sci. 2019, 76, 2247–2273. [Google Scholar] [CrossRef] [PubMed]
- Boeynaems, S.; Bogaert, E.; Van Damme, P.; Van Den Bosch, L. Inside out: The role of nucleocytoplasmic transport in ALS and FTLD. Acta Neuropathol. 2016, 132, 159–173. [Google Scholar] [CrossRef] [PubMed]
- Chisholm, C.G.; Lum, J.S.; Farrawell, N.E.; Yerbury, J.J. Ubiquitin homeostasis disruption, a common cause of proteostasis collapse in amyotrophic lateral sclerosis? Neural Regen. Res. 2022, 17, 2218. [Google Scholar] [PubMed]
- Zhang, M.; Xi, Z.; Saez-Atienzar, S.; Chia, R.; Moreno, D.; Sato, C.; Montazer Haghighi, M.; Traynor, B.J.; Zinman, L.; Rogaeva, E. Combined epigenetic/genetic study identified an ALS age of onset modifier. Acta Neuropathol. Commun. 2021, 9, 1–9. [Google Scholar] [CrossRef]
- Polverino, A.; Rucco, R.; Stillitano, I.; Bonavita, S.; Grimaldi, M.; Minino, R.; Pesoli, M.; Trojsi, F.; d’Ursi, A.M.; Sorrentino, G.; et al. In amyotrophic lateral sclerosis blood cytokines are altered, but do not correlate with changes in brain topology. Brain Connect. 2020, 10, 411–421. [Google Scholar] [CrossRef] [PubMed]
- Hensel, N.; Claus, P. The actin cytoskeleton in SMA and ALS: How does it contribute to motoneuron degeneration? Neuroscientist 2018, 24, 54–72. [Google Scholar] [CrossRef]
- Giampetruzzi, A.; Danielson, E.W.; Gumina, V.; Jeon, M.; Boopathy, S.; Brown, R.H.; Ratti, A.; Landers, J.E.; Fallini, C. Modulation of actin polymerization affects nucleocytoplasmic transport in multiple forms of amyotrophic lateral sclerosis. Nat. Commun. 2019, 10, 1–15. [Google Scholar] [CrossRef]
- Ulgen, E.; Ozisik, O.; Sezerman, O. pathfindR: An R package for comprehensive identification of enriched pathways in omics data through active subnetworks. Front. Genet. 2019, 10, 858. [Google Scholar] [CrossRef]
- Tank, E.M.; Figueroa-Romero, C.; Hinder, L.M.; Bedi, K.; Archbold, H.C.; Li, X.; Weskamp, K.; Safren, N.; Paez-Colasante, X.; Pacut, C.; et al. Abnormal RNA stability in amyotrophic lateral sclerosis. Nat. Commun. 2018, 9, 1–16. [Google Scholar] [CrossRef] [PubMed]
- Wang, J.C.; Ramaswami, G.; Geschwind, D. Gene co-expression network analysis in human spinal cord highlights mechanisms underlying amyotrophic lateral sclerosis susceptibility. Sci. Rep. 2021, 11, 1–14. [Google Scholar] [CrossRef]
- Verheijen, M.H.; Peviani, M.; Hendricusdottir, R.; Bell, E.M.; Lammens, M.; Smit, A.B.; Bendotti, C.; Van Minnen, J. Increased axonal ribosome numbers is an early event in the pathogenesis of amyotrophic lateral sclerosis. PLoS ONE 2014, 9, e87255. [Google Scholar] [CrossRef] [PubMed]
- Subramanian, A.; Tamayo, P.; Mootha, V.K.; Mukherjee, S.; Ebert, B.L.; Gillette, M.A.; Paulovich, A.; Pomeroy, S.L.; Golub, T.R.; Lander, E.S.; et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 2005, 102, 15545–15550. [Google Scholar] [CrossRef]
- Mandrioli, J.; D’Amico, R.; Zucchi, E.; Gessani, A.; Fini, N.; Fasano, A.; Caponnetto, C.; Chiò, A.; Dalla Bella, E.; Lunetta, C.; et al. Rapamycin treatment for amyotrophic lateral sclerosis: Protocol for a phase II randomized, double-blind, placebo-controlled, multicenter, clinical trial (RAP-ALS trial). Medicine 2018, 97, e11119. [Google Scholar] [CrossRef]
- Warita, H.; Kato, M.; Asada, R.; Yamashita, A.; Hayata, D.; Adachi, K.; Aoki, M. Safety, tolerability, and pharmacodynamics of intrathecal injection of recombinant human HGF (KP-100) in subjects with amyotrophic lateral sclerosis: A phase I trial. J. Clin. Pharmacol. 2019, 59, 677–687. [Google Scholar] [CrossRef] [PubMed]
- La Cognata, V.; Gentile, G.; Aronica, E.; Cavallaro, S. Splicing Players Are Differently Expressed in Sporadic Amyotrophic Lateral Sclerosis Molecular Clusters and Brain Regions. Cells 2020, 9, 159. [Google Scholar] [CrossRef]
- Nagara, Y.; Tateishi, T.; Yamasaki, R.; Hayashi, S.; Kawamura, M.; Kikuchi, H.; Iinuma, K.M.; Tanaka, M.; Iwaki, T.; Matsushita, T.; et al. Impaired Cytoplasmic–Nuclear Transport of Hypoxia-Inducible Factor-1α in Amyotrophic Lateral Sclerosis. Brain Pathol. 2013, 23, 534–546. [Google Scholar] [CrossRef] [PubMed]
- Moreau, C.; Gosset, P.; Kluza, J.; Brunaud-Danel, V.; Lassalle, P.; Marchetti, P.; Defebvre, L.; Destée, A.; Devos, D. Deregulation of the hypoxia inducible factor-1α pathway in monocytes from sporadic amyotrophic lateral sclerosis patients. Neuroscience 2011, 172, 110–117. [Google Scholar] [CrossRef]
- Dupree, E.J.; Goodwin, A.; Darie, C.C.; Boolani, A. A Pilot Exploratory Proteomics Investigation of Mental Fatigue and Mental Energy. Adv. Exp. Med. Biol. 2019, 1140, 601–611. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).