Next Article in Journal
Expression of Endogenous Putative TSH Binding Protein in Orbit
Previous Article in Journal
Functional Screenings Identify Regulatory Variants Associated with Breast Cancer Susceptibility
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Homozygosity Haplotype and Whole-Exome Sequencing Analysis to Identify Potentially Functional Rare Variants Involved in Multiple Sclerosis among Sardinian Families

1
Department of Brain and Behavioral Sciences, University of Pavia, 27100 Pavia, Italy
2
John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, FL 33136, USA
3
Dr. John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, Miami, FL 33136, USA
4
Divisione di Neurologia, Presidio Ospedaliero S. Francesco, ASL Numero 3 Nuoro, 08100 Nuoro, Italy
5
Bioinformatics and Statistical Genomics Unit, Istituto Auxologico Italiano IRCCS, 20095 Cusano Milanino, Italy
*
Author to whom correspondence should be addressed.
Curr. Issues Mol. Biol. 2021, 43(3), 1778-1793; https://doi.org/10.3390/cimb43030125
Submission received: 22 September 2021 / Revised: 13 October 2021 / Accepted: 23 October 2021 / Published: 27 October 2021
(This article belongs to the Section Bioinformatics and Systems Biology)

Abstract

:
Multiple Sclerosis (MS) is a complex multifactorial autoimmune disease, whose sex- and age-adjusted prevalence in Sardinia (Italy) is among the highest worldwide. To date, 233 loci were associated with MS and almost 20% of risk heritability is attributable to common genetic variants, but many low-frequency and rare variants remain to be discovered. Here, we aimed to contribute to the understanding of the genetic basis of MS by investigating potentially functional rare variants. To this end, we analyzed thirteen multiplex Sardinian families with Immunochip genotyping data. For five families, Whole Exome Sequencing (WES) data were also available. Firstly, we performed a non-parametric Homozygosity Haplotype analysis for identifying the Region from Common Ancestor (RCA). Then, on these potential disease-linked RCA, we searched for the presence of rare variants shared by the affected individuals by analyzing WES data. We found: (i) a variant (43181034 T > G) in the splicing region on exon 27 of CUL9; (ii) a variant (50245517 A > C) in the splicing region on exon 16 of ATP9A; (iii) a non-synonymous variant (43223539 A > C), on exon 9 of TTBK1; (iv) a non-synonymous variant (42976917 A > C) on exon 9 of PPP2R5D; and v) a variant (109859349-109859354) in 3′UTR of MYO16.

1. Introduction

Multiple Sclerosis (MS) is a complex neurological autoimmune disease, which mainly affects people in early adulthood; for this reason, it is considered as the most common cause of neurologic disability in young adults [1,2].
The prevalence of the disease is different across the different countries: it has a high prevalence in Europe, with a north to south gradient, and a lower prevalence in Asia and Africa [3]. In Italy, we observed a disease prevalence of 176 per 100,000 inhabitants [4], except in the Mediterranean island of Sardinia, where we found an age- and sex-adjusted prevalence of MS of 330 per 100,000 inhabitants, among the highest reported worldwide, ranging from 217 in the Olbia-Tempio district to 425 in the Ogliastra district [5], with the lowest risk areas being closer to the coast.
Although most MS cases occur sporadically, about 20% of the affected individuals are related by family, with first-degree relatives of MS patients at increased risk of disease, thus suggesting that the disease is moderately heritable, with a sibling relative recurrence risk of 6.35 in the Caucasian population [6] and of 31 in the founder population of the Sardinian province of Nuoro [7].
In line with other common, complex disorders, almost 20% of risk heritability is attributable to common genetic variants in the autosomal genome, including 233 unequivocally MS-associated loci identified over the last 15 years by GWAS (genome-wide association studies), comprising 32 loci within the Major Histocompatibility Complex (MHC) [8,9,10,11,12,13], each of which explain only a small fraction of risk [14]. A recent study by the International Multiple Sclerosis Genetics Consortium (IMSGC) [15] provides evidence that 11.34% of risk heritability is explained by low-frequency variants (Minor Allele Frequency (MAF) < 5%); of these rare variants, (MAF < 1%) alone explains 9%. Most low-frequency variants impact genes that are not detectable by common variants identified by genome-wide association studies (GWAS), and only a small portion of them is in Linkage Disequilibrium (LD) with variants highlighted by GWAS.
Many low frequency and rare variant associations, as important sources of unexplained heritability, remain to be discovered [16]. This investigation would require large sample sizes to reach an appropriate statistical power, or alternatively, the use of multiplex families from a founder population for which both genotyping and sequencing data are available.
Our study aims at understanding the genetic contribution to MS and suggesting new potential causative variants in families, by contributing to the discovery of new exonic and potentially functional low-frequency variants. To this end, we analyzed multiplex families originating from the genetically homogeneous and isolated population of the Nuoro province of Sardinia for which Immunochip genotyping and whole exome-sequencing (WES) data are available.
We followed a two-stage approach. In the first stage, we prioritized candidate regions to be further investigated via a non-parametric Homozygosity Haplotype (HH) analysis, which uses reduced haplotypes composed by homozygous single nucleotide polymorphisms (SNPs) only and deletes all the heterozygous ones. We performed this analysis on thirteen families by exploiting the co-segregation of the disease and genetic variants between affected and unaffected subjects for a genome-wide search of shared autosomal segments. In the second stage, on the promising candidate regions identified in the HH analysis, we searched for the presence of rare variants shared by the affected individuals by analyzing WES data that were available for five families only.

2. Results

2.1. Sample Description

Thirteen multiplex Sardinian pedigrees, containing from three to sixteen MS patients each, were selected for the analysis, for a total of 80 affected (63 with Immunochip genotyping data) and 655 unaffected (220 with Immunochip genotyping data) patients.
Table 1 reports the description of the family data available for the HH analysis. We analyzed a total of 129.448 Immunochip QC-filtered SNPs that had assigned dbSNP refIDs. WES data were available for five families only. Specifically, three cases and one control for family 61, 10 cases for family 2360, two cases for family 45, four cases for family 4, and five cases for family 5.

2.2. Identification of RCHHs

The HH statistical analysis was performed for all the 13 families using both the genotyped patients (n = 63) and controls (n = 220), so that the algorithm worked to treat the affected and unaffected members of a family as cases and controls, respectively. As we had Immunochip data [10], obtained with an Illumina Infinium HD custom array designed for the fine mapping of 184 established autoimmune loci, and not a high-density array, we used a cutoff value of 7 cM to search the candidate RCHHs, in order to reduce the risk of false positives and to increase specificity, and we used CEUAnnotation1MDuo (http://www.hhanalysis.com/ (accessed on 15 February 2021)) as an annotation file. We selected regions with a significance level of -log10 (p-value) 1.2, corresponding to a p-value of 0.06, to establish difference between patient and control pools. The choice of this liberal level of significance was driven by our research strategy in which HH analysis represents a step towards prioritizing candidate regions, and thus, towards reducing false negative probability, in order to enable further investigation in the second stage (WES analysis).
HH was run to identify Regions with a Conserved Homozygosity Haplotype (RCHHs) that were differently shared among cases and controls (see Table 2). Then, we searched for genes in each significant RCHH, and Figure 1 reports the upset plot in which the number of genes shared between families is graphically represented. The first five bar charts on the top of the figure represent the number of genes in the significant RCHHs found only in each individual family (whose identified number is specified in the left of the figure), e.g., 305 genes that are only in family 2360 but not in the other families, etc. The last three bar charts on the top of the figure report the number of genes shared between families. In particular, families 4 and 44 share one gene, i.e., AL833583, and families 3 and 2360 share DQ596042, while families 9 and 44 share the USP16 gene. Interestingly, increased USP16 levels, a deubiquitinase required for chromosomal segregation in mitosis [17], were observed in peripheral CD4+ T cells from patients with distinct autoimmune diseases and T cell-specific USP16 knockout mice showed a reduced severity of experimental autoimmune encephalitis [18]. Families that were not reported in the upset plot are those who did not share any gene between them.
The Phenolyzer (http://phenolyzer.wglab.org (accessed on 30 March 2021)) and WEbGestalt (http://www.webgestalt.org (accessed on 30 March 2021)) tools were used to verify if there were genes involved in MS-related diseases due to their involvement in the CNS or immune system. We found that: (i) CD244 (highlighted in family 44), CIITA (highlighted in family 44), HLA-DRB1 (highlighted in family 2360) and NFKBIL1 (highlighted in family 2360) are related to rheumatoid arthritis; (ii) BANK1, FCGR2A, FCGR2B and FCGR2C (highlighted in family 44) are related to systemic lupus erythematosus; and iv) HLA-G (highlighted in family 2360), HNMT (highlighted in family 44), and TNF (highlighted in family 2360) are related to ASMA.

2.3. Identification of Pathogenic Variants

WES analysis of available subjects on the regions prioritized by HH analysis was performed to identify putative causative rare variants for MS. Figure 2 reports the pedigree structure of the analyzed families for which we also had WES data, and the identified variants.
From the FASTQ files, we obtained the VCF files, using bphg19 as the reference genome. The VCF files were annotated using wANNOVAR (http://wannovar.wglab.org (accessed on 3 May 2021)), and we searched for variants on the resulting csv file. Calls with synonymous variants, and benign variants were excluded, while calls with MAF reported in the 1000 genomes (https://www.internationalgenome.org/ (accessed on 10 May 2021)) dataset and gnomAD (https://gnomad.broadinstitute.org (accessed on 10 May 2021)), with values of less than 0.001 or unknown, were included. The aim was to search for rare genetic variants shared between the affected within the genes prioritized via HH analysis.
For each family and for each RCHH, the variant, its function and the number of cases and control sharing the variant are reported in Table 3. Interestingly, three variants, i.e., 43181034 T > G and 50245517 A > C, located in splicing region on exon 19 of the CUL9 gene and on exon 16 of the ATAP9A gene, respectively, and a deletion (43557751-43557832) in the splicing site of the UMODL1 gene, were classified as photogenic in VarSome (https://varsome.com/ (accessed on 12 October 2021)). The other variants were classified as of uncertain significance.
In Table S1, for each variant, the frequency in the different populations, such as those in 1000 Genome, gnomAD, and in the Sardinian population [19,20], are reported. ST1 also reports the results obtained using different prediction algorithms, with the grade of pathogenicity for each variant.

3. Discussion

MS is characterized by a complex multi-factorial nature that involves the interplay of a still non-identified environmental exposure and a genetic predisposition. In recent studies, it was observed that extensive grey matter lesions in the cerebral and cerebellar cortex and hippocampus [21,22,23,24] are involved in the pathology of the disease. As with other multifactorial diseases, MS has been predominantly studied assuming the “common disease-common variant” paradigm via GWASs. The importance and success of the GWASs approach to identifying the loci underlying common disease cannot be overlooked [25]. These breakthroughs, along with both statistical and technological advances, have led to the identification and confirmed association of numerous genetic loci for MS susceptibility [10,11,12,26]. In spite of these advances, only a relatively small proportion of the genetic influences in MS have been uncovered, and much is yet to be understood. It is important to point out that there are alternative hypotheses concerning the genetic architecture of common diseases, including the multiple rare variant hypothesis, which may help to elucidate the so-called missing heritability associated with common complex diseases such as MS [27].
Our study aims at contributing to the understanding of the genetic risk component of MS and the contribution of rare variants to disease risk through a combination of HH analysis—with the aim of prioritizing candidate regions—and WES to deeply explore this candidate region in search of putative causative rare variants in the founder population of Nuoro province (Sardinia). The Sardinian population has a high prevalence of MS and autoimmune disease, and its genetic background is proven to be homogeneous and an outlier in Europe, having been under selective pressure due to malaria, which has been endemic there for centuries.
Exons are regions in the DNA that are particularly vulnerable in the presence of mutations in their sequence: an unexpected mutation may alter the structure and the function of the protein, leading to deleterious biological consequences that may contribute to—or cause—a specific disease. Thus, identifying the presence of an exonic genetic variant in a candidate region, previously highlighted by HH, will help us to better understand the biological processes involved in the disease.
Firstly, HH analysis performed on 13 multiplex Sardinian families made it possible to identify significant RCHHs (-log10 (p-value) 1.2). Secondly, in these regions, WES data that were available for 25 subjects belonging to five families were analyzed.
Interesting results were found in:
(i)
The RCHH region on chr6:42767957-43333769, shared in 5 cases and 7 controls of family 61, where we identified the variant 43181034 T > G in the splicing region on exon 27 of the CUL9 gene. CUL9 is highly expressed in the brain, particularly in the cerebral cortex [28]. A study [29] using a human cell-derived model to characterize CUL9 in human neuronal development showed that the deletion or depletion of the protein causes the aberrant formation of neural rosettes that are related to the early stage of neurodevelopment. Furthermore, the neuronal transcription factors CUX1 and SOX3 were significantly upregulated in CUL9 knockout neuroepithelial progenitor cells. Fisher et al. [30] analyzed the potential molecular pathways of tissue injury in active cortical MS lesions, and by identifying prominent changes in gene expression, they found genes that are involved in different steps of apoptosis, DNA damage, p53 function, and DNA repair, including CUL9. In the same RCHH region, 43106964 A > C, a non-synonymous variant on exon 9, and rs780764712, in PTK7, a gene involved in the Wnt/planar cell polarity pathway, were also found. It is important to note that the PTK7 mutant with a truncated protein perinatally caused severe defects in neural tube closure [31]. In this stidy, 43223539 A > C, a non-synonymous variant on exon 9 in the TTBK1 gene, and 42976917 A > C, a non-synonymous variant on exon 9 in the PPP2R5D gene, were also found in all the available cases. TTBK1 is a brain-specific tau kinase expressed in the entorhinal cortex and hippocampal regions. TTBK1 transgenic mice showed severe axonal degeneration in the perforant path, which is essential for many forms of memory [32]. TTBK1 is highly expressed in the entorhinal cortex and the perforant path region, two specific brain regions involved in the early stage of Alzheimer’s disease pathology [33], and thus, has a critical role in axonal degeneration. Collapsin response mediator protein-2 (CRMP2) is a downstream target of TTBK1 [32], whose expression induces the accumulation of phosphorylated CRMP2, and it was shown to be involved in the axonal degeneration pathology in MS [34]. PPP2R5D is a regulatory B subunit of Protein Phosphatase 2A (PP2A) and plays a crucial role in normal neuronal development and functioning. Variants of this gene were found to be associated with intellectual disability, autism, and other neurodevelopmental disorders [35]. Mutations in this gene were found in juvenile-onset parkinsonism [36].
(ii)
The RCHH region on chr13:108090996-108968251, shared in 11 cases and 17 controls of family 6, where we identified the variant 109859349-109859354 TGTGTT> in 3′UTR of the MYO16 gene. This variant is also present in 1 case of family 4, in 2 cases of family 45, and in 1 case of family 5. MYO16 is mainly expressed in the central nervous system and seems to be involved in the development and functioning of the nervous system also in adulthood; therefore, alterations in this gene, e.g., SNPs, deletions, or epigenetic modifications, are associated with neurodegenerative and neuropsychiatric disorders [37,38,39]. MYO16 is thus considered as an important regulator of neural cells’ functioning even if its specific role and molecular mechanisms remain to be elucidated. Interestingly, not far from MYO16, in the chr13:108090996-108968251 region highlighted by HH analysis, is located the TNFSF13B gene, encoding the cytokine and drug target B-cell activation factor (BAFF) whose overexpression is related to autoimmunity [40]. In particular, in [41], a TNFSF13B variant was found to be associated with MS and systemic lupus erythematosus (SLE) through a mechanism that led to an overexpression of BAFF, which, in turn, upregulated the humoral immunity.
(iii)
The RCHH region on chr20:49044993-50323395, shared in 5 cases and 7 controls of family 61, where we identified the variant 50245517 A > C in the splicing region on exon 16 of the ATP9A gene. ATP9A is a regulator of endosomal recycling and plays an inhibitory role in the release of extracellular vesicles (EV) [42], and many biological processes, such as the immune response, are modulated by proteins, DNA, miRNA, and mRNAs that could be controlled via EV-instigated intercellular communication [43].
Taken together, these results provide newer insights into the genetics of MS and a more thorough understanding of the disease biology that, in future functional studies of these highlighted specific gene variants, may provide hints towards the creation of new and more effective treatments in MS.

4. Materials and Methods

4.1. Sample Collection and Genotyping

MS patients were ascertained through the case register established in 1995 in the province of Nuoro, Sardinia, Italy. Cases were diagnosed according to Poser’s criteria [44]. Thirteen pedigrees, containing from three to sixteen MS patients each, were selected for the analysis. Genotyping data were obtained using Immunochip results obtained from a previous study [13], where the quality control-filtered dataset included 131.497 SNPs.

4.2. HH Analysis

HH analysis, proposed by Miyazawa et al. [45], is an efficient non-parametric tool to detect regions harboring either novel or known mutations; it makes it possible to identify patient’s shared chromosomal segments derived from a common ancestor, which are characterized by the distinct identity of their haplotype.
The analysis is based on the concept of identity-by-descent (IBD), in which a DNA segment is defined as IBD in two or more individuals if it is a direct copy of the same ancestral allele; thus, affected subjects who inherit the mutation from a common ancestor share IBD—the genetic portion around the mutation in which there should be no discordant homozygous calls for both dominant and recessive genes. HH are a reduced haplotype obtained by removing all the heterozygous SNPs from the sample dataset and leaving only the homozygous ones. The method works by comparing homozygous segments; in this way, there is no need to reconstruct haplotypes. This results in a reduced computational timing and a simplification of the analysis process, even for large numbers of SNPs. Specifically, the method makes it possible to compare the number of subjects in the patient pool and in the control pool who share an RCHH region. RCHH is a region with a given genetic length, chosen on the basis of the study, containing comparable SNPs (compSNP) in homozygosity in two subjects. The algorithm performs a pairwise comparison of individuals based on the presence of compSNP in the pair. A mismatched comparable SNP, also indicated as a discordant homozygous SNP (dhSNP), has discordant homozygous SNP genotypes in two subjects (e.g., AA vs. BB). The borders of an RCHH are defined by either dhSNPs or by the ends of the chromosome. RCHH regions shared by multiple subjects are used to predict the presence of a Region from a Common Ancestor (RCA) or an IBD [46,47]. Given m and n, the number of generations of two affected subjects descended from a common ancestor RCA is calculated as
            R C A   ( m , n : m n ) = { 2 m + 1                 m 1 , n = 0 3 4                         m = 1 , n = 1 2 m n + 2                 o t h e r w i s e }
In consanguineous families or in populations that are geographically isolated, such as our Sardinia multiplex families, patients suffering from a disease (in our study, MS) share a common ancestor, and the RCA represent candidate regions in which disease genes can be looked for. RCA is identified through RCHHs via the comparison of RCHHs shared between the patient and the control pool [45]. As explained in [46], the algorithm works firstly by dividing a genetic autosomal region into smaller ones and selecting, for each of them, the representative RCHHs shared by the largest number of affected subjects within the patient pool. The numbers of individuals sharing each representative RCHH pool are then counted in both the patient and the control groups and compared to each other. The p-value of the null hypothesis of no difference between these two proportions is calculated according to the standard normal distribution.
In the HH approach, genotyping error may lead to be causative gene being excluded mistakenly. In fact, an RCHH will be truncated when a non-dhSNP changes into a dhSNP due to a genotyping error; in this way, the resulting genomic segment will be smaller than the chosen cut-off length, and thus, the RCHH will be not identified. The algorithm, by means of Monte Carlo Chain Simulation (MCMC), makes it possible to calculate the probability of obtaining dhSNPs due to genotyping errors. If the probability of creating dhSNPs in the region is very low (p-value < 0.001), the HH results are reliable.
A critical step in the analysis is represented by the choice of cut-off value since there is not an optimal cut-off value. They need to be chosen on the basis of the population under study, taking into account that the decreases in the average genetic length of the RCAs over generations; therefore, when we analysed distantly related subjects who shared smaller RCA, it was preferable to use a small cut-off (e.g., 3 cM). Another aspect that guides our choice of the cut-off is the array used for genotyping (e.g., low-density vs. high-density). As we only had Immunochip data [10], which were obtained with the Illumina Infinium HD custom array that was designed for the fine mapping of 184 established autoimmune loci, we chose a conservative cutoff value of 7 cm to search for candidate RCHHs that represent our prioritized regions. This step was taken to reduce the risk of false positives and increase the specificity of the results.
In summary, the advantages of HH approach numerous are: (a) there is no need to reconstruct haplotypes since the homozygote haplotype for each chromosome is uniquely determined; (b) the chromosomal segments in which all polymorphic markers are homozygous are considered to be autozygous segments [48]; (c) if the coefficient of consanguinity for a patient is large as a result of belonging to an inbred family, and the disease is rare, then the probability that the disease-causing gene is located in the shared segment is very high; (d) since HH analysis looks for ancestral segments, both dominant and recessive genes can be detected; (e) the HH approach is robust to genotyping errors.
In our analysis, the dataset was prepared using R software [49], and the HH analysis was run using the HH program (http://www.hhanalysis.com (accessed on 15 February 2021)).

4.3. Screening of Known Causative Genes

We then scrutinized the segments highlighted in the HH analysis with the help of publicly available databases such as Phenolyzer (http://phenolyzer.wglab.org (accessed on 30 March 2021)) and WEbGestalt (http://www.webgestalt.org (accessed on 30 March 2021)), in order to search for genes located, within the specified region, whose function could be plausibly related to MS.

4.4. Whole-Exome Sequencing Data Generation

All samples were sequenced at the Center for Genome Technology within the University of Miami John P. Hussman Institute for Human Genomics.
Library preparation was conducted using the SureSelectXT Human All Exon V4 + UTR (Agilent Technologies Inc., Santa Clara, CA, USA). This protocol targets 99% of coding regions in addition to 5′ and 3′-untranslated region sequences. Pre-enrichment libraries were constructed using the SureSelect Low Input reagent kit, and exome enrichment of the DNA library was performed via a hybridization reaction with biotinylated baits from the SureSelect Human All Exon V4 + UTR Enrichment Kit. Sequencing of the prepared DNA libraries was undertaken using the Illumina HiSeq2000 instrument (Illumina Inc., San Diego, CA, USA) with an average coverage of 80× with 2 × 100 bps paired-end reads. Quality controls were applied at the lane and fastq levels. Specifically, the cutoff used for a successful lane was Pass Filter > 90%, with over 250 M reads for the high-output mode. The fraction of reads in each lane assigned to each sample (no set value) and the fraction of bases with a quality score > Q30 for read 1 and read 2 (above 80% expected for each) were also checked. Raw sequencing reads were demultiplexed using Illumina bcl2fastq. In addition, the FASTQC tool kit (www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 8 November 2020)) was used to review the base quality distribution, which provided representations of the four nucleotides of particular k-mer sequences (adaptor contamination). We used the Genome Analysis Software Kit (GATK) (version 4.1) best-practice pipeline to analyze our WES data. Reads were aligned with the human reference genome (hg19), using the Maximal Exact Matches algorithm in the Burrows–Wheeler Aligner (BWA) [50]. PCR duplicates were removed using the Picard tool (picard.sourceforge.net/). The GATK base quality score recalibrator was applied to correct the sequencing artifacts. Variants were called using the GATK haplotypeCaller algorithm, visually inspected using the Integrative Genomics Viewer (IGV, Broad Institute), and further annotated with ANNOVAR. Variants were categorized as follows: (1) non-synonymous; (2) synonymous; (3) frameshift deletion or insertion; (4) splicing; (5) stop gain or loss; or (6) functional intronic or promoter variants.

5. Conclusions

Our study, which was performed on multiplex Sardinian families and combined the HH approach with WES data analysis, first enabled the identification of disease-linked regions, and then the identification of specific rare variants located in these regions. Although our study was limited to the use of Immunochip data that, despite making it possible to scrutinize the entire genome, only allowed this within specific autoimmune candidate regions, the obtained results represent an important step in the comprehension of the genetics of MS.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/cimb43030125/s1.

Author Contributions

Conceptualization, L.B. and D.G.; methodology, D.G, L.B. and T.F.; formal analysis, D.M., T.F., D.G. and A.L.C.; investigation, V.S. and M.P.; data curation, A.B., A.H., J.L.M. and T.F.; writing—original draft preparation, T.F. and L.B.; writing—review and editing, all authors; supervision, D.G. and L.B.; project administration, L.B.; funding acquisition, L.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the MIMOmics European Collaborative Project (FP7-305280), as part of the HEALTH-2012-INNOVATION scheme; by FISM—Fondazione Italiana Sclerosi Multipla (Cod. 2009//R//2); and by Fondazione Cariplo (grant no. 2009-2528).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the ethics committee of the Azienda Sanitaria of Nuoro.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data can be found here https://doi.org/10.5281/zenodo.5493638 (accessed on 8 September 2021).

Acknowledgments

We wish to thank the patients, their relatives, and all the volunteers who donated their samples.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kahana, E. Epidemiologic studies of multiple sclerosis: A review. Biomed. Pharmacother. 2000, 54, 100–102. [Google Scholar] [CrossRef]
  2. Compston, A.; Coles, A. Multiple sclerosis. Lancet 2008, 372, 1502–1517. [Google Scholar] [CrossRef]
  3. Pugliatti, M.; Sotgiu, S.; Rosati, G. The worldwide prevalence of multiple sclerosis. Clin. Neurol. Neurosurg 2002, 104, 182–191. [Google Scholar] [CrossRef]
  4. Bargagli, A.M.; Colais, P.; Agabiti, N.; Mayer, F.; Buttari, F.; Centonze, D.; Di Folco, M.; Filippini, G.; Francia, A.; Galgani, S.; et al. Prevalence of multiple sclerosis in the Lazio region, Italy: Use of an algorithm based on health information systems. J. Neurol. 2016, 263, 751–759. [Google Scholar] [CrossRef] [PubMed]
  5. Urru, S.A.M.; Antonelli, A.; Sechi, G.M. Prevalence of multiple sclerosis in Sardinia: A systematic cross-sectional multi-source survey. Mult. Scler. J. 2020, 26, 372–380. [Google Scholar] [CrossRef]
  6. Hemminki, K.; Li, X.; Sundquist, J.; Hillert, J.; Sundquist, K. Risk for multiple sclerosis in relatives and spouses of patients diagnosed with autoimmune and related conditions. Neurogenetics 2009, 10, 5–11. [Google Scholar] [CrossRef]
  7. Montomoli, C.; Prokopenko, I.; Caria, A.; Ferrai, R.; Mander, A.; Seaman, S.; Musu, L.; Piras, M.L.; Ticca, A.F.; Murgia, S.B.; et al. Multiple sclerosis recurrence risk for siblings in an isolated population of Central Sardinia, Italy. Genet. Epidemiol. 2002, 22, 265–271. [Google Scholar] [CrossRef] [PubMed]
  8. Bahlo, M.; Booth, D.R.; Simon, A.B.; Brown, M.A.; Foote, S.J.; Griffiths, L.R.; Kilpatrick, T.J.; Lechner-Scott, J.; Moscato, P.; Perreau, V.M.; et al. Genome-wide association study identifies new multiple sclerosis susceptibility loci on chromosomes 12 and 20. Nat. Genet. 2009, 41, 824–828. [Google Scholar] [CrossRef]
  9. Baranzini, S.E.; Galwey, N.W.; Wang, J.; Khankhanian, P.; Lindberg, R.; Pelletier, D.; Wu, W.; Uitdehaag, B.M.J.; Kappos, L.; Polman, C.H.; et al. Pathway and network-based analysis of genome-wide association studies in multiple sclerosis. Hum. Mol. Genet. 2009, 18, 2078–2090. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Consortium, I.M.S.G.; Beecham, A.H.; Patsopoulos, N.A.; Xifara, D.K.; Davis, M.F.; Kemppinen, A.; Cotsapas, C.; Shah, T.S.; Spencer, C.; Booth, D.; et al. Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat. Genet 2013, 45, 1353–1360. [Google Scholar]
  11. Consortium, I.M.S.G.; Consortium, W.T.C.C.; Sawce, S.; Hellenthal, G.; Pirinen, M.; Spencer, C.C.; Patsopoulos, N.A.; Moutsianas, L.; Dilthey, A.; Su, Z.; et al. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature 2011, 476, 214–219. [Google Scholar]
  12. Consortium, I.M.S.G. Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility. Science 2019, 365, eaav7188. [Google Scholar] [CrossRef] [Green Version]
  13. Fazia, T.; Pastorino, R.; Foco, L.; Han, L.; Abney, M.; Beecham, A.; Hadjixenofontos, A.; Guo, H.; Gentilini, D.; Papachristou, C.; et al. Investigating multiple sclerosis genetic susceptibility on the founder population of east-central Sardinia via association and linkage analysis of immune-related loci. Mult. Scler. J. 2017. [Google Scholar] [CrossRef]
  14. Sawcer, S.; Franklin, R.J.M.; Ban, M. Multiple sclerosis genetics. Lancet Neurol. 2014, 13, 700–709. [Google Scholar] [CrossRef]
  15. Mitrovič, M.; Patsopoulos, N.A.; Beecham, A.H.; Dankowski, T.; Goris, A.; Dubois, B.; D’hooghe, M.B.; Lemmens, R.; Van Damme, P.; Søndergaard, H.B.; et al. Low-Frequency and Rare-Coding Variation Contributes to Multiple Sclerosis Risk. Cell 2018, 180, 403. [Google Scholar] [CrossRef] [Green Version]
  16. Manolio, T.A.; Collins, F.S.; Cox, N.J.; Goldstein, D.B.; Hindorff, L.A.; Hunter, D.J.; McCarthy, M.I.; Ramos, E.M.; Cardon, L.R.; Chakravarti, A.; et al. Finding the missing heritability of complex diseases. Nature 2009, 461, 747–753. [Google Scholar] [CrossRef] [Green Version]
  17. Zhuo, X.; Guo, X.; Zhang, X.; Jing, G.; Wang, Y.; Chen, Q.; Jiang, Q.; Liu, J.; Zhang, C. Usp16 regulates kinetochore localization of Plk1 to promote proper chromosome alignment in mitosis. J. Cell Biol. 2015, 210, 727–735. [Google Scholar] [CrossRef]
  18. Zhang, Y.; Liu, R.B.; Cao, Q.; Fan, K.Q.; Huang, L.J.; Yu, J.S.; Gao, Z.J.; Huang, T.; Zhong, J.Y.; Mao, X.T.; et al. USP16-mediated deubiquitination of calcineurin A controls peripheral T cell maintenance. J. Clin. Investig. 2019, 129, 2856–2871. [Google Scholar] [CrossRef] [Green Version]
  19. Sidore, C.; Busonero, F.; Maschio, A.; Porcu, E.; Naitza, S.; Zoledziewska, M.; Mulas, A.; Pistis, G.; Steri, M.; Danjou, F.; et al. Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers. Nat. Genet. 2015, 47, 1272–1281. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Chiang, C.W.K.; Marcus, J.H.; Sidore, C.; Biddanda, A.; Al-Asadi, H.; Zoledziewska, M.; Pitzalis, M.; Busonero, F.; Maschio, A.; Pistis, G.; et al. Genomic history of the Sardinian population. Nat. Genet. 2018, 50, 1426–1434. [Google Scholar] [CrossRef]
  21. Peterson, J.W.; Bö, L.; Mörk, S.; Chang, A.; Trapp, B.D. Transected neurites, apoptotic neurons, and reduced inflammation in cortical multiple sclerosis lesions. Ann. Neurol. 2001, 50, 389–400. [Google Scholar] [CrossRef]
  22. Kutzelnigg, A.; Lucchinetti, C.F.; Stadelmann, C.; Brück, W.; Rauschka, H.; Bergmann, M.; Schmidbauer, M.; Parisi, J.E.; Lassmann, H. Cortical demyelination and diffuse white matter injury in multiple sclerosis. Brain 2005, 128, 2705–2712. [Google Scholar] [CrossRef]
  23. Kutzelnigg, A.; Faber-Rod, J.C.; Bauer, J.; Lucchinetti, C.F.; Sorensen, P.S.; Laursen, H.; Stadelmann, C.; Brück, W.; Rauschka, H.; Schmidbauer, M.; et al. Widespread demyelination in the cerebellar cortex in multiple sclerosis. Brain Pathol. 2007, 17, 38–44. [Google Scholar] [CrossRef]
  24. Geurts, J.J.G.; Bö, L.; Roosendaal, S.D.; Hazes, T.; Daniëls, R.; Barkhof, F.; Witter, M.P.; Huitinga, I.; Van Der Valk, P. Extensive hippocampal demyelination in multiple sclerosis. J. Neuropathol. Exp. Neurol. 2007, 66, 819–827. [Google Scholar] [CrossRef] [Green Version]
  25. Tam, V.; Patel, N.; Turcotte, M.; Bossé, Y.; Paré, G.; Meyre, D. Benefits and limitations of genome-wide association studies. Nat. Rev. Genet. 2019, 20, 467–484. [Google Scholar] [CrossRef]
  26. Cotsapas, C.; Mitrovic, M. Genome-wide association studies of multiple sclerosis. Clin. Transl. Immunol. 2018, 7, e1018. [Google Scholar] [CrossRef] [PubMed]
  27. Eichler, E.E.; Flint, J.; Gibson, G.; Kong, A.; Leal, S.M.; Moore, J.H.; Nadeau, J.H. Missing heritability and strategies for finding the underlying causes of complex disease How should we solve the problem of “missing heritability” in complex diseases? Nat. Rev. Genet. 2010, 11, 446–450. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. van de Leemput, J.; Boles, N.C.; Kiehl, T.R.; Corneo, B.; Lederman, P.; Menon, V.; Lee, C.; Martinez, R.A.; Levi, B.P.; Thompson, C.L.; et al. CORTECON: A temporal transcriptome analysis of in vitro human cerebral cortex development from human embryonic stem cells. Neuron 2014, 83, 51–68. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Ortolano, N.A.; Romero-Morales, A.I.; Rasmussen, M.L.; Bodnya, C.; Kline, L.A.; Joshi, P.; Connelly, J.P.; Rose, K.L.; Pruett-Miller, S.M.; Gama, V. A proteomics approach for the identification of cullin-9 (CUL9) related signaling pathways in induced pluripotent stem cell models. PLoS ONE 2021, 16, e0248000. [Google Scholar] [CrossRef] [PubMed]
  30. Fischer, M.T.; Wimmer, I.; Höftberger, R.; Gerlach, S.; Haider, L.; Zrzavy, T.; Hametner, S.; Mahad, D.; Binder, C.J.; Krumbholz, M.; et al. Disease-specific molecular events in cortical multiple sclerosis lesions. Brain 2013, 136, 1799–1815. [Google Scholar] [CrossRef]
  31. Lu, X.; Borchers, A.G.M.; Jolicoeur, C.; Rayburn, H.; Baker, J.C.; Tessier-Lavigne, M. PTK7/CCK-4 is a novel regulator of planar cell polarity in vertebrates. Nature 2004, 430, 93–98. [Google Scholar] [CrossRef]
  32. Ikezu, S.; Ingraham Dixie, K.L.; Koro, L.; Watanabe, T.; Kaibuchi, K.; Ikezu, T. Tau-tubulin kinase 1 and amyloid-β peptide induce phosphorylation of collapsin response mediator protein-2 and enhance neurite degeneration in Alzheimer disease mouse models. Acta Neuropathol. Commun. 2020, 8, 12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Lund, H.; Cowburn, R.F.; Gustafsson, E.; Strömberg, K.; Svensson, A.; Dahllund, L.; Malinowsky, D.; Sunnemark, D. Tau-tubulin kinase 1 expression, phosphorylation and co-localization with phospho-ser422 tau in the Alzheimer’s disease brain. Brain Pathol. 2013, 23, 378–389. [Google Scholar] [CrossRef] [PubMed]
  34. Petratos, S.; Ozturk, E.; Azari, M.F.; Kenny, R.; Young Lee, J.; Magee, K.A.; Harvey, A.R.; McDonald, C.; Taghian, K.; Moussa, L.; et al. Limiting multiple sclerosis related axonopathy by blocking Nogo receptor and CRMP-2 phosphorylation. Brain 2012, 135, 1794–1818. [Google Scholar] [CrossRef] [PubMed]
  35. Shang, L.; Henderson, L.B.; Cho, M.T.; Petrey, D.S.; Fong, C.T.; Haude, K.M.; Shur, N.; Lundberg, J.; Hauser, N.; Carmichael, J.; et al. De novo missense variants in PPP2R5D are associated with intellectual disability, macrocephaly, hypotonia, and autism. Neurogenetics 2016, 17, 43–49. [Google Scholar] [CrossRef] [PubMed]
  36. Walker, I.M.; Riboldi, G.M.; Drummond, P.; Saade-Lemus, S.; Martin-Saavedra, J.S.; Frucht, S.; Bardakjian, T.M.; Gonzalez-Alegre, P.; Deik, A. PPP2R5D Genetic Mutations and Early-Onset Parkinsonism. Ann. Neurol. 2021, 89, 194–195. [Google Scholar] [CrossRef]
  37. Telek, E.; Kengyel, A.; Bugyi, B. Myosin XVI in the Nervous System. Cells 2020, 9, 1903. [Google Scholar] [CrossRef]
  38. Brown, M.E.; Bridgman, P.C. Myosin Function in Nervous and Sensory Systems. J. Neurobiol. 2004, 58, 118–130. [Google Scholar] [CrossRef] [PubMed]
  39. Gallo, G. The Neuronal Actin Cytoskeleton and the Protrusion of Lamellipodia and Filopodia. In Functions of Myosin Motor Proteins in the Nervous System; Springer: New York, NY, USA, 2011; pp. 45–72. [Google Scholar]
  40. Mackay, F.; Schneider, P.; Rennert, P.; Browning, J. BAFF and APRIL: A tutorial on B cell survival. Annu. Rev. Immunol. 2003, 21, 231–264. [Google Scholar] [CrossRef]
  41. Steri, M.; Orrù, V.; Idda, M.L.; Pitzalis, M.; Pala, M.; Zara, I.; Sidore, C.; Faà, V.; Floris, M.; Deiana, M.; et al. Overexpression of the Cytokine BAFF and Autoimmunity Risk. N. Engl. J. Med. 2017, 376, 1615–1626. [Google Scholar] [CrossRef] [PubMed]
  42. Naik, J.; Hau, C.M.; ten Bloemendaal, L.; Mok, K.S.; Hajji, N.; Wehman, A.M.; Meisner, S.; Muncan, V.; Paauw, N.J.; de Vries, H.E.; et al. The P4-ATPase ATP9A is a novel determinant of exosome release. PLoS ONE 2019, 14, e0213069. [Google Scholar] [CrossRef] [PubMed]
  43. Xu, X.; Xu, L.; Zhang, P.; Ouyang, K.; Xiao, Y.; Xiong, J.; Wang, D.; Liang, Y.; Duan, L. Effects of ATP9A on extracellular vesicle release and exosomal lipid composition. Oxid. Med. Cell. Longev. 2020. [Google Scholar] [CrossRef]
  44. Poser, C.M.; Paty, D.W.; Scheinberg, L.; McDonald, W.I.; Davis, F.A.; Ebers, G.C.; Johnson, K.P.; Sibley, W.A.; Silberberg, D.H.; Tourtellotte, W.W. New diagnostic criteria for multiple sclerosis: Guidelines for research protocols. Ann. Neurol 1983, 13, 227–231. [Google Scholar] [CrossRef]
  45. Miyazawa, H.; Kato, M.; Awata, T.; Kohda, M.; Iwasa, H.; Koyama, N.; Tanaka, T.; Huqun; Kyo, S.; Okazaki, Y.; et al. Homozygosity haplotype allows a genomewide search for the autosomal segments shared among patients. Am. J. Hum. Genet. 2007, 80, 1090–1102. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Jiang, H.; Orr, A.; Guernsey, D.L.; Robitaille, J.; Asselin, G.; Samuels, M.E.; Dubé, M.P. Application of homozygosity haplotype analysis to genetic mapping with high-density SNP genotype data. PLoS ONE 2009, 4, e5280. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Hagiwara, K.; Morino, H.; Shiihara, J.; Tanaka, T.; Miyazawa, H.; Suzuki, T.; Kohda, M.; Okazaki, Y.; Seyama, K.; Kawakami, H. Homozygosity mapping on homozygosity haplotype analysis to detect recessive disease-causing genes from a small number of unrelated, outbred patients. PLoS ONE 2011, 6, e25059. [Google Scholar] [CrossRef]
  48. Lander, E.S.; Botstein, D. Homozygosity mapping: A way to map human recessive traits with the DNA of inbred children. Science 1987, 236, 1567–1570. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. R Development Core Team R: A Language and Environment for Statistical Computing; R Development Core Team R: Vienna, Austria, 2017.
  50. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Upset plot. In the plot, the intersections of genes between families are visualized. The first five bar charts on the top of the figure represent the number of genes in the significant RCHHs found only in each individual family. The last three bar charts on the top of the figure report the number of genes shared between families.
Figure 1. Upset plot. In the plot, the intersections of genes between families are visualized. The first five bar charts on the top of the figure represent the number of genes in the significant RCHHs found only in each individual family. The last three bar charts on the top of the figure report the number of genes shared between families.
Cimb 43 00125 g001
Figure 2. Pedigree of families in the study. Squares and circles indicate men and women, respectively. The symbols in black represent the affected members. The squares or circles with a line indicate a deceased individual. The asterisk represents subjects in the pedigree for which WES data were available, while the colored triangles indicate the subjects carrying the variant.
Figure 2. Pedigree of families in the study. Squares and circles indicate men and women, respectively. The symbols in black represent the affected members. The squares or circles with a line indicate a deceased individual. The asterisk represents subjects in the pedigree for which WES data were available, while the colored triangles indicate the subjects carrying the variant.
Cimb 43 00125 g002
Table 1. Description of the family data available for the HH analysis. For each family, the total numbers of affected and unaffected subjects are reported together with the availability of Immunochip genetic data.
Table 1. Description of the family data available for the HH analysis. For each family, the total numbers of affected and unaffected subjects are reported together with the availability of Immunochip genetic data.
FamilyTotal N. of
Affected
N. of Affected
with Genotyping Data
Total N. of
Unaffected
N. of Unaffected
with Genotyping Data
3658429
4553723
5857922
9986422
1233193
21555319
26634313
4433149
45643615
58322714
61764017
8133238
2360161113626
Table 2. List of RCHHs grouped by family. The table reports the number of subjects sharing an RCHH in both pools, its coordinate and the -log10 (p-value) for the difference between the pools. In our study, we set a significance level of -log10 (p-value) 1.2, corresponding to a p-value of 0.06.
Table 2. List of RCHHs grouped by family. The table reports the number of subjects sharing an RCHH in both pools, its coordinate and the -log10 (p-value) for the difference between the pools. In our study, we set a significance level of -log10 (p-value) 1.2, corresponding to a p-value of 0.06.
FamilyNo. Subjects Sharing
RCHH in Patient Pool
No. Subjects Sharing RCHH in Control PoolChrStart–End (bp *)Start–End (SNP)-log10(p-Value)
23606 out of 117 out of 26120024018-20110347rs12405947-rs104929991.20
23608 out of 1111 out of 263127665610-128219449rs1687462-rs98265261.23
23605 out of 115 out of 26626287459-32628250rs4458680-rs117571591.25
23607 out of 119 out of 261116548170-17019030rs4593976-rs79420851.20
236011 out of 1119 out of 2613106416156-106713791rs9555302-rs18192431.30
236011 out of 1118 out of 2613106716741-106978764rs3949948-rs169706231.47
236011 out of 1119 out of 2613106979630-106979630rs1830754-rs18307541.30
236011 out of 1118 out of 2613106979661-108046934rs2076766-rs79921491.47
236011 out of 1119 out of 2613108048315-108048315rs9583266-rs95832661.30
236011 out of 1118 out of 2613108050353-108090301rs12586075-rs169728491.47
236011 out of 1117 out of 2613108090996-108968251rs16972855-rs95214151.63
23607 out of 119 out of 262243217334-43366873rs133807-rs1386281.20
615 out of 67 out of 172221458760-223560597rs634813-rs14400631.22
616 out of 610 out of 174181036139-181598064rs7655585-rs27274261.23
616 out of 610 out of 17531060610-31060610rs1392428-rs13924281.23
615 out of 67 out of 17642767957-43333769rs394754-rs77521201.22
615 out of 67 out of 17645282619-46437363rs7762957-rs13725671.22
614 out of 64 out of 17662416537-67861137rs213824-rs92947361.38
614 out of 64 out of 176100591693-100883678rs9399393-rs26581321.38
615 out of 67 out of 1778552281-8768564rs1859275-rs102685801.22
616 out of 69 out of 1798991207-8991239rs10511519-rs108160281.41
616 out of 68 out of 1798991538-8994896rs10511520-rs108160291.61
616 out of 69 out of 1799000336-9395003rs7025315-rs107590641.41
616 out of 610 out of 1799395316-11328859rs1475680-rs177883701.23
616 out of 610 out of 17912662320-13015284rs1408801-rs105148221.23
615 out of 67 out of 179115676064-115712622rs6478042-rs70349291.22
616 out of 610 out of 179134997809-134997809rs626713-rs6267131.23
616 out of 610 out of 1712122479650-122743242rs1706477-rs71379461.23
616 out of 610 out of 1712127998742-128288581rs11060036-rs108478071.23
615 out of 67 out of 171684533519-84542101rs305059-rs9089881.22
615 out of 66 out of 172048202462-49044808rs6020298-rs61229911.43
615 out of 67 out of 172049044993-50323395rs11904901-rs60681171.22
615 out of 66 out of 172050326580-50563902rs6021835-rs20246501.43
454 out of 47 out of 15122014570-23032859rs2010397-rs46548211.22
453 out of 44 out of 15754280160-56646646rs13438238-rs26340811.21
454 out of 47 out of 151372551882-73411123rs4883922-rs7289261.22
454 out of 46 out of 151373411181-75116077rs17288193-rs10064121.40
443 out of 33 out of 912289487-2455004rs7545940-rs98037641.24
443 out of 33 out of 91156653770-160351933rs2188102-rs14152591.24
443 out of 33 out of 92138041133-138458405rs7563139-rs101995421.24
443 out of 33 out of 9496765575-103179634rs11941922-rs21292941.24
443 out of 33 out of 9573747-556484rs7709758-rs64200451.24
443 out of 33 out of 95150483977-150816773rs2303027-rs178028281.24
443 out of 33 out of 95154874578-156864954rs1295243-rs22770271.24
443 out of 33 out of 97142903019-142909027rs6963381-rs18805601.24
443 out of 33 out of 98602758-3038563rs9314595-rs132615501.24
443 out of 33 out of 91610909415-11580966rs2229321-rs80504611.24
263 out of 33 out of 131171619320-173691804rs6701066-rs8609051.65
263 out of 34 out of 131173695580-174106457rs1016815-rs107984181.40
263 out of 34 out of 131480069595-82872217rs1543918-rs176259291.40
215 out of 510 out of 19114226898-14226898rs4579751-rs45797511.27
215 out of 510 out of 19573478194-73598034rs2120729-rs14608121.27
94 out of 84 out of 221145356859-51450167rs717653-rs122915811.31
94 out of 84 out of 221244020977-44336606rs10785572-rs8781111.31
97 out of 810 out of 222127810551-29354016rs2830992-rs10640191.48
53 out of 54 out of 22558726008-59841224rs525099-rs405121.41
54 out of 58 out of 22985066453-85227053rs10867967-rs8717901.21
54 out of 57 out of 22985228118-85448505rs3860918-rs10526901.39
55 out of 512 out of 2213109106993-109251608rs9515092-rs79863461.22
55 out of 511 out of 2213109253434-109607383rs11069806-rs95216231.36
54 out of 58 out of 2213109945232-110218960rs9555712-rs128654651.21
45 out of 512 out of 231237070145-237394008rs869035-rs19800041.30
45 out of 511 out of 231237395275-238575190rs11808376-rs104954661.43
45 out of 512 out of 231238576784-238576784rs7552602-rs75526021.30
45 out of 511 out of 231238579605-238713399rs12137050-rs96621361.43
45 out of 511 out of 237142336895-143056687rs4236481-rs125401881.43
45 out of 512 out of 237143059971-143855577rs4640977-rs20578681.30
45 out of 511 out of 237143858588-144083589rs17169930-rs77932271.43
45 out of 512 out of 237144088382-145211344rs6954142-rs46012311.30
45 out of 512 out of 238126600646-126663920rs4006563-rs70168671.30
45 out of 512 out of 238126672376-128144872rs4870946-rs14563141.30
45 out of 5 12 out of 2313109370211-109416832rs7323507-rs79846461.30
45 out of 511 out of 2313109419064-109607383rs9583447-rs95216231.43
45 out of 511 out of 231492683815-94120061rs12589195-rs128808621.43
45 out of 512 out of 231494122073-94216494rs2069956-rs71482041.30
45 out of 512 out of 231767527147-67755206rs9914764-rs129392711.30
45 out of 512 out of 232142351204-42525479rs17114247-rs8813951.30
45 out of 511 out of 232142525851-42616108rs915846-rs6915671.43
45 out of 512 out of 232224549605-24559502rs4820658-rs48226611.30
45 out of 512 out of 232224561312-24579927rs17704912-rs27482341.30
33 out of 56 out of 291171432870-171774723rs1234313-rs14610191.30
31 out of 51 out of 29290959860-91680834rs10201040-rs43738031.23
35 out of 514 out of 29371749225-72031216rs864380-rs98615831.38
35 out of 515 out of 29372031489-72458673rs1995453-rs43038231.27
35 out of 514 out of 29372459498-74148704rs6790069-rs14053961.38
35 out of 514 out of 29374710393-74710393rs13073838-rs130738381.38
34 out of 510 out of 29375820822-76074141rs536575-rs40955461.27
31 out of 51 out of 29569782071-69967168rs169717-rs38714601.23
34 out of 510 out of 29652061483-52205031rs6906409-rs69134721.27
34 out of 59 out of 29652205660-52213626rs9395771-rs93820841.42
35 out of 510 out of 2974275511-4316475rs10272180-rs21078341.96
35 out of 511 out of 2974318943-4930528rs2097884-rs20899671.79
35 out of 512 out of 2974930906-4930906rs6947947-rs69479471.64
35 out of 511 out of 2974934189-5342068rs13224720-rs102347091.79
35 out of 512 out of 2975344404-5549582rs13238999-rs20982251.64
35 out of 513 out of 2975551125-6459404rs1725213-rs78105531.50
35 out of 514 out of 2976461542-7642706rs7792987-rs102801851.38
35 out of 515 out of 2977767212-7831276rs17137412-rs127026611.27
35 out of 515 out of 297150524562-150524562rs310586-rs3105861.27
35 out of 514 out of 297150526978-150611097rs7458773-rs69535521.38
35 out of 515 out of 297150613053-150619105rs7797007-rs2192451.27
31 out of 51 out of 29940059290-44362584rs375972-rs49290231.23
31 out of 51 out of 29944670536-46992793rs12006135-rs70490151.23
31 out of 51 out of 29965231255-65635106rs28533023-rs14803681.23
31 out of 51 out of 2913111553045-114123122rs12017986-rs128742901.23
31 out of 51 out of 292028039018-28259678rs7267880-rs65674651.23
* SNP Position hg18.
Table 3. Results of WES analysis. For each family and for each RCHH region, identified by HH analysis, the variant, its function, and the number of cases and controls sharing the variant are reported.
Table 3. Results of WES analysis. For each family and for each RCHH region, identified by HH analysis, the variant, its function, and the number of cases and controls sharing the variant are reported.
FamilyRCHH Region
(bph18)
ChrStartEndRefAltFunctionGeneNo. of
Affected
No. of
Unaffected
61chr6: 42767957-4333376964318103443181034TGsplicing region on exon 27CUL93 out of 31 out of 1
64310696443106964ACnon-synonymous variant on exon 9PTK73 out of 30 out of 1
64322353943223539ACnon-synonymous variant on exon 9TTBK13 out of 30 out of 1
64297691742976917ACnon-synonymous variant on exon 9PPP2R5D3 out of 30 out of 1
chr20:48202462-49044808204936693349366933GCnon-synonymous variant on exon 3PARD6B3 out of 30 out of 1
chr20:49044993-50323395205024551750245517ACsplicing region on exon 16ATP9A3 out of 31 out of 1
chr12:122479650-12274324212123943942123943942ACintronic variantSNRNP353 out of 30 out of 1
chr2:221458760-2235605972223554057223554057TGnon-synonymous variant on exon 3MOGAT13 out of 31 out of 1
2360chr6:26287459-3262825062985625729856257GCnon-coding RNAHLA-H6 out of 10NA
62985626129856261GAnon-coding RNAHLA-H6 out of 10NA
62985626329856263GAnon-coding RNAHLA-H6 out of 10NA
chr13:108090996-10896825113108519067108519070CTCT-5′UTRFAM155A (NLF-1)9 out of 10NA
13109859349109859354TGTGTT-3′UTRMYO167 out of 10NA
4chr21:42351204-42525479214355775143557832 deletion on slicing siteUMODL13 out of 4NA
5chr13:109945232-110218960131136487711364877-GGGinsertion in the upstream siteING13 out of 5NA
NA = No WES data were available for unaffected subjects in the family.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fazia, T.; Marzanati, D.; Carotenuto, A.L.; Beecham, A.; Hadjixenofontos, A.; McCauley, J.L.; Saddi, V.; Piras, M.; Bernardinelli, L.; Gentilini, D. Homozygosity Haplotype and Whole-Exome Sequencing Analysis to Identify Potentially Functional Rare Variants Involved in Multiple Sclerosis among Sardinian Families. Curr. Issues Mol. Biol. 2021, 43, 1778-1793. https://doi.org/10.3390/cimb43030125

AMA Style

Fazia T, Marzanati D, Carotenuto AL, Beecham A, Hadjixenofontos A, McCauley JL, Saddi V, Piras M, Bernardinelli L, Gentilini D. Homozygosity Haplotype and Whole-Exome Sequencing Analysis to Identify Potentially Functional Rare Variants Involved in Multiple Sclerosis among Sardinian Families. Current Issues in Molecular Biology. 2021; 43(3):1778-1793. https://doi.org/10.3390/cimb43030125

Chicago/Turabian Style

Fazia, Teresa, Daria Marzanati, Anna Laura Carotenuto, Ashley Beecham, Athena Hadjixenofontos, Jacob L. McCauley, Valeria Saddi, Marialuisa Piras, Luisa Bernardinelli, and Davide Gentilini. 2021. "Homozygosity Haplotype and Whole-Exome Sequencing Analysis to Identify Potentially Functional Rare Variants Involved in Multiple Sclerosis among Sardinian Families" Current Issues in Molecular Biology 43, no. 3: 1778-1793. https://doi.org/10.3390/cimb43030125

Article Metrics

Back to TopTop