Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome
Abstract
1. Introduction
2. Results
3. Discussion
4. Materials and Methods
4.1. DNA Sample Source
4.2. Genomic DNA Extraction
4.3. Library Preparation
4.3.1. Nanopore Sequencing
4.3.2. SRS Using Illumina Sequencing
4.4. Whole-Genome Sequencing
4.4.1. Nanopore Analytic Pipeline
Basecalling and Mapping
SV Calling
4.4.2. SV Calls Using SRS
4.5. Comparing Variants Called by Both Platforms
4.6. Overlapping Set and Validation
4.7. Annotation
4.8. Plots and Graphs
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
WGS | Whole-Genome Sequencing |
ONT | Oxford Nanopore Technology |
NcRNA | Non-coding RNA |
MZ | Monozygotic Twins |
RO | Reciprocal Overlap |
References
- Uddin, M.; Msc, B.T.; Walker, S.; Wang, Z.; Hu, P.; Lamoureux, S.; Wei, J.; Macdonald, J.R.; Pellecchia, G.; Lu, C.; et al. A high-resolution copy-number variation resource for clinical and population genetics. Genet. Med. 2015, 17, 747–752. [Google Scholar] [CrossRef] [PubMed]
- Takumi, T.; Tamada, K. CNV biology in neurodevelopmental disorders. Curr. Opin. Neurobiol. 2018, 48, 183–192. [Google Scholar] [CrossRef] [PubMed]
- Uddin, M.; Tammimies, K.; Pellecchia, G.; Alipanahi, B.; Hu, P.; Wang, Z.; Pinto, D.; Lau, L.; Nalpathamkalam, T.; Marshall, C.R.; et al. Brain-expressed exons under purifying selection are enriched for de novo mutations in autism spectrum disorder. Nat. Genet. 2014, 46, 742–747. [Google Scholar] [CrossRef] [PubMed]
- Halvorsen, M.; Huh, R.; Oskolkov, N.; Wen, J.; Netotea, S.; Giusti-Rodriguez, P.; Karlsson, R.; Bryois, J.; Nystedt, B.; Ameur, A.; et al. Increased burden of ultra-rare structural variants localizing to boundaries of topologically associated domains in schizophrenia. Nat. Commun. 2020, 11, 1842. [Google Scholar] [CrossRef] [PubMed]
- Tsuchida, N.; Nakashima, M.; Kato, M.; Heyman, E.; Inui, T.; Haginoya, K.; Watanabe, S.; Chiyonobu, T.; Morimoto, M.; Ohta, M.; et al. Detection of copy number variations in epilepsy using exome data. Clin. Genet. 2018, 93, 577–587. [Google Scholar] [CrossRef]
- Mizuguchi, T.; Suzuki, T.; Abe, C.; Umemura, A.; Tokunaga, K.; Kawai, Y.; Nakamura, M.; Nagasaki, M.; Kinoshita, K.; Okamura, Y.; et al. A 12-kb structural variation in progressive myoclonic epilepsy was newly identified by long-read whole-genome sequencing. J. Hum. Genet. 2019, 64, 359–368. [Google Scholar] [CrossRef]
- Scherer, S.W.; Lee, C.; Birney, E.; Altshuler, D.M.; Eichler, E.E.; Carter, N.P.; Hurles, M.E.; Feuk, L. Challenges and standards in integrating surveys of structural variation. Nat. Genet. 2007, 39 (Suppl. S7), S7–S15. [Google Scholar] [CrossRef]
- Alkan, C.; Sajjadian, S.; Eichler, E.E. Limitations of next-generation genome sequence assembly. Nat. Methods 2010, 8, 61–65. [Google Scholar] [CrossRef]
- Mahmoud, M.; Gobet, N.; Cruz-Dávalos, D.I.; Mounier, N.; Dessimoz, C.; Sedlazeck, F.J. Structural variant calling: The long and the short of it. Genome Biol. 2019, 20, 246. [Google Scholar] [CrossRef]
- Amarasinghe, S.L.; Su, S.; Dong, X.; Zappia, L.; Ritchie, M.E.; Gouil, Q. Opportunities and challenges in long-read sequencing data analysis. Genome Biol. 2020, 21, 30. [Google Scholar] [CrossRef]
- Mantere, T.; Kersten, S.; Hoischen, A. Long-Read Sequencing Emerging in Medical Genetics. Front. Genet. 2019, 10, 426. [Google Scholar] [CrossRef] [PubMed]
- Pang, A.W.; Macdonald, J.R.; Pinto, D.; Wei, J.A.; Rafiq, M.; Conrad, D.F.; Park, H.; Hurles, M.E.; Lee, C.; Venter, J.C.; et al. Towards a comprehensive structural variation map of an individual human genome. Genome Biol. 2010, 11, R52. [Google Scholar] [CrossRef] [PubMed]
- Tyson, J.R.; O’Neil, N.J.; Jain, M.; Olsen, H.E.; Hieter, P.; Snutch, T.P. MinION-based long-read sequencing and assembly extends the Caenorhabditis elegans reference genome. Genome Res. 2018, 28, 266–274. [Google Scholar] [CrossRef] [PubMed]
- Tyler, A.D.; Mataseje, L.; Urfano, C.J.; Schmidt, L.; Antonation, K.S.; Mulvey, M.R.; Corbett, C.R. Evaluation of Oxford Nanopore’s MinION Sequencing Device for Microbial Whole Genome Sequencing Applications. Sci. Rep. 2018, 8, 10931. [Google Scholar] [CrossRef] [PubMed]
- Jain, M.; Koren, S.; Miga, K.H.; Quick, J.; Rand, A.C.; Sasani, T.A.; Tyson, J.R.; Beggs, A.D.; Dilthey, A.T.; Fiddes, I.T.; et al. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nat. Biotechnol. 2018, 36, 338–345. [Google Scholar] [CrossRef]
- Malmberg, M.M.; Spangenberg, G.C.; Daetwyler, H.D.; Cogan, N.O.I. Assessment of low-coverage nanopore long read sequencing for SNP genotyping in doubled haploid canola (Brassica napus L.). Sci. Rep. 2019, 9, 8688. [Google Scholar] [CrossRef]
- Sakamoto, Y.; Xu, L.; Seki, M.; Yokoyama, T.T.; Kasahara, M.; Kashima, Y.; Ohashi, A.; Shimada, Y.; Motoi, N.; Tsuchihara, K.; et al. Long read sequencing reveals a novel class of structural aberrations in cancers: Identification and characterization of cancerous local amplifications. bioRxiv 2019, 620047. [Google Scholar] [CrossRef]
- Abel, H.J.; Genomics, N.C.F.C.D.; Larson, D.E.; Regier, A.A.; Chiang, C.; Das, I.; Kanchi, K.L.; Layer, R.M.; Neale, B.M.; Salerno, W.J.; et al. Mapping and characterization of structural variation in 17,795 human genomes. Nat. Cell Biol. 2020, 583, 83–89. [Google Scholar] [CrossRef]
- Sharp, A.J.; Cheng, Z.; Eichler, E.E. Structural Variation of the Human Genome. Annu. Rev. Genom. Hum. Genet. 2006, 7, 407–442. [Google Scholar] [CrossRef]
- Distefano, J.K. The Emerging Role of Long Noncoding RNAs in Human Disease. Toxic. Assess. 2018, 1706, 91–110. [Google Scholar] [CrossRef]
- Ardekani, A.M.; Naeini, M.M. The Role of MicroRNAs in Human Diseases. Avicenna J. Med. Biotechnol. 2010, 2, 161–179. [Google Scholar] [PubMed]
- Schepici, G.; Cavalli, E.; Bramanti, P.; Mazzon, E. Autism Spectrum Disorder and miRNA: An Overview of Experimental Models. Brain Sci. 2019, 9, 265. [Google Scholar] [CrossRef] [PubMed]
- Hardwick, S.A.; Bassett, S.D.; Kaczorowski, D.; Blackburn, J.; Barton, K.; Bartonicek, N.; Carswell, S.L.; Tilgner, H.U.; Loy, C.; Halliday, G.; et al. Targeted, High-Resolution RNA Sequencing of Non-coding Genomic Regions Associated with Neuropsychiatric Functions. Front. Genet. 2019, 10, 309. [Google Scholar] [CrossRef] [PubMed]
- Zhang, X.; Meyerson, M. Illuminating the noncoding genome in cancer. Nat. Rev. Cancer 2020, 1, 864–872. [Google Scholar] [CrossRef]
- Tham, C.Y.; Tirado-Magallanes, R.; Goh, Y.; Fullwood, M.J.; Koh, B.T.; Wang, W.; Ng, C.H.; Chng, W.-J.; Thiéry, A.H.; Tenen, D.G.; et al. NanoVar: Accurate characterization of patients’ genomic structural variants using low-depth nanopore sequencing. Genome Biol. 2020, 21, 56. [Google Scholar] [CrossRef] [PubMed]
- Stancu, M.C.; Van Roosmalen, M.J.; Renkens, I.; Nieboer, M.M.; Middelkamp, S.; De Ligt, J.; Pregno, G.; Giachino, D.; Mandrile, G.; Valle-Inclan, J.E.; et al. Mapping and phasing of structural variation in patient genomes using nanopore sequencing. Nat. Commun. 2017, 8, 1326. [Google Scholar] [CrossRef]
- Uddin, M.; Pellecchia, G.; Thiruvahindrapuram, B.; D’Abate, L.; Merico, D.; Chan, A.; Zarrei, M.; Tammimies, K.; Walker, S.; Gazzellone, M.J.; et al. Indexing Effects of Copy Number Variation on Genes Involved in Developmental Delay. Sci. Rep. 2016, 6, 28663. [Google Scholar] [CrossRef] [PubMed]
- Lupski, J.R. Structural variation mutagenesis of the human genome: Impact on disease and evolution. Environ. Mol. Mutagen. 2015, 56, 419–436. [Google Scholar] [CrossRef]
- De Coster, W.; Van Broeckhoven, C. Newest Methods for Detecting Structural Variations. Trends Biotechnol. 2019, 37, 973–982. [Google Scholar] [CrossRef]
- De Coster, W.; Strazisar, M.; De Rijk, P. Critical length in long-read resequencing. NAR Genom. Bioinform. 2020, 2. [Google Scholar] [CrossRef]
- Osborne, L.R.; Li, M.; Pober, B.; Chitayat, D.; Bodurtha, J.; Mandel, A.; Costa, T.; Grebe, T.; Cox, S.; Tsui, L.-C.; et al. A 1.5 million–base pair inversion polymorphism in families with Williams-Beuren syndrome. Nat. Genet. 2001, 29, 321–325. [Google Scholar] [CrossRef] [PubMed]
- Zhang, S.; Liang, F.; Lei, C.; Wu, J.; Fu, J.; Yang, Q.; Luo, X.; Yu, G.; Wang, D.; Zhang, Y.; et al. Long-read sequencing and haplotype linkage analysis enabled preimplantation genetic testing for patients carrying pathogenic inversions. J. Med. Genet. 2019, 56, 741–749. [Google Scholar] [CrossRef] [PubMed]
- Cabianca, S.D.; Casa, V.; Gabellini, D. A novel molecular mechanism in human genetic disease: A DNA repeat-derived lncRNA. RNA Biol. 2012, 9, 1211–1217. [Google Scholar] [CrossRef][Green Version]
- Srijyothi, L.; Ponne, S.; Prathama, T.; Ashok, C.; Baluchamy, S. Roles of Non-Coding RNAs in Transcriptional Regulation. In Transcriptional and Post-Transcriptional Regulation, 1st ed.; Ghedirach, K., Ed.; IntechOpen: London, UK, 2018; Chapter 4; pp. 55–75. [Google Scholar]
- Chen, J.; Ao, L.; Yang, J. Long non-coding RNAs in diseases related to inflammation and immunity. Ann. Transl. Med. 2019, 7, 494. [Google Scholar] [CrossRef] [PubMed]
- Cogill, S.B.; Srivastava, A.K.; Yang, M.Q.; Wang, L. Co-expression of long non-coding RNAs and autism risk genes in the developing human brain. BMC Syst. Biol. 2018, 12 (Suppl. S7), 91. [Google Scholar] [CrossRef]
- Zhang, S.-F.; Gao, J.; Liu, C.-M. The Role of Non-Coding RNAs in Neurodevelopmental Disorders. Front. Genet. 2019, 10, 1033. [Google Scholar] [CrossRef]
- Rennert, O.M.; Ziats, M.N. Editorial: Non-Coding RNAs in Neurodevelopmental Disorders. Front. Neurol. 2017, 8, 629. [Google Scholar] [CrossRef]
- Costain, G.; Walker, S.; Marano, M.; Veenma, D.; Snell, M.; Curtis, M.; Luca, S.; Buera, J.; Arje, D.; Reuter, M.S.; et al. Genome Sequencing as a Diagnostic Test in Children With Unexplained Medical Complexity. JAMA Netw. Open 2020, 3, e2018109. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Begum, G.; Albanna, A.; Bankapur, A.; Nassir, N.; Tambi, R.; Berdiev, B.K.; Akter, H.; Karuvantevida, N.; Kellam, B.; Alhashmi, D.; et al. Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome. Int. J. Mol. Sci. 2021, 22, 2060. https://doi.org/10.3390/ijms22042060
Begum G, Albanna A, Bankapur A, Nassir N, Tambi R, Berdiev BK, Akter H, Karuvantevida N, Kellam B, Alhashmi D, et al. Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome. International Journal of Molecular Sciences. 2021; 22(4):2060. https://doi.org/10.3390/ijms22042060
Chicago/Turabian StyleBegum, Ghausia, Ammar Albanna, Asma Bankapur, Nasna Nassir, Richa Tambi, Bakhrom K. Berdiev, Hosneara Akter, Noushad Karuvantevida, Barbara Kellam, Deena Alhashmi, and et al. 2021. "Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome" International Journal of Molecular Sciences 22, no. 4: 2060. https://doi.org/10.3390/ijms22042060
APA StyleBegum, G., Albanna, A., Bankapur, A., Nassir, N., Tambi, R., Berdiev, B. K., Akter, H., Karuvantevida, N., Kellam, B., Alhashmi, D., Sung, W. W. L., Thiruvahindrapuram, B., Alsheikh-Ali, A., Scherer, S. W., & Uddin, M. (2021). Long-Read Sequencing Improves the Detection of Structural Variations Impacting Complex Non-Coding Elements of the Genome. International Journal of Molecular Sciences, 22(4), 2060. https://doi.org/10.3390/ijms22042060