Progress in Flax Genome Assembly from Nanopore Sequencing Data
Abstract
1. Introduction
2. Results and Discussion
2.1. Genome of the Variety K-3018
2.2. Genome of the Variety Svyatogor
2.3. Progress in the Flax Genome Assembly
3. Materials and Methods
3.1. DNA Preparation and Sequencing on the ONT Platform
3.2. Construction and Sequencing of Genomic and Hi-C Libraries on the Illumina Platform
3.3. Assembly of Flax Genomes
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Sun, Y.; Shang, L.; Zhu, Q.H.; Fan, L.; Guo, L. Twenty years of plant genome sequencing: Achievements and challenges. Trends Plant Sci. 2022, 27, 391–401. [Google Scholar] [CrossRef]
- Espinosa, E.; Bautista, R.; Larrosa, R.; Plata, O. Advancements in long-read genome sequencing technologies and algorithms. Genomics 2024, 116, 110842. [Google Scholar] [CrossRef] [PubMed]
- Zhang, T.; Zhou, J.; Gao, W.; Jia, Y.; Wei, Y.; Wang, G. Complex genome assembly based on long-read sequencing. Brief. Bioinform. 2022, 23, bbac305. [Google Scholar] [CrossRef]
- Medhi, U.; Chaliha, C.; Singh, A.; Nath, B.K.; Kalita, E. Third generation sequencing transforming plant genome research: Current trends and challenges. Gene 2025, 940, 149187. [Google Scholar] [CrossRef] [PubMed]
- Li, H.; Durbin, R. Genome assembly in the telomere-to-telomere era. Nat. Rev. Genet. 2024, 25, 658–670. [Google Scholar] [CrossRef]
- Diaz-Riano, J.I.; Duitama, J. Current progress in phased genome assembly from long-read DNA sequencing data. Methods Mol. Biol. 2025, 2955, 51–70. [Google Scholar] [CrossRef] [PubMed]
- Mahmoud, M.; Agustinho, D.P.; Sedlazeck, F.J. A Hitchhiker’s guide to long-read genomic analysis. Genome Res. 2025, 35, 545–558. [Google Scholar] [CrossRef]
- Bernal-Gallardo, J.J.; de Folter, S. Plant genome information facilitates plant functional genomics. Planta 2024, 259, 117. [Google Scholar] [CrossRef]
- Garg, S.; Nain, P.; Kumar, A.; Joshi, S.; Punetha, H.; Sharma, P.K.; Siddiqui, S.; Alshaharni, M.O.; Algopishi, U.B.; Mittal, A. Next generation plant biostimulants & genome sequencing strategies for sustainable agriculture development. Front. Microbiol. 2024, 15, 1439561. [Google Scholar] [CrossRef] [PubMed]
- Dmitriev, A.A.; Pushkova, E.N.; Melnikova, N.V. Plant genome sequencing: Modern technologies and novel opportunities for breeding. Mol. Biol. 2022, 56, 495–507. [Google Scholar] [CrossRef]
- Schreiber, M.; Jayakodi, M.; Stein, N.; Mascher, M. Plant pangenomes for crop improvement, biodiversity and evolution. Nat. Rev. Genet. 2024, 25, 563–577. [Google Scholar] [CrossRef]
- Kumar, R.; Das, S.P.; Choudhury, B.U.; Kumar, A.; Prakash, N.R.; Verma, R.; Chakraborti, M.; Devi, A.G.; Bhattacharjee, B.; Das, R.; et al. Advances in genomic tools for plant breeding: Harnessing DNA molecular markers, genomic selection, and genome editing. Biol. Res. 2024, 57, 80. [Google Scholar] [CrossRef]
- Jayakodi, M.; Shim, H.; Mascher, M. What are we learning from plant pangenomes? Annu. Rev. Plant Biol. 2025, 76, 663–686. [Google Scholar] [CrossRef] [PubMed]
- Naithani, S.; Deng, C.H.; Sahu, S.K.; Jaiswal, P. Exploring pan-genomes: An overview of resources and tools for unraveling structure, function, and evolution of crop genes and genomes. Biomolecules 2023, 13, 1403. [Google Scholar] [CrossRef]
- Garg, V.; Bohra, A.; Mascher, M.; Spannagl, M.; Xu, X.; Bevan, M.W.; Bennetzen, J.L.; Varshney, R.K. Unlocking plant genetics with telomere-to-telomere genome assemblies. Nat. Genet. 2024, 56, 1788–1799. [Google Scholar] [CrossRef]
- Tse, T.J.; Guo, Y.; Shim, Y.Y.; Purdy, S.K.; Kim, J.H.; Cho, J.Y.; Alcorn, J.; Reaney, M.J.T. Availability of bioactive flax lignan from foods and supplements. Crit. Rev. Food Sci. Nutr. 2023, 63, 9843–9858. [Google Scholar] [CrossRef]
- Gao, Z.; Cao, Q.; Deng, Z. Unveiling the power of flax lignans: From plant biosynthesis to human health benefits. Nutrients 2024, 16, 3520. [Google Scholar] [CrossRef]
- Stepien, A.E.; Trojniak, J.; Tabarkiewicz, J. Anti-oxidant and anti-cancer properties of flaxseed. Int. J. Mol. Sci. 2025, 26, 1226. [Google Scholar] [CrossRef]
- Campos, J.R.; Severino, P.; Ferreira, C.S.; Zielinska, A.; Santini, A.; Souto, S.B.; Souto, E.B. Linseed essential oil—source of lipids as active ingredients for pharmaceuticals and nutraceuticals. Curr. Med. Chem. 2019, 26, 4537–4558. [Google Scholar] [CrossRef] [PubMed]
- Kezimana, P.; Dmitriev, A.A.; Kudryavtseva, A.V.; Romanova, E.V.; Melnikova, N.V. Secoisolariciresinol diglucoside of flaxseed and its metabolites: biosynthesis and potential for nutraceuticals. Front. Genet. 2018, 9, 641. [Google Scholar] [CrossRef] [PubMed]
- Goudenhooft, C.; Bourmaud, A.; Baley, C. Flax (Linum usitatissimum L.) Fibers for composite reinforcement: Exploring the link between plant growth, cell walls development, and fiber properties. Front. Plant Sci. 2019, 10, 411. [Google Scholar] [CrossRef]
- Wang, Z.; Hobson, N.; Galindo, L.; Zhu, S.; Shi, D.; McDill, J.; Yang, L.; Hawkins, S.; Neutelings, G.; Datla, R. The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads. Plant J. 2012, 72, 461–473. [Google Scholar] [CrossRef]
- You, F.M.; Xiao, J.; Li, P.; Yao, Z.; Jia, G.; He, L.; Zhu, T.; Luo, M.C.; Wang, X.; Deyholos, M.K. Chromosome-scale pseudomolecules refined by optical, physical and genetic maps in flax. Plant J. 2018, 95, 371–384. [Google Scholar] [CrossRef]
- Zhang, J.; Qi, Y.; Wang, L.; Wang, L.; Yan, X.; Dang, Z.; Li, W.; Zhao, W.; Pei, X.; Li, X. Genomic comparison and population diversity analysis provide insights into the domestication and improvement of flax. Iscience 2020, 23, 100967. [Google Scholar] [CrossRef]
- Dmitriev, A.A.; Pushkova, E.N.; Novakovskiy, R.O.; Beniaminov, A.D.; Rozhmina, T.A.; Zhuchenko, A.A.; Bolsheva, N.L.; Muravenko, O.V.; Povkhova, L.V.; Dvorianinova, E.M. Genome sequencing of fiber flax cultivar Atlant using Oxford Nanopore and Illumina platforms. Front. Genet. 2021, 11, 590282. [Google Scholar] [CrossRef] [PubMed]
- Sa, R.; Yi, L.; Siqin, B.; An, M.; Bao, H.; Song, X.; Wang, S.; Li, Z.; Zhang, Z.; Hazaisi, H. Chromosome-level genome assembly and annotation of the fiber flax (Linum usitatissimum) genome. Front. Genet. 2021, 12, 735690. [Google Scholar] [CrossRef]
- Dvorianinova, E.M.; Bolsheva, N.L.; Pushkova, E.N.; Rozhmina, T.A.; Zhuchenko, A.A.; Novakovskiy, R.O.; Povkhova, L.V.; Sigova, E.A.; Zhernova, D.A.; Borkhert, E.V.; et al. Isolating Linum usitatissimum L. nuclear DNA enabled assembling high-quality genome. Int. J. Mol. Sci. 2022, 23, 13244. [Google Scholar] [CrossRef]
- Zhao, X.; Yi, L.; Zuo, Y.; Gao, F.; Cheng, Y.; Zhang, H.; Zhou, Y.; Jia, X.; Su, S.; Zhang, D. High-quality genome assembly and genome-wide association study of male sterility provide resources for flax improvement. Plants 2023, 12, 2773. [Google Scholar] [CrossRef]
- Arkhipov, A.A.; Pushkova, E.N.; Bolsheva, N.L.; Rozhmina, T.A.; Borkhert, E.V.; Zhernova, D.A.; Rybakova, T.Y.; Barsukov, N.M.; Moskalenko, O.D.; Sigova, E.A.; et al. Nanopore data-driven chromosome-level assembly of flax genome. Plants 2024, 13, 3465. [Google Scholar] [CrossRef] [PubMed]
- Lu, J.; Wu, H.; Wang, F.; Li, J.; Wang, Y.; Zhao, Q.; Wang, Y.; Wang, X.; Lei, X.; Sun, R.; et al. Telomere to telomere flax (Linum usitatissimum L.) genome assembly unlocks insights beyond fatty acid metabolism pathways. Hortic. Res. 2025, 12, uhaf127. [Google Scholar] [CrossRef] [PubMed]
- Yadav, H.K.; Singh, N.; Singh, B.; Kaur, V.; Sawant, S.V. Telomere-to-telomere genome assembly of linseed (Linum usitatissimum L.) for functional genomics and accelerated genetic improvement. Plant Biotechnol. J. 2025, 23, 3919–3933. [Google Scholar] [CrossRef]
- Pushkova, E.N.; Borkhert, E.V.; Novakovskiy, R.O.; Dvorianinova, E.M.; Rozhmina, T.A.; Zhuchenko, A.A.; Zhernova, D.A.; Turba, A.A.; Yablokov, A.G.; Sigova, E.A.; et al. Selection of flax genotypes for pan-genomic studies by sequencing tagmentation-based transcriptome libraries. Plants 2023, 12, 3725. [Google Scholar] [CrossRef]
- Cloutier, S.; Ragupathy, R.; Miranda, E.; Radovanovic, N.; Reimer, E.; Walichnowski, A.; Ward, K.; Rowland, G.; Duguid, S.; Banik, M. Integrated consensus genetic and physical maps of flax (Linum usitatissimum L.). Theor. Appl. Genet. 2012, 125, 1783–1795. [Google Scholar] [CrossRef]
- Bolsheva, N.L.; Semenova, O.Y.; Muravenko, O.; Nosova, I.V.; Popov, K.; Zelenin, A.V. Localization of telomere sequences in chromosomes of two flax species. Biol. Membr. 2005, 22, 227–231. [Google Scholar]
- Dvorianinova, E.M.; Pushkova, E.N.; Bolsheva, N.L.; Borkhert, E.V.; Rozhmina, T.A.; Zhernova, D.A.; Novakovskiy, R.O.; Turba, A.A.; Sigova, E.A.; Melnikova, N.V.; et al. Genome of Linum usitatissimum convar. crepitans expands the view on the section Linum. Front. Genet. 2023, 14, 1269837. [Google Scholar] [CrossRef] [PubMed]
- Dvorianinova, E.M.; Pushkova, E.N.; Bolsheva, N.L.; Rozhmina, T.A.; Zhernova, D.A.; Sigova, E.A.; Borkhert, E.V.; Melnikova, N.V.; Dmitriev, A.A. Improving genome assembly of flax line 3896 with high-precision Illumina reads. Russ. J. Genet. 2023, 59, S237–S240. [Google Scholar] [CrossRef]
- Cheng, H.; Qu, H.; McKenzie, S.; Lawrence, K.R.; Windsor, R.; Vella, M.; Park, P.J.; Li, H. Efficient near telomere-to-telomere assembly of Nanopore simplex reads. bioRxiv 2025. [Google Scholar] [CrossRef]
- Stanojević, D.; Lin, D.; Nurk, S.; Florez de Sessions, P.; Šikić, M. Telomere-to-telomere phased genome assembly using HERRO-corrected simplex Nanopore reads. bioRxiv 2024. [Google Scholar] [CrossRef]
- Zhernova, D.A.; Pushkova, E.N.; Rozhmina, T.A.; Borkhert, E.V.; Arkhipov, A.A.; Sigova, E.A.; Dvorianinova, E.M.; Dmitriev, A.A.; Melnikova, N.V. History and prospects of flax genetic markers. Front. Plant Sci. 2024, 15, 1495069. [Google Scholar] [CrossRef]
- Zelenka, T.; Spilianakis, C. HiChIP and Hi-C protocol optimized for primary murine T cells. Methods Protoc. 2021, 4, 49. [Google Scholar] [CrossRef]
- De Coster, W.; Rademakers, R. NanoPack2: Population-scale evaluation of long-read sequencing data. Bioinformatics 2023, 39, btad311. [Google Scholar] [CrossRef] [PubMed]
- Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
- Gurevich, A.; Saveliev, V.; Vyahhi, N.; Tesler, G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics 2013, 29, 1072–1075. [Google Scholar] [CrossRef]
- Flynn, J.M.; Hubley, R.; Goubert, C.; Rosen, J.; Clark, A.G.; Feschotte, C.; Smit, A.F. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 2020, 117, 9451–9457. [Google Scholar] [CrossRef] [PubMed]
- Bao, W.; Kojima, K.K.; Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 2015, 6, 11. [Google Scholar] [CrossRef] [PubMed]
- Storer, J.; Hubley, R.; Rosen, J.; Wheeler, T.J.; Smit, A.F. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob. DNA 2021, 12, 2. [Google Scholar] [CrossRef]
- Gabriel, L.; Bruna, T.; Hoff, K.J.; Ebel, M.; Lomsadze, A.; Borodovsky, M.; Stanke, M. BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS and TSEBRA. Genome Res. 2024, 34, 769–777. [Google Scholar] [CrossRef]
- Zhernova, D.A.; Arkhipov, A.A.; Rozhmina, T.A.; Zhuchenko, A.A.; Bolsheva, N.L.; Sigova, E.A.; Dvorianinova, E.M.; Borkhert, E.V.; Pushkova, E.N.; Melnikova, N.V.; et al. Transcriptome map and genome annotation of flax line 3896. Front. Plant Sci. 2025, 16, 1520832. [Google Scholar] [CrossRef]
- Kuznetsov, D.; Tegenfeldt, F.; Manni, M.; Seppey, M.; Berkeley, M.; Kriventseva, E.V.; Zdobnov, E.M. OrthoDB v11: Annotation of orthologs in the widest sampling of organismal diversity. Nucleic Acids Res. 2023, 51, D445–D451. [Google Scholar] [CrossRef]
- Manni, M.; Berkeley, M.R.; Seppey, M.; Simao, F.A.; Zdobnov, E.M. BUSCO update: Novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 2021, 38, 4647–4654. [Google Scholar] [CrossRef]
- Rhie, A.; Walenz, B.P.; Koren, S.; Phillippy, A.M. Merqury: Reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 2020, 21, 245. [Google Scholar] [CrossRef]
- Kielbasa, S.M.; Wan, R.; Sato, K.; Horton, P.; Frith, M.C. Adaptive seeds tame genomic sequence comparison. Genome Res. 2011, 21, 487–493. [Google Scholar] [CrossRef]
- Brown, M.R.; Manuel Gonzalez de La Rosa, P.; Blaxter, M. tidk: A toolkit to rapidly identify telomeric repeats from genomic datasets. Bioinformatics 2025, 41, btaf049. [Google Scholar] [CrossRef]
- Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar] [CrossRef]
- Open2C; Abdennur, N.; Fudenberg, G.; Flyamer, I.M.; Galitsyna, A.A.; Goloborodko, A.; Imakaev, M.; Venev, S.V. Pairtools: From sequencing data to chromosome contacts. bioRxiv 2023. [Google Scholar] [CrossRef] [PubMed]
- Durand, N.C.; Shamim, M.S.; Machol, I.; Rao, S.S.; Huntley, M.H.; Lander, E.S.; Aiden, E.L. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016, 3, 95–98. [Google Scholar] [CrossRef] [PubMed]
- Marcais, G.; Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 2011, 27, 764–770. [Google Scholar] [CrossRef]
- Ranallo-Benavidez, T.R.; Jaron, K.S.; Schatz, M.C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 2020, 11, 1432. [Google Scholar] [CrossRef]




| Variety | Assembly Size, Mb | QV Score | Merqury Completeness, % | BUSCO Completeness, % | Number of Gaps | Number of Telomeres |
|---|---|---|---|---|---|---|
| K-3018 v1 | 489.1 | - | - | 95.8 | 9 | 30 |
| K-3018 v2 | 491.1 | - | - | 96.0 | 2 | 30 |
| Svyatogor | 497.8 | 55.3 | 99.0 | 95.9 | 3 | 30 |
| T397 | 494.9 | 59.6 | 96.0 | 95.8 | 5 | 28 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Pushkova, E.N.; Arkhipov, A.A.; Bolsheva, N.L.; Rozhmina, T.A.; Zhuchenko, A.A.; Borkhert, E.V.; Barsukov, N.M.; Oleshnya, G.A.; Milovanova, A.V.; Moskalenko, O.D.; et al. Progress in Flax Genome Assembly from Nanopore Sequencing Data. Plants 2026, 15, 151. https://doi.org/10.3390/plants15010151
Pushkova EN, Arkhipov AA, Bolsheva NL, Rozhmina TA, Zhuchenko AA, Borkhert EV, Barsukov NM, Oleshnya GA, Milovanova AV, Moskalenko OD, et al. Progress in Flax Genome Assembly from Nanopore Sequencing Data. Plants. 2026; 15(1):151. https://doi.org/10.3390/plants15010151
Chicago/Turabian StylePushkova, Elena N., Alexander A. Arkhipov, Nadezhda L. Bolsheva, Tatiana A. Rozhmina, Alexander A. Zhuchenko, Elena V. Borkhert, Nikolai M. Barsukov, Gavriil A. Oleshnya, Alina V. Milovanova, Olesya D. Moskalenko, and et al. 2026. "Progress in Flax Genome Assembly from Nanopore Sequencing Data" Plants 15, no. 1: 151. https://doi.org/10.3390/plants15010151
APA StylePushkova, E. N., Arkhipov, A. A., Bolsheva, N. L., Rozhmina, T. A., Zhuchenko, A. A., Borkhert, E. V., Barsukov, N. M., Oleshnya, G. A., Milovanova, A. V., Moskalenko, O. D., Kostromskoy, F. D., Ivankina, E. A., Dvorianinova, E. M., Krupskaya, D. A., Melnikova, N. V., & Dmitriev, A. A. (2026). Progress in Flax Genome Assembly from Nanopore Sequencing Data. Plants, 15(1), 151. https://doi.org/10.3390/plants15010151

