De Novo Genome Assembly, Genomic Features, and Comparative Analysis of the Sawfly Dentathalia scutellariae
Simple Summary
Abstract
1. Introduction
2. Materials and Methods
2.1. Sample Preparation and Sequencing
2.2. Genome Features Assessment
2.3. Genome Assembly and Assessment
2.4. Genome Annotation
2.5. Comparative Genomics Analysis
3. Results
3.1. Genome Features Assessment and Assembly
3.2. Repetitive Sequence Annotation
3.3. Gene Annotation
3.4. Mitochondrial Genome Assembly
3.5. Comparative Genomics
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Schoch, C.L.; Ciufo, S.; Domrachev, M.; Hotton, C.L.; Kannan, S.; Khovanskaya, R.; Leipe, D.; Mcveigh, R.; O’Neill, K.; Robbertse, B.; et al. NCBI Taxonomy: A comprehensive update on curation, resources and tools. Database 2020, 2020, baaa062. [Google Scholar] [CrossRef]
- Niu, G.Y.; Budak, M.; Korkmaz, E.M.; Doğan, Ö.; Nel, A.; Wan, S.Y.; Cai, C.Y.; Jouault, C.; Li, M.; Wei, M.C. Phylogenomic analyses of the Tenthredinoidea support the familial rank of Athaliidae (Insecta, Tenthredinoidea). Insects 2022, 13, 858. [Google Scholar] [CrossRef]
- Opitz, S.E.; Boevé, J.L.; Nagy, Z.T.; Sonet, G.; Koch, F.; Müller, C. Host shifts from Lamiales to Brassicaceae in the sawfly genus Athalia. PLoS ONE 2012, 7, e33649. [Google Scholar] [CrossRef][Green Version]
- Benson, R.B. A revision of the Athaliini (Hymenoptera: Tenthredinidae). B. Brit. Mus. (Nat. Hist.) Entomol. 1962, 11, 334–382. [Google Scholar]
- Oeyen, J.P.; Baa-Puyoulet, P.; Benoit, J.B.; Beukeboom, L.W.; Bornberg-Bauer, E.; Buttstedt, A.; Calevro, F.; Cash, E.I.; Chao, H.; Charles, H.; et al. Sawfly genomes reveal evolutionary acquisitions that fostered the mega-radiation of parasitoid and eusocial Hymenoptera. Genome Biol. Evol. 2020, 12, 1099–1188. [Google Scholar] [CrossRef]
- Crowley, L.M.; Broad, G.R.; University of Oxford and Wytham Woods Genome Acquisition Lab; Natural History Museum Genome Acquisition Lab; Darwin Tree of Life Barcoding collective; Wellcome Sanger Institute Tree of Life programme; Wellcome Sanger Institute Scientific Operations: DNA Pipelines collective; Tree of Life Core Informatics collective; Green, A.; Darwin Tree of Life Consortium. The genome sequence of the Turnip Sawfly, Athalia rosae (Linnaeus, 1758). Wellcome Open Res. 2023, 8, 87. [Google Scholar] [CrossRef]
- Halstead, A.; Falk, S.; University of Oxford and Wytham Woods Genome Acquisition Lab; Natural History Museum Genome Acquisition Lab; Darwin Tree of Life Barcoding collective; Wellcome Sanger Institute Tree of Life Management; Samples and Laboratory team; Wellcome Sanger Institute Scientific Operations: Sequencing Operations; Wellcome Sanger Institute Tree of Life Core Informatics team; Tree of Life Core Informatics collective; et al. The genome sequence of a sawfly, Athalia cordata Serville, 1823. Wellcome Open Res. 2025, 10, 15. [Google Scholar] [CrossRef]
- Opitz, S.E.; Jensen, S.R.; Müller, C. Sequestration of glucosinolates and iridoid glucosides in sawfly species of the genus Athalia and their role in defense against ants. J. Chem. Ecol. 2010, 36, 148–157. [Google Scholar] [CrossRef]
- Opitz, S.E.; Mix, A.; Winde, I.B.; Müller, C. Desulfation followed by sulfation: Metabolism of benzylglucosinolate in Athalia rosae (Hymenoptera: Tenthredinidae). Chembiochem 2011, 12, 1252–1257. [Google Scholar] [CrossRef]
- Zhao, T.T.; Tang, H.L.; Xie, L.; Zheng, Y.; Ma, Z.B.; Sun, Q.; Li, X.F. Scutellaria baicalensis Georgi. (Lamiaceae): A review of its traditional uses, botany, phytochemistry, pharmacology and toxicology. J. Pharm. Pharmacol. 2019, 71, 1353–1369. [Google Scholar] [CrossRef]
- Müller, C.; Agerbirk, N.; Olsen, C.E.; Boevé, J.L.; Schaffner, U.; Brakefield, P.M. Sequestration of host plant glucosinolates in the defensive hemolymph of the sawfly Athalia rosae. J. Chem. Ecol. 2001, 27, 2505–2516. [Google Scholar] [CrossRef]
- Simon, S.; Breeschoten, T.; Jansen, H.J.; Dirks, R.P.; Schranz, M.E.; Ros, V.I.D. Genome and transcriptome analysis of the beet armyworm Spodoptera exigua reveals targets for pest control. G3 Genes|Genomes|Genet. 2021, 11, jkab311. [Google Scholar] [CrossRef]
- Li, F.; Zhao, X.; Li, M.; He, K.; Huang, C.; Zhou, Y.; Li, Z.; Walters, J.R. Insect genomes: Progress and challenges. Insect Mol. Biol. 2019, 28, 739–758. [Google Scholar] [CrossRef]
- Chen, S.F.; Zhou, Y.Q.; Chen, Y.R.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
- Ranallo-Benavidez, T.R.; Jaron, K.S.; Schatz, M.C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 2020, 11, 1432. [Google Scholar] [CrossRef]
- Hu, J.; Wang, Z.; Sun, Z.; Hu, B.; Ayoola, A.O.; Liang, F.; Li, J.; Sandoval, J.R.; Cooper, D.N.; Ye, K.; et al. NextDenovo: An efficient error correction and accurate assembly tool for noisy long reads. Genome Biol. 2024, 25, 107. [Google Scholar] [CrossRef]
- Cheng, H.; Concepcion, G.T.; Feng, X.; Zhang, H.; Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 2021, 18, 170–175. [Google Scholar] [CrossRef]
- Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef]
- Rhie, A.; Walenz, B.P.; Koren, S.; Phillippy, A.M. Merqury: Reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 2020, 21, 245. [Google Scholar] [CrossRef]
- Formenti, G.; Rhie, A.; Walenz, B.P.; Thibaud-Nissen, F.; Shafin, K.; Koren, S.; Myers, E.W.; Jarvis, E.D.; Phillippy, A.M. Merfin: Improved variant filtering, assembly evaluation and polishing via k-mer validation. Nat. Methods 2022, 19, 696–704. [Google Scholar] [CrossRef]
- Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
- Garrison, E.; Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv 2012, arXiv:1207.3907. [Google Scholar]
- Li, H. Minimap and miniasm: Fast mapping and de novo assembly for noisy long sequences. Bioinformatics 2016, 32, 2103–2110. [Google Scholar] [CrossRef]
- Li, H.; Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef]
- Uliano-Silva, M.; Ferreira, J.G.R.N.; Krasheninnikova, K.; Darwin Tree of Life Consortium; Formenti, G.; Abueg, L.; Torrance, J.; Myers, E.W.; Durbin, R.; Blaxter, M.; et al. MitoHiFi: A python pipeline for mitochondrial genome assembly from PacBio high fidelity reads. BMC Bioinform. 2023, 24, 288. [Google Scholar] [CrossRef]
- Allio, R.; Schomaker-Bastos, A.; Romiguier, J.; Prosdocimi, F.; Nabholz, B.; Delsuc, F. MitoFinder: Efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics. Mol. Ecol. Resour. 2020, 20, 892–905. [Google Scholar] [CrossRef]
- Greiner, S.; Lehwark, P.; Bock, R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019, 47, W59–W64. [Google Scholar] [CrossRef]
- Tarailo-Graovac, M.; Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 2009, 4, 4.10.1–4.10.14. [Google Scholar] [CrossRef]
- Mei, Y.; Jing, D.; Tang, S.Y.; Chen, X.; Chen, H.; Duanmu, H.N.; Cong, Y.Y.; Chen, M.Y.; Ye, X.H.; Zhou, H.; et al. InsectBase 2.0: A comprehensive gene resource for insects. Nucleic Acids Res. 2022, 50, D1040–D1045. [Google Scholar] [CrossRef]
- Hu, K.; Ni, P.; Xu, M.H.; Zou, Y.; Chang, J.Y.; Gao, X.; Li, Y.H.; Ruan, J.; Hu, B.; Wang, J.X. HiTE: A fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation. Nat. Commun. 2024, 15, 5573. [Google Scholar] [CrossRef]
- Hoff, K.J.; Lomsadze, A.; Borodovsky, M.; Stanke, M. Whole-genome annotation with BRAKER. Methods Mol. Biol. 2019, 1962, 65–95. [Google Scholar]
- Finn, R.D.; Clements, J.; Eddy, S.R. HMMER web server: Interactive sequence similarity searching. Nucleic Acids Res. 2011, 39, W29–W37. [Google Scholar] [CrossRef]
- Buchfink, B.; Xie, C.; Huson, D.H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 2015, 12, 59–60. [Google Scholar] [CrossRef]
- Cantalapiedra, C.P.; Hernández-Plaza, A.; Letunic, I.; Bork, P.; Huerta-Cepas, J. eggNOG-mapper v2: Functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol. Biol. Evol. 2021, 38, 5825–5829. [Google Scholar] [CrossRef]
- Nawrocki, E.P.; Eddy, S.R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 2013, 29, 2933–2935. [Google Scholar] [CrossRef]
- Ontiveros-Palacios, N.; Cooke, E.; Nawrocki, E.P.; Triebel, S.; Marz, M.; Rivas, E.; Griffiths-Jones, S.; Petrov, A.I.; Bateman, A.; Sweeney, B. Rfam 15: RNA families database in 2025. Nucleic Acids Res. 2025, 53, D258–D267. [Google Scholar] [CrossRef]
- Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef]
- Katoh, K.; Misawa, K.; Kuma, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef]
- Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
- Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef]
- Sanderson, M.J. r8s: Inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics 2003, 19, 301–302. [Google Scholar] [CrossRef]
- Misof, B.; Liu, S.L.; Meusemann, K.; Peters, R.S.; Donath, A.; Mayer, C.; Frandsen, P.B.; Ware, J.; Flouri, T.; Beutel, R.G.; et al. Phylogenomics resolves the timing and pattern of insect evolution. Science 2014, 346, 763–767. [Google Scholar] [CrossRef]
- Vasilikopoulos, A.; Misof, B.; Meusemann, K.; Lieberz, D.; Flouri, T.; Beutel, R.G.; Niehuis, O.; Wappler, T.; Rust, J.; Peters, R.S.; et al. An integrative phylogenomic approach to elucidate the evolutionary history and divergence times of Neuropterida (Insecta: Holometabola). BMC Evol. Biol. 2020, 20, 64. [Google Scholar]
- He, C.; Yang, Y.; Zhao, X.X.; Li, J.J.; Cai, Y.T.; Peng, L.J.; Liu, Y.Y.; Xiong, S.J.; Mei, Y.; Yan, Z.C.; et al. Large-scale genome analyses provide insights into Hymenoptera evolution. Mol. Biol. Evol. 2025, 42, msaf221. [Google Scholar] [CrossRef]
- Mendes, F.K.; Vanderpool, D.; Fulton, B.; Hahn, M.W. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics 2021, 36, 5516–5518. [Google Scholar] [CrossRef]
- Wu, T.Z.; Hu, E.Q.; Xu, S.B.; Chen, M.J.; Guo, P.F.; Dai, Z.H.; Feng, T.Z.; Zhou, L.; Tang, W.L.; Zhan, L.; et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef]
- Broad, G.R.; Sivess, L.; Holt, S.; Fletcher, C.; Januszczak, I.; Natural History Museum Genome Acquisition Lab; Darwin Tree of Life Barcoding collective; Wellcome Sanger Institute Tree of Life Management; Samples and Laboratory team; Wellcome Sanger Institute Scientific Operations: Sequencing Operations; et al. The genome sequence of the cephid sawfly, Cephus spinipes (Panzer, 1800). Wellcome Open Res. 2024, 9, 557. [Google Scholar] [CrossRef]
- Falk, S.; Crowley, L.M.; Green, A.; University of Oxford and Wytham Woods Genome Acquisition Lab; Darwin Tree of Life Barcoding collective; Wellcome Sanger Institute Tree of Life Management; Samples and Laboratory team; Wellcome Sanger Institute Scientific Operations: Sequencing Operations; Wellcome Sanger Institute Tree of Life Core Informatics team; Tree of Life Core Informatics collective; et al. The genome sequence of the Figwort Sawfly Tenthredo scrophulariae Linnaeus, 1758. Wellcome Open Res. 2024, 9, 650. [Google Scholar] [CrossRef]
- Xiao, S.; Ye, X.H.; Wang, S.P.; Yang, Y.; Fang, Q.; Wang, F.; Ye, G.Y. Genome assembly of the ectoparasitoid wasp Theocolax elegans. Sci. Data 2023, 10, 159. [Google Scholar] [CrossRef]
- Abou-Zaid, M.M.; Beninger, C.W.; Arnason, J.T.; Nozzolillo, C. The effect of one flavone, two catechins and four flavonols on mortality and growth of the European corn borer (Ostrinia nubilalis hubner). Biochem. Syst. Ecol. 1993, 21, 415–420. [Google Scholar] [CrossRef]
- Onyilagha, J.C.; Lazorko, J.; Gruber, M.Y.; Soroka, J.J.; Erlandson, M.A. Effect of flavonoids on feeding preference and development of the crucifer pest Mamestra configurata walker. J. Chem. Ecol. 2004, 30, 109–124. [Google Scholar] [CrossRef]
- Pan, L.; Ren, L.; Chen, F.; Feng, Y.; Luo, Y. Antifeedant activity of ginkgo biloba secondary metabolites against Hyphantria cunea larvae: Mechanisms and Applications. PLoS ONE 2016, 11, e0155682. [Google Scholar] [CrossRef]
- Burghardt, F.; Knüttel, H.; Becker, M.; Fiedler, K. Flavonoid wing pigments increase attractiveness of female common blue (Polyommatus icarus) butterflies to mate-searching males. Naturwissenschaften 2000, 87, 304–307. [Google Scholar] [CrossRef]
- Simmonds, M.S.J. Flavonoid-insect interactions: Recent advances in our knowledge. Phytochemistry 2003, 64, 21–30. [Google Scholar] [CrossRef]
- War, A.R.; Paulraj, M.G.; Ahmad, T.; Buhroo, A.A.; Hussain, B.; Ignacimuthu, S.; Sharma, H.C. Mechanisms of plant defense against insect herbivores. Plant Signal. Behav. 2012, 7, 1306–1320. [Google Scholar] [CrossRef]
- Johnson, R.M.; Mao, W.; Pollock, H.S.; Niu, G.; Schuler, M.A.; Berenbaum, M.R. Ecologically appropriate xenobiotics induce cytochrome P450s in Apis Mellifera. PLoS ONE 2012, 7, e31051. [Google Scholar] [CrossRef]
- Yu, Z.W.; Yang, C.J.; Xie, L.; Yang, F.; Yuan, Y.Y. Physiological and biochemical mechanisms of Aoria nigripes (Coleoptera, Chrysomelidae) adaption to flavonoid-rich plant Nekemias grossedentata. Insects 2025, 16, 399. [Google Scholar] [CrossRef]
- Rupasinghe, S.G.; Wen, Z.; Chiu, T.L.; Schuler, M.A. Helicoverpa zea CYP6B8 and CYP321A1: Different molecular solutions to the problem of metabolizing plant toxins and insecticides. Protein Eng. Des. Sel. 2007, 20, 615–624. [Google Scholar] [CrossRef]
- Zhang, C.; Wong, A.; Zhang, Y.; Ni, X.; Li, X. Common and unique cis-acting elements mediate xanthotoxin and flavone induction of the generalist P450 CYP321A1. Sci. Rep. 2014, 4, 6490. [Google Scholar] [CrossRef]
- Deng, Z.Y.; Zhang, Y.T.; Fang, L.Y.; Zhang, M.; Wang, L.X.; Ni, X.Z.; Li, X.C. Identification of the flavone-inducible counter-defense genes and their cis-elements in Helicoverpa armigera. Toxins 2023, 15, 365. [Google Scholar] [CrossRef]
- Gu, C.Z.; Zeng, B.X.; Wang, M.M.; Zhang, Y.J.; Yan, C.X.; Lin, Y.Z.; Khan, A.; Zeng, R.S.; Song, Y.Y. Study on active components and mechanism of lettuce latex against Spodoptera litura. Chem. Biodivers. 2024, 21, e202400993. [Google Scholar] [CrossRef]
- Li, T.; Yin, Y.S.; Zhang, K.X.; Li, Y.; Kong, X.X.; Liu, D.; Luo, Y.; Zhang, R.L.; Zhang, Z. Ecotoxicity effect of aspirin on the larvae of Musca domestica through retinol metabolism. Ecotoxicol. Environ. Saf. 2024, 270, 115845. [Google Scholar] [CrossRef]




| Element Type | Dentathalia scutellariae | Athalia rosae | Athalia cordata | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Num 1 | LO (bp) 2 | PS 3 | Num | LO (bp) | PS | Num | LO (bp) | PS | |
| Total interspersed repeats | - | 6,819,110 | 4.34% | - | 8,605,934 | 5.00% | - | 6,828,764 | 4.04% |
| SINEs | 250 | 23,817 | 0.02% | 0 | 0 | 0.00% | 49 | 9578 | 0.01% |
| LINEs | 3407 | 267,248 | 0.17% | 0 | 0 | 0.00% | 72 | 63,300 | 0.04% |
| LTR elements | 3940 | 2,188,477 | 1.39% | 5666 | 2,762,375 | 1.61% | 1414 | 1,877,030 | 1.11% |
| DNA transposons | 18,871 | 3,620,076 | 2.31% | 20,299 | 5,843,559 | 3.40% | 12,717 | 4,878,856 | 2.89% |
| Unclassified | 7034 | 719,492 | 0.46% | 0 | 0 | 0.00% | 0 | 0 | 0.00% |
| Small RNA | 329 | 61,958 | 0.04% | 0 | 0 | 0.00% | 49 | 9578 | 0.01% |
| Satellites | 1 | 79 | 0.00% | 0 | 0 | 0.00% | 0 | 0 | 0.00% |
| Simple repeats | 156,653 | 6,140,166 | 3.91% | 129,325 | 5,133,883 | 2.99% | 123,472 | 5,184,289 | 3.07% |
| Low complexity | 33,557 | 1,607,487 | 1.02% | 26,068 | 1,315,192 | 0.76% | 29,831 | 1,445,383 | 0.85% |
| Feature | Dentathalia scutellariae | Athalia rosae [6] |
|---|---|---|
| Protein-coding genes | 14,904 | 11,393 |
| BUSCO (%) (Annotation) | C 1: 98.4 | C: 99.3 |
| Average gene length (bp 2) | 3558.31 | 8560.96 |
| Average number of exons per transcript | 6.11 | 6.78 |
| Average CDS 3 length (bp) | 1663.10 | 1767.84 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wang, S.; Liu, C.; Mei, Y.; Yang, D.; Pang, H.; Wang, F.; Ye, G.; Fang, Q.; Ye, X.; Yang, Y. De Novo Genome Assembly, Genomic Features, and Comparative Analysis of the Sawfly Dentathalia scutellariae. Biology 2026, 15, 214. https://doi.org/10.3390/biology15030214
Wang S, Liu C, Mei Y, Yang D, Pang H, Wang F, Ye G, Fang Q, Ye X, Yang Y. De Novo Genome Assembly, Genomic Features, and Comparative Analysis of the Sawfly Dentathalia scutellariae. Biology. 2026; 15(3):214. https://doi.org/10.3390/biology15030214
Chicago/Turabian StyleWang, Shasha, Chang Liu, Yang Mei, Deqing Yang, Huiwen Pang, Fang Wang, Gongyin Ye, Qi Fang, Xinhai Ye, and Yi Yang. 2026. "De Novo Genome Assembly, Genomic Features, and Comparative Analysis of the Sawfly Dentathalia scutellariae" Biology 15, no. 3: 214. https://doi.org/10.3390/biology15030214
APA StyleWang, S., Liu, C., Mei, Y., Yang, D., Pang, H., Wang, F., Ye, G., Fang, Q., Ye, X., & Yang, Y. (2026). De Novo Genome Assembly, Genomic Features, and Comparative Analysis of the Sawfly Dentathalia scutellariae. Biology, 15(3), 214. https://doi.org/10.3390/biology15030214

