Complete Chloroplast Genome Characteristics and Phylogenetic Analysis of Brassica juncea L.
Abstract
1. Introduction
2. Results
2.1. Basic Characteristics of the Chloroplast Genome of Brassica juncea L.
2.2. Functional Annotation of Chloroplast Genome in Brassica juncea L.
2.3. Analysis of Codon Preference
2.4. Repetitive Sequence Analysis
2.5. Nucleic Acid Diversity and Boundary Analysis
2.6. KaKs Analysis
2.7. System Evolution Analysis
3. Discussion
4. Materials and Methods
4.1. Test Materials and Sequencing
4.2. Chloroplast Genome Assembly and Functional Annotation
4.3. Analysis of Scattered Repetitive Sequence and Simple Repetitive Sequence
4.4. Chloroplast Genomic Nucleic Acid Diversity and Boundary Analysis
4.5. System Evolution Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wan, Z.; Fan, Y.; Meng, Q.; Li, Y.; Zou, R.; Li, H.; Xu, Y. Development and Prospect of Chinese Mustard Seed Industry. Chin. Veg. 2020, 382, 128–129. [Google Scholar]
- Kang, L.; Qian, L.; Zheng, M.; Chen, L.; Chen, H.; Yang, L.; You, L.; Yang, B.; Yan, M.; Gu, Y.; et al. Genomic insights into the origin, domestication and diversification of Brassica juncea. Nat. Genet. 2021, 53, 1392–1402. [Google Scholar] [CrossRef] [PubMed]
- Kim, Y.T.; Kim, B.K.; Park, K.Y. Antimutagenic and Anticancer Effects of Leaf Mustard and Leaf Mustard Kimchi. Prev. Nutr. Food Sci. 2007, 12. [Google Scholar] [CrossRef]
- Heng, S.; Huang, H.; Cui, M.; Liu, M.; Lv, Q.; Hu, P.; Ren, S.; Li, X.; Fu, T.; Wan, Z. Rapid identification of the BjRCO gene associated with lobed leaves in Brassica juncea via bulked segregant RNA-seq. Mol. Breed. New Strateg. Plant Improv. 2020, 40, 42. [Google Scholar]
- Semchenko, M.; Zobel, K. The role of leaf lobation in elongation responses to shade in the rosette-forming forb Serratula tinctoria (Asteraceae). Ann. Bot. 2007, 100, 83–90. [Google Scholar] [CrossRef]
- Zhou, Y.; Liu, Y.K.; Fang, Y.Z.; Zhou, J.H.; Chen, J.Y. Genomic analysis and differentiation time estimation of 11 species of Dendrobium officinale chloroplasts. J. Zhejiang Univ. Agric. Life Sci. Ed. 2025, 51, 291–302. [Google Scholar]
- Pottosin, I.; Shabala, S. Transport Across Chloroplast Membranes: Optimizing Photosynthesis for Adverse Environmental Conditions. Mol. Plant 2016, 9, 356–370. [Google Scholar] [CrossRef]
- Fan, S.; Guo, X. Research and application progress of plant chloroplast genome. J. Shandong Norm. Univ. Nat. Sci. Ed. 2022, 37. [Google Scholar]
- McFadden, G.I. Chloroplast Origin and Integration1. Plant Physiol. 2001, 125, 50–53. [Google Scholar]
- Liu, H.; Liu, L.; Wang, Z.; Yu, L.; Li, J.; Zeng, Y. Research progress on chloroplast genome of Orchidaceae. Plants Wild Plant Resour. China 2023, 42, 73–79. [Google Scholar]
- Liu, L.M.; Du, X.Y.; Guo, C.; Li, D.Z. Resolving robust phylogenetic relationships of core Brassicaceae using genome skimming data. J. Syst. Evol. 2021, 59, 442–453. [Google Scholar] [CrossRef]
- Yang, J.; Liu, D.; Wang, X.; Ji, C.; Cheng, F.; Liu, B.; Hu, Z.; Chen, S.; Pental, D.; Ju, Y.; et al. The genome sequence of allopolyploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection. Nat. Genet. 2016, 48, 1225–1232, Erratum in Nat. Genet. 2018, 50, 1616. https://doi.org/10.1038/s41588-018-0227-4. [Google Scholar] [CrossRef]
- Yang, Y.; Liu, N.; Chen, X. Research on the classification of mustard greens. J. Hortic. 1989, 16, 114–121. [Google Scholar]
- Zhao, M.; Wu, Y.; Ren, Y. Complete chloroplast genome sequence structure and phylogenetic analysis of kohlrabi(Brassica oleracea var. gongylodes L.). Genes 2024, 15, 550. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Liang, Q.; Zhang, C.; Huang, H.; He, H.; Wang, M.; Li, M.; Huang, Z.; Tang, Y.; Chen, Q.; et al. Sequencing and analysis of complete chloroplast genomes provide insight into the evolution and phylogeny of Chinese kale (Brassica oleracea var. alboglabra). Int. J. Mol. Sci. 2023, 24, 10287. [Google Scholar] [CrossRef] [PubMed]
- Redwan, R.M.; Saidin, A.; Kumar, S.V. Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae. BMC Plant Biol. 2015, 15, 196, Erratum in BMC Plant Biol. 2015, 15, 294. https://doi.org/10.1186/s12870-015-0619-x. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.; Sylvester, S.P.; Li, M.; Zhang, C.; Li, X.; Duan, Y.; Wang, X. The complete plastid genome of Magnolia zenii and genetic comparison to Magnoliaceae species. Molecules 2019, 24, 261. [Google Scholar] [CrossRef]
- Duret, L. tRNA gene number and Codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes. Trends Genet. 2000, 16, 287−289. [Google Scholar] [CrossRef]
- Zhu, M.; Hu, Y.; Shi, X. Characterization and phylogenetic location analysis of chloroplast of the endangered plant Neotrichocolea bissetii. J. Zhejiang AF Univ. 2025, 42, 55−63. [Google Scholar]
- Jiang, S.S.; Yuan, J.; Zhou, W.J.; Niu, G.H.; Zhou, J.Q. Complete chloroplast genome sequence and characteristics analysis of Carya illinoinensis. Acta Hortic. Sin. 2022, 49, 1772−1784. [Google Scholar]
- Wu, J.Y.; Ma, X.C.; Ma, L.; Fang, Y.; Zhang, Y.H.; Liu, L.J.; Li, X.C.; Zeng, R.; Sun, W.C. Complete chloroplast genome sequence and phylogenetic analysis of winter oil rapeseed (Brassica rapa L.). Mitochondrial DNA Part B Resour. 2021, 6, 723−731. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.S.; Huang, H.W.; Wang, Y. Recent progress in plant molecular population genetics. Hereditas 2007, 29, 1191−1198. [Google Scholar] [CrossRef] [PubMed]
- Xu, J.; Liu, C.; Song, Y.; Li, M. Comparative analysis of the chloroplast genome for four Pennisetum species: Molecular structure and phylogenetic relationships. Front. Genet. 2021, 12, 687844. [Google Scholar] [CrossRef] [PubMed]
- Ma, H.; Wang, Z.; Zhao, H.; Zhang, J.; Zeng, Y. Maternal origin diversity of nine olive varieties bred in China. Acta Bot. Boreali-Occident. Sin. 2024, 44, 1760–1768. (In Chinese) [Google Scholar]
- Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
- Hyatt, D.; Chen, G.-L.; LoCascio, P.F.; Land, M.L.; Larimer, F.W.; Hauser, L.J. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010, 11, 119. [Google Scholar] [CrossRef]
- Mistry, J.; Finn, R.D.; Eddy, S.R.; Bateman, A.; Punta, M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013, 41, e121. [Google Scholar] [CrossRef]
- Laslett, D.; Canback, B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 2004, 32, 11–16. [Google Scholar] [CrossRef]
- Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
- Greiner, S.; Lehwark, P.; Bock, R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019, 47, W59–W64. [Google Scholar] [CrossRef]
- Kurtz, S. The Vmatch large scale sequence analysis software. Ref Type Comput. Program 2003, 412, 297. [Google Scholar]
- White, G.L.; Fishbein, S.; Rutsein, J. Passionate love and the misattribution of arousal. J. Personal. Soc. Psychol. 1981, 41, 56–62. [Google Scholar] [CrossRef]
- Librado, P.; Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25, 1451–1452. [Google Scholar] [CrossRef]
- Darling, A.C.E.; Mau, B.; Blattner, F.R.; Perna, N.T. Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements. Genome Res. 2004, 14, 1394–1403. [Google Scholar] [CrossRef]
- Capella-Gutiérrez, S.; Silla-Martínez, J.M.; Gabaldón, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009, 25, 1972–1973. [Google Scholar] [CrossRef]
- Silvestro, D.; Michalak, I. raxmlGUI: A graphical front-end for RAxML. Org. Divers. Evol. 2011, 12, 335–337. [Google Scholar] [CrossRef]








| Sequence Type Characteristics | Base Type | Number | % |
|---|---|---|---|
| Large-Scale Copy Region (LSC) Feature | A | 26,695 | 32.05% |
| C | 14,613 | 17.54% | |
| G | 13,806 | 16.58% | |
| T | 28,179 | 33.83% | |
| GC | 28,419 | 34.12% | |
| All | 83,293 | 100.00% | |
| Small-Scale Copy Region (SSC) Feature | A | 6309 | 35.49% |
| C | 2695 | 15.16% | |
| G | 2496 | 14.04% | |
| T | 6275 | 35.30% | |
| GC | 5191 | 29.20% | |
| All | 17,775 | 100.00% | |
| Inverse Repeat Sequence a (IRa) Feature | A | 7575 | 28.90% |
| C | 5775 | 22.03% | |
| G | 5323 | 20.31% | |
| T | 7538 | 28.76% | |
| GC | 11,098 | 42.34% | |
| All | 26,211 | 100.00% | |
| Inverse Repeat Sequence b (IRb) Feature | A | 7538 | 28.76% |
| C | 5323 | 20.31% | |
| G | 5775 | 22.03% | |
| T | 7575 | 28.90% | |
| GC | 11,098 | 42.34% | |
| All | 26,211 | 100.00% | |
| Total Amount | A | 48,117 | 31.35% |
| C | 28,406 | 18.51% | |
| G | 27,400 | 17.85% | |
| T | 49,567 | 32.29% | |
| GC | 55,806 | 36.36% | |
| All | 153,490 | 100.00% |
| Category | Gene Group | Gene Name |
|---|---|---|
| Photosynthesis | Subunits of photosystem I | psaA, psaB, psaC, psaI, psaJ |
| Subunits of photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | |
| Subunits of NADH dehydrogenase | ndhA *, ndhB * (2), ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK | |
| Subunits of cytochrome b/f complex | petA, petB *, petD *, petG, petL, petN | |
| Subunits of ATP synthase | atpA, atpB, atpE, atpF *, atpH, atpI | |
| Large subunit of rubisco | rbcL | |
| Subunits photochlorophyllide reductase | - | |
| Self-replication | Proteins of large ribosomal subunit | rpl14, rpl16 *, rpl2 * (2), rpl20, rpl22, rpl23(2), rpl32, rpl33, rpl36 |
| Proteins of small ribosomal subunit | rps11, rps12 ** (2), rps14, rps15, rps16 *, rps18, rps19, rps2, rps3, rps4, rps7(2), rps8 | |
| Subunits of RNA polymerase | rpoA, rpoB, rpoC1 *, rpoC2 | |
| Ribosomal RNAs | rrn16(2), rrn23(2), rrn4.5(2), rrn5(2) | |
| Transfer RNAs | trnA-UGC * (2), trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnG-GCC, trnG-UCC *, trnH-GUG, trnI-CAU(2), trnI-GAU * (2), trnK-UUU *, trnL-CAA(2), trnL-UAA *, trnL-UAG, trnM-CAU, trnN-GUU(2), trnP-UGG, trnQ-UUG, trnR-ACG(2), trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC(2), trnV-UAC *, trnW-CCA, trnY-GUA, trnfM-CAU | |
| Other genes | Maturase | matK |
| Protease | clpP ** | |
| Envelope membrane protein | cemA | |
| Acetyl-CoA carboxylase | accD | |
| c-type cytochrome synthesis gene | ccsA | |
| Translation initiation factor | - | |
| other | - | |
| Genes of unknown function | Conserved hypothetical chloroplast ORF. | ycf1(2), ycf15(2), ycf2(2), ycf3 **, ycf4 |
| Symbol | Codon | No. | RSCU | Symbol | Codon | No. | RSCU | Symbol | Codon | No. | RSCU |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ter * | UAA | 48 | 1.8228 | Ile | AUA | 624 | 0.9474 | Pro | CCC | 161 | 0.7024 |
| Ter * | UAG | 19 | 0.7215 | Ile | AUC | 350 | 0.5313 | Pro | CCG | 124 | 0.5408 |
| Ter * | UGA | 12 | 0.4557 | Ile | AUU | 1002 | 1.5213 | Pro | CCU | 366 | 1.5964 |
| Ala | GCA | 348 | 1.1208 | Lys | AAA | 995 | 1.5644 | Gln | CAA | 641 | 1.571 |
| Ala | GCC | 177 | 0.57 | Lys | AAG | 277 | 0.4356 | Gln | CAG | 175 | 0.429 |
| Ala | GCG | 131 | 0.422 | Leu | CUA | 323 | 0.8022 | Arg | AGA | 384 | 1.755 |
| Ala | GCU | 586 | 1.8872 | Leu | CUC | 149 | 0.3702 | Arg | AGG | 126 | 0.576 |
| Cys | UGC | 62 | 0.461 | Leu | CUG | 138 | 0.3426 | Arg | CGA | 296 | 1.3524 |
| Cys | UGU | 207 | 1.539 | Leu | CUU | 497 | 1.2342 | Arg | CGC | 96 | 0.4386 |
| Asp | GAC | 169 | 0.3898 | Leu | UUA | 871 | 2.163 | Arg | CGG | 102 | 0.4662 |
| Asp | GAU | 698 | 1.6102 | Leu | UUG | 438 | 1.0878 | Arg | CGU | 309 | 1.4118 |
| Glu | GAA | 928 | 1.5492 | Met | AUA | 0 | 0 | Ser | AGC | 100 | 0.354 |
| Glu | GAG | 270 | 0.4508 | Met | AUC | 0 | 0 | Ser | AGU | 355 | 1.2576 |
| Phe | UUC | 401 | 0.5876 | Met | AUG | 512 | 6.9867 | Ser | UCA | 339 | 1.2006 |
| Phe | UUU | 964 | 1.4124 | Met | AUU | 0 | 0 | Ser | UCC | 235 | 0.8322 |
| Gly | GGA | 624 | 1.6104 | Met | CUG | 0 | 0 | Ser | UCG | 156 | 0.5526 |
| Gly | GGC | 153 | 0.3948 | Met | GUG | 1 | 0.0133 | Ser | UCU | 509 | 1.803 |
| Gly | GGG | 254 | 0.6556 | Met | UUG | 0 | 0 | Thr | ACA | 366 | 1.2344 |
| Gly | GGU | 519 | 1.3392 | Asn | AAC | 244 | 0.444 | Thr | ACC | 211 | 0.7116 |
| His | CAC | 125 | 0.4922 | Asn | AAU | 855 | 1.556 | Thr | ACG | 121 | 0.408 |
| His | CAU | 383 | 1.5078 | Pro | CCA | 266 | 1.1604 | Thr | ACU | 488 | 1.646 |
| Val | GUA | 455 | 1.456 | ||||||||
| Val | GUC | 152 | 0.4864 | ||||||||
| Val | GUG | 168 | 0.5376 | ||||||||
| Val | GUU | 475 | 1.52 | ||||||||
| Trp | UGG | 396 | 1 | ||||||||
| Tyr | UAC | 155 | 0.3694 | ||||||||
| Tyr | UAU | 684 | 1.6306 |
| Length | F | P | R | C | Total |
|---|---|---|---|---|---|
| 30 | 3 | 7 | 0 | 1 | 11 |
| 31 | 1 | 0 | 2 | 0 | 3 |
| 32 | 3 | 2 | 0 | 0 | 5 |
| 33 | 0 | 1 | 0 | 0 | 1 |
| 34 | 2 | 2 | 0 | 0 | 4 |
| 35 | 0 | 0 | 1 | 0 | 1 |
| 36 | 0 | 1 | 0 | 0 | 1 |
| 37 | 1 | 1 | 0 | 1 | 3 |
| 40 | 0 | 1 | 0 | 0 | 1 |
| 42 | 1 | 0 | 0 | 0 | 1 |
| 43 | 1 | 0 | 0 | 0 | 1 |
| 44 | 0 | 1 | 0 | 0 | 1 |
| 45 | 0 | 1 | 0 | 0 | 1 |
| 46 | 1 | 0 | 0 | 0 | 1 |
| 58 | 1 | 0 | 0 | 0 | 1 |
| 26,211 | 0 | 1 | 0 | 0 | 1 |
| Total | 14 | 18 | 3 | 2 | 37 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tang, S.; Liu, J.; Zhu, Z.; An, X.; Dong, J.; Luo, X.; Chen, C.; Liu, T.; Zou, L.; Li, S.; et al. Complete Chloroplast Genome Characteristics and Phylogenetic Analysis of Brassica juncea L. Int. J. Mol. Sci. 2026, 27, 2882. https://doi.org/10.3390/ijms27062882
Tang S, Liu J, Zhu Z, An X, Dong J, Luo X, Chen C, Liu T, Zou L, Li S, et al. Complete Chloroplast Genome Characteristics and Phylogenetic Analysis of Brassica juncea L. International Journal of Molecular Sciences. 2026; 27(6):2882. https://doi.org/10.3390/ijms27062882
Chicago/Turabian StyleTang, Shenyue, Juan Liu, Ziyi Zhu, Xingcai An, Junyuan Dong, Xiahong Luo, Changli Chen, Tingting Liu, Lina Zou, Shaocui Li, and et al. 2026. "Complete Chloroplast Genome Characteristics and Phylogenetic Analysis of Brassica juncea L." International Journal of Molecular Sciences 27, no. 6: 2882. https://doi.org/10.3390/ijms27062882
APA StyleTang, S., Liu, J., Zhu, Z., An, X., Dong, J., Luo, X., Chen, C., Liu, T., Zou, L., Li, S., & An, X. (2026). Complete Chloroplast Genome Characteristics and Phylogenetic Analysis of Brassica juncea L. International Journal of Molecular Sciences, 27(6), 2882. https://doi.org/10.3390/ijms27062882
