Complete Chloroplast Genome of Cnidium monnieri (Apiaceae) and Comparisons with Other Tribe Selineae Species

: Cnidium monnieri is an economically important traditional Chinese medicinal plant. In this study, the complete chloroplast (cp) genome of C. monnieri was determined using the Illumina paired-end sequencing, the GetOrganelle de novo assembly strategy, as well as the GeSeq annotation method. Our results showed that the cp genome was 147,371 bp in length with 37.4% GC content and included a large single-copy region (94,361 bp) and a small single-copy region (17,552 bp) separated by a pair of inverted repeat regions (17,729 bp). A total of 129 genes were contained in the cp genome, including 85 protein-coding genes, 36 tRNA genes, and eight rRNA genes. We also investigated codon usage, RNA editing, repeat sequences, simple sequence repeats (SSRs), IR boundaries, and pairwise Ka/Ks ratios. Four hypervariable regions ( trnD-trnY-trnE-trnT , ycf2 , ndhF-rpl32-trnL , and ycf1 ) were identiﬁed as candidate molecular markers for species authentication. The phylogenetic analyses supported non-monophyly of Cnidium and C. monnieri located in tribe Selineae based on the cp genome sequences and internal transcribed spacer (ITS) sequences. The incongruence of the phylogenetic position of C. monnieri between ITS and cpDNA phylogenies suggested that C. monnieri might have experienced complex evolutions with hybrid and incomplete lineage sorting. All in all, the results presented herein will provide plentiful chloroplast genomic resources for studies of the taxonomy, phylogeny, and species authentication of C. monnieri . Our study is also conducive to elucidating the phylogenetic relationships and taxonomic position of Cnidium .


Introduction
Cnidium monnieri (L.) Cuss, an annual herb in Apiaceae with excellent medicinal and economic values, is mainly grown in the fields, roadsides, grasslands, and riverside wetlands of China [1]. The mature and dry fruit of C. monnieri is the traditional Chinese medicine "She Chuang Zi" (Figure 1) and is generally used for killing parasites, antiitch, wind-expelling, and removing dampness [1,2]. Owing to the good medicinal value, previous studies about C. monnieri largely focused on their pharmacological effect, chemical extraction, and herb authentication [3][4][5][6]. In addition, C. monnieri is the type species of the genus Cnidium. Downie et al. [7] found Cnidium is not monophyletic and identified two clades (tribe Selineae and Sinodielsia Clade) within Cnidium based on nuclear ribosomal DNA internal transcribed spacer (ITS), which indicated that the taxonomy and phylogeny of this genus needs to be further studied. The determination of the phylogenetic position of type species can pave the way for resolving the phylogeny and taxonomy of Cnidium. However, there are few studies on the molecular phylogeny of C. monnieri, and these studies only involve this species instead of concentrating on it [7,8]. Therefore, it is particularly necessary to analyze the phylogeny of C. monnieri. Recently, with the increasing use of medicinal plants, species identification is especially important. However, Apiaceae plants The chloroplast is an important organelle for green plants, which participates in the photosynthesis, the biosynthesis of fatty acids, starch, pigments, and amino acids [9]. The chloroplast (cp) genome generally ranges from 115 to 165 kb and has a quadripartite structure comprising a pair of inverted repeat (IR) regions separated by the large single-copy (LSC) and small single-copy (SSC) regions in most angiosperms [10]. The uniqueness of the cp genome, including maternal inheritance, small size, conserved sequences, and simple structure [11], makes it suitable for divergence dating, DNA barcoding, taxonomy, and phylogeny studies [12][13][14][15]. Currently, the high-throughput sequencing technology has been very mature and widely used. For this reason, more and more cp genomes of Apiaceae have been reported and performed in phylogenomic research, such as Angelica The chloroplast is an important organelle for green plants, which participates in the photosynthesis, the biosynthesis of fatty acids, starch, pigments, and amino acids [9]. The chloroplast (cp) genome generally ranges from 115 to 165 kb and has a quadripartite structure comprising a pair of inverted repeat (IR) regions separated by the large single-copy (LSC) and small single-copy (SSC) regions in most angiosperms [10]. The uniqueness of the cp genome, including maternal inheritance, small size, conserved sequences, and simple structure [11], makes it suitable for divergence dating, DNA barcoding, taxonomy, and phylogeny studies [12][13][14][15]. Currently, the high-throughput sequencing technology has been very mature and widely used. For this reason, more and more cp genomes of Apiaceae have been reported and performed in phylogenomic research, such as Angelica [16], Bupleurum [17,18], Notopterygium [19], Ligusticum [20], etc. There has been a lack of studies using the cp genome to resolve the phylogeny of C. monnieri with respect to Apiaceae.
In this study, we obtained the complete cp genome of C. monnieri by de novo assembly. We described the cp genome characteristics, including length, gene content, GC content, repeat sequences, SSRs, codon usage, and RNA editing. We simultaneously compared and analyzed the cp genome with 12 members of the tribe Selineae. These 12 species involved nine genera of published cp genome in the tribe Selineae. We additionally performed a phylogenetic analysis using cp genome sequences and internal transcribed spacer (ITS) sequences to reconstruct the phylogeny and infer the phylogenetic position of C. monnieri. In a word, our study is useful for the studies on phylogeny, taxonomy, and species authentication of C. monnieri, and is also conducive to elucidating the phylogenetic relationships and taxonomic position of Cnidium.

Taxon Sampling and DNA Sequencing
The fresh plant leaves of C. monnieri were collected from Huanren, Liaoning, China (41 • 7 6.77 N, 125 • 17 29.72 E, October 2020). A voucher specimen (RT2020100801) was deposited at the herbarium of Sichuan University (Chengdu, China). Total DNA was extracted using the CTAB method [21]. 1% agarose gel electrophoresis and a Quant-iT PicoGreen dsDNA Assay Kit (Life Technologies, Carlsbad, CA, USA) were used to measure the DNA integrity and concentration. The total DNA sample was sent to Shanghai Personal Biotechnology Co., Ltd. (Shanghai, China) for library construction (400 bp) and Illumina sequencing. The TruSeq DNA Sample Preparation Kits (Illumina, San Diego, CA, USA) were used to construct an Illumina paired-end library. The paired-end (2 × 150 bp) sequencing was conducted on the Illumina NovaSeq platform (Illumina, San Diego, CA, USA), yielding approximately 5 Gb of raw data for C. monnieri.

Chloroplast Genome Assembly and Annotation
Firstly, the obtained raw data were trimmed by removing adaptors and low-quality reads using AdapterRemoval v2 (trimwindows = 5; minlength = 50) [22]. Then, de novo assembly was performed using GetOrganelle v1.7.2 [23] with default settings (-R 15; -k 21,45,65,85,105). Finally, the annotation of the cp genome was performed using the GeSeq [24]. The Geneious v9.0.2 [25] was used to manually correct the positions of start and stop codons, as well as the boundaries of introns and exons by comparing the sequences with the related species to ensure the accuracy of the annotation results. The gene map of C. monnieri was plotted by OGDRAW [26]. The annotated complete cp genome of C. monnieri was deposited in GenBank with accession no. OL839918.

Codon Usage, RNA Editing, Repeat Sequence, and SSR of C. monnieri Cp Genome
The codon usage of all protein-coding genes was analyzed using MEGA6.0 software [27]. RNA editing sites were detected by the PREP suite [28] with a cut-off value of 0.8. Simple sequence repeats (SSRs) were detected using MISA Perl script [29], with the following parameters: 10 repeat units for mononucleotide SSRs, 5 repeat units for dinucleotide repeat SSRs, 4 repeat units for trinucleotide repeat SSRs, and 3 repeat units for tetra-, penta-, and hexanucleotide repeat SSRs. The forward, reverse, palindromic and complementary repeats were detected by the online REPuter program [30] with a minimal size ≥ 30 bp, and a hamming distance = 3.

Evolutionary and Phylogenomic Analyses
We used the KaKs_Calculator2.0 program [37] with the NG model to calculate the rates of non-synonymous substitutions (Ka), synonymous substitutions (Ks), and their ratio (Ka/Ks) using C. monnieri as a reference. The 79 protein-coding sequences were aligned using the ClustalW (Codons) method of the MEGA6.0 software [27], then axt format files were prepared using AXTConvertor.
The 48 complete cp genome sequences were used to construct phylogenetic trees to determine the phylogenetic position of C. monnieri, in which 45 completed cp genome sequences from Apioideae, and three Saniculoideae species were used as outgroup. We selected Maximum likelihood (ML) and Bayesian inference (BI) methods to perform phylogenomic analyses. 78 chloroplast protein-coding genes (PCGs) shared by 48 species were extracted using PhyloSuite v1.2.2 [38]. The shared 78 PCGs were aligned by MAFFT [39] with codon mode, trimmed by trimAl [40] with nogaps option, and were then concatenated by PhyloSuite v1.2.2 [38]. Moreover, corresponding 48 nuclear ITS sequences downloaded from the GenBank were also used to infer the phylogenetic relationships of C. monnieri. We selected the GTRGAMMA model for ML analysis implemented in RAxML v8.2.8 [41] with 1000 bootstrap replicates. MrBayes v3.1.2 [42] was used to perform the BI analysis with the nucleotide substitution model (GTR + I + G and SYM + I + G) selected by Modeltest v3.7 [43]. The Markov chain Monte Carlo (MCMC) algorithm was run for 5,000,000 generations (sampling every 100 generations) with two runs and four chains (three heated chains and one cold chain). The first 25% of trees were discarded as burn-in, and the remaining trees were used to construct the consensus tree.

Characteristics of the C. monnieri Cp Genome
The complete cp genome sequence of C. monnieri was 147,371 bp in length and had a typical quadripartite structure as found in most land plants ( Figure 1). The cp genome included a LSC region of 94,361 bp and a SSC region of 17,552 bp, separated by a pair of IR regions with 17,729 bp ( Table 1). The GC content in the whole genome, LSC, SSC, and IRs was 37.4, 35.9, 30.9, and 44.9%, respectively (Table 1). It was worth mentioning that GC content in IRs was higher than LSC and SSC regions. The GC content of the protein-coding regions (CDS) was 37.9%. Within CDS, the GC content was 45.8% for the first codon position, 38.2% for the second position, and 29.6% for the third position.
The cp genomes of 12 tribe Selineae species were selected to compare with the C. monnieri ( Table 1). The GC content was similar in different species, while the total length and the gene number had differences. M. pimpinelloideum and P. japonicum had the
The cp genomes of 12 tribe Selineae species were selected to compare with the monnieri ( Table 1). The GC content was similar in different species, while the total leng and the gene number had differences. M. pimpinelloideum and P. japonicum had the larg total length and the most gene number. The other species varied slightly in the total leng and the gene number.

Codon Usage, RNA Editing, and Ka/Ks in Protein-Coding Genes
Codon usage frequency of the C. monnieri cp genome was estimated and summariz (Table S3). All the protein-coding genes were encoded by 24,248 codons. In these codo the most frequent amino acid was Leucine (2576, 10.62%) and the least was Cysteine (2 1.06%). The start codon AUG was identified 572 times. All three stop codons were prese with UAA used most frequently (46 times) followed by UAG 22 times and UGA 17 tim The GC content of the third codon positions is significantly lower than the first and seco codon positions.
The PREP-cp program predicted 59 potential RNA editing sites for 22 protein-codi genes of the C. monnieri cp genome (Table S4). Of these 59 editing sites, 15 (25.4%) and (74.6%) were located at the first and the second codon position, but no editing site w found at the third codon position. The genes ndhB (10), rpoB (6), rpoC2 (6), ndhD (4), nd (4), accD (5), and matK (3) had a high number of RNA editing sites. Other genes, name atpA, atpB, ccsA, ndhF, ndhG, petB, petG, psaI, psbE, psbF, rpl20, rpoC2, rpoA, rps2, and rps contained two or one potential RNA editing site. The amino acid conversion S to L o curred most frequently, while A to V and R to C occurred least. 37 out of 59 RNA editi sites changed the encoded amino acid from polar to apolar.

Comparative Analyses
A comparison of the IR boundaries was performed among the 13 tribe Selineae sp cies (Figure 4). JLB extended over the petB in M. pimpinelloideum and P. japonicum, wh JLB extended over ycf2 in the other 11 tribe Selineae species. The JSB was located in t ycf1 or ψycf1. Moreover, the JSB expanded 1-70 bp into ndhF of M. pimpinelloideum, littoralis, and A. dahurica. The JSA was located in the ycf1 and expanded 1,655-2,269

Codon Usage, RNA Editing, and Ka/Ks in Protein-Coding Genes
Codon usage frequency of the C. monnieri cp genome was estimated and summarized (Table S3). All the protein-coding genes were encoded by 24,248 codons. In these codons, the most frequent amino acid was Leucine (2576, 10.62%) and the least was Cysteine (258, 1.06%). The start codon AUG was identified 572 times. All three stop codons were present with UAA used most frequently (46 times) followed by UAG 22 times and UGA 17 times. The GC content of the third codon positions is significantly lower than the first and second codon positions.

Comparative Analyses
A comparison of the IR boundaries was performed among the 13 tribe Selineae species (Figure 4). JLB extended over the petB in M. pimpinelloideum and P. japonicum, while JLB extended over ycf2 in the other 11 tribe Selineae species. The JSB was located in the ycf1 or ψycf1. Moreover, the JSB expanded 1-70 bp into ndhF of M. pimpinelloideum, G. littoralis, and A. dahurica. The JSA was located in the ycf1 and expanded 1655-2269 bp into the ycf1 in the 13 tribe Selineae cp genomes. The trnH were all located in the LSC region and were 1632 bp away from the JLA. The petD were 1029-1472 bp away from JLA in M. pimpinelloideum and P. japonicum, while trnL were 510-1862 bp away from JLA in other 11 tribe Selineae species.  The overall sequence identity of the 13 tribe Selineae cp genomes was plotted using the mVISTA program ( Figure 5). The comparison demonstrated that the two IR regions were less divergent than the LSC and SSC regions. Non-coding regions were more divergent than coding regions, and the most divergent regions were localized in the intergenic spacers. The nucleotide diversity (Pi) was calculated to determine the sequence divergence levels ( Figure 6). The analysis reached the same conclusion that the IR regions were The overall sequence identity of the 13 tribe Selineae cp genomes was plotted using the mVISTA program ( Figure 5). The comparison demonstrated that the two IR regions were less divergent than the LSC and SSC regions. Non-coding regions were more divergent than coding regions, and the most divergent regions were localized in the intergenic spacers. The nucleotide diversity (Pi) was calculated to determine the sequence divergence levels ( Figure 6). The analysis reached the same conclusion that the IR regions were more conserved than the SC regions. Moreover, we detected four highly variable regions (Pi > 0.02), namely trnD-trnY-trnE-trnT, ycf2, ndhF-rpl32-trnL, and ycf1. more conserved than the SC regions. Moreover, we detected four highly variable regions (Pi > 0.02), namely trnD-trnY-trnE-trnT, ycf2, ndhF-rpl32-trnL, and ycf1.

Phylogeny Inference
The chloroplast genome sequences and nuclear ITS sequences were used to perform the phylogenetic analyses (Table S6). The cp tree and ITS tree produced incongruent tree topologies, while they all inferred the non-monophyly of Cnidium (Figures 7 and 8). C. monnieri and C. officinale fell into tribe Selineae and Sinodielsia Clade with 100% support

Phylogeny Inference
The chloroplast genome sequences and nuclear ITS sequences were used to perform the phylogenetic analyses (Table S6). The cp tree and ITS tree produced incongruent tree topologies, while they all inferred the non-monophyly of Cnidium (Figures 7 and 8). C. monnieri and C. officinale fell into tribe Selineae and Sinodielsia Clade with 100% support values, respectively. The cp phylogenetic tree based on 78 shared PCGs with ML and BI methods had an identical topological structure with high support values (Figure 7). C. monnieri was related to Angelica group instead of other tribe Selineae species. The Angelica group also contained two species (G. littoralis and M. pimpinelloideum) in our study [44]. C. officinale was related to L. sinense, and was then clustered with L. chuanxiong and L. jeholense formed a clade. In the ITS tree, the tree topologies resulting from ML and BI analysis were sometimes different (Figure 8). C. monnieri was related to Ses. Montanum with weak support (BS < 50%), then they clustered with other tribe Selineae species except for Angelica group by ML analysis, whereas it was resolved as a sister to other tribe Selineae species except for Angelica group with moderate support by BI analysis (PP = 0.7). As for C. officinale, it was still clustered with Ligusticum species in Sinodielsia Clade with high support. Other clades of Apioideae were generally consistent with prior studies [7,[45][46][47].

Discussion
In the present study, the cp genome of C. monnieri was sequenced, and comparative analyses were conducted with the other 12 tribe Selineae species to gain a complete view of the architecture of the C. monnieri cp genome. The C. monnieri cp genome herein revealed a typical quadripartite structure and an expected size range for angiosperm plants [11,48]. There was no gene loss in the cp genome of C. monnieri, yet gene loss actually often occurs in plant lineages. Many cp genes have been lost in different plants, such as accD, infA, clpP, ccsA, rps16, rpl23, ndh complex, and so on [49][50][51][52]. However, the rpl20 gene was considered the most stable, for it was not lost in the study of 2,511 cp genomes [49]. The high GC content in the IR region of C. monnieri might result from four rRNAs genes with a high GC content [53]. The length of the whole genome, IR, LSC, and SSC regions, as well as the gene number, showed obvious differences among the 13 tribe Selineae cp genomes, which was also observed in a recent study of Apiaceae [45].
Codons play an important role in spreading genetic information because it acts as a bridge to nucleic acids and proteins. The same amino acid has two or more codons, while

Discussion
In the present study, the cp genome of C. monnieri was sequenced, and comparative analyses were conducted with the other 12 tribe Selineae species to gain a complete view of the architecture of the C. monnieri cp genome. The C. monnieri cp genome herein revealed a typical quadripartite structure and an expected size range for angiosperm plants [11,48]. There was no gene loss in the cp genome of C. monnieri, yet gene loss actually often occurs in plant lineages. Many cp genes have been lost in different plants, such as accD, infA, clpP, ccsA, rps16, rpl23, ndh complex, and so on [49][50][51][52]. However, the rpl20 gene was considered the most stable, for it was not lost in the study of 2511 cp genomes [49]. The high GC content in the IR region of C. monnieri might result from four rRNAs genes with a high GC content [53]. The length of the whole genome, IR, LSC, and SSC regions, as well as the gene number, showed obvious differences among the 13 tribe Selineae cp genomes, which was also observed in a recent study of Apiaceae [45].
Codons play an important role in spreading genetic information because it acts as a bridge to nucleic acids and proteins. The same amino acid has two or more codons, while each has its preferred codon owing to codon usage bias (CUB). Several studies have shown that the CUB may be the result of natural selection, mutation, and genetic drift [54,55]. In C. monnieri, the most preferred codons end with A/U, which is consistent with various terrestrial plant cp genomes [50,56,57]. Perhaps high AT content in the cp genomes is the major reason for biased codons ending with A/U [58]. In higher plants, the protein-coding genes of chloroplast always occur in RNA editing and mostly convert cytidine (C) to uridine (U) [59]. RNA editing can correct DNA mutations on the RNA level, and is thereby essential for the growth and development of plants [60,61]. Here, 59 RNA editing sites were identified in 22 protein-coding genes of the cp genome of C. monnieri. Most RNA editing sites changed the encoded amino acid from polar to apolar, thus increasing the protein hydrophobicity. This tendency was also reported in other studies, which implies that the increased hydrophobicity can affect the protein structural features, protein-protein interactions, and transmembrane domains in the chloroplast protein complexes [59,61,62].
As for repeat sequences and SSRs, C. monnieri showed patterns comparable to other Apiaceae in numbers and location [17,20]. A high number of repeats were located in intergenic regions or ycf2. Repeats in the ycf2 are commonly observed in Primula and Cardiocrinum [50,63]. Most mono-and dinucleotides are generally composed of A or T and infrequently contain C or G, which contribute to the high AT content in the cp genome [56,64]. As in other species [58,64], the distribution of SSRs is uneven in the C. monnieri cp genome. These abundant SSRs in cp genomes would be served as genetic markers applied in population genetics and phylogeographical studies of C. monnieri.
The change of the IR/SC boundary is a universal phenomenon in the cp genome evolution [65,66]. In the 13 tribe Selineae species, C. monnieri, A. dahurica, A. gigas, A. laxifoliata, G. littoralis, L. buchtormensis, Lig. likiangense, Lig. thomsonii and S. divaricata showed similar characteristics, and only the length flanking showed a little difference. The organization gene of JSB in P. praeruptorum and Ses. montanum is ψycf1, which is different from other species. M. pimpinelloideum and P. japonicum showed more differences than other species in organization genes of JLB and JLA, namely, petB and petD, as well as the length of IR and LSC (35,759 bp; 76,445-75,584 bp). The IR contraction and expansion is the main reason for the size change of cp genomes [66,67]. We therefore concluded that the IR expansion of M. pimpinelloideum and P. japonicum causes a longer length of the two cp genomes (164,431-164,653 bp). Except for contraction and expansion, the absence of IR region is not uncommon in plant lineages. A recent study showed that about 10.31% of the cp genomes have lost the IR region span across all the lineages [52]. In angiosperms, some species of Erodium, Cassytha, Passiflora, and legumes [68][69][70][71] have been described as IR-lacking lineages.
The non-synonymous (Ka) and synonymous (Ks) nucleotide substitution patterns are very vital markers in gene evolution studies. The ratio of Ka/Ks < 1 indicates purifying selection, while Ka/Ks > 1 indicates positive selection [72]. Non-synonymous nucleotide substitutions have occurred less frequently than synonymous substitutions in most proteincoding genes [73]. We calculated Ka/Ks ratios of the protein-coding genes in C. monnieri cp genome versus 12 other tribe Selineae species. Notably, most of the genes indicated purifying selection based on the Ka/Ks values. However, 11 genes (ccsA, matK, ndhA, psbI, rbcL, rpl22, rpoA, rpoC2, rps3, ycf1, and ycf2) were identified under positive selection. Among these, the Ka/Ks ratio of ndhA, psbI, rpoC2, rps3, rpl22, ccsA, ycf2, and rpoA genes in one or two of the 12 comparison groups, while the Ka/Ks ratio of matK, rbcL, and ycf1 genes in 5, 6, and 9 of the 12 comparison groups, respectively. The matK gene with about 1500 bp in length encodes the only maturase (MatK) in the cp genome of land plants [74]. MatK regulates the expression of several essential genes related to protein biosynthesis [61], thus the matK gene is essential for cell viability. The rbcL gene encodes the large subunit of the ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO) [11]. It is a modulator of photosynthetic electron transport and is essential for photosynthesis [75]. The ycf1 gene is one of the largest genes in the cp genome, encoding a protein of about 1800 amino acids [11]. A study showed that ycf1 gene encodes Tic214, a vital component of the Arabidopsis TIC complex [76]. Positive selection on the three genes has also been observed in Dipsacales [77]. Furthermore, matK, rbcL, and ycf1 genes are also known as fast-evolving genes compared with other chloroplast genes, and they therefore can be used as an excellent molecular marker to be widely applied in phylogenetic inferences [78][79][80].
We investigated the phylogenetic position of C. monnieri using cp genome sequences and ITS sequences, while the cp tree and ITS tree produced incongruent tree topologies. Nevertheless, they all inferred the non-monophyly of Cnidium, which has been demonstrated in other studies [7,8]. C. monnieri fell into tribe Selineae, consistent with previous studies [8,[45][46][47]82]. In the cp tree, C. monnieri was related to Angelica group instead of other tribe Selineae species. In the ITS tree, C. monnieri was related to Ses. montanum then clustered with other tribe Selineae species except for Angelica group by ML analysis, whereas it was resolved as sister to other tribe Selineae species except for Angelica group by BI analysis. The incongruence of the phylogenetic position of C. monnieri between ITS and cp phylogenies is consistent with the previous study based on transcriptomes and chloroplast genomes [45,82], suggesting that C. monnieri may have experienced complex evolutions with hybrid and incomplete lineage sorting [82]. In addition, the incongruence may result from the different mutation rates and inherited background between cpDNA and ITS. The ITS is biparentally inherited with a higher mutation rate, while the cpDNA is maternally inherited with a lower mutation rate [83]. As for C. officinale, it fell into Sinodielsia Clade and formed a clade with L. sinense, L. chuanxiong and L. jeholense. This may indicate that C. officinale should be suggested to be transferred to Ligusticum [7,8]. This study is helpful to deepen the understanding of the phylogenetic position of C. monnieri, and provides new insight into the phylogeny and taxonomy of the genus Cnidium. In a word, our result will be beneficial to future phylogeny, taxonomy, and evolutionary studies of C. monnieri.

Conclusions
In this study, the chloroplast genome of C. monnieri was sequenced, assembled, and compared with other tribe Selineae species. The cp genome was a circular molecule of 147,371 bp with 37.4% GC content. C. monnieri cp genome harbored 129 genes, including 85 protein-coding genes, 36 tRNA genes, and eight rRNA genes. All the protein-coding genes were encoded by 24,248 codons. We detected 79 simple sequence repeats (SSRs), 47 repeat sequences, and 59 RNA editing sites. The IR boundary of C. monnieri did not show significant expansion and contraction relative to the other 12 species. Four hypervariable regions (trnD-trnY-trnE-trnT, ycf2, ndhF-rpl32-trnL, and ycf1) were identified as candidate molecular markers for species authentication. Eleven genes (ccsA, matK, ndhA, psbI, rbcL, rpl22, rpoA, rpoC2, rps3, ycf1, and ycf2) were identified under positive selection (Ka/Ks > 1). The phylogenetic analysis based on cp genome sequences and internal transcribed spacer (ITS) sequences from 48 species supported C. monnieri located in tribe Selineae with high support. The incongruence of the phylogenetic position of C. monnieri between ITS and cpDNA phylogenies suggested that C. monnieri might have experienced complex evolutions with hybrid and incomplete lineage sorting. This study provided precious genetic resources of C. monnieri, which can be used for the species authentication, phylogeny, and taxonomy studies of C. monnieri. This study would also be beneficial in elucidating the taxonomy and reconstructing the phylogeny of the genus Cnidium.