Next Article in Journal
Dynamics of Bioactive Compounds during Spontaneous Fermentation of Paste Obtained from Capsicum ssp.—Stage towards a Product with Technological Application
Next Article in Special Issue
Enhancing Testing Laboratory Engagement in Plant DNA Barcoding through a Routine Workflow—A Case Study on Chinese Materia Medica (CMM)
Previous Article in Journal
A Comparative Study of Isolated Secondary Metabolites from Lichens and Their Antioxidative Properties
Previous Article in Special Issue
Challenges in Medicinal and Aromatic Plants DNA Barcoding—Lessons from the Lamiaceae
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of ITS2 Region Relevance for Taxa Discrimination and Phylogenetic Inference among Pinaceae

1
Department of Genetics, Institute of Experimental Biology, Faculty of Biology, School of Natural Sciences, Adam Mickiewicz University, Poznań, Uniwersytetu Poznańskiego 6, 61-614 Poznań, Poland
2
Institute of Dendrology, Polish Academy of Sciences, Parkowa 5, 62-035 Kórnik, Poland
*
Author to whom correspondence should be addressed.
Plants 2022, 11(8), 1078; https://doi.org/10.3390/plants11081078
Submission received: 4 March 2022 / Revised: 12 April 2022 / Accepted: 12 April 2022 / Published: 15 April 2022
(This article belongs to the Special Issue DNA Barcoding for Herbal Medicines)

Abstract

:
The internal transcribed spacer 2 (ITS2) is one of the best-known universal DNA barcode regions. This short nuclear region is commonly used not only to discriminate taxa, but also to reconstruct phylogenetic relationships. However, the efficiency of using ITS2 in these applications depends on many factors, including the family under study. Pinaceae represents the largest family of extant gymnosperms, with many species of great ecological, economic, and medical importance. Moreover, many members of this family are representatives of rare, protected, or endangered species. A simple method for the identification of Pinaceae species based on DNA is necessary for their effective protection, authentication of products containing Pinaceae representatives, or phylogenetic inference. In this study, for the first time, we conducted a comprehensive study summarizing the legitimacy of using the ITS2 region for these purposes. A total of 368 sequences representing 71 closely and distantly related taxa of the seven genera and three subfamilies of Pinaceae were characterized for genetic variability and divergence. Intra- and interspecies distances of ITS2 sequences as well as rates of sequence identification and taxa discrimination among Pinaceae at various taxonomic levels, i.e., the species complex, genus, subfamily, and family, were also determined. Our study provides a critical assessment of the suitability of the ITS2 nuclear DNA region for taxa discrimination among Pinaceae. The obtained results clearly show that its usefulness for this purpose is limited.

1. Introduction

The Pinaceae family is the largest extant family of all gymnosperms [1]. Within this family, 225 species have been distinguished, which are grouped in three subfamilies (Pinoideae Pilg., Laricoideae Melchior et Werdermann, Abietoideae Pilg.) and eleven genera, i.e., Abies Mill., Cathaya Chun & Kuang, Cedrus Belon ex Trew, Keteleeria Carriere., Larix Mill., Nothotsuga Hu ex C. N. Page, Picea A. Dietr., Pinus L., Pseudolarix Gordon, Pseudotsuga Carriere, and Tsuga Carriere [2]. Representatives of the Pinaceae are an extremely important component of temperate, boreal, subalpine, and subtropical forests in the northern hemisphere. Moreover, conifers from the Pinaceae family are of great economic importance, especially the genus Pinus, which has been used for many years as a valuable source of wood, resins, essential oils, and seeds [3]. Many studies also show that members of this family represent a valuable source of active substances with medicinal properties [4,5,6,7,8].
The high economic and medical value of members of this family may be a strong temptation to over-exploit particular species, obtain them illegally, or even falsify products containing them. Since many species of the Pinaceae family are rare, endangered, or protected, this can pose a serious threat. Another equally serious risk is the misidentification of certain species. There are many closely related taxa in the Pinaceae family with a very similar needle or cone morphology. Moreover, they have the same ecological niches, similar geographic ranges, and often grow together in sympatric populations. In such populations, gene flow occurs, which leads to the formation of hybrid individuals with a phenotype intermediate to the parent taxa. Thus, the processes of hybridization and introgression additionally hinder the simple and effective assignment of such individuals to particular species. One of the best examples of such a complex system involving all of the above-mentioned factors is the European mountain pine association, Pinus mugo Turra sensu lato. This complex has been the subject of research using various techniques and methods for several decades [9,10,11,12,13,14,15]. However, the main problem in this complex remains the same, i.e., finding easy-to-use specific diagnostic determinants for individual taxa, preferably based on DNA markers.
The unambiguous identification of plant material or distinguishing taxa plays an important role not only in the authentication of Pinaceae-containing products, but also in establishing genetic relationships in this family, characterizing the genetic resources of its taxa, determining their geographical distribution, developing conservation plans for individual taxa, monitoring gene flow, and finally analyzing various evolutionary processes, such as hybridization or introgression.
DNA barcoding is a popular method of species identification using short and standardized gene fragments as molecular markers [16]. This rapid approach has commonly been used in taxonomic research for species-level identification in different domains of the tree of life for several years. The undeniable features of good DNA barcode regions are easy PCR amplification and sequencing, and most importantly, ensuring a high level of species discrimination [17]. The short length of the PCR amplicons facilitates work with very degraded DNA, e.g., from herbarium specimens. However, the effectiveness of identifying the analyzed biological material with DNA barcodes and assigning it to a specific family, genus, or species depends on many factors (including, among others, the analyzed family, DNA region or abundance of the reference base) and is not always satisfactorily high [18,19,20,21]. Therefore, in recent years, many studies have been carried out to circumvent these limitations by selecting appropriate regions of DNA that guarantee a high percentage of satisfactory identifications, both for universal barcoding and for species identification within individual families or genera [22,23,24,25,26]. One of the best known and widely used universal DNA barcode regions is the internal transcribed spacer 2 (ITS2) [27]. This short nuclear region, due to its properties, is also used to reconstruct phylogenetic relationships at the species and genus level [28,29,30,31,32]. So far, ITS2 has been used to analyze taxa from the Pinaceae family rather sporadically, while evaluating the usefulness of this region as barcode DNA for various groups of plants and animals [26,27].To the best of the authors’ knowledge, so far, there is no comprehensive study summarizing the validity of using the ITS2 region for the discrimination and identification of taxa as well as phylogenetic inference in this family carried out on a large number of species and sequences.
Therefore, the main objective of this study is to determine whether ITS2: (1) is a useful DNA marker for the discrimination and identification of both distant and closely related taxa in the Pinaceae family; (2) has a sufficiently high potential for the authentication of products containing Pinaceae representatives; and (3) is valuable nuclear region for inferring phylogenetic relationships of conifers in this family.

2. Results

2.1. Genetic Variation of ITS2 Sequences in Pinaceae

A total of 368 sequences representing 71 Pinaceae conifer taxa were analyzed (Table S1), of which 346 sequences were downloaded from GenBank and the remaining 22 sequences were obtained in this study and deposited in the same genetic database (Table S2). The accession numbers of the newly obtained sequences are shown in Table S3.
The genetic variability of the ITS2 sequence in the Pinaceae family was precisely characterized. All analyses and calculations were performed separately for four different taxonomic levels, i.e., species complex: Pinus mugo complex; genus: Abies, Keteleeria, Tsuga, Larix, Pseudotsuga, Picea, and Pinus; subfamily: Abietoideae, Laricoideae, Pinoideae, and family: Pinaceae.
The obtained results indicate that the analyzed ITS2 sequences in Pinaceae differ significantly in the values of the basic parameters describing genetic variation depending on the considered taxonomic level (Table 1). At the species complex level, represented by the closely related European mountain pine complex, Pinus mugo complex (PMC), no sequence variation was observed. At the genera level, the alignment length varied only slightly. The shortest alignment length was noticed for the genus Pseudotsuga (232 bp), while the longest for the genus Pinus (252 bp). The number of conserved sites (CS) ranged from 70 to 237. The highest percentage of calculated conserved sites (CS) was observed for the genus Pseudotsuga (100%) and the lowest for the Pinus genus (27.78%). The number of variable sites (VS) ranged from 0 (for the genus Pseudotsuga) to 177 (for the Pinus genus), which gives 0% and 70.24% respectively. The genus Pseudotsuga was also characterized by zero parsimony informative sites (PIS) and singleton sites (SS), and consequently also zero overall mean distance (OMD). On the other hand, the highest values of these coefficients were recorded for the genus Pinus (28.97%) and Keteleeria (48.5%), for PIS and SS, respectively. The highest OMD value (0.236) was found for the genus Keteleeria. At the subfamily level, the alignment length varied from 233 bp to 254 bp. The number of conserved sites ranged from 61 to 148. The highest percentage of calculated conserved sites (CS) was observed for the Laricoideae (63.52%) and the lowest for the Pinoideae subfamily (24.21%). The number of variable sites varied substantially from 85 to 187 within the subfamilies. The lowest proportion of variable sites (VS) was observed for the Laricoideae (36.48%) and the highest for the Pinoideae (74.21%) followed by Abietoideae (73.23%). The number of parsimony informative sites (PIS) ranged from 48 (Laricoideae) to 107 (Pinoideae), which is 20.60% and 42.4%, respectively. The values of the number of singleton sites (SS) and overall mean distance (OMD) were also the lowest for the members of the Laricoideae subfamily and amounted to 15.88% and 0.023, respectively. The highest percentage of calculated singleton sites was observed for the Abietoideae subfamily (38.19%), and the highest overall mean distance (OMD) was noticed for the Pinoideae (0.149). At the family level, the estimated alignment length was equal to 270 bp. The proportion of calculated conserved sites (CS), variable sites (VS), parsimony informative sites (PIS), singleton sites (SS) was: 17 (6.30%), 250 (92.59%), 196 (72.5%), and 49 (18.15%), respectively.
The overall mean distance (OMD) for the Pinaceae is 0.342. In general, the highest values of the basic parameters characterizing the genetic variation of the ITS2 sequence in Pinaceae were found at the family level, and the lowest at the species complex level.

2.2. Genetic Divergence within and between Pinaceae Taxa

MEGA version X was used to calculate the genetic divergence of ITS2 sequences in the Pinaceae family. Table 2 summarizes in detail the values of five genetic divergence parameters, i.e., all interspecific distance, minimum interspecific distances, theta, all intraspecific distance, and coalescence depth obtained in this study at the species complex, genus, subfamily, and family levels.
At the P. mugo complex level, all genetic divergence indices were zero. At the genera level, the highest all interspecific and all intraspecific variation was observed in the genus Keteleeria (0.1998 ± 0.0973 and 0.1972 ± 0.1919, respectively). This genus was also characterized by the highest value of theta (0.1995 ± 0.0250) and coalescent depth (0.2958 ± 0.2874). The lowest values of the above-mentioned coefficients were observed for the genus Picea. The genus Pseudotsuga was the only one represented by only one taxon. Therefore, the value of all five coefficients for this type was zero. At the subfamily level, the lowest all interspecific and all intraspecific distances were revealed for Laricoideae (0.0436 ± 0.0083 and 0.0063 ± 0.0011). In turn, the highest all interspecific distance was found in Pinoideae (0.1702 ± 0.0059), and the highest all intraspecific distance in Abietoideae (0.0370 ± 0.0291). For all analyzed subfamilies, the minimum interspecific distance was equal to 0.0000. The theta value ranged from 0.0222 for Laricoideae to 0.1492 for Pinoideae. The highest value of coalescent depth was observed for Abietoideae (0.0562) while the lowest (0.0249) for Laricoideae. At the family level, all interspecific and all intraspecific distances parameters were 0.3586 and 0.0294, respectively. The theta value was equal 0.3417 and coalescent depth value was 0.0423.
The pairwise genetic distance-based method was used to determine the genetic divergence of the analyzed ITS2 sequences. Figure 1 shows the relative distribution of all intraspecific and all interspecific distances obtained for the analyzed samples from the Pinaceae family. The value of intraspecific distance in the range from 0.0% to 1.0% constituted 46.04% of the total observation, while the values of interspecific distance in the range of 0.0 to 1.0 and 1.0 to 2.0 were respectively 14.74% and 14.39%. Overall, the vast majority of intraspecific distance values (95.06%) are in the range from 0% to 8%, with 83.93% in the range from 0.0% to 2.0%. The interspecific value of the distance in the range from 0% to 14% constitutes 90.49% of the calculations, and the range of distance from 0% to 6% constitutes 67.75% (Table S4).

2.3. Rates of Sequences Identification and Taxa Discrimination among Pinaceae

BLAST1 and TAXONDNA/Species Identifier 1.8 were used to determine the percentage of correct, ambiguous, and incorrect ITS2 sequence identification and taxa discrimination among Pinaceae.
Using the BLAST1 method, the correct ITS2 sequence identification rate was the highest for the Tsuga genus (100%) and the lowest for Pseudotsuga (0%). More than 40% of ITS2 sequences were correctly identified in the genus Abies and just over 30% in the genus Pinus. In contrast, in the genus Picea, only 8.33% of the ITS2 sequences were correctly identified. Ambiguous ITS2 sequences identification was highest for Pseudotsuga (100%) and lowest (0%) for Tsuga. Generally, for six out of seven analyzed Pinaceae genera, the value of ambiguous identification rate was more than 50%. Incorrect identification rates were highest for Picea (25%), followed by Pinus (13.51%) and Abies (4.76%). Overall success in correct species discrimination based on the BLAST1 method and ITS2 sequences was moderate (32.88%). Figure 2 and Table S5 summarize the results obtained with BLAST1 method.
TAXONDNA/Species Identifier 1.8 program, using “Best Match” (BM) and “Best Close Match” (BCM) options, was used to calculate the percentage of correct ITS2 sequence identification at the species complex, genus, subfamily, and family level for taxa represented in the analysis by at least two sequences. The summary of the success of taxa identification for analyzed genera among the Pinaceae family using the TaxonDNA method is shown in Table 3. At the species complex level, represented by the closely related European mountain pine complex, Pinus mugo complex (PMC), 100% of the analyzed ITS2 sequences were ambiguously identified using the BM and BCM options. None of the ITS2 sequence in this species complex were identified as correct (0%) or incorrect (0%). At the genera level, Pseudotsuga and Tsuga were the most successfully discriminated (100%), while Keteleeria had the lowest discriminatory success. At the subfamily level, the highest success of identification was noticed for Laricoideae (23.48%), and the lowest success for the subfamily Abietoideae (17.80%). At the family level, the success rate of species discrimination among the Pinaceae was relatively low. Only 71 of 368 sequences were correctly identified to species level (19.2%) based on best match (BM) analysis. It is worth emphasizing that the ambiguous identification was over three times higher than the correct identification (74.1%). Incorrect identification concerned 7.5% of the sequences. Similar results were obtained based on the best close match (BCM) analysis.

2.4. Phylogenetic Inference

Phylogenetic inference was performed in the RaxML v8.2.11 [33] using 371 ITS2 sequences and the maximum likelihood (ML) method. The resulting phylogenetic tree sorted all analyzed Pinaceae taxa into the appropriate genera according to the commonly accepted taxonomy, most with high bootstrap support. Although the ITS2 sequences allowed the assignment of taxa to the different Pinaceae genera, their distinction within these genera was not so obvious. These observations are fully consistent with the results obtained by us with BLAST1 and Taxon DNA in terms of the correct sequence identification and discrimination of Pinaceae taxa, where the species level also turned out to be the most critical.
The overall topology of the obtained phylogenetic ML tree based on ITS2 sequences is consistent with the commonly accepted division of Pinaceae and strongly supports the concept of monophyly of particular Pinaceae genera (Figure S1).

3. Discussion

The main aim of the article was to assess the suitability of the ITS2 region for the discrimination and identification of taxa in Pinaceae and phylogenetic inference in this large family of conifers. Detailed analysis of the genetic variation of the ITS2 sequence was performed for the first time in one comprehensive study. Particular attention has been paid to precisely characterize the level of genetic variation in the ITS2 sequence at four different taxonomic levels and to determine the success of sequence and taxa discrimination using both phylogenetically distant and closely related conifers.
In contrast to some reports in the literature [26,27] our observations clearly showed that although ITS2 is easy to analyze, it has severe limitations (mainly low level of genetic variability) and is not the best choice for identification, authentication, or phylogenetic inference in Pinaceae. The finding was quite unexpected as there are quite a few ITS2 sequences in the databases. Moreover, many studies show that ITS2 is a very good DNA barcode region [22,23,25,34,35]. According to China Plant BOL Group [36], the nuclear ribosomal DNA region, ITS, is characterized by a much better ability to discriminate species compared to plastid regions. Internal transcribed spacer (ITS1, 5.8S, ITS2) is characterized by a higher rate of evolution than the coding regions, is inherited biparentally, and shows a high level of divergence at the species level, which allows it to be used to identify even closely related species [37]. This is one of the reasons why the assessment of the ability of ITS2 to discriminate between closely related taxa in the Pinaceae family was one of the goals of our work.
In seed plants, the length of ITS varies usually from 500 to 700 bp [38], while in gymnosperms it is much longer, especially in Pinaceae (1500–3700 bp) [37,39], except that half the length of ITS1 consists of subrepeats [40,41,42,43], which results in some difficulty in PCR amplification and typical Sanger sequencing reads. Hence, the availability of complete ITS sequences in genetic databases for different species is still very limited. Therefore, China Plant BOL Group has suggested using only the ITS2 sequence as an alternative to the complete ITS region, due to the corresponding variability of the primary and secondary structure [23,27,44].
There are many reports in the literature on the assessment of the identification potential of various genomic regions postulated for distinguishing and analyzing different groups of organisms that would be ideal DNA barcodes. For animals, the cytochrome c oxidase 1 subunit 1 has been proposed as the main bar code [16]. In the case of plant identification, this region will not work well due to the insufficient rate of nucleotide substitution in the plant’s mitochondrial genome [45]. To solve this problem, several promising highly variable plastid DNA regions have been proposed, including both coding and non-coding loci, which can be used singly or together in various combinations. In this way, among others, rbcL [45,46,47], matK [48,49,50,51,52], combination rbcL + matK [53], intergenic spacer region- trnH-psbA [46,49,50,54,55,56,57], rpoB [45,49,51], rpoC1 [49,51], atpF-atpH [45], ndhJ, ycf5, or accD [51], were selected as valuable. Recently, the ycf1 region was indicated as very promising due to its very high variability, and even recommended as the main barcode in terrestrial plants [58]. Although the usefulness of ycf1 as a species-diagnostic marker has been demonstrated in the case of the Pinaceae family [59], it should be used in the analyses with caution as it was confirmed that this region is under positive selection in all Pinus plastomes [60], as well as frequent hybridizations within the genus Pinus, and finally the complex model of inheritance of the chloroplast genome [61]. Some difficulty in the widespread use of the complete ycf1 region may also arise from its considerable length, which makes it difficult to apply conventional Sanger sequencing with readings of 600–800 bp.
In this respect, nuclear markers, especially short ITS2, seem to be a better solution than chloroplast regions, especially given that the ITS2 region has so far been used quite successfully for the discrimination and identification of taxa in many angiosperm families. ITS2 turned out to be the best marker differentiating species from the Araliaceae family [24]. Several universal and popular DNA barcode regions (matK, rbcL, ITS2, psbA-trnH, and ycf5) were assessed by Liu [24] for their ability to identify 1113 sequences derived from 276 species from 23 genera. ITS2 correctly identified 85.23% and 97.29% of the sequences at the species and genus levels, respectively. Additionally, it was suggested that the psbA-trnH region could be an additional candidate DNA barcode for the identification of the Araliaceae family. Similarly, the high efficiency of the ITS2 region in discriminating taxa (>90% and 100% at the species and genus levels, respectively) was demonstrated in the Euphorbiaceae family based on the analysis of 1183 samples representing 871 species and 66 genera [34]. Similar results were obtained in the study of the Rutaceae family [62], where ITS2 was shown to be superior to the other six barcodes (psbA-trnH, matK, ycf5, rpoC1, rbcL, ITS). It was characterized by the highest interspecific divergence with regard to intraspecific divergence. Moreover, it has also been proven to be effective in distinguishing between closely related species. However, in our research, the effectiveness of the ITS2 sequence is zero at the P. mugo complex level. Our research clearly shows that the higher the taxonomic level, the higher the percentage of success in discriminating Pinaceae taxa. However, it seems that this percentage is decreasing not only with decreasing phylogenetic distance of analyzed samples, but also with increasing number of available sequences and individuals in the database.
ITS2 was also used with greater or lesser success in the analysis of the Apocynaceae family [22] or Asteraceae [35]. In the latter family, ITS2 was considered as a suitable, but not ideal, barcode for identifying species of high medical importance, belonging to the largest family of flowering plants. Compared to other barcodes, ITS2 was characterized by the greatest universality, specific divergence, and discrimination, which makes it a promising marker for Asteraceae authentication. In the Fabaceae family, ITS2 has been shown to be effective in identifying medicinal plants. It is worth noting that ITS2 also turned out to be an appropriate phylogenetic marker, but it did not solve all taxonomic problems [23]. This suggests that effective DNA barcoding using ITS2 varies from family to family and can be unreliable in complex taxonomic groups. In the Brassicaceae and Roasaceae families, ITS2 proved to be an imperfect barcode due to its low resolution [63,64].
As generally accepted, the ideal DNA barcode region should have a higher interspecies than intraspecies variation. In the case of the current study based on the pairwise genetic distance-based method (PWG-distance) for 368 ITS2 sequences from the Pinaceae family, the estimated inter-specific divergence parameter is higher than the intra-specific distances for all analyzed groups except for the Pinus genus. The DNA sequence similarity-based method, on the other hand, showed that in the case of the Pinaceae family, the success of taxa discrimination using ITS2 was relatively low. This shows that internal transcribed spacer 2 has some problems in identifying both subspecies and varieties, so closely related species are not properly distinguished.
Our results are fully consistent with those obtained for other gymnosperm families, including Podocarpaceae, the second largest family among gymnosperms, in which ITS2 was not characterized by too high an index of taxa discrimination [20]. Moreover, Yao`s [26] studies on several families of gymnosperms showed that ITS 2 had the least discriminatory success at the species level in comparison to other plant groups (mosses, ferns, monocotyledons, dicotyledons) as well as animals.

4. Materials and Methods

4.1. Sampling and Plant Materials

In our study, a total of 371 ITS2 sequences were analyzed, of which 368 sequences belonged to 71 Pinaceae taxa of 7 genera, and three Podocarpus sp. Sequences were outgroups only for phylogenetic analyzes (Tables S1 and S2). Of the 368 ITS2 Pinaceae sequences, 346 samples were downloaded from GenBank, and 22 sequences were obtained in this study by analyzing 20 individuals belonging to the Pinus mugo complex and 2 individuals of Scots pine (Pinus sylvestris) as the closest related taxa to the Pinus mugo complex. The analyzed specimens were collected in Poland, the Czech Republic and Germany (Table S3). The suitability of the ITS2 region to discriminate against taxa of the Pinaceae family was assessed by analyzing both phylogenetically distant and closely related taxa.

4.2. DNA Extraction and Next-Generation Sequencing

The 100 mg of fresh plant tissue was used to extract genomic DNA using the Genomic Mini AX Plant Spin kit (A&A Biotechnology, Gdańsk, Poland). The quality and concentration of the extracted DNA were verified using gel electrophoresis and a NanoDrop spectrophotometer (Thermo Fisher Scientific, Carlsbad, CA, United States). The genomic libraries were constructed with protocol: Ion Xpress™ Plus gDNA Fragment Library Preparation, using Ion Xpress Plus Fragment Library Kit (Pub. No. MAN0009847) (ThermoFisher Scientific, Waltham, MA, USA). Next-generation sequencing was performed on the GeneStudio™ S5 System (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer’s instructions using the protocol: Ion 540 ™ Kit—OT2 User Guide (Cat. No. A27753, Publication No. MAN0010850, Revision D). The reads generated from next-generation sequencing were assembled and annotated using the Geneious Prime 2020.2.5 package. Reads were mapped to MT735327 sequence from genbank using Genious mapper with default settings and minimum mapping quality 30%. Sequences that were mapped were subsequently assembled de novo using Geneious algorithm on default settings and annotated based on MT735327 sequence. The obtained 22 complete ITS2 sequences were submitted to GenBank. Each sequence was assigned an accession number (Table S2).

4.3. Data Validation

The sequences of internal transcribed spacer 2 from GenBank were downloaded using query “internal transcribed spacer 2 Pinaceae’’ (in July 2021). The nuclear ribosomal ITS2 sequences with ambiguous bases were discarded from further analyzes. In order to extract ITS2 sequence, the ITS2 database (Internal Transcribed Spacer 2 Ribosomal DNA Database) (version 3.0.13) available online (http://its2.bioapps.biozentrum.uni-wuerzburg.de (accessed on 20 August 2021) was used [65,66,67]. Single sequence species were excluded from the analysis.

4.4. Data Analysis

In order to carry out a deeper and more precise genetic characterization, i four taxonomic levels were considered, namely species complex, genus, subfamily, and family. ITS2 sequences were aligned using the ClustalW with default parameters available in MEGA version X [68]. Then, the length of the alignment was estimated and the percentage of conserved sites (CS), variable sites (VS), parsimony-informative sites (PIS), singleton sites (SS), and overall mean distance (OMD) were calculated. To assess the suitability of the ITS2 sequence as a potential barcode at the level of genus, subfamily, and family of Pinaceae, selected methods were used, namely the pairwise genetic distance-based method (PWG-distance), the DNA sequence similarity-based method (TaxonDNA, BLAST), and phylogenetic tree method (maximum likelihood).
The pairwise genetic distance-based method was used to determine the genetic divergence of the obtained ITS2 sequences. The five parameters were calculated in MEGA version X [68] using the Kimura two-parameter distance model (K2P) [69] with pairwise deletion option to define the interspecific and intraspecific variability. The interspecific divergence has been characterized by all interspecific distance and minimum interspecific distance parameters [27,70,71]. Intraspecific variability was calculated using the K2P distance matrix by applying three parameters: all intraspecific distance, theta(θ) and coalescent depth [50,70]. Using the “Pairwise summary” option based on the K2P model available in the TaxonDNA/SpeciesIdentifier 1.8 software, the frequency of the distribution of interspecific distance and intraspecific variability was obtained. Plots were made to illustrate the barcode gap for each genus and for the entire Pinaceae family.
Sequence identification and taxa discrimination rates among Pinaceae were calculated using the two different methods. The first was the method based on DNA sequence similarity-based method implemented in TAXONDNA/Species Identifier 1.8 program. “Best Match” (BM) and “Best Close Match” (BCM) options were used to verify the percentage of correct ITS2 sequence identification at the species complex, genus, subfamily and family level for taxa represented in at least two sequences [72]. The second was BLAST1. In this method, performed with the BLAST program (http://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 15 February 2022), all ITS2 Pinaceae sequences were used as query sequences to search the reference database. Correct identification means that the best BLAST hits of the query sequence are from the expected species, while ambiguous identification means that the best BLAST hits for the query sequence turned out to be those of several species, including the expected species. Incorrect identification in turn means that the query sequence’s best BLAST hit is not from the expected species.
Phylogenetic inference was constructed by maximum likelihood (ML) analysis using 371 ITS2 Pinaceae sequences in RaxML v8.2.11 [33], with 1000 rapid bootstrap replicates along with a search for the best-scoring ML tree in every run and parsimony random seed set to 10. Podocarpus longefoliolatus (AY083065), Podocarpus macrophyllus voucher HZ20070103 (EF660588), and Podocarpus neriifolius isolate VNMN000814 (KR674120) were used as an outgroup.

5. Conclusions

Our study provides a critical assessment of nuclear DNA ITS2 region relevance for taxa discrimination among Pinaceae. Based on the results obtained, it can be concluded that although ITS2 fulfills some of the important features of an ideal DNA barcode region, its usefulness for distinguishing taxa among Pinaceae is severely limited. It seems that the correct and successful identification of taxa is reserved only for those that are phylogenetically distant and represent different genera rather than species, and for those for which few sequences are available in genetic databases. The closely related conifers at the species complex level are indistinguishable using ITS2. The application of this region for study relationships in the Pinaceae family does not seem justified due to its low genetic variability, which results low phylogenetic resolution. Nevertheless, further research on ITS2 is needed, especially in terms of extending the available genetic databases with new records, especially for species from the genera Tsuga or Pseudotsuga. A wider database would determine whether the high success in distinguishing taxa in these two genera is due to the low number of deposited samples in these databases or to the high efficiency of ITS2. Another interesting direction for further research could also be to determine the effectiveness of complete internal transcribed spacer 1 (ITS1) in discriminating Pinaceae taxa and studying the phylogeny of this large and important family of conifers.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/plants11081078/s1, Figure S1: Evolutionary relationships analyzed 371 ITS2 sequences, Table S1: Summary of the Pinaceae samples analyzed in this study. Table S2: List of taxa and records with accession numbers analyzed in this study, Table S3: List of individuals representing Pinus mugo complex taxa sequenced in this study, Table S4: Abundance of all intraspecific and all interspecific K2P pairwise distance in Pinaceae, Table S5: The results of species discrimination in seven genera of the Pinaceae family based on the BLAST1 method.

Author Contributions

J.S. and K.C. conceived of and designed the research framework; J.S. performed the experiments; J.S. and K.C. wrote the original draft manuscript, reviewed and edited the final manuscript; J.S. and H.F. performed data analysis; K.C. participated in the data analysis; J.S. and K.C. collected the samples; K.C. supervised the project. All authors have read and agreed to the published version of the manuscript.

Funding

Research paper financed from the budget for science in 2018–2020, as a research project under the “Diamond Grant” program No. DI2017003147.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data are available within the article and Supplementary Materials.

Acknowledgments

The authors would like to thank Pascal von Sengbusch—Büro für ökologische Gutachten/Moorkundeand Ladislav Rektoris—Agentura ochrany přírody a krajiny České Republiky for their help in collecting the plant material. We also thank four anonymous reviewers for reading our manuscript carefully and for their many insightful comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Liston, A.; Vining, T.F.; Campbell, C.S.; Gernandt, D.S.; Piñero, D. Molecular phylogeny of pinaceae and pinus. Acta Hortic. 2003, 615, 107–114. [Google Scholar] [CrossRef]
  2. Farjon, A. A Natural History of Conifers; Timber Press: Portland, OR, USA, 2008. [Google Scholar]
  3. Le Maitre, D.C. Pines in cultivation: A global view. In Ecology and Biogeography of Pinus; David, M.R., Ed.; Cambridge University Press: Cambridge, UK, 1998; pp. 407–431. [Google Scholar]
  4. Leduc, C.; Coonishish, J.; Haddad, P.; Cuerrier, A. Plants used by the cree nation of eeyou istchee (Quebec, Canada) for the treatment of diabetes: A novel approach in quantitative ethnobotany. J. Ethnopharmacol. 2006, 105, 55–63. [Google Scholar] [CrossRef] [PubMed]
  5. Yasin, M.; Hussain Janbaz, K.; Imran, I.; Gilani, A.U.H.; Bashir, S. Pharmacological Studies on the antispasmodic, bronchodilator and anti-platelet activities of abies webbiana. Phyther. Res. 2014, 28, 1182–1187. [Google Scholar] [CrossRef]
  6. Kizilarslan, Ç.; Sevg, E. Ethnobotanical uses of genus pinus, L. (Pinaceae) in Turkey. Indian J. Tradit. Knowl. 2013, 12, 209–220. [Google Scholar]
  7. Hussain, K.; Nisar, M.F.; Majeed, A.; Nawaz, K.; Bhatti, K.H. Ethnomedicinal survey for important plants of Jalalpur Jattan, District Gujrat, Punjab, Pakistan. Ethnobot. Leafl. 2010, 14, 807–825. [Google Scholar]
  8. Kaushik, P.; Lal, S.; Rana, A.C.; Kaushik, D. GC-MS analysis of bioactive constituents of pinus roxburghii sarg. (Pinaceae) from Northern India. Res. J. Phytochem. 2014, 8, 42–46. [Google Scholar] [CrossRef]
  9. Celiński, K.; Kijak, H.; Wojnicka-Półtorak, A.; Buczkowska-Chmielewska, K.; Sokołowska, J.; Chudzińska, E. Effectiveness of the DNA barcoding approach for closely related conifers discrimination: A case study of the pinus mugo complex. Comptes Rendus-Biol. 2017, 340, 339–348. [Google Scholar] [CrossRef]
  10. Celiński, K.; Bonikowski, R.; Wojnicka-Półtorak, A.; Chudzińska, E.; Maliński, T. Volatiles as chemosystematic markers for distinguishing closely related species within the pinus mugo complex. Chem. Biodivers. 2015, 12, 1208–1213. [Google Scholar] [CrossRef]
  11. Bonikowski, R.; Celiński, K.; Wojnicka-Półtorak, A.; Maliński, T. Composition of essential oils isolated from the needles of Pinus Uncinata and P. Uliginosa Grown in Poland. Nat. Prod. Commun. 2015, 10, 371–373. [Google Scholar] [CrossRef] [Green Version]
  12. Celiński, K.; Kijak, H.; Barylski, J.; Grabsztunowicz, M.; Wojnicka-Półtorak, A.; Chudzińska, E. Characterization of the complete chloroplast genome of pinus uliginosa (neumann) from the pinus mugo complex. Conserv. Genet. Resour. 2017, 9, 209–212. [Google Scholar] [CrossRef] [Green Version]
  13. Celiński, K.; Chudzińska, E.; Gmur, A.; Piosik, Ł.; Wojnicka-Półtorak, A. Cytological characterization of three closely related pines—Pinus Mugo, P. Uliginosa and P. × Rhaetica from the Pinus Mugo complex (Pinaceae). Biologia 2019, 74, 751–756. [Google Scholar] [CrossRef] [Green Version]
  14. Wachowiak, W.; Celiński, K.; Prus-Głowacki, W. Evidence of natural reciprocal hybridisation between Pinus Uliginosa and P. Sylvestris in the sympatric population of the species. Flora Morphol. Distrib. Funct. Ecol. Plants 2005, 200, 563–568. [Google Scholar] [CrossRef]
  15. Sokołowska, J.; Fuchs, H.; Celiński, K. New insight into taxonomy of european mountain pines, pinus mugo complex, based on complete chloroplast genomes sequencing. Plants 2021, 10, 1331. [Google Scholar] [CrossRef] [PubMed]
  16. Hebert, P.D.N.; Cywinska, A.; Ball, S.L.; DeWaard, J.R. Biological Identifications through DNA Barcodes. Proc. R. Soc. B Biol. Sci. 2003, 270, 313–321. [Google Scholar] [CrossRef] [Green Version]
  17. Hollingsworth, P.M.; Graham, S.W.; Little, D.P. Choosing and using a plant DNA barcode. PLoS ONE 2011, 6, e19254. [Google Scholar] [CrossRef]
  18. Wang, X.C.; Liu, C.; Huang, L.; Bengtsson-Palme, J.; Chen, H.; Zhang, J.H.; Cai, D.; Li, J.Q. ITS1: A DNA barcode better than ITS2 in eukaryotes? Mol. Ecol. Resour. 2015, 15, 573–586. [Google Scholar] [CrossRef]
  19. Ran, J.H.; Wang, P.P.; Zhao, H.J.; Wang, X.Q. A test of seven candidate barcode regions from the Plastome in Picea (Pinaceae). J. Integr. Plant Biol. 2010, 52, 1109–1126. [Google Scholar] [CrossRef]
  20. Little, D.P.; Knopf, P.; Schulz, C. DNA barcode identification of Podocarpaceae—The second largest conifer family. PLoS ONE 2013, 8, e81008. [Google Scholar] [CrossRef] [Green Version]
  21. Maia, V.H.; da Mata, C.S.; Franco, L.O.; Cardoso, M.A.; Cardoso, S.R.S.; Hemerly, A.S.; Ferreira, P.C.G. DNA barcoding bromeliaceae: Achievements and pitfalls. PLoS ONE 2012, 7, e29877. [Google Scholar] [CrossRef] [Green Version]
  22. LV, Y.N.; YANG, C.Y.; SHI, L.C.; ZHANG, Z.L.; XU, A.S.; ZHANG, L.X.; LI, X.L.; LI, H.T. Identification of Medicinal plants within the apocynaceae family using ITS2 and PsbA-TrnH barcodes. Chin. J. Nat. Med. 2020, 18, 594–605. [Google Scholar] [CrossRef]
  23. Gao, T.; Yao, H.; Song, J.; Liu, C.; Zhu, Y.; Ma, X.; Pang, X.; Xu, H.; Chen, S. Identification of Medicinal plants in the family fabaceae using a potential DNA barcode ITS2. J. Ethnopharmacol. 2010, 130, 116–121. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, Z.; Zeng, X.; Yang, D.; Chu, G.; Yuan, Z.; Chen, S. Applying DNA barcodes for identification of plant species in the family araliaceae. Gene 2012, 499, 76–80. [Google Scholar] [CrossRef] [PubMed]
  25. Feng, S.; Jiang, M.; Shi, Y.; Jiao, K.; Shen, C.; Lu, J.; Ying, Q.; Wang, H. Application of the ribosomal DNA ITS2 region of physalis (Solanaceae): DNA barcoding and phylogenetic study. Front. Plant Sci. 2016, 7, 1047. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Yao, H.; Song, J.; Liu, C.; Luo, K.; Han, J.; Li, Y.; Pang, X.; Xu, H.; Zhu, Y.; Xiao, P.; et al. Use of ITS2 region as the universal DNA barcode for plants and animals. PLoS ONE 2010, 5, e13102. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Chen, S.; Yao, H.; Han, J.; Liu, C.; Song, J.; Shi, L.; Zhu, Y.; Ma, X.; Gao, T.; Pang, X.; et al. Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS ONE 2010, 5, e8613. [Google Scholar] [CrossRef]
  28. Coleman, A.W. ITS2 is a double-edged tool for eukaryote evolutionary comparisons. Trends Genet. 2003, 19, 370–375. [Google Scholar] [CrossRef]
  29. Schultz, J.; Maisel, S.; Gerlach, D.; Müller, T.; Wolf, M. A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. RNA 2005, 11, 361–364. [Google Scholar] [CrossRef] [Green Version]
  30. Miao, M.; Warren, A.; Song, W.; Wang, S.; Shang, H.; Chen, Z. Analysis of the internal transcribed spacer 2 (ITS2) region of scuticociliates and related taxa (ciliophora, oligohymenophorea) to infer their evolution and phylogeny. Protist 2008, 159, 519–533. [Google Scholar] [CrossRef]
  31. Coleman, A.W. Pan-eukaryote ITS2 homologies revealed by RNA secondary structure. Nucleic Acids Res. 2007, 35, 3322–3329. [Google Scholar] [CrossRef] [Green Version]
  32. Schultz, J.; Wolf, M. ITS2 sequence-structure analysis in phylogenetics: A how-to manual for molecular systematics. Mol. Phylogenetics Evol. 2009, 52, 520–523. [Google Scholar] [CrossRef]
  33. Stamatakis, A. RAxML Version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed]
  34. Pang, X.; Song, J.; Zhu, Y.; Xie, C.; Chen, S. Using DNA barcoding to identify species within euphorbiaceae. Planta Med. 2010, 76, 1784–1786. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Gao, T.; Yao, H.; Song, J.; Zhu, Y.; Liu, C.; Chen, S. Evaluating the feasibility of using candidate DNA barcodes in discriminating species of the large Asteraceae family. BMC Evol. Biol. 2010, 10, 324. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. China Plant BOL Group; Li, D.Z.; Gao, L.M.; Li, H.T.; Wang, H.; Ge, X.J.; Liu, J.Q.; Chen, Z.D.; Zhou, S.L.; Chen, S.L.; et al. Comparative analysis of a large dataset indicates that Internal Transcribed Spacer (ITS) Should be incorporated into the core barcode for seed plants. Proc. Natl. Acad. Sci. USA 2011, 108, 19641–19646. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Álvarez, I.; Wendel, J.F. Ribosomal ITS sequences and plant phylogenetic inference. Mol. Phylogenet. Evol. 2003, 29, 417–434. [Google Scholar] [CrossRef] [Green Version]
  38. Baldwin, B.G.; Sanderson, M.J.; Porter, J.M.; Wojciechowski, M.F.; Campbell, C.S.; Donoghue, M.J. The ITS Region of nuclear ribosomal DNA: A valuable source of evidence on angiosperm phylogeny. Ann. Missouri Bot. Gard. 1995, 85, 247–277. [Google Scholar] [CrossRef]
  39. Liston, A.; Robinson, W.A.; Oliphant, J.M.; Alvarez-Buylla, E.R. Length variation in the nuclear ribosomal DNA internal transcribed spacer region of non-flowering seed plants. Syst. Bot. 1996, 21, 109–120. [Google Scholar] [CrossRef]
  40. Marrocco, R.; Gelati, M.T.; Maggini, F. Nucleotide sequence of the internal transcribed spacers and 5.8 s region of ribosomal DNA in Phus pinea L. Mitochondrial DNA 1996, 6, 175–177. [Google Scholar] [CrossRef]
  41. Vining, T.F.; Campbell, C.S. Phylogenetic signal in sequence repeats within nuclear ribosomal DNA internal transcribed spacer 1 in Tsuga. Am. J. Bot. 1997, 84, 702. [Google Scholar]
  42. Gernandt, D.S.; Liston, A. Internal transcribed spacer region evolution in Larix and Pseudotsuga (Pinaceae). Am. J. Bot. 1998, 86, 711–723. [Google Scholar] [CrossRef] [Green Version]
  43. Maggini, F.; Frediani, M.; Gelati, M.T. Nucleotide sequence of the internal transcribed spacers of ribosomal DNA in Picea Abies Karst. Mitochondrial DNA 2000, 11, 87–89. [Google Scholar] [CrossRef]
  44. Han, J.; Zhu, Y.; Chen, X.; Liao, B.; Yao, H.; Song, J.; Chen, S.; Meng, F. The short ITS2 sequence serves as an efficient taxonomic sequence tag in comparison with the full-length ITS. Biomed Res. Int. 2013, 2013, 3–10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Fazekas, A.J.; Burgess, K.S.; Kesanakurti, P.R.; Graham, S.W.; Newmaster, S.G.; Husband, B.C.; Percy, D.M.; Hajibabaei, M.; Barrett, S.C.H. Multiple multilocus DNA barcodes from the plastid genome discriminate plant species equally well. PLoS ONE 2008, 3, e2802. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Kress, W.J.; Erickson, D.L. A Two-locus global DNA barcode for land plants: The coding rbcL gene complements the non-coding trnH-PsbA spacer region. PLoS ONE 2007, 2, e508. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Newmaster, S.G.; Fazekas, A.J.; Ragupathy, S. DNA barcoding in land plants: Evaluation of RbcL in a multigene tiered approach. Can. J. Bot. 2006, 84, 335–341. [Google Scholar] [CrossRef] [Green Version]
  48. Pennisi, E. Taxonomy. Wanted: A barcode for plants. Science 2007, 318, 190–191. [Google Scholar] [CrossRef]
  49. Chase, M.W.; Cowan, R.S.; Hollingsworth, P.M.; Van Den Berg, C.; Madriñán, S.; Petersen, G.; Seberg, O.; Jørgsensen, T.; Cameron, K.M.; Carine, M.; et al. A Proposal for a standardised protocol to barcode all land plants. Taxon 2007, 56, 295–299. [Google Scholar] [CrossRef]
  50. Lahaye, R.; Van Der Bank, M.; Bogarin, D.; Warner, J.; Pupulin, F.; Gigot, G.; Maurin, O.; Duthoit, S.; Barraclough, T.G.; Savolainen, V. DNA barcoding the floras of biodiversity hotspots. Proc. Natl. Acad. Sci. USA 2008, 105, 2923–2928. [Google Scholar] [CrossRef] [Green Version]
  51. Ford, C.S.; Ayres, K.L.; Toomey, N.; Haider, N.; Stahl, J.V.A.; Kelly, L.J.; Wikström, N.; Hollingsworth, P.M.; Duff, R.J.; Hoot, S.B.; et al. Selection of candidate coding DNA barcoding regions for use on land plants. Bot. J. Linn. Soc. 2009, 159, 1–11. [Google Scholar] [CrossRef] [Green Version]
  52. Starr, J.R.; Naczi, R.F.C.; Chouinard, B.N. Plant DNA barcodes and species resolution in sedges (Carex, Cyperaceae). Mol. Ecol. Resour. 2009, 9 (Suppl. S1), 151–163. [Google Scholar] [CrossRef]
  53. CBOL Plant Working Group. A DNA barcode for land plants. Proc. Natl. Acad. Sci. USA 2009, 106, 12794–12797. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Kress, W.J.; Wurdack, K.J.; Zimmer, E.A.; Weigt, L.A.; Janzen, D.H. Use of DNA barcodes to identify flowering plants. Proc. Natl. Acad. Sci. USA. 2005, 102, 8369–8374. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Newmaster, S.G.; Ragupathy, S. Esting plant barcoding in a sister species complex of pantropical acacia (Mimosoideae, Fabaceae). Mol. Ecol. Resour. 2009, 9, 172–180. [Google Scholar]
  56. Pang, X.; Liu, C.; Shi, L.; Liu, R.; Liang, D.; Li, H.; Cherny, S.S.; Chen, S. Utility of the TrnH-PsbA intergenic spacer region and its combinations as plant DNA barcodes: A meta-analysis. PLoS ONE 2012, 7, e48833. [Google Scholar] [CrossRef]
  57. Bolson, M.; De Camargo Smidt, E.; Brotto, M.L.; Silva-Pereira, V. ITS and TrnH-PsbA as efficient DNA barcodes to identify threatened commercial woody angiosperms from Southern Brazilian atlantic rainforests. PLoS ONE 2015, 10, e0143049. [Google Scholar] [CrossRef]
  58. Dong, W.; Xu, C.; Li, C.; Sun, J.; Zuo, Y.; Shi, S.; Cheng, T.; Guo, J.; Zhou, S. Ycf1, the most promising plastid DNA barcode of land plants. Sci. Rep. 2015, 5, 8348. [Google Scholar] [CrossRef] [Green Version]
  59. Olsson, S.; Grivet, D.; Cid Vian, J. Species-diagnostic markers in the genus pinus: Evaluation of the chloroplast regions Matk and Ycf1. For. Syst. 2018, 27, e016. [Google Scholar] [CrossRef] [Green Version]
  60. Zeb, U.; Dong, W.L.; Zhang, T.T.; Wang, R.N.; Shahzad, K.; Ma, X.F.; Li, Z.H. Comparative plastid genomics of pinus species: Insights into sequence variations and phylogenetic relationships. J. Syst. Evol. 2020, 58, 118–132. [Google Scholar] [CrossRef]
  61. Kormutak, A.; Galgoci, M.; Bolecek, P.; Gomory, D.; Libantova, J. Reinforced evidence on partial compatibility between pinus sylvestris and pinus mugo and on maternal inheritance of chloroplast DNA in the Pinus Mugo × Pinus Sylvestris cross. Silvae Genet. 2020, 69, 108–115. [Google Scholar] [CrossRef]
  62. Luo, K.; Chen, S.L.; Chen, K.L.; Song, J.Y.; Yao, H.; Ma, X.; Zhu, Y.J.; Pang, X.H.; Yu, H.; Li, X.W.; et al. Assessment of candidate plant DNA barcodes using the rutaceae family. Sci. China Life Sci. 2010, 53, 701–708. [Google Scholar] [CrossRef]
  63. Pang, X.; Song, J.; Zhu, Y.; Xu, H.; Huang, L.; Chen, S. Applying plant DNA barcodes for rosaceae species identification. Cladistics 2011, 27, 165–170. [Google Scholar] [CrossRef] [PubMed]
  64. Sun, X.Q.; Qu, Y.Q.; Yao, H.; Zhang, Y.M.; Yan, Q.Q.; Hang, Y.Y. Applying DNA barcodes for identification of economically important species in Brassicaceae. Genet. Mol. Res. 2015, 14, 15050–15061. [Google Scholar] [CrossRef] [PubMed]
  65. Koetschan, C.; Förster, F.; Keller, A.; Schleicher, T.; Ruderisch, B.; Schwarz, R.; Müller, T.; Wolf, M.; Schultz, J. The ITS2 database III sequences and structures for phylogeny. Nucleic Acids Res. 2009, 38 (Suppl. S1), 275–279. [Google Scholar] [CrossRef] [PubMed]
  66. Selig, C.; Wolf, M.; Müller, T.; Dandekar, T.; Schultz, J. The ITS2 database II: Homology modelling RNA Structure for molecular systematics. Nucleic Acids Res. 2008, 36 (Suppl. S1), 377–380. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Schultz, J.; Müller, T.; Achtziger, M.; Seibel, P.N.; Dandekar, T.; Wolf, M. The internal transcribed spacer 2 database—A web server for (not only) low level phylogenetic analyses. Nucleic Acids Res. 2006, 34, 704–707. [Google Scholar] [CrossRef]
  68. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  69. Kimura, M. A Simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 1980, 16, 111–120. [Google Scholar] [CrossRef]
  70. Meyer, C.P.; Paulay, G. DNA barcoding: Error rates based on comprehensive sampling. PLoS Biol. 2005, 3, e422. [Google Scholar] [CrossRef] [Green Version]
  71. Meier, R.; Zhang, G.; Ali, F. The use of mean instead of smallest interspecific distances exaggerates the size of the “barcoding gap” and leads to misidentification. Syst. Biol. 2008, 57, 809–813. [Google Scholar] [CrossRef]
  72. Meier, R.; Shiyang, K.; Vaidya, G.; Ng, P.K.L. DNA barcoding and taxonomy in Diptera: A tale of high intraspecific variability and low identification success. Syst. Biol. 2006, 55, 715–728. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Abundance of all intraspecific and all interspecific K2P pairwise distance in Pinaceae.
Figure 1. Abundance of all intraspecific and all interspecific K2P pairwise distance in Pinaceae.
Plants 11 01078 g001
Figure 2. Species discrimination rates based on BLAST1 method.
Figure 2. Species discrimination rates based on BLAST1 method.
Plants 11 01078 g002
Table 1. Values of basic parameters characterizing genetic variation in ITS2 sequences in Pinaceae. AL—alignment length; CS—conserved sites; VS-variable sites; PIS—parsimony informative sites; SS—singleton sites; OMD—overall mean distance. Values in brackets are given as percentages.
Table 1. Values of basic parameters characterizing genetic variation in ITS2 sequences in Pinaceae. AL—alignment length; CS—conserved sites; VS-variable sites; PIS—parsimony informative sites; SS—singleton sites; OMD—overall mean distance. Values in brackets are given as percentages.
Taxonomic Group/LevelNameALCSVSPISSSOMD
Species complexP. mugo complex243243 (100.00)0 (0.00)0 (0.00)0 (0.00)0.000
GeneraAbies246169 (68.70)75 (30.49)27 (10.98)48 (19.51)0.016
Keteleeria249118 (47.39)122 (48.99)0 (0.00)121 (48.5)0.236
Tsuga247237 (95.95)10 (4.05)7 (2.83)3 (1.21)0.026
Larix233172 (73.82)61 (26.18)15 (6.44)46 (19.74)0.013
Pseudotsuga232232 (100.00)0 (0.00)0 (0.00)0 (0.00)0.000
Picea236232 (98.31)4 (1.69)3 (1.27)1 (0.42)0.004
Pinus25270 (27.78)177 (70.24)73 (28.97)101 (40.0)0.077
SubfamilyAbietoideae25466 (25.98)186 (73.23)88 (34.65)97 (38.19)0.074
Laricoideae233148 (63.52)85 (36.48)48 (20.60)37 (15.88)0.023
Pinoideae25261 (24.21)187 (74.21)107 (42.4)77 (30.56)0.149
FamilyPinaceae27017 (6.30)250 (92.59)196 (72.5)49 (18.15)0.342
Table 2. Values of genetic divergence indices in ITS2 sequences in Pinaceae.
Table 2. Values of genetic divergence indices in ITS2 sequences in Pinaceae.
Taxonomic Group/LevelNameAll Interspecific
Distance
Minimum Interspecific DistanceAll Intraspecific DistanceThetaCoalescent Depth
Species complexP. mugo complex0.0000 ± 0.00000.0000 ± 0.00000.0000 ± 0.00000.0000 ± 0.00000.0000 ± 0.0000
GeneraAbies0.0175 ± 0.00060.0000 ± 0.00000.0085 ± 0.00120.0176 ± 0.00390.0190 ± 0.0046
Keteleeria0.1998 ± 0.09730.0000 ± 0.00000.1972 ± 0.19190.1995 ± 0.02500.2958 ± 0.2874
Tsuga0.0356 ± 0.01190.0291 ± 0.01090.0061 ± 0.00610.0258 ± 008200.0062±0.0061
Larix0.0137 ± 0.00060.0000 ± 0.00000.0067 ± 0.00100.0130 ± 0.00330.0271 ± 0.0074
Pseudotsuga0.0000 ± 0.00000.0000 ± 0.00000.0000 ± 0.00000.0000 ± 0.00000.0000 ± 0.0000
Picea0.0048 ± 0.00090.0000 ± 0.00000.0017 ± 0.00120.0043 ± 0.00230.0024 ± 0.0018
Pinus0.0392 ± 0.02570.0000 ± 0.00000.0959 ± 0.00500.0768 ± 0.01020.0486 ± 0.0259
SubfamilyAbietoideae0.1089±0.00820.0000 ± 0.00000.0370 ± 0.02910.0665 ± 0.00650.0562 ± 0.0389
Laricoideae0.0436 ± 0.00830.0000 ± 0.00000.0063 ± 0.00110.0222 ± 0.00370.0249 ± 0.0071
Pinoideae0.1702 ± 0.00590.0000 ± 0.00000.0342 ± 0.02290.1492 ± 0.01570.0396 ± 0.0210
FamilyPinaceae0.3586 ± 0.00410.0000 ± 0.00000.0294 ± 0.01410.3417 ± 0.02870.0423 ± 0.0162
Table 3. Percentage of correct, incorrect and ambiguous sequence identifications based on the ‘best match’ and ‘best close match’ with TaxonDNA software.
Table 3. Percentage of correct, incorrect and ambiguous sequence identifications based on the ‘best match’ and ‘best close match’ with TaxonDNA software.
Taxonomic Group/LevelNameBest Match (BM)Best Close Match (BCM)
CorrectIncorrectAmbiguousCorrectIncorrectAmbiguousNo Match
Species complexP. mugo complex0.000.00100.000.000.00100.000.00
GeneraAbies16.198.5775.2416.198.5775.240.00
Keteleeria11.110.0088.8911.100.0088.890.00
Tsuga100.000.000.00100.000.000.000.00
Larix21.433.5775.0021.432.6875.000.89
Pseudotsuga100.000.000.00100.000.000.000.00
Picea20.830.0079.1720.830.0079.170.00
Pinus16.229.0174.7715.327.2172.075.40
SubfamilyAbietoideae17.808.4773.7317.807.6373.730.84
Laricoideae23.483.4873.0423.402.6175.000.87
Pinoideae17.047.4075.5016.305.9373.334.44
FamilyPinaceae19.206.5274.1019.005.4373.382.17
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sokołowska, J.; Fuchs, H.; Celiński, K. Assessment of ITS2 Region Relevance for Taxa Discrimination and Phylogenetic Inference among Pinaceae. Plants 2022, 11, 1078. https://doi.org/10.3390/plants11081078

AMA Style

Sokołowska J, Fuchs H, Celiński K. Assessment of ITS2 Region Relevance for Taxa Discrimination and Phylogenetic Inference among Pinaceae. Plants. 2022; 11(8):1078. https://doi.org/10.3390/plants11081078

Chicago/Turabian Style

Sokołowska, Joanna, Hanna Fuchs, and Konrad Celiński. 2022. "Assessment of ITS2 Region Relevance for Taxa Discrimination and Phylogenetic Inference among Pinaceae" Plants 11, no. 8: 1078. https://doi.org/10.3390/plants11081078

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop