Compensatory Base Changes in ITS2 Secondary Structure Alignment, Modelling, and Molecular Phylogeny: An Integrated Approach to Improve Species Delimitation in Tulasnella (Basidiomycota)

Background: The delimitation of species of Tulasnella has been extensively studied, mainly at the morphological (sexual and asexual states) and molecular levels—showing ambiguity between them. An integrative species concept that includes characteristics such as molecular, ecology, morphology, and other information is crucial for species delimitation in complex groups such as Tulasnella. Objectives: The aim of this study is to test evolutionary relationships using a combination of alignment-based and alignment-free distance matrices as an alternative molecular tool to traditional methods, and to consider the secondary structures and CBCs from ITS2 (internal transcribed spacer) sequences for species delimitation in Tulasnella. Methodology: Three phylogenetic approaches were plotted: (i) alignment-based, (ii) alignment-free, and (iii) a combination of both distance matrices using the DISTATIS and pvclust libraries from an R package. Finally, the secondary structure consensus was modeled by Mfold, and a CBC analysis was obtained to complement the species delimitation using 4Sale. Results and Conclusions: The phylogenetic tree results showed delimited monophyletic clades in Tulasnella spp., where all 142 Tulasnella sequences were divided into two main clades A and B and assigned to seven species (T. asymmetrica, T. andina, T. eichleriana ECU6, T. eichleriana ECU4 T. pinicola, T. violea), supported by bootstrap values from 72% to 100%. From the 2D secondary structure alignment, three types of consensus models with helices and loops were obtained. Thus, T. albida belongs to type I; T. eichleriana, T. tomaculum, and T. violea belong to type II; and T. asymmetrica, T. andina, T. pinicola, and T. spp. (GER) belong to type III; each type contains four to six domains, with nine CBCs among these that corroborate different species.


Introduction
Species delimitation is an uncertain issue, particularly in fungi.Several criteria are available, and the selection of the better alternative depends on the characteristics of each taxonomic group [1].During the last two decades, and thanks to the advent of highthroughput sequencing technologies, many efforts have approached species delimitation at the level of DNA sequencing [2].Such efforts have led to the development of the field of molecular phylogenetics, which aims to address the concept of species delimitation through phylogeny estimation from sequence data [3,4].The hypothesis behind molecular phylogenetics is that genomes evolve by the gradual accumulation of mutations.Therefore, the level of difference in the nucleotide sequences between a pair of genomes should indicate how recently those two genomes shared a common ancestor [5].
Up to now, the most accepted approach for species delimitation from sequence information relies on the existence of so-called universal barcodes-for instance, short DNA regions placed in the nuclear, mitochondrial, and/or chloroplast DNA that evolve at a rather constant rate.DNA barcoding [6] has provided valuable evidence for resolving misidentified species and identifying cryptic species within morphologically complex groups such as Tulasnella [7].
In addition, species delimitation from barcode sequences has traditionally relied on the generation of multiple sequence alignments (MSA) [8].The interpretation of MSA assumes that homologous positions across species occur in the same order relative to one another (homology), which is not always true due to the recombination, rearrangement, and lateral genetic transfers that all occur naturally in genes and genomes [9,10].Therefore, in many species, the positional hypothesis of homology generated by MSA is incomplete or incorrect, which has the effect of diffusing the phylogenetic signal of barcodes and hampering species delimitation-especially when approached by a single-locus strategy.
Some alternatives to MSA in phylogenetic inference were introduced to improve species delimitation from sequence data: the so-called alignment-free approach, which computes pairwise similarity from sequences [11], and the distance integration method.In this sense, there are two popular methods for performing such data integration steps: (i) concatenation (the supermatrix method) and (ii) consensus (the supertree method) [8].
Despite the clear benefits of using a multi-locus strategy instead of a single-locus approach, both have the handicap of relying on MSA.Generating accurate alignments is challenging, as errors introduced during this step significantly affect phylogenetic reconstruction and species delimitation.However, computational methods that implement data integration schemes are known to offer improved solutions [9,10].Abeysundera et al. [11] reported an accurate alternative method for combining distance matrices (and not gene trees) from multiple single genes on identical taxon sets to obtain a single representative distance matrix, and to derive a species tree.
Consequently, the secondary structure of ITS2 provides additional information for resolving the challenges of identifying closely related fungal species [10][11][12].The ITS2 secondary structure is being increasingly used in the field of phylogenetics due to several reasons: (i) Secondary structure analysis extends the taxonomic applicability of this marker [13]; (ii) the results from secondary structure analysis can improve the phylogenetic resolution obtained from the primary sequence [18]; and (iii) the structural data can provide new information via compensatory base change (CBC) analysis, and thus provide a tool for species delimitation [19].In fact, the secondary structure is a prerequisite for all conclusions derived from the phylogenetic analysis.
This has led to active research in molecular phylogenetics on alignment-free methods for reconstructing phylogenies.These methods integrate structural alignment-based and alignment-free features into distance matrices, which are then used to infer phylogenetic trees with good resolution at the species level [20,21].
The ITS2 secondary structure provides features such as the so called compensatory base changes (CBCs, nucleotide substitutional events), suggested as additional evidence for the distinguishing of some closely related species [22,23].Wolf et al. [23] mentioned that if there is a CBC, then the samples are two different species, at a probability of ~93%; if there is not a CBC, then the samples are the same species, at a probability of ~76%.This approach includes searching for co-variations (i.e., CBCs-e.g., C-G → U-A-and hemi-CBCs, e.g., A → U), common structural motifs, and indels by comparing every single nucleotide/base pair in homologous ITS2 regions of the predicted structures [24,25].
Many studies have used structural information to estimate phylogenies.Agüero-Chapin et al. [26] reported a pure alignment-free strategy for estimating phylogenies from ITS2 sequences, which was used to complement the morphological characterization of some endophytic fungi (Neurospora and Gelasinospora genus) of living plants that had been challenging to place taxonomically.Additionally, in [27,28], several alignment-free methods for phylogeny estimation were published.Furthermore, Cheng et al. [29] created a web server to facilitate the use of various alignment-free approaches for alignment-free phylogeny estimation.
Despite the vast number of available methods for phylogeny estimation (single-locus, multi-locus, structural feature-based, and alignment-free), the benefits of integrating different approaches have been poorly explored; this is probably due to the limited availability of algorithms to combine distance matrices derived from different metrics [9,10].
In previous studies, Abeysundera et al. [11] combined alignment-free distances based on spectral covariance (SpCov) [30].Inspired by the work of Agüero-Chapin et al. [7] and Abeysundera et al. [30], we hypothesized that integrating the alignment-free (structural features) and alignment-based distances of ITS2 sequences into a single matrix, including the identification of CBCs between the 2D structural helices, would improve species recognition.To test our hypothesis, we used a published dataset of the complex fungal group Tulasnella (Basidiomycota), where species delimitation based on molecular and taxonomical data is usually ambiguous.

Datasets
One fungal dataset involving 142 sequences of Tulasnella spp. was selected for this analysis.Most of the delimited species had already been discussed on the morpho-molecular level by Cruz et al. 2014 [7].Sequences including the complete ribosomal nrITS-5.8Sregion were downloaded from the Genbank database National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/,accessed on 1 February 2022) [31].The flanking 28S and 5.8S regions were excluded by the ITSx v1.0.11tool [17,32].The molecular data used to reconstruct the phylogenies consisted of only ITS2 sequences.Finally, one outgroup sequence belonging to Puccinia boraniae (AY348716) was added to the dataset using the Mafft-add tool [33].This outgroup was selected because it does not share close characters with Tulasnella spp.and allowed us to strongly mark the formation of phylogenetic clades in this highly variable group.
Figure 1 shows the workflow summary proposed to delimit Tulasnella species in this work.Tulasnella sequences with total region ITS-5.8Sfrom Genbank database [31]; ITS2 region extraction with ITSx v1.0.11software [17,32]; (ii) Obtain two distance matrices from Alignment-based and Alignment-free; (iii) Combine the distance matrices by Distatist [9], (iv) Plot the resulting phylogenetic tree using the pvclust library (R package) [34]; (v) Secondary structure modelling and; (vi) Define the CBCs as additional molecular markers to delimit Tulasnella species.

Phylogenetic Reconstruction
The phylogenetics strategy for the distance matrices followed these next steps: (i) Alignment-free (AF) matrices from ITS2 secondary structure.First, the 2D structures were inferred by a MFE (free energy minimization) parameter from Mfold [35].Then, the distance matrix from the predicted structures was obtained Tulasnella sequences with total region ITS-5.8Sfrom Genbank database [31]; ITS2 region extraction with ITSx v1.0.11software [17,32]; (ii) Obtain two distance matrices from Alignment-based and Alignment-free; (iii) Combine the distance matrices by Distatist [9], (iv) Plot the resulting phylogenetic tree using the pvclust library (R package) [34]; (v) Secondary structure modelling and; (vi) Define the CBCs as additional molecular markers to delimit Tulasnella species.

Phylogenetic Reconstruction
The phylogenetics strategy for the distance matrices followed these next steps: (i) Alignment-free (AF) matrices from ITS2 secondary structure.First, the 2D structures were inferred by a MFE (free energy minimization) parameter from Mfold [35].Then, the distance matrix from the predicted structures was obtained with RNA distance (Vienna Package v2.0) [36,37], which was used as the input for constructing a first Neighbor Joining (NJ) phylogenetic tree with MEGA6.0 [38] to estimate an AF-based phylogenetic tree.
The AB tree was developed based on Opal MSA with bootstrap = 3000 repetitions, which obtained the highest negative score (-InL of −4755.596439)compared to the other two methods: Mafft (InL = −3760.694766)and Clustal Omega (InL = −3766.491609).
Then, DNAdist (https://evolution.genetics.washington.edu/phylip/doc/dnadist. html, accessed on 2 February 2022) was used to obtain the distance matrix from Opal MSA, which was used as the input for constructing a second NJ phylogenetic tree with MEGA6.0 [40] to obtain an AB phylogenetic tree.
(iii) AB and AF distance matrices combination.
To delimit species boundaries by phylogenetic analyses, the two resulting matrices from AB and AF were combined into a single-compromise matrix (data) using DistatisR, which evaluates the similarity between distance matrices.The compromise matrix represents the best aggregate of the original matrices.The following parameters were used: test<-distatis (data, Norm ="MFA", Distance = TRUE, RV = TRUE, nfact2keep = 2, compact = FALSE) The compromise matrix was used as the input data to construct the third phylogenetic tree using the pvclust [35] library (R package v1.0), using a bootstrap analysis (nboot = 1000), distance measure matrix (method.dist= minkowski), and single cluster analysis (method.hclust= single).A summary of the procedure to calculate the distance matrix is given in [9,11].

ITS2 Consensus Secondary Structure Modelling
The ITS2 database http://its2.bioapps.biozentrum.uni-wuerzburg.de/,accessed on 2 February 2022 [44] does not provide the right Tulasnella models due to its lack of templates; thus, we used the Uncultured Tulasnellaceae (design by Wolf M, collaborator 2010) as a secondary structure comparative model with respect to Colletotrichum gloeosporioide (modeled by ITS2 database), which helped as a control for the genus Tulasnella-see Figure 2a,b.

AB and AF Phylogenetic Reconstruction
The phylogenetic trees generated using the NJ-based methods-alignment-based (AB) see Figure 4a and alignment-free (AF) see Figure 4b-showed different monophyletic clades.The relationship between the clades could not be fully resolved.

AB and AF Phylogenetic Reconstruction
The phylogenetic trees generated using the NJ-based methods-alignment-based (AB) see Figure 4a and alignment-free (AF) see Figure 4b-showed different monophyletic clades.The relationship between the clades could not be fully resolved.

AB and AF Distance Integration with DistatisR
The distance integration phylogenetic tree obtained from the AB and AF matrix combination is shown in the next phylogenetic tree (Figure 5), where clades were significantly delimited according to their morphology in clade A and B. The clades reflected significant bootstrap values (range from 72% to 100%) between species, and most of the molecular characteristics matched with the results submitted by Cruz et al. [49], who also carried out a morphological analysis (the size and shape of hyphae, basidia, sterigmata, and basidiospores) to complement Tulasnella species delimitation.Only two clades did not have complete correspondence (T.eichleriana, which includes T. tomaculum, T. sp_ECU5) and (T.asymmetrica, which includes T. sp_ECU3), as defined in [49].combination is shown in the next phylogenetic tree (Figure 5), where clades were significantly delimited according to their morphology in clade A and B. The clades reflected significant bootstrap values (range from 72% to 100%) between species, and most of the molecular characteristics matched with the results submitted by Cruz et al. [49], who also carried out a morphological analysis (the size and shape of hyphae, basidia, sterigmata, and basidiospores) to complement Tulasnella species delimitation.Only two clades did not have complete correspondence (T.eichleriana, which includes T. tomaculum, T. sp_ECU5) and (T.asymmetrica, which includes T. sp_ECU3), as defined in [49].(A1,A2,B1,B2-I,B2-II,B2-III,B3) represent monophyletic clades identified by M-GMYC analysis using the R tool (see Table 1).(A1,A2,B1,B2-I,B2-II,B2-III,B3) represent monophyletic clades identified by M-GMYC analysis using the R tool (see Table 1).
As is depicted in Table 1, the LR scores for the M-GMYC were the lowest: p-value < 0.05 for the Distance method with respect to the other two.However, AB tree-grouping species of the same genus showed better p-values than the AF tree.

Consensus Secondary Structure Modelling for Intra-Specific Differentiation
Hence, in relation to the previous monophyletic clades and subclades, the ITS2 consensus secondary structure was modeled using structure-based alignment.All predicted secondary structures were arranged to obtain a robust consensus model [50]; it was essential to approximate the phylogenetic and evolutionary relationships between the organisms.The shape and length of each structural helices were manually inspected, and the final consensus models are illustrated in Figure 6.

CBCs in the ITS2 Secondary Structure
As exemplified for each internal sub-clade A (A1, A2[I-II]) and B (B1, B2[I-IV], B3), typically, the CBCs were equal to 0 between the same species (e.g., A1 -> A1).However, among the external sub-clades (the variants between clades A and B), there was a group of variants with at least one CBC with respect to the remaining groups-with a total of 645 CBCs between all the internal and external subclades (Appendix B).We directly averaged the whole variant CBC matrix (presence/absence) for each species [23,51], as shown in Table 2. Based on the sequence-structure information, CBC matrices were calculated using the algorithms implemented in 4SALE [48], where different numbers of CBCs were discovered in the conserved regions (helices II and III) between two organisms, which were correlated with morphological incompatibility.Type II included T. violea, T. eichleriana (ECU6), and T. tomaculum with a 5-domains, where domain III and IV were divided into two and three subdomains (a-b-c).T. eichleriana was divided into original and genetic species; clade A contained the sequences from Europe and was considered as the original species, and clade B contained T. aff.eicheleriana ECU, considered genetic species that are different but that resemble each other in their morphology-according to Cruz et al.Thus, the secondary structure consensus type II model was applicable to both groups, and so was considered a conspecific species.Type III included T. sp.(GER), T. pinicola, T. asymmetrica, and T. andina with 6-domains, whose helices were divided into two and three subdomains (a-b-c).

CBCs in the ITS2 Secondary Structure
As exemplified for each internal sub-clade A (A1, A2[I-II]) and B (B1, B2[I-IV], B3), typically, the CBCs were equal to 0 between the same species (e.g., A1 -> A1).However, among the external sub-clades (the variants between clades A and B), there was a group of variants with at least one CBC with respect to the remaining groups-with a total of 645 CBCs between all the internal and external subclades (Appendix B).We directly averaged the whole variant CBC matrix (presence/absence) for each species [23,51], as shown in Table 2. Based on the sequence-structure information, CBC matrices were calculated using the algorithms implemented in 4SALE [48], where different numbers of CBCs were discovered in the conserved regions (helices II and III) between two organisms, which were correlated with morphological incompatibility.
In summary, clade A could be delimitated by their secondary structures, having 4-7 CBCs between the helices-as shown in Figure 6.In this sense, T. violea showed 3-5 CBCs with respect to T. tomaculum; T. eichleriana (ECU6) exhibited 1-5 CBCs with respect to T. tomaculum and 1CBC with respect to T. sp.(ECU5).

Discussion
The phylogenetic results taken from the ITS2 sequence distance matrix combinations allowed us to identify the two main monophyletic clades A and B with bootstrap values, ranging from 72% to 100%, and with internal (A1, A2[I-II]) and external sub-clades (B1, B2[I-III], B3), respectively.The CBCs in the ITS2 secondary structures correlated with seven different Tulasnella species (A1, A2, B1, B2-I, B2-II, B2-III, and B3) and three types of domains.Although this integration approach improved the inference of evolutionary relationships and the species delimitation of Tulasnella sequences compared to traditional approaches, which have shown discrepancies between molecular and taxonomic data [49], it is still not sufficient for defining more complex group-for example, the cryptic species T. sp.ECU 5 (A2-II) and T. sp.ECU 3 (B2-IV), and T. tomaculum (A2-I) and T. albida (B2-I), which remained unclassified-as also discussed by Cruz et al., 2016. [49] Tulasnellaceae species are especially difficult to classify taxonomically because of their ambiguous morphological and molecular characteristics [30].Members of the genus Tulasnella were studied morphologically by [50] on samples collected in Ecuador, Germany, and other countries, with some of them distinguished as T. eichleriana, T. cf.pinicola, T. sp., and T. violea based on the size and shape of their hyphae, basidia, sterigmata, and basidiospores.However, the morphological structures of Tulasnella often overlap in shape and size among species, making correct identification difficult [49].
Cruz et al. [7,49] proposed a new threshold for defining phylogenetic species in fungi-particularly Tulasnella.They examined the variation within species and divergence among species by using the ITS-5.8Sregion and the morphological data of the basidiomata of Tulasnella spp.Their analysis suggested that morphological characteristics alone are insufficient to delimit species of Tulasnella-a situation that is widespread in the Basidiomycota.Furthermore, in the phylogenetic tree provided by Cruz et al., 2014 [7] the specimen DC271 was classified within the subclade Tulasnella sp.ECU4-which, according to the morphology sequences, belongs to the clade T. asymmetrica.By contrast, our phylogenetic tree identified the specimen DC271 as the species most related to T. eichleriana (B2-III) and T. asymmetrica; in agreement with Cruz et al., 2014, [7] their clades were related to each other morphologically by the similar growth of their anamorphs in culture.
Consequently, motivated by providing better solutions than single-and multi-locus strategies for species delimitation, we applied a new phylogenetic approach based on distance matrix combination.
The Distatis tree results shown in Figure 5 indicate two delimited clades A and B, specified according to their morphology.Clade A included subgroups (A1 and A2) that correspond to morphologically close species in Tulasnella (T.violea and T. eichleriana), with globose-to-subglobose basidiospores from Germany (DC), England (KM), Ecuador (DC) and Wales (KM)-already discussed in depth by Cruz et al., 2016 [49].
The Tulasnella eichleriana ECU6 and Tulasnella eichleriana ECU4 were externally divided into Clade A2 and Clade B2-III, respectively.This separation could be associated with the sample collection location-for example, clade A2 contained sequences sampled from Germany, England, and Ecuador (ECU6), while clade B2-III contained sequences sampled only from Ecuador (ECU4).However, we can observe some incongruences in clade A2 (T.eichleriana), which was internally subdivided into T. tomaculum from England and Wales (A2-I, in light green) and T. sp. from Ecuador-ECU5 (A2-II, in light green).
On the other hand, clade B contained subgroups (B1, B2, and B3); B1 included T. spp.and Uncultured Tulasnella from Germany and Ecuador, and their sequences showed a high bootstrap value (100% BP) in the taxonomy.Clade B2 was subdivided into four (I-IV) classes: T. pinicola (Germany and Ecuador), T. albida (England and Wales), T. asymmetrica (Australia), and T. sp.(ECU3), with BP values between 72% and 99%.In agreement with Cruz et al.'s phylogenetic results, all these were clustered into the same clade B2.
Although the distance method has been successfully tested, some inconsistencies were still identified in our phylogenetic tree (Figure 5).These incongruences were specifically identified in clade A2[I-II] (T.tomaculum, T. spp.ECU5) and in clade B2-IV (T.spp.ECU3), which were misclassified in the tree; this can be attributed to their variability and morphologic traits, as has been mentioned by Cruz et al. [49].
This information was also integrated with secondary structure modelling and CBC identification between helices to improve Tulasnella species delimitation.
The Tulasnella ITS2 models in this work did not match the consensus models reported in the literature [50,51], which normally have four stems (helices)-with the longest being stem III-and a UGGU motif.Stem II contains a pyrimidine-pyrimidine bulge and the loop between the stems has a pronounced adenine-both of which are common features of ITS2 among angiosperms.The species of the Tulasnella group have truly divergent ITS-5.8Sregions ranging from 600 to 900 bp for the different species, allowing for the formation of ingroups, as reported by Cruz et al. [7,49] and Freitas et al. [52].The presence of indels in this group represents interspecific variability, displaying different consensus secondary structure prototypes and revealing three types of models.
Some molecular phylogenetic studies on the nrITS-5.8Ssequences of Tulasnella species previously isolated from the mycorrhizas of epiphytic orchids have shown genomic variability among clones that is difficult to interpret as intra-or interspecific variations or to correlate with described Tulasnella species; thus, it could be attributed to the apparent variability in the number of helices and structural details-for example, four and six helices were modelled in this study.
Due to the diversity of mycorrhizal Tulasnella associated with epiphytic orchids, they show morphological variability.
In Eukaryotics that also have a predefined pattern of four helices, the slippage of RNA polymerase during transcription may result in the production of mononucleotide repeats ("UUUU") in the RNA sequences [52,53].These inadvertent errors in transcription may lead to an increase in the number of detected ITS2 ribotypes [54].Furthermore, this could be attributed to the apparent variability in the number of helices and structural details that occur in the ITS1 transcript among Eukaryotics-for example, seven helices were proposed for Digenea [51,[55][56][57][58].
According to Cao et al. [22] and Zhang et al. [59], the site covariation (CBC)-in combination with phylogenetic comparisons-is increasingly being used as a more effective method for structure prediction.Hence, to corroborate the species delimitation revealed by the phylogeny and consensus model (Type I-IV with a total of four to six domains), a minimum of nine CBCs among individual structural helices and a total of 645 CBCs were identified between the internal and external subclasses of different species, as an additional molecular marker.It has been experimentally demonstrated that taxa separated by one CBC are totally incapable of intercrossing [24,25,60], and that it could be said that they represent separate species.

Conclusions
The proposed phylogenetic method delineated the Tulasnella species into two main clades (A and B), with seven monophyletic internal subclades.All predicted consensus secondary structures were essential to approximate the phylogenetic and evolutionary relationships between the organisms.The ITS2 consensus models revealed three different types of domains with four and six helices, which does not match with the consensus models reported in the literature, which normally have four helices.
Clades A and B were delimitated by one to nine CBCs in the conserved regions (helices II and III), and correlated with morphological incompatibility and intragenomic variability in the ITS2 sequences.However, the integration method depends on the data's complexity-namely, it relies on the variability in each sequence's group.
To the best of our knowledge, this is the first study to systematically evaluate CBCs in the predicted secondary structures for the rRNA sequences that belong to Tulasnella species.For future work, we propose to use the artificial intelligence method of special neural networks as a species recognition approach, as well as for secondary structure modelling and automatic CBC detection.

Figure 3 .
Figure 3. ITS2 sequence structure-based alignment generated by LocaRNA-P.Dot-bracket notation represents unpaired bases and matching parenthesized positions represent paired bases.The red regions indicate structure reliability, the yellow regions represent sequence reliability, and the thin line shows the combined column-reliability.

Figure 3 .
Figure 3. ITS2 sequence structure-based alignment generated by LocaRNA-P.Dot-bracket notation represents unpaired bases and matching parenthesized positions represent paired bases.The red regions indicate structure reliability, the yellow regions represent sequence reliability, and the thin line shows the combined column-reliability.

Figure 4 .
Figure 4. (a).AB tree built with Opal[42] and MEGA[38] tools, showing 11 clades, and (b).AF built with VRNAdistance[37] and MEGA[38] tools, with 9 clades.AB tree matrix car information about the evolutionary changes in the ITS2 sequences and the AF matrix car information about changes in their secondary structure.

Figure 4 .
Figure 4. (a).AB tree built with Opal[42] and MEGA[38] tools, showing 11 clades, and (b).AF tree built with VRNAdistance[37] and MEGA[38] tools, with 9 clades.AB tree matrix carries information about the evolutionary changes in the ITS2 sequences and the AF matrix carries information about changes in their secondary structure.

Figure 6 .
Figure 6.A consensus secondary structures of ITS2 based on minimum free energy (MFE).The colors indicate structural conservation according to the key in the figure, where the homologous regions highlighted in red contain the highest probability of compatibility, with a radiating central and internal loop interconnected with unpaired nucleotides (colorless) in the helices.

Figure 6 .
Figure 6.A consensus secondary structures of ITS2 based on minimum free energy (MFE).The colors indicate structural conservation according to the key in the figure, where the homologous regions highlighted in red contain thehighest probability of compatibility, with a radiating central and internal loop interconnected with unpaired nucleotides (colorless) in the helices.

Figure 6
Figure 6 displays different consensus secondary structure prototypes, revealing three types of models that matched with the subclades shown in Figure 5. Type I consisted of T. albida, with the standard model of four domains (helices from I to IV).Type II included T. violea, T. eichleriana (ECU6), and T. tomaculum with a 5-domains, where domain III and IV were divided into two and three subdomains (a-b-c).T. eichleriana was divided into original and genetic species; clade A contained the sequences from Europe and was considered as the original species, and clade B contained T. aff.eicheleriana ECU, considered genetic species that are different but that resemble each other in their morphology-according to Cruz et al.Thus, the secondary structure consensus type II model was applicable to both groups, and so was considered a conspecific species.Type III included T. sp.(GER), T. pinicola, T. asymmetrica, and T. andina with 6-domains, whose helices were divided into two and three subdomains (a-b-c).

Table 1 .
M-GMYC statistics obtained during the identification of monophyletic clades in the AB, AF, and Distance phylogenetic trees.The Likelihood Rate is shown in Column 3 and Column 4 refers to the number of Maximum Likelihood cluster-delimited groups with more than two individuals/sequences.