Abstract
Despite the common use of Thesium species as food and medicine and the prevailing systematic (taxonomic) difficulties of the genus, the diversity of phenolic compounds and their chemophenetic value remain largely unknown. As part of ongoing systematic research on Thesium, phenolic compounds and their composition were investigated at four taxonomic ranks: generic, infrageneric, interspecific and infraspecific. Liquid chromatography–mass spectrometry, multivariate analyses, comparisons to DNA data and manual investigations of total ion chromatograms were conducted for 156 samples of 50 Thesium species, as well the two monotypic genera sister to Thesium (Lacomucinea and Osyridicarpos). A wide diversity of phenolic compounds, primarily flavonols, carboxylic acids, phenolic acids and associated derivatives, were observed. Rutin was the most common compound, followed by citric acid, isorhamnetin O-glucoside O-rhamnoside, kaempferol O-rutinoside, quinic acid and cryptochlorogenic acid. Chemophenetic data proved valuable at the generic and interspecific level and, to a lesser degree, at the infraspecific level. On the other hand, no distinct patterns were observed at the infrageneric level, nor did chemophenetic data correlate with infrageneric classifications based on genetic, geographical or morphological data. At the generic level, the patterns of phenolic compounds in Lacomucinea and Osyridicarpos overlapped with those of Thesium species, but no compound or group of compounds were unique to Thesium. At the interspecific level, total ion chromatograms of the species were largely distinct from one another and, with the exception of four species, remarkably consistent. Two related species (T. triflorum and T. scandens) with doubtful species boundaries had similar but distinct chromatograms, providing evidence of their retention as separate species. At the infraspecific level, the data were mostly inconclusive, but variation in samples of T. asterias revealed two morphologically distinct populations, one of which is possibly a species new to science. This study provides valuable preliminary insights into the phenolic diversity and chemophenetic relationships in Thesium, Lacomucinaea and Osyridicarpos and highlights the potential of chemophenetics in taxonomically problematic plant groups.
1. Introduction
Thesium L. is the largest genus [1] in the parasitic plant family Santalaceae [2], with ±350 species occurring naturally in Africa, Madagascar, Australia, Asia, Europe and South America, and one species being introduced to North America [3]. The centre of diversity for the genus is in southern Africa where ±175 species occur [3]. Thesium is monophyletic and sister to Lacomucinaea Nickrent and M.A.García plus Osyridicarpos A.DC., two monotypic genera found in South Africa, and southern and tropical Africa, respectively [1,4]. Thesium plants are root hemiparasites that usually grow as herbs or subshrubs. They have linear or scale-like leaves that lack petioles and dry, nut-like fruits that often form elaiosomes [1,5,6]. Several species of Thesium have traditional and contemporary uses, particularly as medicines and functional foods in Africa and Asia [3].
Thesium has a long history of different circumscriptions and infrageneric classifications (see Ref [7]), since it was described by Linnaeus in 1753 [8]. Prior to a recent molecular study by Zhigila et al. (2020) [7], infrageneric groupings were based predominantly on morphological characters, and the proposed infrageneric classification systems (subgenera, sections, series, etc.) were geographically biased (e.g., Refs [9,10,11,12]). Zhigila et al. (2020) [7] presented a revised infrageneric classification for Thesium based on molecular data from one nuclear and three plastid gene regions, for 160 samples representing 116 Thesium species. They recognised five subgenera within Thesium based on five well-supported phylogenetic clades: Subgenus Hagnothesium (A.DC.) Zhigila, Verboom and Muasya, Subgenus Thesium, Subgenus Discothesium (A.DC.) Zhigila, Verboom and Muasya, Subgenus Psilothesium (A.DC.) Zhigila, Verboom and Muasya and Subgenus Frisea (Rchb.) Hendrych [7]. These five subgenera are supported by a strong geographical component, with Subgenus Thesium occurring mainly in Eurasia, Subgenus Psilothesium in tropical Africa and South America, and Subgenera Hagnothesium, Discothesium and Frisea in South Africa. The molecular phylogeny also confirmed that previous groupings based on morphology, particularly those proposed by Hill (1915) [9] for South African species, were artificial. However, although some clades, such as Subgenus Hagnothesium, and, to a degree, Subgenus Thesium and Subgenus Discothesium, have clear morphological apomorphies, the prevalence of homoplasious characters within the genus complicates the morphological diagnoses of some clades and subclades proposed by Zhigila et al. (2020) [7]. Furthermore, while their study presented strong molecular support for the five main clades, some internal nodes were unresolved and showed incongruent placements between nuclear and plastid phylogenetic trees [7]. This incongruence was especially prominent in the largest clade, Subgenus Frisea (clade 5), which consists of ±103 southern African Thesium species [7]. It is clear that although broad groupings within Thesium have largely been resolved, uncertainty remains regarding the placement of numerous species, as well as several natural groups within Thesium. Furthermore, some molecular clades lack clear diagnostic morphological characters (synapomorphies) to inform taxonomic revisions, and the placement of species not included in any molecular studies to date remains uncertain. Additional non-molecular and non-morphological data, such as information on patterns of variation in phytochemical compounds, as reported by Stander et al. (2019) [13] in a few Thesium species used as Cape herbal teas, might shed light on several remaining questions on the systematics of Thesium and provide additional evidence for the current groupings.
Chemophenetic information on Thesium is limited. The only chemophenetic report for Thesium was completed by De Kock and Rapson in 1938 [14] on 17 Thesium species endemic to the Greater Cape Floristic Region (GCFR) in South Africa. They showed that the presence or absence of an unidentified phlobatannin and/or an unidentified volatile oil glycoside in Thesium species corresponded with infrageneric sections of the time that were proposed by Hill (1915) [9] based on floral morphology. Species from Section Imberbia A.W.Hill contained both the phlobatannin and the volatile oil glycoside; species from Section Annulata A.W.Hill contained neither of the compounds; and species from Section Barbata A.W.Hill contained only the phlobatannin, only the volatile oil glycoside, both compounds or neither compounds [9,14]. While their results show some phytochemical support for the infrageneric sections of the time, several recent molecular studies have not only shown that Hill’s sections are paraphyletic [4,7,15] but also that the species grouped together by De Kock and Rapson (1938) [14] are paraphyletic.
As with chemophenetics, knowledge on the phytochemical compounds found in Thesium is limited. In a recent literature review, Lombard et al. (2020) [3] found that the phytochemical compounds of only eight Thesium species have been studied. A total of 70 phytochemical compounds (mainly phenolics, fatty acids and alkaloids) were isolated from these eight species. It should be noted that 60 of the 70 phytochemical compounds were isolated from only two species (T. chinense Turcz. from Asia and T. humile Vahl from the Mediterranean area), and the phytochemical composition of ±340 species remains unknown [3]. This limited knowledge is surprising, considering that Thesium species are used in ±40 functional foods, as medicine to treat ±130 ailments and its reported toxicity in some species [3].
Given (1) the remaining knowledge gaps in species relationships and natural groups within Thesium, (2) the need for apomorphic characters that support molecular clades, (3) the paucity of information on phytochemical composition patterns in Thesium and (4) the outdated nature of current chemophenetic research on the genus, an exploratory chemophenetic study of Thesium is warranted. An investigation of phenolic compounds was a logical starting point for such an exploratory study as phenolics; in particular, flavonoids are the most common compounds found thus far in Thesium [3], especially those traditionally used as herbal teas. The aims of this study, as part of ongoing systematic studies of Thesium, were to (1) conduct an exploratory investigation on the phenolic compounds present in Thesium and to (2) investigate the chemophenetic value of patterns in phenolic compound variation in Thesium at the generic level, infrageneric level, interspecific level and infraspecific level. This study provides the first account of phenolic variation in Thesium, Lacomucinaea and Osyridicarpos, including the first phytochemical accounts of 49 Thesium species and O. schimperianus.
2. Materials and Methods
2.1. Plant Material Used
A total of 156 samples of 50 Thesium species from Africa, Madagascar, Europe and Asia were included in this study. All five subgenera within Thesium are represented: 1 species (of 8) from Subgenus Hagnothesium, 3 species (of ±65) from Subgenus Thesium, 2 species (of 13) from Subgenus Discothesium, 21 species (of ±98) from Subgenus Psilothesium and 23 species (of ±103) from Subgenus Frisea. Of the 156 Thesium samples (50 species) included here, 142 samples (44 species) were collected by the authors from live plants at several sites across South Africa during the summer seasons of 2016 to 2019 and air dried before extraction. Where possible, multiple populations (one to three) of a species were sampled and multiple samples (one to five) collected from each population. Each species is therefore represented by between one and nine samples. A further six air-dried samples of two Eurasian species, T. ebracteatum Hayne and T. ramosum Hayne, were obtained from collections made by Zigmantas Gudzinskas in Lithuania. Eight samples of four species (T. chinense from Asia, T. cymosum A.W.Hill and T. cf. ussanguense Engl. from southern and eastern Africa, and T. cf. leandrianum Cavaco and Keraudren from Madagascar) were also taken from herbarium specimens. Every attempt was made to include samples from the full geographical range and morphological variation of the genus.
The two monotypic genera that are sister to Thesium, Lacomucinaea and Osyridicarpos [1] were also investigated using three and one samples, respectively. A list of all samples with corresponding voucher information is provided in Table 1.
Table 1.
List of samples used in this study, including their species names, subgenera, sample codes, countries of origin, collection localities and dates, and voucher details. Vouchers are housed in the Pretoria National Herbarium, South Africa (PRE). Samples included in multivariate statistics are indicated with an “*”.
2.2. Extraction
Samples consisted of above-ground parts (including leaves, stems, flowers and fruits) of plants and were finely ground using a mortar and pestle. Approximately 0.2 g of each sample was extracted using 50% methanol in water containing 1% formic acid (1.5 mL) by soaking it overnight. This was followed by extraction in an ultrasonic bath (0.5 Hz, Integral Systems, South Africa) for 60 min at room temperature. The extracts were lastly centrifuged (Hermle Z160 m, 3000× g for 5 min).
2.3. Standards
Standards of nine compounds (flavonoids and organic acids), namely caffeic acid, catechin, citric acid, epicatechin, ferulic acid, gallic acid, p-coumaric acid, rutin and quinic acid were prepared at concentrations of 1.95 μg/mL, 3.9 μg/mL, 15.6 μg/mL, 31.25 μg/mL, 62.5 μg/mL, 125 μg/mL and 500 μg/mL with the same solvent used for sample extraction (50% methanol in water containing 1% formic acid).
2.4. Liquid Chromatography–Mass Spectrometry (LCMS)
High-resolution UPLC-MS analysis was conducted using a Waters Synapt G2 Quadrupole time-of-flight (QTOF) mass spectrometer (MS) and a Waters Acquity ultra-performance liquid chromatograph (UPLC) (Waters, Milford, MA, USA). Negative mode was used for electrospray ionisation with a cone voltage of 15 V, desolvation gas at 650 L/h and desolvation temperature of 275 °C. The other mass spectrometry settings were optimised for best resolution and sensitivity. Data were attained by scanning from m/z 150 to 1500 in the resolution mode and MSE mode. In the MSE mode, two channels of mass spectrometry data were collected: the first at low collision energy (4 V) and the second with a collision energy ramp (40–100 V) for fragmentation data. Accurate mass determination was performed with leucine enkephalin as a mass reference, and the instrument was calibrated with sodium formate. A Waters HSS T3, 2.1 × 100 mm, 1.7 μm column was used for separation. The injection volume was 2 μL, and the mobile phase consisted of 0.1% formic acid (solvent A), as well as acetonitrile with 0.1% formic acid (solvent B). After 1 min of 100% solvent A, the gradient changed linearly to 28% solvent B over 22 min, 40% solvent B over 50 s, followed by a wash step of 1.5 min at 100% solvent B and, finally, re-equilibration to the initial conditions for 4 min. The column temperature was sustained at 55 °C, and the flow rate was 0.3 mL/min.
2.5. Compound Identification
Provisional identifications of compounds are provided based on a combination of the following information: correlation with standards, data in previous publications on Thesium, as well as the fragmentation data, elemental composition (based on accurate mass), relative retention times and UV data of peaks. Online databases, including Chemspider (www.chemspider.com, accessed 11 November 2020), KNApSAck (www.knapsackfamily.com/knapsack.core, accessed 10 January 2022) and METLIN (metlin.scripps.edu, accessed 18 January 2022), were used. Quinic acid derivatives showing an m/z 191 fragment ion, ferulic acids (m/z 193), coumaric acids (m/z 163) and combinations of these occurred in many samples. Flavonol bases (fragment ions of kaempferol m/z 285, quercetin m/z 300 and m/z 315 isorhamnetin) with different mono-and diglycosides (loss of m/z 162 for hexose, -146 for rhamnose, -308 for dihexoside) were also detected. Further identifications of observed compounds are beyond the scope of this study and may be addressed in future studies, especially in those species with interesting and as yet unidentified compounds.
2.6. Multivariate Analysis
To ensure accurate peak alignment, only samples with little retention time drift were included in the multivariate analysis. A total of 128 samples (46 species) fitted this criterion and are indicated with an “*” in Table 1. The remaining 28 samples were manually studied and compared. Raw mass spectrometry data were processed using the MarkerLynx XS function of MassLynx version 4.1 software (Waters Inc., Johannesburg, South Africa, 2012). The peak detection analysis was used to align peaks, as well as to convert raw data to retention time–mass pairs with the signal intensity for each peak. Only peaks between 4 and 25 min retention time were included, and the analysis was run with the following method parameters: low mass = 120, high mass = 1300, XIC window (Da) = 0.01, use relative retention time = no, peak width at 5% height (seconds) = 15, apply smoothing = yes, marker intensity threshold = 500, mass window = 0.05, retention time window = 0.80, noise elimination level = 200 and deisotope data = yes. Peak alignment was checked manually for accuracy. Pareto scaling was applied.
Generic, infrageneric, interspecific and infraspecific relationships were investigated and visualised using Python version 3.8 (Python Software Foundation, 2001–2022; https://www.python.org/downloads/, accessed on 8 December 2020) with the Scikit-learn 0.23.2 and Matplotlib 3.3 Libraries, as well as MetaboAnalyst version 5.0 software (Xia Lab, McGill University, 2020; https://www.metaboanalyst.ca/home.xhtml, accessed on 8 December 2020). In Python, the Markerlynx mass spectra data were firstly normalised to compensate for the variance in concentration and to ensure equal representation in the dataset, thereby facilitating comparative analysis. Normalisation involved scaling each sample vector to a unit norm, independently of other samples, so that all values were represented on a common scale. The dimensionality of the dataset was then reduced using principal component analysis (PCA). This was performed both to gain a better understanding of the important features in the dataset and to reduce dimensional complexity for further analysis steps without significant reduction in accuracy. The minimum number of PCA components to be evaluated were selected, so that the amount of variance that needed to be explained by this number of components was greater than two times the standard deviation (95.45%) of data coverage. In this case, 44 PCA components were sufficient to explain the variation in the data to more than two standard deviations, and by examining the loading factors for each of these PCA components, it is possible to visualise the causative factors, which contributed the most variation within the dataset. Next, unsupervised clustering analysis was preformed using the mean shift clustering algorithm (or the mode-seeking algorithm). This is a non-parametric kernel density estimation-based algorithm that attempts to seek the densest region of samples iteratively. The mean shift algorithm does not require specifying the number of clusters in advance; the number of clusters is conferenced by the data. The algorithm initially assigns each point in the feature space as a possible cluster centroid, and the distribution distances of points in the feature space are used to estimate the kernel bandwidth parameter, also known as the attractive interaction distance between samples. Iteratively, the mean of all additional points within the bandwidth of the initial centroids determines the new set of centroids (kernel density estimation) and assigns each data point to the closest cluster centroid until convergence is achieved. After convergence, the remaining cluster centres define the number of clusters, and the data points associated with the same centroid are members of the same cluster. The kernel density estimate bandwidth parameter significantly impacts the accuracy and precision performance of the mean shift algorithm. The bandwidth parameter was estimated for this feature space to be the average of the distance of each vector to its k-nearest neighbour (kNN), relative to the furthest distance between the feature vectors. Setting the number of k-nearest neighbours to 6–7% of the total sample set yielded optimal results.
In MetaboAnalyst, raw peak intensity data from MarkerLynx were normalised by sum, log transformed and scaled using automatic scaling, after which two- and three-dimensional PCA plots and hierarchical clustering dendrograms (using Euclidean distance and Ward clustering) were computed to show infraspecific relationships.
2.7. Phylogenetic Comparison
Generic, infrageneric and interspecific relationships were also investigated in a phylogenetic framework by plotting the main 33 compounds observed (see Table 2) in Thesium and Lacomucinaea (see Table 3) on the latest and most complete phylogeny [7]. A total of 27 Thesium species spanning all five subgenera overlapped between this study and the phylogeny, as well as L. lineata.
Table 2.
The most common phenolic compounds and organic acids (peaks) detected in a total of 156 samples from 50 species of Thesium, along with their retention time, molecular mass, molecular formula, fragments, standards used and preliminary identifications with associated references. The number of samples and species in which each compound was observed at high levels (peak intensity > 30%) are also provided.
Table 3.
Comparison of the main phenolic compounds and organic acids (peak intensity >30%) detected in Lacomucinaea, Osyridicarpos and Thesium.
3. Results
3.1. Overview of Phenolic Compounds
More than 200 individual peaks were recorded during the manual examination of 156 total ion chromatograms of 50 Thesium species, L. lineata and O. schimperianus (all chromatograms are provided in Supplementary Figure S1). While the identification of all peaks was not the aim of this study, preliminary identifications and information on the ±30 most common peaks observed in Thesium are provided in Table 2 and those of L. lineata and O. schimperianus in Table 3. In all three genera, flavonols, carboxylic acids, phenolic acids, organic acids and associated derivatives were the main compounds observed. A flavanone (eriodictyol-O-glucoside) was also detected in 10 samples from four species.
In Thesium, rutin was the most common compound observed. Rutin was detectable (peak intensity > 0) in all but 2 species (T. cymosum and T. cf. leandrianum) and was the main peak (peak intensity > 30%) in 27 species. Other common compounds, present at peak intensities of more than 30%, included citric acid (26 species), isorhamnetin O-glucoside O-rhamnoside (20 species), kaempferol O-rutinoside (15 species), quinic acid (14 species), cryptochlorogenic acid (4-caffeoylquinic acid) (12 species), quercetin rhamnosyl-rhamnosyl-glucoside (11 species), dehydrodiferulic acid (10 species) and quercetin robinobioside (10 species) (Table 2). Numerous peaks of coumaroylquinic acid, feruloylquinic acid and flavonoid glycosides were also observed (Table 2).
Lacomucinaea had several prominent peaks, including citric acid, cryptochlorogenic acid, neochlorogenic acid, sinapoyl hexose, quercetin-O-dihexoside and an unknown compound (F121.2, Table 3). The main peaks observed in Osyridicarpos were citric acid, cryptochlorogenic acid, neochlorogenic acid, hydroxytyrosol O-glucoside, feruloylquinic acid and one unknown compound (F94.1, Table 3).
3.2. Chemophenetic Value at Generic Level
The PCA analysis and dendrogram showed both L. lineata and O. schimperianus to be deeply embedded within Thesium (Figure 1 and Figure 2), and no compound or group of compounds were found to universally distinguish Thesium from Lacomucinaea or Osyridicarpos. Similarly, the main compounds plotted on the Zhigila et al. (2020) [7] phylogeny overlapped between Thesium and Lacomucinaea (Figure 3).
Figure 1.
Principal component analysis with cluster map showing the relationships among 46 Thesium species, Lacomucinaea lineata and Osyridicarpos schimperianus, based on phenolic compound composition. Clusters are indicated with different colours. Species names corresponding to sample labels are provided in Table 1.

Figure 2.
Dendrogram with clusters showing the relationships among 46 Thesium species, Lacomucinaea lineata and Osyridicarpos schimperianus, based on phenolic compound composition. The placements of subgenera are also shown: Discothesium (clear triangles), Frisea (solid triangles), Hagnothesium (clear circles), Psilothesium (solid circles) and Thesium (clear squares). Clusters are shown in different colours. Species names corresponding to sample labels are provided in Table 1.
Figure 3.
Patterns of phenolic compound variation observed in Thesium (27 species) and Lacomucinaea (1 species) within a phylogenetic framework. The most common phenolic compounds (peaks) detected in Thesium (see Table 2) and Lacomucinaea (see Table 3) are plotted on the most recent molecular phylogeny (adapted from [7]). The presence of compounds (peaks) in species is indicated with solid circles, and the five subgenera are shown in different colours. Compound names correspond with the information provided in Table 2 and Table 3.
With the exception of T. microcarpum A.DC. (see Section 4), all Thesium species had one or more of the following six compounds in detectable amounts (peak intensity > 50): rutin, citric acid, isorhamnetin O-glucoside O-rhamnoside, kaempferol O-rutinoside, quinic acid and cryptochlorogenic acid. Three of the previously mentioned six compounds (isorhamnetin O-glucoside O-rhamnoside, kaempferol O-rutinoside and quinic acid) were not detected in L. lineata or O. schimperianus, and rutin was only detected at very low levels (peak intensity < 0.2 in Lacomucinaea and <35 in Osyridicarpos) (Table 3). All three genera had relatively high levels of citric acid, cryptochlorogenic acid, neochlorogenic acid and feruloylquinic acid (Table 3; Supplementary Figure S1).
Lacomucinaea and Osyridicarpos seem to be distinguishable from one another based on several compounds observed at high levels (peak intensity > 180) in each genus (see Table 3). While these compounds are observed at high levels in Lacomucinaea and Osyridicarpos, respectively, low levels (peak intensity < 100) were also occasionally present in the other genera studied. A larger sampling of Lacomucinaea and Osyridicarpos plants is needed to affirm these results.
3.3. Chemophenetic Value at Infrageneric Level
No discernible groupings were apparent among Thesium species using multivariate statistics (Figure 1 and Figure 2), comparison with the Zhigila et al. (2020) phylogeny [7] (Figure 3) or manual examination of species’ total ion chromatograms (Supplementary Figure S1). In the multivariate framework, neither the subgenera of the current DNA-based classification (Figure 2) nor the species from the same geographic regions (Supplementary Figure S2) or species with similar morphological characters (Supplementary Figure S3) grouped together. One exception was a cluster of three morphologically similar species endemic to South Africa and Lesotho: T. imbricatum Thunb., T. karooicum Compton and T. sonderianum Schltr. (Figure 2; Supplementary Figure S3). Thesium scandens E.Mey. ex Sond., T. zeyheri A.DC., T. ramosum and T. transvaalense Schltr. were somewhat separated from the other species in the PCA analysis (Figure 1) and T. zeyheri and T. transvaalense in the dendrogram (Figure 2). Thesium zeyheri was the only species in which F7 (unknown sulphated caffeic acid derivative) was detected at high levels (peak intensity > 3), and both T. zeyheri and T. transvaalense had low levels of rutin (peak intensity < 12). Other compounds responsible for variation in principal components one to three and their details are provided in Supplementary Figure S4 and Supplementary Table S1. Another exception were the slight differences in the phenolic and organic acid composition of Subgenus Hagnothesium (see Section 4).
3.4. Chemophenetic Value at Interspecific Level
While no interspecific level signals were found in the phylogenetic framework (Figure 3), the multivariate framework and manual examination of species’ total ion chromatograms revealed an interesting result. Almost all Thesium species, as well as both L. lineata and O. schimperianus, were largely distinct from one another in the combination of peaks and, to a lesser extent, relative peak intensities (Supplementary Figures S1 and S5). Indeed, 42 of the 50 species had at least one main peak that was not present in any of the other species. Furthermore, in the 38 species represented by multiple samples, the samples of each species not only grouped together (Figure 2) but also correlated more closely to one another than to those of other species (Figure 4). This indicates that phenolic total ion chromatograms might be useful in species identification when used in combination with morphological characteristics. In the three species (T. pallidum A.DC., T. impeditum A.W.Hill and T. zeyheri; Figure 2) where samples of a species did not group together, manual examination of their total ion chromatograms proved them to be conspecific (Supplementary Figure S1). Conversely, four species (T. asterias A.W.Hill, T. costatum A.W.Hill, T. strictum P.J.Bergius and T. cf. ussanguense) showed substantial variation (Supplementary Figures S1 and S2) (see Section 4).
Figure 4.
Correlation heatmap showing the similarities among 128 samples of 46 Thesium species, as well as sister genera Osyridicarpos and Lacomucinaea, based on phenolic compound composition. Darker colours indicate higher similarity between samples. Species names corresponding to sample labels are provided in Table 1.
3.5. Chemophenetic Value at Infraspecific Level
Out of the 38 species included in the PCA analysis, 4 species were represented by multiple samples from different populations. These species included T. durum Hilliard and B.L.Burtt (two populations), T. flexuosum A.DC. (two populations), T. hispidulum Lam. (three populations) and T. imbricatum (two populations). In T. durum, the samples from the same populations grouped together, thereby indicating that the two populations differed (Figure 5A). This difference was driven by slight variances in peak intensities and not dissimilarities in peak composition (Supplementary Figure S1). In the remaining three species (Figure 5B–D), the samples from populations intermixed, indicating that there were no distinguishable differences among populations in peak intensities or peak composition.
Figure 5.
Two- and three-dimensional PCA plots and hierarchical clustering dendrograms comparing the phenolic compound compositions of different populations of four Thesium species (A). T. durum. (B). T. flexuosum. (C). T. hispidulum. (D). T. imbricatum. For each species, different populations are indicated by different colours.
4. Discussion
This study investigated the phenolic compound composition and chemophenetic relationships of 50 Thesium species, L. lineata and O. schimperianus, at the generic, infrageneric, interspecific and infraspecific levels. To our knowledge, it is the first study on patterns in phenolic compound variation in Thesium, Lacomucinaea and Osyridicarpos, and the second and most comprehensive chemophenetic study of Thesium to date (after that of De Kock and Rapson in 1938 [14]). With the exception of T. chinense and L. lineata, the phytochemistry of all of the species included here is studied for the first time.
In line with previous studies (e.g., Refs [3,18,19,20,21,22,23]), the phenolic compounds found in Thesium were mainly flavonols, carboxylic acids, phenolic acids and associated derivatives. Although the phenolic compound diversity in Thesium appears to be high (more than 200 prominent peaks were observed), the majority of the compounds are relatively common in the plant kingdom (e.g., citric acid, rutin, chlorogenic acid isomers, quinic acid derivatives and flavonoid glycosides). To our knowledge, several compounds, including cryptochlorogenic acid, neochlorogenic acid, coumaroylquinic acid, feruloylquinic acid, eriodictyol and quercetin are reported here for the first time in Thesium (see Ref [3]). No flavones were detected as major compounds, even though glycosides of apigenin, luteolin and chrysoeriol were previously reported from T. chinense.
Overall, the phenolic and organic acid composition seems to have some chemophenetic value at the generic level but limited value at the infrageneric level. Preliminary results show distinctive phenolic and organic acid differences between Lacomucinaea and Osyridicarpos, which support molecular data to retain them as separate genera [1]. While both genera have leaves with distinct petioles and smooth fleshy fruit, there are noteworthy differences in, among others, the shape of the leaves and the structure of the flowers [1]. In Thesium, however, no compound or group of compounds could be found to universally differentiate all Thesium species from Lacomucinaea or Osyridicarpos. Nevertheless, the majority of Thesium species have at least one of six compounds (rutin, citric acid, isorhamnetin O-glucoside O-rhamnoside, kaempferol O-rutinoside, quinic acid and cryptochlorogenic acid) at relatively high intensities. One exception is T. microcarpum, in which only very low levels of citric acid and rutin were detected. Thesium microcarpum forms part of Subgenus Hagnothesium, which was previously recognised as a separate genus, Thesidium Sond. [7,24]. Species of Subgenus Hagnothesium are characterised by, among others, their dioecious reproductive system and mostly 4-merous flowers (versus monoecious reproductive system and predominantly 5-merous flowers in the other subgenera of Thesium) [7]. Although molecular studies have shown Thesidium to be embedded in Thesium [4,7], it is interesting to note that its phenolic and organic acid composition differs somewhat from the main trend observed in Thesium. Not only are the main compounds found in the majority of Thesium species almost absent from T. microcarpum, but it also contains several main peaks not present in the other species investigated. More extensive studies of Subgenus Hagnothesium, as well as Lacomucinaea and Osyridicarpos, are needed to further explore these apparent patterns.
Contrary to the work conducted on a phlobatannin and/or an unidentified volatile oil glycoside [13], no discernible infrageneric groupings were present among the species investigated. Although T. transvaalense was slightly further removed from the other species in the PCA analysis, and T. zeyheri and T. transvaalense in the dendrogram, no significance could be linked to these placements in terms of morphology, distribution or molecular relationships. The separation of T. zeyheri and T. transvaalense was likely driven by an unknown saccharide (F7) and rutin, as the majority of variation in the first principal component of the PCA analysis was attributed to these two compounds (Supplementary Figure S4). It is possible that the study of additional Thesium species may alter the preliminary trends and relationships observed in this study. Future studies might also investigate patterns in the imino acid cis-4-Hydroxy-L-proline [25] and acetylenic seed oils [26], as these compounds have shown chemophenetic potential at the generic level in Santalaceae.
The value of phenolic compound composition apparently lies at the interspecific level, as many Thesium species, as well as L. lineata and O. schimperianus, are recognisable to some degree by their total ion chromatograms and might therefore aid species identification when used in combination with morphological characters. However, this approach should be used with caution, as the chemistry of the majority of Thesium species has not yet been studied. The combination of peaks and their relative intensities can also be helpful in distinguishing between morphologically similar species. For instance, T. triflorum L.f. and T. scandens are closely related based on both morphology [6] and DNA data [7]. It is often difficult to distinguish them from one another—to such a degree that they might be considered conspecific based on morphological species concepts [27]. The main morphological difference between them is found in their leaves: fleshy, more or less terete leaves without clearly visible veins in T. scandens and flat, linear to linear-lanceolate leaves with one to three (five) main veins in T. triflorum (Figure 6). However, there seems to be a continuum of variation in leaf morphology between T. scandens plants, which occur in the Cape regions of South Africa, and T. triflorum plants, which occur in subtropical and tropical Africa. The total ion chromatograms of these two species, collected about 800 km apart, had several similarities (Figure 6). They shared all peaks except for neochlorogenic acid and cryptochlorogenic acid, which was present at high intensities in T. triflorum but absent from T. scandens. The presence or absence of these compounds can be a useful tool to distinguish between T. triflorum and T. scandens and illustrates the potential of phenolic total ion chromatograms to aid future systematic studies of this taxonomically difficult genus.
Figure 6.
Comparison of the phenolic compound composition and morphology between two closely related Thesium species, T. triflorum and T. scandens. (A). Total ion chromatograms of a T. triflorum plant from near St. Lucia, KwaZulu-Natal, South Africa (top) and four T. scandens plants from Eastern Cape, South Africa (bottom). (B). Flat linear to linear-lanceolate leaves of T. triflorum with one to three (five) main veins. (C). Fleshy and often terete leaves of T. scandens lacking clearly visible veins.
Trends in the infraspecific variation are less clear and require further study. The majority of species where multiple individuals were studied (38 species) showed almost no variation in peak composition and limited variation in peak intensity (Figure 2, Figure 4 and Figure 5; Supplementary Figure S1). This indicates that phenolic total ion chromatograms can be used with some confidence to distinguish among species while keeping in mind that the majority of Thesium species have not yet been studied. Four exceptions were T. asterias, T. costatum, T. strictum and T. cf. ussanguense, where substantial variation in peak composition and peak intensity was present. This variation might be attributed to several factors, including laboratory-induced differences between sample runs, widely varying local habitat conditions, possible lateral transfer of phenolic compounds, misidentified samples and erroneous species concepts. It has been shown that some parasitic plants have very similar phenolic profiles to their host plants [17] and that they may contain different major flavonoids and xanthones depending on their hosts [28]. This poorly explored aspect deserves further study but will also require in-depth studies of the host diversity and host specificity of Thesium species. In the case of T. asterias, however, there is evidence that the species delineation needs to be reconsidered. Closer examination of the two populations studied (roughly 700 km apart) revealed noteworthy morphological differences (Figure 7). Plants from Limpopo, South Africa, had thickened, perennial rootstocks and virgate, parallel branches, while plants from KwaZulu-Natal, South Africa, had slender, possibly annual rootstocks and with branches angled at about 45° from the central stem (Figure 7). Further taxonomic investigation will likely prove these two populations to be different species, one of which may be new to science. This example shows how phenolic total ion chromatograms may reveal inconsistencies in current species concepts.
Figure 7.
Differences in phenolic compound composition and morphology between two populations of Thesium asterias. (A). Total ion chromatograms of two T. asterias plants from a single population near St. Lucia, KwaZulu-Natal, South Africa (top) and two T. asterias plants from a single population near Lekgalameetse Nature Reserve, Limpopo, South Africa (bottom). (B). Habit, with branches spreading at ±45 °, and slender rootstock of T. asterias plants from KwaZulu-Natal. (C). Habit, with parallel and virgate branches, and thickened rootstock of T. asterias plants from Limpopo.
5. Conclusions
This study provides valuable preliminary insights into the phenolic diversity and chemophenetic relationships in Thesium, Lacomucinaea and Osyridicarpos. The phytochemistry of 50 Thesium species and Osyridicarpos is studied for the first time, and the first chemophenetic evaluation of the genus in more than 80 years is provided.
LCMS analysis revealed a wide diversity of compounds in Thesium (more than 200 prominent peaks), mainly flavonols, carboxylic acids, phenolic acids, organic acids and associated derivatives. This result is in line with previous studies on the genus.
The phenolic compounds in Thesium seem to have some chemophenetic value at the generic level but limited value at the infrageneric level. While Lacomucinaea and Osyridicarpos appear to be distinguishable from one another based on the presence and absence of several main peaks, this result should be confirmed with a larger sampling of individuals and populations. The true value of phenolic compound variation in Thesium lies at the species level. Due to the wide variety of observed peaks, the total ion chromatograms of the species are largely distinct from one another and may therefore assist with species identification and help tease apart unclear species boundaries. While infraspecific chemical variation seems limited in the majority of species, studies of multiple plant populations are needed to verify the degree of infraspecific variation. Further examination of about 300 Thesium species that remain phytochemically unknown will not only enhance our knowledge of phenolic diversity in the genus but might also alter the preliminary trends and relationships observed in this study.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/d14080590/s1. Supplementary Table S1. Phenolic compounds detected in Thesium responsible for the most variation in Principal Components one to four. Supplementary Figure S1. Total ion chromatograms of 156 sample extracts of 50 Thesium species (1–50), Osyridicarpos schimperianus (51) and Lacomucinaea lineata (52). Supplementary Figure S2. Dendrogram of phenolic relationships showing the geographic distributions of 46 Thesium species, Lacomucinaea lineata and Osyridicarpos schimperianus. Supplementary Figure S3. Dendrogram of phenolic relationships among 46 Thesium species, Lacomucinaea lineata and Osyridicarpos schimperianus, accompanied by the distribution of characters states of nine taxonomically important morphological characters: A. Root type. B. Maximum plant height. C. Plant surface. D. Stem type. E. Leaf type. F. Inflorescence type. G. Reproductive system. H. Inner corolla lobe surface. I. Style length. Supplementary Figure S4. Loading plots for Principal Components 1–3 (A–C) with the most significant contributors indicated. Supplementary Figure S5. Heatmap of the main phenolic peaks detected in the extracts of 128 samples, representing 46 Thesium species, as well as sister genera Osyridicarpos and Lacomucinaea.
Author Contributions
Conceptualisation, B.-E.V.W., N.L. and M.M.L.R.; sample collection, B.-E.V.W., N.L. and M.M.L.R.; formal analysis, H.R., N.L. and M.A.S.; writing—original draft preparation, N.L.; writing—review and editing, B.-E.V.W., M.A.S., H.R. and M.M.L.R. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Foundational Biodiversity Information Programme, small grant number 104931, National Research Foundation of South Africa, grant number 84442, the Botanical Education Trust, University of Johannesburg and the South African National Biodiversity Institute.
Data Availability Statement
Not applicable.
Acknowledgments
The following people and organisations are thanked: the University of Johannesburg, South African National Biodiversity Institute and University of Stellenbosch for hosting this study; Bronwynn Egan, Daniel Nickrent, Kate and Graham Grieve, John Burrows, Magda Nel, Reuhl Lombard, Sedzani Simali, Teodor van Wyk and Wiam Haddad for assistance in the field; Zigmantas Gudzinskas for providing fresh material of T. ebracteatum and T. ramosum from Lithuania; Daniel Zhigila for assistance with species identifications; Malcolm Taylor from Central Analytical Facilities, University of Stellenbosch, for assistance with data analysis and compound identifications.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Nickrent, D.L.; García, M.A. Lacomucinaea, a new monotypic genus in Thesiaceae (Santalales). Phytotaxa 2015, 224, 173–184. [Google Scholar] [CrossRef]
- The Angiosperm Phylogeny Group. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 2016, 181, 1–20. [Google Scholar] [CrossRef]
- Lombard, N.; Van Wyk, B.-E.; le Roux, M.M. A review of the ethnobotany, contemporary uses, chemistry and pharmacology of the genus Thesium (Santalaceae). J. Ethnopharmacol. 2020, 256, 112745. [Google Scholar] [CrossRef] [PubMed]
- García, M.A.; Nickrent, D.L.; Mucina, L. Thesium nautimontanum, a new species of Thesiaceae (Santalales) from South Africa. PhytoKeys 2018, 109, 41–51. [Google Scholar] [CrossRef]
- De Candolle, A. Santalaceae. In Prodromus Systematis Naturalis; De Candolle, A., Ed.; Treuttel et Würtz: Paris, France, 1857; pp. 619–692. [Google Scholar]
- Hill, A.W. Order CXX. Santalaceae. In Flora Capensis, Volume 5; Thiselton-Dyer, W.T., Ed.; L. Reeve & Co. Ltd.: London, UK, 1925; pp. 135–212. [Google Scholar] [CrossRef]
- Zhigila, D.A.; Verboom, G.A.; Muasya, A.M. An infrageneric classification of Thesium (Santalaceae) based on molecular phylogenetic data. Taxon 2020, 69, 100–123. [Google Scholar] [CrossRef]
- Linnaeus, C. Species Plantarum: Exhibentes Plantas Rite Cognitas, Ad Genera Relatas, Cum Differentiis Specificis, Nominibus Trivialibus, Synonymis Selectis, Locis Natalibus, Secundum Systema Sexuale Digestas; Tomus I; Laurentius Salvius: Stockholm, Sweden, 1753; pp. 207–208. [Google Scholar]
- Hill, A.W. The genus Thesium in South Africa, with a key and descriptions of new species. Bull. Misc. Inf. 1915, 1, 1–43. [Google Scholar] [CrossRef]
- Pilger, R. Santalaceae. In Die Natürlichen Pflanzen Familien, 2nd ed.; Engler, A., Ed.; W. Engelmann: Leipzig, Germany, 1935; pp. 52–91. [Google Scholar]
- Bobrov, E.G. Family XLIX. Santalaceae R.Br. In 5. Flora of the U.S.S.R; Komarov, V.L., Ed.; Israel Program for Scientific Translations: Jerusalem, Israel, 1970; pp. 325–340. [Google Scholar]
- Hendrych, R. The natural history and systematic of the genus Thesium L. Acta Univ. Carol. Biol. 1970, 4, 293–358. [Google Scholar]
- Stander, M.A.; Brendler, T.; Redelinghuys, H.; Van Wyk, B.-E. The commercial history of Cape herbal teas and the analysis of phenolic compounds in historic teas from a depository of 1933. J. Food Compos. Anal. 2019, 76, 66–73. [Google Scholar] [CrossRef]
- De Kock, P.C.; Rapson, W.S. A correlation between the chemical constituents and morphological structure of certain Thesium species. Nature 1938, 142, 1078–1079. [Google Scholar] [CrossRef]
- Moore, T.E.; Verboom, G.A.; Forest, F. Phylogenetics and biogeography of the parasitic genus Thesium L. (Santalaceae), with an emphasis on the Cape of South Africa. Bot. J. Linn. Soc. 2010, 162, 435–452. [Google Scholar] [CrossRef][Green Version]
- Willems, J.L.; Khamis, M.M.; Mohammed-Saeid, W.; Purves, R.; Katselis, G.; Low, N.H.; El-Aneed, A. Analysis of a series of chlorogenic acid isomers using differential ion mobility and tandem mass spectrometry. Anal. Chim. Acta 2016, 933, 164–174. [Google Scholar] [CrossRef] [PubMed]
- Assanga, S.B.I.; Luján, L.M.L.; Ruiz, J.C.G.; McCarty, M.F.; Cota-Arce, J.M.; Espinoza, C.L.L.; Gil Salido, A.A.; Ángulo, D.F. Comparative analysis of phenolic content and antioxidant power between parasitic Phoradendron californicum (toji) and their hosts from Sonoran Desert. Results Chem. 2020, 2, 100079. [Google Scholar] [CrossRef]
- Lu, Y.; Wang, J. Study on chemical constituents of Thesium chinense. Chin. Tradit. Herb. Drugs 2004, 35, 491–493. [Google Scholar]
- Parveen, Z.; Deng, Y.; Saeed, M.K.; Dai, R.; Ahamad, W.; Yu, Y.H. Anti-inflammatory and analgesic activities of Thesium chinense Turcz. extracts and its major flavonoids, kaempferol and kaempferol-3-O-glucoside. Yakugaku Zasshi 2007, 127, 1275–1279. [Google Scholar] [CrossRef] [PubMed]
- Lee, I.K.; Kim, K.H.; Choi, S.U.; Lee, J.H.; Lee, K.R. Phytochemical constituents of Thesium chinense Turcz. and their cytotoxic activities in vitro. Nat. Prod. Sci. 2009, 15, 246–249. [Google Scholar]
- Liu, Y.; Zahida, P.; Deng, Y.L.; Zhang, J.H.; Dai, R.J. Study on the flavonoids compounds of Thesium chinense. Zhong Yao Cai 2009, 32, 518–520. [Google Scholar]
- Liu, C.; Li, X.-T.; Cheng, R.-R.; Han, Z.-Z.; Yang, L.; Song, Z.-C.; Wang, Z.-T. Anti-oral common pathogenic bacterial active acetylenic acids from Thesium chinense Turcz. J. Nat. Med. 2018, 72, 433–438. [Google Scholar] [CrossRef] [PubMed]
- Zou, Y.; Hong, M.; Yang, X. Isolation of chemical components from Thesium chinense. Zhongguo Shiyan Fangjixue Zazhi 2016, 22, 74–77. [Google Scholar]
- Forest, F.; Manning, J.C. The minor genera Kunkeliella and Thesidium included in Thesium. Bothalia 2013, 43, 214–216. [Google Scholar] [CrossRef]
- Kuttan, R.; Pattabhiraman, K.S.V.; Radhakrishnan, A.N. Possible chemotaxonomic significance of the occurrence of cis-4-hydroxy-L-proline in Santalaceae. Phytochemistry 1974, 13, 453–454. [Google Scholar] [CrossRef]
- Hopkins, C.Y.; Chisholm, M.J.; Cody, W.J. Fatty acid components of some Santalaceae seed oils. Phytochemistry 1969, 8, 161–165. [Google Scholar] [CrossRef]
- Hilliard, O.M. Thesium. In Flora Zambesiaca, Vol. 9, Part 3: Polygonaceae and Myriaceae; Pope, G.V., Polhill, R.M., Martins, E.S., Eds.; Royal Botanic Gardens, Kew: Richmond, UK, 2006; pp. 206–241. [Google Scholar]
- Jadhav, R.B.; Anarthe, S.J.; Surana, S.J.; Gokhale, S.B. Host-hemiparasite transfer of the C-glucosyl xanthone mangiferin between Mangifera indica and Dendrophthoe falcate. J. Plant Interact. 2005, 1, 171–177. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).