1. Introduction
Echinacea species, belonging to the Compositae (Asteraceae) family [
1], are herbaceous, perennial flowering plants native to North America. They rank among the top ten preferred dietary supplements in the U.S. [
2]. Traditionally,
Echinacea purpurea (L.) Moench,
Echinacea angustifolia DC., and
Echinacea pallida (Nutt.) Nutt. have been used in herbal medicine, primarily to improve or prevent viral upper respiratory infections. However, clinical trials assessing the treatment of these diseases, particularly rhinovirus infections, have yielded mixed results [
3]. The inconsistent performance of
Echinacea samples in clinical trials may be attributed to inadequate characterization and standardization of the test formulations.
Comprehensive profiling of the abundant constituents specific to each
Echinacea species and the particular plant parts used (typically aerial parts or roots) holds significant importance. This profiling helps establish a certain level of standardization [
4]. By conducting such profiling, a more consistent and reliable assessment of the therapeutic potential of
Echinacea samples can be achieved. This, in turn, could lead to better-informed clinical trials and enhanced efficacy in treating various diseases. However, for optimal therapeutic effects, it is crucial to identify and calibrate the active principles to facilitate comparative evaluation of different samples.
Echinacea samples used for herbal medicine and dietary supplements are relatively crude and, hence, complex mixtures. The identification of active constituents of complex mixtures that exhibit therapeutic activity is usually achieved through bioassay-guided fractionation. Even though bioassay-guided fractionation has been proven effective in the past, it has several limitations [
5,
6]. During the fractionation process, there can be a potential loss of synergistic properties among the constituents present in the crude extracts. The process of chemical separation often leads to a reduction in material, posing challenges in isolating trace constituents. Moreover, the methodology tends to prioritize the identification of dominant peaks within each fraction or extract, potentially overlooking the bioactive constituents present in low abundance [
7]. Additionally, the whole process is time-consuming and labor-intensive.
In recent years, there has been significant progress in the field of multivariate statistical analysis and chemometrics, offering valuable tools for interpreting spectra obtained from complex mixtures. These techniques allow for the correlation and potential identification of chemical constituent classes responsible for the observed biological activity of the mixtures [
8,
9]. Among the analytical techniques available, nuclear magnetic resonance (NMR) spectroscopy stands out as a highly effective method capable of detecting a wide range of metabolites [
10]. In particular,
1H-NMR-based chemometrics have emerged as a powerful analytical approach for studying biological systems [
11]. Furthermore,
1H-NMR-based chemometrics have been successfully applied in identifying bioactive constituents from plant extracts [
12,
13,
14].
The successful identification and isolation of bioactive constituents from plant extracts necessitate a comprehensive understanding of the chemical constituents present in the extract. Several chemical classes have been characterized for
Echinacea, and members of these classes have been shown to mediate pharmacological effects consistent with the herbal medicine’s potential therapeutic effects. Alkylamides of
Echinacea, for example, show anti-inflammatory properties [
15,
16], caffeic acid derivatives (CADs) that possess antioxidant properties [
17,
18], and polysaccharide-rich fractions that stimulate macrophages [
19,
20]. Specifically, a homogenous polysaccharide derived from
E. purpurea has been shown to suppress tumor progression in vivo by facilitating M1 macrophage polarization [
21]. Polyacetylenes and polyenes have been shown to decrease the viability of colorectal adenocarcinoma cells [
22].
Echinacea also exhibits several other pharmacological effects, although the specific constituent class responsible for these effects has yet to be identified [
23,
24], and interactions between the classes of constituents are rarely considered [
25].
In our previous study [
26], we demonstrated that
Echinacea stimulates myelopoiesis activity. Specifically, we observed an increase in myeloid progenitor cells (Colony-Forming Units–Granulocytes, Macrophages, CFU-GMs) in the bone marrow of female Sprague–Dawley rats following daily oral administration of an ethanol extract (EtOH, 75% v v
−1) derived from the aerial parts of
Echinacea for a duration of 7 days.
Building on the findings from our previous study [
26], in this study, we investigated the effects of aerial parts from eleven
Echinacea samples belonging to three different species, including ten samples obtained from the United States Department of Agriculture-Agricultural Research Service (USDA-ARS) Germplasm Resources Information Network (GRIN)-North Central Regional Plant Introduction Station (NCRPIS) and one commercial
Echinacea purpurea sample from Monterey Bay Spice Co., Watsonville, CA, on the production of CFU-GMs from the bone marrow of treated rats. To investigate the specific target and underlying mechanisms of
Echinacea’s stimulation of CFU-GMs, we evaluated an active sample’s impact on Colony-Forming Units–Granulocytes, Erythrocytes, Megakaryocytes, Macrophages (CFU-GEMMs) and explored potential changes in Blast Forming Units–Erythrocytes (BFU-Es). Additionally, untargeted high-density profiling of the constituents in the crude plant extracts was performed using
1H-NMR spectroscopy. The obtained spectra were binned, and the relative signals of the extracts were correlated with the stimulation of myelopoiesis activity using multivariate statistical techniques to identify the chemical classes that most likely contributed to the observed bioactivity. Knowledge gained from this approach can be used to guide the selection of optimal
Echinacea sources and design efficient bioassay-guided fractionation.
2. Materials and Methods
2.1. Plant Materials
This study utilized aerial parts from ten samples, representing three Echinacea species: Echinacea purpurea (E. purpurea), Echinacea angustifolia (E. angustifolia), and Echinacea pallida (E. pallida). Specifically, these included E. purpurea samples PI-631307, PI-633668, and PI-649040, which were grown under two distinct conditions: in the field (FG) and a shade house (SH) with approximately 50% shading density. E. angustifolia samples included PI-649026, PI-649029, and PI-633654, and E. pallida samples included PI-631300, PI-597603, and PI-633661. All samples were obtained from the seed repository of the USDA-ARS NCRPIS, cultivated, harvested, and dried by a USDA-ARS horticulturalist (JDC), and then transported to our Monroe, LA laboratory for the preparation of ethanol extracts.
In addition to the samples we obtained from USDA-ARS, we also incorporated MISS 82127, a pre-minced and dried commercial product of
E. purpurea procured from the Monterey Bay Spice Company. The product was labeled “
Echinacea purpurea herb, c-s
Echinacea purpurea 1 lb. (454 g) Lot# 16F155-985”. Both the samples from USDA-ARS and the commercial product were kept at −20 °C until used for the study. A voucher specimen from the sample MISS 82127 was sampled and deposited at the University of Mississippi, Thomas M. Pullen Herbarium (MISS).
Supplementary Information File S1 (File S1: Plant Sample Photodocumentation) provides a photographic record of the utilized plant samples.
2.2. Extraction Schemes
The dried aerial parts of the plants were first processed into a fine powder using a blender, followed by filtration through a 20 Mesh sieve (850 microns) to ensure consistency. The resulting plant powder was then macerated in a solution of 75% ethanol (0.05 g dry weight mL
−1; Pharmco-AAPER, Irvine, CA, USA) for a period of 24 h at room temperature. This process was facilitated by continuous agitation using a magnetic stirrer. After maceration, the mixture was filtered and subjected to a second round of maceration using a fresh batch of 75% ethanol for an additional day. The resulting extracts from both macerations were pooled and subsequently dried using a rotary evaporator with a vacuum pump at a temperature of 40 °C. The remaining residue underwent lyophilization to yield a semi-solid extract (
Figure 1). The percentage yield of this extraction process was calculated (
Table 1) and ranged from 9.2 to 17.8% (w w
−1) of the initial aerial part weight (
Table 1).
2.3. Animal Husbandry and Treatment
Female Sprague–Dawley (SD) rats (approximately 6–9 weeks old; 175–230 g) were obtained either from a breeding colony at the University of Louisiana at Monroe (ULM) animal facility or acquired from Harlan-Sprague Dawley Inc. (Madison, WI, USA). Prior to the study, the rats were allowed a one-week acclimation period, during which they were housed two per cage in polycarbonate cages lined with wood chip bedding (Sani-chips, Harlan Teklad, Madison, WI, USA) with free access to rodent chow food (No 7001, Harlan Teklad) and tap water. The housing conditions were maintained at a controlled temperature (21 ± 1 °C) and humidity (50 ± 10%), with a 12 h light–dark cycle. All animal husbandry and handling procedures adhered to the
Guide for Use and Care of Animals (National Research Council, 2011) [
27], and the study protocols received prior approval from ULM’s Institutional Animal Care and Use Committee (IACUC) under approval number 13OCT-SAM-01.
Rats were randomly assigned to different treatment groups, each consisting of 4 to 6 animals. The control group received a DMSO (5% v v−1 in corn oil) solution. The other groups were administered specific doses of the dried 75% ethanol extracts, namely 25 mg kg−1 d−1, 50 mg kg−1 d−1, 100 mg kg−1 d−1, or 200 mg kg−1 d−1, each initially dissolved in a DMSO (5% v v−1 in corn oil) solution. All animals were subjected to daily visual inspections throughout the dosing period to monitor for any signs of overt toxicity; no adverse symptoms were detected. The extracts were orally administered to the rats via gavage daily over a seven-day period, with a dosage volume of 10 mL kg−1. Twenty-four hours following the final dosage, the animals were weighed and then euthanized under CO2 anesthesia. Immediately after, the femurs were excised to isolate bone marrow cells.
2.4. Bone Marrow Cell Isolation
Both femurs from each rat were dissected from the carcass and cleaned of tissue. Proximal and distal ends of the femurs were removed, and bones were flushed with 3 mL of filter (0.2 µm)-sterilized Iscove’s Modified Dulbecco Medium (IMDM; Life Technologies, Waltham, MA, USA) containing 0.2% BSA (bovine serum albumin; Sigma-Aldrich, St. Louis, MO, USA) and 1% antibiotic, antimycotic (Life Technologies, Waltham, MA, USA) using an 18 ½ gauge needle. Femurs were inverted and flushed again with the same 3 mL medium, and media from both femurs were pooled (6 mL per rat). Hereafter, all procedures were performed under sterile conditions. Cells were filtered through sterile nylon mesh and centrifuged at 250× g for 10 min at room temperature. Pelleted cells were resuspended in 3 mL medium, and bone marrow cells were counted using a hemocytometer.
2.5. Mononuclear Cell Isolation
Mononuclear cells in IMDM were isolated from bone marrow cells at the interface after centrifugation (400× g, 30 min) over histopaque-1077 (6 mL; Sigma-Aldrich, St. Louis, MO, USA). After isolation, the mononuclear cells were transferred to a clean 20 mL centrifuge tube and diluted with 10 mL medium, pelleted by centrifugation (400× g, 10 min), resuspended in 5 mL medium, and pelleted two more times. Finally, the cell pellet was resuspended in 0.25 mL medium. The number of cells was determined by a hemocytometer, and the cell dilutions were made equivalent to 2 × 106 cells mL−1.
2.6. Colony-Forming Units–Granulocytes, Macrophages (CFU-GM) Assay
A CFU-GM assay was performed as described previously [
26]. The CFU-GM assay was performed using the CAMEO kits (cat # KCO1-GM1-1R) as per the supplier’s instruction (HemoGenix, Colorado Springs, CO, USA). Briefly, 20,000 mononuclear cells in 15 µL IMDM medium were mixed with 540 µL master mix containing methylcellulose and growth factors: 20 ng mL
−1 Granulocyte Macrophage–Colony-Stimulating Factor (GM-CSF), 10 ng mL
−1 Interluekin-3 (IL-3), and 50 ng mL
−1 Stem Cell Factor (SCF). The mixture was plated (100 µL) in each well of a 96-well plate. The plates were then incubated in a humidified 5% CO
2 incubator at 37 °C for 5 days. CFU-GM colonies were measured based on ATP content, quantitated as luminescence produced from luciferase, and calibrated against a standard curve generated on the same day. Luminescence was measured using a Synergy H1 hybrid reader (BioTek, Winooski, VT, USA).
2.7. Limulus Amebocyte Lysate (LAL) Assay
The presence of lipopolysaccharides (LPS, bacterial endotoxin) in the extracts was determined via the Limulus Amebocyte Lysate (LAL) assay, specifically using a Pyrochrome® LAL kit (Associates of Cape Cod, Inc., East Falmouth, MA, USA). This assay is based on the principle that LPS in the sample triggers the activation of factors in the LAL, leading to the cleavage of a peptide in Pyrochrome® LAL and the production of para-nitroaniline (pNA). Subsequently, pNA reacted with nitrite in HCl and N-(1-Naphthyl)-ethylenediamine (NEDA) to form a diazotized magenta product, which exhibited an absorbance in the optical density range of 540–550 nm. This absorbance was directly proportional to the amount of LPS present.
The LAL assay was performed using only endotoxin-free materials. Extracts (10 mg dry wt mL−1) were diluted in LAL reagent water (LRW), combined with Pyrochrome (reconstituted with 3.2 mL Glucasheild, a (1→3)-β-D-Glucan Inhibiting Buffer) at a 1:1 ratio, and incubated at 37 °C for 27 min, as per the supplier’s instructions. Post-incubation, the mixture was treated with 50 µL each of reconstituted sodium nitrite in HCl, ammonium sulfamate in water, and N-(1-Naphthyl)-ethylenediamine (NEDA) in water. The absorbance was measured at 550 nm, and the quantity of LPS, expressed as Endotoxin Units per gram (EUs g−1), was calculated via interpolation into the standard curve (concentration: 0.05–2.5 EU mL−1) generated on the same day, with LAL reagent water (LRW) serving as a negative control. To eliminate interfering glucan from bacterial-origin cellulosic material present in plant extracts, Glucasheild was employed. Each sample was analyzed in triplicate.
2.8. Statistics
The effects of Echinacea extracts on CFU-GMs were analyzed using a one-way analysis of variance (ANOVA). The normality and equality of variances were confirmed using the Shapiro–Wilk and Brown–Forsythe tests, respectively. A post hoc comparison of treatment means against the vehicle control was performed using Dunnett’s test with a significance level of p = 0.05 (GraphPad Prism 4, GraphPad Software, Inc., San Diego, CA, USA).
2.9. 1H-NMR Spectra
1H-NMR spectra were acquired using a JEOL Eclipse ECS-400 MHz NMR spectrometer, equipped with a 5 mm proton multi-frequency, z-axis pulsed field gradient NMR probe. The acquisition parameters are 13,107 original points count, 7503 sweep widths (Hz), and 16 transients, with an acquisition time of 1.7469 s at room temperature. Calibration was achieved using the solvent residual methyl proton signal (δ = 3.31 ppm). Extracts were dissolved in methanol (HPLC grade; Sigma-Aldrich, USA) at a concentration of 20 mg mL−1, followed by 10 min of sonication, filtration through Whatman filter paper, and complete evaporation under a stream of nitrogen gas. The resultant residue was then redissolved in 700 µL of deuterated methanol (MeOD-d4; Cambridge Isotope Laboratories, Andover, MA, USA). Samples were prepared, and spectra were acquired on the same day.
2.10. Chemometric Analysis
The
1H-NMR spectra were automatically reduced to ASCII files. The spectra were phased and baseline-corrected automatically to reduce manual error and maintain consistency. Solvent regions methanol (3.29–3.32 ppm) and water (4.80–5.00 ppm) were removed from the analysis, and spectral intensities were referenced to the residual non-deuterated methanol signal (3.31 ppm). Each spectrum was integrated into 244 bins corresponding to 0.04 ppm bins across the chemical shift range (0.02–10.02 ppm), and the resultant (non-negative; 194) values were divided by the total spectral intensity to normalize the values, resulting in a total of 194 variables. The data were analyzed using the ACD NMR processor academic version [
28,
29].
Further data analysis was conducted using R software, version 4.3 [
30]. The factoextra and FactoMineR packages were used for visualizing Principal Component Analysis (PCA). The ropls package facilitated the modeling of Orthogonal Partial Least Squares-Discriminant Analysis (OPLS-DA). The area under the ROC curve (AUC) was plotted using the pROC package, and the caret package was employed for the generation of the confusion matrix. Standard univariate scaling was used for all analyses.
4. Discussion
Hematopoiesis is a tightly regulated event with each division, leading to the production of further cells in the lineage thereby giving rise to mature blood cells [
32]. All mature blood cells in the hematopoietic system originate from Hematopoietic Stem Cells (HSCs) that give rise to multipotent progenitors (MPPs). MPPs further give rise to oligopotent progenitors, such as common myeloid progenitors and common lymphoid progenitors (CMPs and CLPs, respectively). CMPs advance to erythrocyte progenitors (BFU-Es), granulocyte, macrophage progenitors (CFU-GMs), and megakaryocyte precursors (CFU-megs) that mature to blood effector cells erythrocytes, granulocytes and macrophages, and platelets, respectively [
32,
33]. The proliferation of progenitor cells in the hematopoietic system is regulated by three factors: hematopoietic cytokines, transcription factors, and the Jak-Stat pathway [
34].
A number of herbal samples have been shown to stimulate bone marrow progenitor cells, including polysaccharides from
Angelica sinensis [
35], a medicinal herbal cocktail Bojungbangdocktong (BJBDT) [
36], a polysaccharide fraction from black soybean [
36], and total saponins of
Panax ginseng cells (TSPGs) [
37]. We are the first to describe the stimulation of myelopoiesis activity by aerial parts of
Echinacea species in female Sprague–Dawley rats [
26].
In our present study, we extended our investigation to identify the effects of the eleven samples that belong to three species of
Echinacea on CFU-GMs. Additionally, we studied the effects of
E. angustifolia PI-649026, which exhibited significant CFU-GM-stimulating activity, on the bone marrow upstream progenitor cells to CFU-GMs, i.e., CFU-GEMMs, as well as the erythroid lineage progenitor cells, i.e., BFU-Es. These investigations aimed to uncover the specific stage within the myeloid pathway at which
Echinacea exerts its effects (
Supplementary Information File S2, File S2 CFUGEMM BFUE).
Among the five samples tested from
E. purpurea species, MISS 82127 (
Figure 2) exhibited the highest activity (91% increase) at 100 mg kg
−1 d
−1, and MISS 82127 was the only sample that showed activity at the lowest dose, i.e., 25 mg kg
−1 d
−1. PI-631307 and PI-633668 showed statistically significant increases in CFU-GMs at 100 mg kg
−1 d
−1 and 200 mg kg
−1 d
−1, with activity plateauing at 100 mg kg
−1 d
−1 for PI-631307 and PI-633668. A statistically significant increase in CFU-GMs was observed for PI-649040 SH and PI-649040 FG at 100 mg kg
−1 d
−1; however, the activity diminished at higher concentrations, i.e., at 200 mg kg
−1 d
−1 (
Figure 2).
Only one
E. angustifolia sample, PI-649026, showed a statistically significant increase in CFU-GMs at the doses tested, while the other samples, PI-649029 and PI-633654, did not exhibit any statistically significant increase in CFU-GMs. We observed a statistically significant increase in CFU-GMs at 200 mg kg
−1 d
−1 with PI-649026 (
Figure 2). We have investigated the effects of
E. pallida samples on CFU-GMs for the first time, and we found a statistically significant increase in CFU-GMs at 100 mg kg
−1 d
−1 and 200 mg kg
−1 d
−1 for PI-597603. However, the observed effect at 200 mg kg
−1 d
−1 decreased from 100 mg kg
−1 d
−1. The samples PI-631300 and PI-633661 did not show any effect on CFU-GMs (
Figure 2). These results demonstrate that the effects of
Echinacea on CFU-GMs are not species-specific and extend across all three species examined. In each of these species, there was at least one sample that showed significant activity in the CFU-GM assay, which aligns with the findings from our previous study [
26].
Seven out of the eleven samples tested showed a statistically significant effect on CFU-GMs at either 100 mg kg
−1 d
−1 or 200 mg kg
−1 d
−1, while the remaining four samples did not show any statistically significant increase in activity. The decreasing order of CFU-GM activity of the eleven samples, based on the maximum percent increase in CFU-GMs from vehicle control for each sample, is as follows (
Figure 3;
Table 2): MISS 82127 > PI-649026 > PI-597603 > PI-631307 > PI-633668 > PI-649040 SH > PI-649040 FG > PI-633661 > PI-631300 > PI-649029 > PI-633654.
Interestingly, PI-649026 and PI-649029 samples obtained from different harvests were studied previously for their effect on CFU-GM cells [
26]. The individual effects at 100 mg kg
−1 d
−1 and 200 mg kg
−1 d
−1 for PI-649026 differed compared to the previous study; however, similar results were obtained regarding the activity or inactivity of these samples in the CFU-GM assay. This suggests that
Echinacea samples grown across different time periods and different harvesting conditions will give similar results regarding their effect on CFU-GM cells. Hence, a large part of the factors driving the stimulation of myelopoiesis appears to be intrinsic to the plant and retained in the germplasm.
A key factor when assessing how
Echinacea stimulates CFU-GMs is that the CAMEO culture media is formulated with excess cytokines that stimulate the clonal growth and maturation of in vivo affected hematopoietic stem and myeloid progenitor cells. There are three types of cytokines that are involved in this procedure: IL-3, GM-CSF, and SCF [
35], with IL-3 and GM-CSF being pleiotropic [
34,
38], meaning that they act on two different types of progenitor cells (CFU-GEMMs and CFU-GMs). Therefore, the CAMEO CFU-GM assay is not able to distinguish which cell populations (CFU-GEMM or CFU-GM) are targeted by
Echinacea.
E. angustifolia PI-649026 was chosen to investigate the potential effects of
Echinacea on CFU-GEMM cells because this sample is known to demonstrate CFU-GM activity in both the present and previous studies. Specifically, we examined the impact of PI-649026 on CFU-GEMM cells at high dosage levels of 100 mg kg
−1 d
−1 and 200 mg kg
−1 d
−1. As shown in
Figure S2a, CFU-GEMM cells are unaffected by PI-649026. These results indicate that
Echinacea may influence the efficacy of the transition from CFU-GEMM to CFU-GM, potentially by altering transcription factors involved in the differentiation of common myeloid progenitor cells or inducing signal transduction pathways related to the development of myeloid progenitor cells.
Lineage programming with transcription factor antagonism (GATA-1 and PU.1 promote erythroid, megakaryocytic, eosinophil, and myeloid differentiation, respectively) determines lineage in hematopoietic progenitor differentiation [
39]. To determine whether the effects of
Echinacea are due to its effect on transcription factors, we studied the effects of the
E. angustifolia sample, PI-649026, on BFU-Es at high doses of 100 mg kg
−1 d
−1 and 200 mg kg
−1 d
−1. Transcription factor antagonism would predict that if
Echinacea’s effect on CFU-GMs is mediated through transcription factors, it would result in a decrease in BFU-Es. However, as demonstrated in
Figure S2b, we did not observe any significant effect, neither an increase nor a decrease, of PI-649026 on BFU-Es. These findings suggest that the effects of
Echinacea on myeloid progenitor cells are unlikely to result from a shift in commitment to the erythroid lineage via modification of lineage transcription factors.
Echinacea’s effects on myeloid progenitors (CFU-GMs) are not attributable to the induction of progenitors of CFU-GMs (increase in CFU-GEMMs) or transcription factor antagonism (decrease in BFU-Es), as illustrated in
Figure S2a,b. This suggests that the
Echinacea’s effect on myeloid progenitor cells is most likely due to the Jak-Stat signaling pathway. The Jak-Stat signaling pathway is the major pathway in hematopoietic progenitor cells that regulates the proliferation and differentiation of progenitor cells. Jak2 is known to have an effect on myeloid progenitors [
40,
41]. Moreover, the Jak2 Stat5 pathway has been implicated in the development of progenitor cells by BJBDT [
36] and TSPG [
37]. Further investigation is required to validate this hypothesis.
The myeloid progenitor-stimulating activity exhibited by some Echinacea samples (PI-631307, PI-633668) plateaued, while the activity of other samples (MISS 82127, PI-649040 SH, PI-649040 FG, PI-597603) decreased at higher doses (200 mg kg−1 d−1). The active Echinacea samples, as they are complex mixtures, may contain chemical constituents that do not stimulate myeloid progenitors, and at higher doses, may inhibit myeloid progenitor stimulation by other active chemical constituents. Consequently, it is crucial to identify and isolate the specific chemical constituent classes responsible for the myeloid progenitor-stimulating activity, which will contribute to the development of consistent and standardized Echinacea formulations for use in various therapeutic applications and further enhance our understanding of the mechanisms underlying its effects on myeloid progenitor cells.
Several studies claimed to have shown that the majority of pharmacological activity exhibited by plant materials is not attributable to plants’ chemical constituents but is rather due to the bacteria and bacterial constituents (lipopolysaccharides) within plants [
42,
43,
44]. A lipopolysaccharide (LPS) assay was conducted to measure the content of the LPS in plant samples. Based on the LPS assay results, the amount of LPS present in
Echinacea species was too low (range 99–221 EUs g
−1) to stimulate myelopoiesis. Moreover, there was no correlation between LPS content and percent activity exhibited by
Echinacea samples (
Figure 5), indicating that bacterial constituents were not responsible for the stimulation of myelopoiesis by
Echinacea.
We applied an
1H-NMR-based chemometric approach to identify the chemical constituent classes correlated with the observed stimulation of myelopoiesis activity by
Echinacea. The
1H-NMR data of eleven samples were reduced to bin sizes of 0.04 ppm, resulting in 194 bins (excluding zeros). Using these 194 variables, a preliminary Principal Component Analysis (PCA) was performed to identify groups, trends, and outliers among the samples. A PCA score plot (
Figure 7) showed that there were no outliers among the eleven samples, and there was no grouping of the samples based on two major principal components (Dim 1 and Dim 2). Further evaluation of other principal components indicated the grouping of the samples based on activity with Dim 2 and Dim 4 (
Figure 8). PCA is an unsupervised approach, and its application only reveals group structure with major principal components when the within-group variation is sufficiently smaller than the between-group variation. Based on the score plots (
Figure 7 and
Figure 8), it is evident that the separation of inactive samples from active samples occurs primarily with the minor principal components (Dim 2 and Dim 4), which only account for 32.43% of the total variance. It is likely that this is due to the presence of variables in the data that do not separate samples based on activity. Thus, we used a supervised method, Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA), to identify variables responsible for the separation of samples based on activity.
Due to the high variability of the methylcellulose-based colony-forming unit assay and based on PCA score plots (
Figure 7 and
Figure 8), PLS regression [
45] for activity using a continuous metric of percentage increase over control for the stimulation of myelopoiesis as the Y variable was determined not to be useful. Therefore, discriminant analysis (DA) was applied with either the presence or absence of activity in the CFU-GM assay as the Y variable. We used OPLS-DA modeling since OPLS-DA provides a better interpretation of the model, although it provides similar predictions to PLS-DA [
28]. Additionally, OPLS-DA loadings and regression coefficients allow for a more realistic interpretation than PLS-DA since the systematic variation of predictors uncorrelated with the response variable is partitioned in a preprocessing step [
46].
After a supervised model is developed, it is validated with R
2Y and Q
2 statistics. Regression models are considered to have excellent predictability as Q
2 values approach 1 [
47]. However, using R
2Y and Q
2 statistics as validation metrics for discriminant analysis is questionable since Q
2 statistics are developed for regression models, not categorical models [
48,
49]. Moreover, it has been established that the Number of Misclassifications (NMCs) and area under the ROC curve (AUC) perform better than Q
2 statistics in determining model validity for discriminant analysis [
48,
50,
51].
An OPLS-DA model, with the presence or absence of activity as the Y variable, was developed, and internal validation was performed using the NMCs and AUC. As shown in
Figure 9, the OPLS-DA score plot (
Figure 9) was able to distinguish active from inactive samples along the first predictive component. Interestingly, among active samples, MISS 82187 and PI-649026 are separated from other active samples along the first orthogonal component similar to the PCA score plot with minor principal components (
Figure 8). Based on the confusion matrix (
Table 4), the model is able to differentiate between active and inactive samples with an error rate of zero. The p-value from Fisher’s exact test [
52] for the confusion matrix is 0.00303, indicating that the association between original and predicted groups is statistically significant. The AUC plot (
Figure 11) and AUC value (1.0) indicate perfect discrimination between active and inactive samples.
Variable Importance in Projection (VIP) is an important criterion for selecting variables with high significance in chemometrics. VIP is a combined measure of a variable’s contribution to the description of two sets of data: the dependent (Y) and the independent (X) variables. VIP is a weighted sum of squares of OPLS-DA weights that represent each variable’s influence on the model [
53]. The weights in an OPLS-DA model reflect the covariance between the independent and dependent variables. When these weights are included, VIP values are able to reflect both the quality of the description of the dependent variable, as well as its significance for the independent variables in the model [
54]. VIP ≥ 1.0 is generally used as a factor to determine the importance of variables because the average sum of squares of VIP is equal to 1. We identified 71 variables with VIP ≥ 1.0 based on our OPLS-DA model.
When using OPLS-DA modeling, a sample size of 11 can be considered a limitation for identifying important variables. After selecting 71 variables with VIP ≥ 1.0, we used an unsupervised PCA with these variables to evaluate their effectiveness in distinguishing between active and inactive samples using major principal components. As depicted in
Figure 12, the 71 selected variables distinctively separate active and inactive samples along the first principal component (Dim 1). This result aligns with findings from the OPLS-DA score plot (
Figure 9). Uniquely, a novel aspect of our validation process was the comparison of the PCA score plot (
Figure 12) with 71 variables to the PCA score plot with minor principal components (
Figure 8), generated using all 194 variables. The similar grouping patterns observed validate the successful removal of variables that do not contribute to the segregation of samples based on their activity through the selection of variables with VIP ≥ 1.0. This innovative validation approach, therefore, substantiates the proficiency of the selected 71 variables in effectively discriminating between active and inactive samples.
The variables correlated with active samples are identified based on loading plot and VIP scores, and subsequently mapped on the
1H-NMR spectrum of the most potent
Echinacea sample, MISS 82127 (
Table 5 Figure 13 and
Figure 14a–c).
Figure 13 and
Figure 14a–c indicate that structures containing aromatic, alkyne, and alkane protons constitute the most prominent chemical class correlating with the observed stimulation of myelopoiesis activity exhibited by
Echinacea. Previous investigations have demonstrated that the water fraction rich in polysaccharides has no impact on the CFU-GM assay [
26]. Hence, we postulate that the aromatics possessing alkane or alkyl side chains, possibly phenolics, within
Echinacea are responsible for the observed stimulating effect on myeloid progenitors.
It is highly unlikely that a single constituent accounts for the myeloid progenitor-stimulating effect. Rather, the activity may arise from synergistic interactions among multiple constituents, where specific constituents, side chains, and functional groups play crucial roles. In this context, the structural chemical attributes of Echinacea, such as the length of the side chain and the presence or absence of certain moieties, may significantly influence its activity, warranting further exploration of the structure–activity relationship.
A targeted phytochemical analysis of the aromatics with alkane and alkyl side chains provides a promising strategy to identify the active constituents responsible for myeloid progenitor stimulation. For example, lipophilic components are commonly extracted using solvents, such as hexane [
55] or acetonitrile [
56]. Under a typical extraction procedure, aerial parts are initially treated with hexane to remove alkylamides, and then sequentially processed with 100% ethanol and 70% ethanol to eliminate phenolic and phenolic glycosides, respectively [
57]. Leveraging these extraction methodologies in future research could yield significant insights into the contribution of the aromatic fraction with alkane or alkyl side chains in the stimulation of myeloid progenitors. As such, comprehensive testing of this specific fraction within the CFU-GM assay in further investigations is crucial to deepen our understanding of these biological processes.
In summary, the utilization of chemometrics has proven instrumental in the successful identification of potential chemical constituent classes that contribute to the stimulation of myelopoiesis exhibited by Echinacea. Through the application of an advanced statistical modeling technique, OPLS-DA, we were able to pinpoint the key variables and discriminate between active and inactive samples based on their activity profiles. Subsequent validation using PCA of the selected variables provided robust confirmation of their efficacy in distinguishing between the two groups. Finally, the aromatic compounds with alkane and alkyl side chains, possibly phenolics, emerged as a promising direction for understanding the active constituents responsible for the observed myeloid progenitor-stimulating effects.