Next Article in Journal
Impacts of Various Straw Mulching Strategies on Soil Water, Nutrients, Thermal Regimes, and Yield in Wheat–Soybean Rotation Systems
Previous Article in Journal
Transpiration and Water Use Efficiency of Mediterranean Eucalyptus Genotypes Under Contrasting Irrigation Regimes
Previous Article in Special Issue
Genomic Signatures of Environmental Adaptation in Castanopsis hainanensis (Fagaceae)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Chemotaxonomy, an Efficient Tool for Medicinal Plant Identification: Current Trends and Limitations

Department of Life Sciences, Yeungnam University, Gyeongsan 38541, Republic of Korea
*
Author to whom correspondence should be addressed.
Plants 2025, 14(14), 2234; https://doi.org/10.3390/plants14142234
Submission received: 28 May 2025 / Revised: 3 July 2025 / Accepted: 17 July 2025 / Published: 19 July 2025
(This article belongs to the Special Issue Plant Phylogeny, Taxonomy and Evolution)

Abstract

This review highlights the critical role of chemotaxonomy in the identification, authentication, and discovery of bioactive compounds in medicinal plants. By analyzing secondary metabolites using techniques like UV spectroscopy, FTIR, HPLC, GC-MS, NMR, LC-MS-Qtof, and MALDI-TOF MS, chemotaxonomy ensures accurate plant identification, supporting the safe and effective use of plants in herbal medicine. Key secondary metabolites used in chemotaxonomic identification include alkaloids, flavonoids, terpenoids, phenolics, tannins, and plant peptides. Chemotaxonomy also facilitates the discovery of novel compounds with therapeutic potential, contributing to drug development. The integration of chemotaxonomy with genomics and proteomics allows a deeper understanding of plant biosynthesis and the mechanisms behind bioactive compound production. However, challenges due to variability in metabolite profiles and the lack of standardized methods remain, and future research should focus on developing global databases, improving standardization, and incorporating artificial intelligence and machine learning to enhance plant identification and bioactive compound discovery. The integration of chemotaxonomy with personalized medicine offers the potential to tailor plant-based therapies to individual genetic profiles, advancing targeted treatments. This review underscores chemotaxonomy’s importance in bridging traditional knowledge and modern science, offering sustainable solutions for medicinal plant use and drug development.

1. Introduction

In plant sciences, “Taxonomy,” the identification, classification, and naming of plants based on shared characteristics and evolutionary relationships, is considered a foundational field of study [1,2]. Taxonomy is crucial in the conservation of endangered species, as identifying and classifying species accurately is necessary for establishing conservation priorities and protecting biodiversity [3]. In the field of medicine, proper plant identification is vital for ensuring the efficacy and safety of plant-derived medicines [4], and taxonomic classification based on secondary metabolites (alkaloids, flavonoids, terpenoids, etc.) aids in discovering new bioactive compounds for drug development [5]. Similarly, in agricultural sciences, taxonomy supports crop improvement by identifying and classifying economically important plants [6], helping to develop pest-resistant varieties and optimize cultivation practices. With the advent of molecular techniques like DNA barcoding, the accuracy and speed of plant identification have greatly improved, allowing precise classification and even parsing species complexes and identifying cryptic species [7,8].
Classical plant taxonomy emphasizes the use of stable morphological characters that are minimally influenced by environmental factors for taxonomic distinction, excluding traits with high phenotypic plasticity [9]. However, in practice, especially when dealing with commercialized plant materials often presented as ground or fragmented forms lacking key diagnostic organs, reliable identification using macroscopic morphology becomes challenging [10]. In these cases, microscopic anatomical features, molecular markers (DNA-based methods), and chemical profiling can provide more robust and accurate means of identification [11], particularly for medicinal plants, where precise authentication is critical. Palynology, which uses the size, shape, and surface texture of pollen grains to identify plant species, is another valuable method, especially in ecological and archaeological contexts [12]. Molecular methods, such as DNA barcoding, have revolutionized plant identification. By providing a genetic “barcode” using markers like rbcL and matK, precise species identification is possible even in cases where morphological features are absent or ambiguous [13]. Additionally, metabolomics, which uses advanced technologies like liquid chromatography–mass spectrometry (LC-MS) and nuclear magnetic resonance (NMR) to profile the entire metabolome, provides in-depth insights into plant chemistry, further aiding species differentiation [14,15]. Moreover, artificial intelligence (AI) and machine learning (ML) are increasingly being integrated with traditional methods, enhancing the efficiency of plant classification by automating data analysis and detecting complex patterns in chemical and molecular datasets [16]. Through these advances, chemotaxonomy has become important, since this science uses secondary metabolites like flavonoids, alkaloids, terpenoids, and phenolic compounds analyzed through techniques such as high-performance liquid chromatography (HPLC), gas chromatography–mass spectrometry (GC-MS), and LC-MS with quadrupole time of flight (LC-MS-Qtof) [17]. Such high-throughput techniques are helpful to distinguish species based on chemical profile, thus offering high accuracy in cases of closely related or cryptic species [18]. Interestingly, multiple plant identification techniques, including DNA barcoding, AI, and chemotaxonomy, are increasingly being integrated, with such protocols often referred to as hybrid methods [19].
Chemotaxonomy is an old but now rapidly evolving field that plays a significant role in plant classification and identification [20]. Secondary metabolites are often unique to plant species and provide valuable insights into their evolutionary relationships [21,22]. This integration of morphological and chemical traits allows for more accurate and reliable plant identification, which is especially critical in the study of medicinal plants. Chemotaxonomy has witnessed significant advances in recent years, particularly with the integration of molecular biology techniques. For instance, DNA barcoding has become a popular method for plant identification, offering a molecular-marker-based approach that complements chemical profiling [23]. Metabolomics, through the comprehensiveness of the catalog of metabolites, can be used for specific plant tissues and has also enhanced the capabilities of chemotaxonomy [24]. Additionally, multivariate analysis techniques, such as principal component analysis (PCA) and cluster analysis (CA), have enabled researchers to better correlate chemical data with taxonomic information, improving the accuracy of plant classification [25]. Furthermore, the integration of bioinformatic tools and databases has facilitated the analysis and storage of large volumes of chemotaxonomic data, making it easier to access and compare plant profiles from different regions and studies [20,26].
The objective of this review is to provide an in-depth exploration of the current trends in chemotaxonomy, particularly in the context of medicinal plant identification. It highlights recent technological advances in chemotaxonomy, including the latest analytical techniques and molecular methods used to study plant chemical composition. It also discusses the integration of chemotaxonomy with traditional taxonomy and molecular biology to improve the accuracy and reliability of plant identification.

2. Concept of Chemotaxonomy and Medicinal Plant Identification

Chemotaxonomy is a discipline that not only utilizes the chemical characteristics of a plant to classify plants but also distinguishes between closely related species [20] and differentiates intraspecific taxa such as subspecies and varieties [27]. This makes it an essential and nuanced tool in plant identification and classification. Unlike traditional morphological classification, chemotaxonomy focuses on the chemical compounds found within a plant. Secondary metabolites are often characteristic of specific plant species and families, making them valuable for taxonomic classification [28]. However, their levels and presence can vary significantly depending on factors such as plant developmental stage, organ type, harvest time, and environmental conditions [29]. Therefore, while secondary metabolites can complement morphological traits in classification, their variability should be carefully considered during chemotaxonomic studies. This technique is often used in conjunction with traditional morphological methods, and together, they provide a deeper, molecular-level understanding of plant diversity. However, by analyzing their chemical profiles, chemotaxonomy can uncover subtle differences between species that are otherwise unobservable (Figure 1).

3. Primary and Secondary Metabolites in Medicinal Plants

In medicinal plants, primary metabolites (i.e., carbohydrates, amino acids, proteins, and fatty acids) are inherently essential to the plant’s basic growth and cellular processes [30]. They play key roles in energy production, structural functions, and cell division within the plant, and are crucial to overall plant health and survival [31]. Secondary metabolites, on the other hand, are considered non-essential compounds not directly involved in basic plant functions but serve key ecological roles in, for example, defense against herbivores, pathogens, and environmental stressors [32]. These compounds include an array of diverse classes like alkaloids, flavonoids, terpenoids, phenolic acids, glycosides, etc., which are the primary pharmacologically active compounds in medicinal plants [33] (Figure S1). These pharmacological activities include anti-inflammatory, antimicrobial, anticancer, antidiabetic, antiviral, and antioxidant activities, making them critical in the development of new therapeutic agents [34,35] (Table 1, Figure S2).

4. Chemotaxonomy vs. Traditional Morphological Taxonomy

Morphological plant identification that is based on physical plant features including leaf shape, flower color, fruit type, and plant size, has been a foundation of plant taxonomy for centuries [39]. While this method provides an easily accessible and non-invasive approach, environmental factors, phenotypic plasticity, and developmental stage variations are key limits that can affect accuracy [40,41]. Chemotaxonomy, on the other hand, relies on the chemical composition of plants, particularly the secondary metabolites, which are more stable and consistent traits in plants [20,42]. Additionally, chemotaxonomy can identify cryptic species that may appear morphologically similar but differ in chemical composition [43]. Chemotaxonomy can therefore be considered as a “complementing tool” that can facilitate robust plant identification.
Combining morphological characteristics with chemical analysis offers a more comprehensive understanding of plant relationships. Chemotaxonomy thus has the potential to enhance the accuracy of plant classification, particularly when used in combination with molecular tools, such as DNA barcoding [9]. Together, these methods can offer a more complete and precise plant identification system, ensuring that medicinal plants are accurately classified and effectively utilized for therapeutic purposes (Table 2).

5. Applications of Chemotaxonomy in the Herbal and Medicinal Plant Sciences

The primary advantage of chemotaxonomy in medicinal plant identification is its ability to more precisely and reliably distinguish between plant species. As mentioned above, many plants appear similar in terms of their morphology but differ significantly in their chemical composition. This makes chemotaxonomy a powerful tool for identifying plants with unique medicinal properties, as the chemical profile is often directly linked to the plant’s therapeutic potential and ethnopharmacological relevance [47]. Additionally, by focusing on chemical markers, chemotaxonomy allows researchers to identify plants in situations where fresh or whole plants are unavailable, e.g., in cases of adulteration and substitution, which is essential for ensuring the correct plant species is used in herbal medicines and pharmaceuticals [48]. Moreover, chemotaxonomy supports the standardization and quality control of herbal products by providing reproducible chemical markers that act as benchmarks for authentication [49]. This ensures batch-to-batch consistency, which is critical for the efficacy and safety of herbal medicines. Furthermore, it also facilitates regulatory compliance by enabling manufacturers to present validated phytochemical profiles of their raw materials and final products [50] (Figure 2).

6. Analytical Methods Used for Compound Identification

Chemotaxonomy involves both qualitative and quantitative analyses of compounds to establish relationships between plant species. In chemotaxonomy, the identification of secondary metabolites is typically carried out using analytical techniques such as HPLC, mass spectrometry (MS, including gas chromatography–MS (GC-MS)], LC-MS-Qtof, matrix-assisted laser desorption/ionization time-of-flight MS (MALDI-TOF MS)), UV and infrared (Fourier transform infrared), and NMR. These techniques allow the efficient separation, identification, and quantification of plant metabolites, providing a chemical “fingerprint” that is unique to each species (Table 3).

6.1. HPLC

High-performance liquid chromatography has become indispensable in chemotaxonomy. Its versatility, high resolution, and reproducibility make it ideal for the analysis of complex plant samples containing a wide variety of chemical compounds [56]. Researchers have been applying HPLC to chemotaxonomic identification within plant families for decades, and almost all plant families have been extensively studied using HPLC. Here are a few examples: The Fabaceae family, which includes a large number of economically and medicinally important plants, has been the subject of several such studies. Here, HPLC has been used to identify flavonoids and alkaloids, which have become key chemotaxonomic markers [57]. Similarly, within the genera Medicago (alfalfa) and Cicer (chickpea), HPLC has been used to profile flavonoids such as quercetin, kaempferol, and their derivatives [58]. These compounds help differentiate species within these genera, as the presence and concentration of specific flavonoids vary between species. Additionally, the alkaloid profiling of Lupinus (lupine) species has been investigated using HPLC, identifying compounds such as quinolizidine alkaloids, which are characteristic of this genus [59].
Similarly, chemical identification in the family Solanaceae has been extensively performed using HPLC, with the main focus on alkaloids and other bioactive compounds. In Capsicum (chili pepper) species, for instance, the compound responsible for their signature heat, flavor, and pungency, capsaicin, was identified and characterized using HPLC [60]. Similarly, nicotine, the alkaloid that serves as a chemotaxonomic marker for Nicotiana tabacum (tobacco) within the family, was also identified using HPLC [61].
Chemotaxonomic identification based on HPLC has also been used in many other plant groups including, the family Asteraceae, which includes well-known medicinal plants, such as Artemisia annua (used for malaria treatment) and Echinacea purpurea (used for immune support). Flavonoids, sesquiterpenes, and alkaloids have been extensively profiled using HPLC [62,63]. The family Rutaceae (most notable for containing the genus Citrus) is another plant group that has been widely studied using HPLC. Citrus fruits, including oranges (Citrus sinensis) and lemons (Citrus limon), are known for their content of flavonoids and terpenoids, especially hesperidin and narirutin, and these serve as important chemotaxonomic markers within this family [64]. Additionally, detailed data is available on plants of the Rubiaceae family, especially Coffea arabica (coffee), which contains extensive concentrations of caffeine, chlorogenic acids, and other alkaloids that are critical for distinguishing between Coffea species [65]. More recently, the integration of HPLC with chemometric techniques, like PCA and CA, has further enhanced plant classification. For example, Coffea species were differentiated from each other based on their alkaloid and flavonoid content using HPLC and PCA [66]. Similarly, using analyses combining HPLC data with chemometrics, essential oil and secondary metabolite profiles helped define Coriandrum sativum (coriander) and Carum carvi (caraway) as species [67]. Traditional HPLC mainly utilizes retention time and UV-Vis spectral fingerprints for compound analysis. Advanced systems equipped with photodiode array (PDA) detectors incorporate UV-Vis spectral libraries, allowing tentative compound identification through spectral matching against reference databases (e.g., Agilent OpenLab, ChemStation) [68,69,70]. However, due to overlapping UV absorption profiles among structurally related compounds, identification based solely on UV-Vis spectra often lacks specificity and requires complementary techniques for confirmation.

6.2. GC-MS

Gas chromatography–mass spectrometry (GC-MS) is an important technique used for the analysis of volatile compounds. It is widely applied in the study of essential oils containing terpenoids, fatty acids, and polyphenolics, which are responsible for the medicinal properties of many plants [71]. The use of GC-MS in plant chemotaxonomy dates back to the 1970s and 1980s, and this technique plays a key role in chemotaxonomy of volatile compounds.
For several years, GC-Ms has been used for component analysis in a number of diverse plant families. For instance, the Lamiaceae family, which is often referred to as the “mint family,” is well known for its “aromatic” qualities. It includes well-known species like Mentha piperita (mint) and genera like Thymus (thyme) and Ocimum (basil), and it has been extensively studied using GCMS [72]. The Mentha species have been classified based on their high concentrations of menthol and other terpenoids using GCMS [73]. Similarly, Thymus vulgaris (thyme) mainly contains high concentrations of thymol and carvacrol, which are key components of its essential oil [74]. Thymol and carvacrol are considered as valuable chemotaxonomic markers of similar kinds of compounds that can be helpful in assessing the quality of medicinal herbs.
Another notable application of GC-MS is its use in the analysis of Apiaceae, which includes aromatic and medicinal plants like Carum carvi (caraway) and Co. sativum (coriander). GC-MS has been widely used to identify the volatile compounds in the essential oils of these plants, such as carvone, limonene, and linalool, which are characteristic of Ca. carvi and Co. sativum [67,75]. These compounds thus serve as chemotaxonomic markers, aiding in the differentiation of species within the Apiaceae family. Similarly, Citrus species of the Rutaceae family have been studied using GC-MS, with limonene and other terpenes identified as key chemotaxonomic markers for species identification [76].
More recently, GC-MS has been combined with chemometrics to enhance plant species delineation based on chemical profiles [77]. The integration of multivariate statistical techniques, such as PC and CA, with GC-MS data has improved our ability to differentiate closely related plant species, detecting even subtle variations in their chemical profiles [78] Additionally, combining GC-MS and LC-MS allows the development of a complete chemical fingerprint of a plant, helping researchers identify the key compounds responsible for its therapeutic properties [79].

6.3. LC-MS-Qtof

Liquid chromatography–mass spectrometry with quadrupole time of flight is an advanced analytical tool in chemotaxonomy that enables the precise identification of a plant species based on their secondary metabolites [80]. This high-resolution technique combines the separation capabilities of liquid chromatography with the accuracy of time-of-flight mass spectrometry, making it ideal for producing plant chemical profiles. The application of LC-MS-Qtof in chemotaxonomy began in the early 2000s, at which time it represented a new approach. Early studies primarily focused on the analysis of alkaloids, flavonoids, and terpenoids as key metabolites often used in chemotaxonomic studies [80,81]. However, this technique is equally efficient for essential oils. For example, in investigations focusing on the family Lamiaceae, LC-MS-Qtof has been effectively used for the chemotaxonomical profiling of essential oils, flavonoids, and phenolic compounds [82,83].
Similarly, in Citrus species (Rutaceae), LC-MS-Qtof has been used to distinguish between species based on terpenoid profiles, with particular attention paid to limonene, a characteristic compound of these species [84]. Likewise, in the Solanaceae family, LC-MS-Qtof has been extensively used to analyze bioactive alkaloids like capsaicin and nicotine [85]. In an investigation, LC-MS-Qtof was used to study Capsicum annuum (chili pepper), with capsaicinoids identified as key chemotaxonomic markers for this species [86]. Similarly, in the Asteraceae family, LC-MS-Qtof has proven valuable for identifying bioactive compounds in Artemisia (wormwood) and Echinacea species, both of which are widely used in traditional medicine. For example, A. annua is very well known for its antimalarial properties, which rely on the presence of artemisinin. It has been studied extensively using LC-MS-Qtof to determine its sesquiterpene content [87], and E. purpurea, which is known for its immune-boosting properties, has been analyzed to characterize its contents of echinacoside and other caffeic acid derivatives [88]. These examples highlight the increasing use of LC-MS-Qtof for the validation and authentication of medicinal plants. As with other analytical methods, the coupling of LC-MS-Qtof data with PCA and CA is gaining the attention of researchers for its ability to make data analysis more efficient and interpretation more reliable [89]. Modern LC-MS platforms leverage extensive mass spectral libraries to enable rapid and accurate compound identification by matching acquired spectra against reference databases. Prominent libraries include the comprehensive NIST Mass Spectral Library; METLIN, which specializes in metabolites and natural products, the open-access MassBank repository; and the commercial high-resolution mzCloud database with predictive fragmentation features [90,91,92]. Collaborative platforms like GNPS facilitate community-driven annotation of MS/MS data, especially for natural products [93]. Integrated software tools, such as Thermo Fisher’s Compound Discoverer, Agilent’s MassHunter, Waters’ UNIFI, and Sciex OS [93], utilize these libraries for automated spectral matching, molecular formula prediction, and in silico fragmentation, significantly improving identification confidence for complex mixtures.

6.4. MALDI-TOF MS

Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry has become a key technique in chemotaxonomy [94]. The MALDI-TOF MS system employs a matrix to assist in the ionization of molecules, which are then analyzed using time of flight to identify their characteristic mass-to-charge ratios (m/z). This technique’s ability to generate unique spectral fingerprints of an organism’s metabolites, particularly proteins and lipids, has made MALDI-TOF MS invaluable in taxonomic studies [95].
As a recent example, MALDI-TOF MS was utilized to identify alkaloids and other bioactive compounds in species of Cucurbita (squash plants), which are used in traditional medicines [96]. By analyzing the protein and lipid profiles, researchers were able to categorize these species more accurately than when using traditional morphological approaches. In addition, the MALDI-TOF MS analysis revealed the presence of several important bioactive compounds, including cucurbitacins (e.g., cucurbitacin B and cucurbitacin E), quinolone alkaloids, and flavonoids, as well as phenolic acids like caffeic acid and chlorogenic acid [96]. Similarly MALDI-TOF MS analysis provided a comprehensive chemical profiling of closely related species and strains of Echinacea, including E. purpurea and Echinacea angustifolia, that highlighted common chemical compounds [97]. This analysis is crucial in ensuring the correct species is used in medicinal products, as different species may have different active compounds that contribute to immune modulation.

6.5. Nuclear Magnatic Resoanace (NMR)

Nuclear magnetic resonance spectroscopy offers a non-destructive, detailed method for characterizing the structures of plant metabolites and is especially useful for those not easily characterized via other methods [98]. It provides extensive information about the chemical environment of atoms within a molecule, allowing for the identification of complex compounds with high specificity [99]. It is important to note that NMR is often used in conjunction with other techniques, such as HPLC (Preparative), GC-MS, LC-MS, or LC-MS-Qtof. In such combinations, NMR can greatly enhance plant identification in families with diverse metabolites. For instance, in the Apiaceae family, for which flavonoids and coumarins are essential secondary metabolites, studies on Angelica and Coriandrum species have successfully employed NMR to elucidate unique flavonoid, glycoside, and furanocoumarin profiles that are critical for chemotaxonomic classification within these genera [100,101]. Such metabolite profiles can be directly correlated with a plant’s therapeutic properties, not only allowing accurate species identification but also effective quality control [102]. Similarly, NMR spectroscopy has greatly facilitated chemotaxonomy in the Lamiaceae family [20,103], where it has been extensively used to profile terpenoids, such as menthol, carvacrol, and eugenol, which serve as chemical markers [104] for distinguishing between closely related species. For instance, M. piperita and Mentha spicata (spearmint) can be differentiated based on their distinct terpenoid profiles, as revealed through NMR analysis [105]. Overall, the phytochemistry and plant science literature is replete with NMR analyses, and no phytochemical research is complete without one.

7. Current Trends in the Identification of Medicinal Plants

In recent years, advancements in analytical and molecular technologies have significantly enhanced the scope and precision of chemotaxonomic studies. Furthermore, the integration of chemotaxonomy with bioinformatics and computational tools has streamlined data analysis, enabling more comprehensive studies of plant species and their medicinal properties. In this section, we review current trends in chemotaxonomy, focusing on the integration of molecular tools and multivariate analyses and the computational and technological advancements that are pushing the field forward (Figure 3).

7.1. Integrating Molecular Techniques with Chemotaxonomy

Molecular techniques complement traditional chemotaxonomy analyses well, and recent advancements are enabling even more accurate identification and classification of medicinal plants. Two molecular approaches, DNA barcoding and metabolomics, have gained significant attention [24,106]. By integrating chemical and molecular information, plant identification can be more authentic.

7.1.1. DNA Barcoding

DNA barcoding has emerged as a critical tool in modern plant taxonomy, as it provides a rapid and reliable method for plant species identification. By utilizing the DNA sequences of short, universally accepted genomic regions, such as rbcL and matK gene regions and the ITS (internal transcribed spacer) region of the ribosomal RNA gene [107], DNA barcoding enables precise, species-level identification, even for morphologically similar species or when only incomplete plant specimens are available [108]. These regions are selected due to the balance between conservation and variation their sequences exhibit, which allows differentiation among species while maintaining a high degree of genetic stability across plant families [109]. By helping to catalog species, DNA barcoding aids in the conservation of plant biodiversity, especially in regions with rich but under-studied flora [110]. Barcoding, along with the computational tools and global genomic databases that have grown in its wake, has enabled researchers to address challenges related to species identification with enhanced efficiency and accuracy, driving significant advancements in plant biology and botanical research.

7.1.2. Metabolomics

Metabolomics, the comprehensive study of metabolites in organisms, has emerged as a powerful tool in chemotaxonomy [24]. For medicinal plants, this approach involves analyzing the complete set of metabolites in a plant, including both primary and secondary metabolites, which are critical for identification. By employing techniques like HPLC, GC-MS, and NMR, metabolomics enables the profiling of plant species based on their metabolic signatures [111]. The integration of metabolomics into chemotaxonomy provides a deeper understanding of the functional roles of plant metabolites in medicinal efficacy, enhancing the identification of bioactive compounds and improving the overall quality control of medicinal plant products [20].

7.2. Multivariate Analysis in Chemotaxonomy

Multivariate analyses are crucial for processing and interpreting the large datasets generated by many chemotaxonomic techniques. Principal component analysis and CA are two commonly used methods that help researchers extract meaningful patterns and relationships from complex chemical datasets, facilitating plant identification and classification [44,112].

7.2.1. Principal Component Analyses (PCA)

Principal component analyses are widely employed in chemotaxonomy to reduce the dimensionality of large chemical datasets while retaining the most significant variance [113]. By transforming the data into a set of orthogonal components, PCA facilitates the identification of patterns and trends that may not be immediately apparent in the raw data [17]. In the context of medicinal plants, PCA helps in grouping plant species based on their chemical profiles, providing insights into the chemical diversity of plant families or genera. This technique is particularly useful when dealing with large datasets generated when combining multiple analytical techniques, such as HPLC, GC-MS, and NMR [114].

7.2.2. Cluster Analysis (CA)

Cluster analysis (CA) complements PCA by grouping plant species or samples based on similarities in their chemical profiles. This method uses algorithms, such as hierarchical clustering or k-means clustering, to classify plant species into distinct groups or clusters [115]. It is particularly valuable when dealing with closely related species or varieties that exhibit overlapping chemical traits, as it can identify subtle yet significant chemical differences [116]. By organizing species into clusters based on their chemical characteristics, CA not only aids in the identification of novel plant species but also enhances our understanding of the ecological adaptations of different plant groups and the evolutionary relationships within and between groups [117]. Overall, CA offers chemotaxonomy a powerful tool for the classification and identification of plant species, facilitating the discovery of bioactive compounds and improving the quality control of medicinal plant products.

8. The Role of AI in Chemotaxonomy

Artificial intelligence has emerged as a transformative force in chemotaxonomy, where it is used to enhance plant identification, classification, and the discovery of bioactive compounds [20,118]. In particular, AI models based on ML, deep learning, and natural language processing offer powerful tools for processing and analyzing the vast chemical datasets generated by analytical techniques like NMR, GC-MS, and LC-MS-Qtof [119]. Furthermore, AI can assist in automating plant identification, improving pattern recognition among chemical profiles, and enabling predictive analyses of plant compound bioactivity (Figure 4). In this section, we discuss the key areas where AI is making an impact in chemotaxonomy.

8.1. Data Analysis and Pattern Recognition

One of the primary applications of AI in chemotaxonomy is in the analysis of complex datasets through techniques like PCA and CA, as discussed earlier.

8.2. Automation of Plant Identification

Various AI models, especially deep learning algorithms like convolutional neural networks (CNNs), are increasingly used to automate plant species identification [120]. By training AI systems on large datasets of chemical fingerprints, these models learn to recognize specific metabolite patterns that distinguish one species from another [121]. The advantage of AI in this context is its ability to process data much faster than human experts and to handle larger datasets, leading to a more efficient identification process. For instance, CNNs have been applied to GC-MS and LC-MS-Qtof data to automate the classification of plant species [122]. These AI models are able to detect chemical signatures in chemical profiles containing secondary metabolites, such as terpenoids, flavonoids, and alkaloids, which are characteristic of specific plant families [123].

8.3. Integration of Multi-Omics Data

Given their ability to discern patterns in large datasets, AI models are playing a significant role in the integration of various omics data types (e.g., genomics, metabolomics, and environmental data), facilitating more accurate and comprehensive plant identification and classification [124,125]. By efficiently combining metabolomics data from LC-MS-Qtof and GC-MS analyses with genomic data, AI models can provide more robust predictions about plant species and their bioactive compounds [126]. This integration helps enhance plant chemotaxonomy by offering a more holistic view of a plant’s chemical, genetic, and environmental profile.

8.4. Predicting Bioactivity and Medicinal Potential

Artificial intelligence has significantly advanced the prediction of bioactivity, and thus medicinal potential, by allowing the analysis of extensive datasets to identify novel drug candidates and forecast their interactions and efficacy [127,128]. Researchers have been working to expedite drug discovery through machine and deep learning techniques, focusing on the prediction of protein structure, drug–target interactions, and molecular properties [129]. Similarly, AI models can more efficiently predict drug toxicity, bioactivity, and physicochemical properties, thereby streamlining the drug development process [130]. For example, in the evaluation of medicinal plants, AI algorithms have been trained to predict the anti-inflammatory, antimicrobial, and anticancer properties of plant compounds based on their chemical structure [131]. These AI-based methods can drastically reduce the time and resources required for experimental testing by providing preliminary insights into the therapeutic potential of specific compounds.

8.5. Advancements in AI Algorithms and Chemotaxonomy

Recent advancements in AI, particularly in deep learning and natural language processing, have further enhanced its application in chemotaxonomy. Newer algorithms are able to handle larger and more complex datasets, enabling faster and more accurate plant species identification based on chemical profiles [132]. For instance, transformer-based models like BERT (bidirectional encoder representations from transformers) have been applied to metabolomic data, improving our understanding of plant metabolites and their role in species differentiation [133,134]

9. Limitations of Chemotaxonomy

Despite its valuable contributions to the identification and classification of medicinal plants, chemotaxonomy faces several limitations that must be addressed before its full potential can be realized (Table 4). This section discusses several important points that must be considered.

9.1. Variability in Secondary Metabolite Profiles

Secondary metabolites in plants are highly influenced by environmental factors, such as soil conditions, climate, altitude, and seasonal changes, as well as genetic diversity [135]. This variability can lead to significant differences in alkaloid, terpenoid, and flavonoid profiles across regions or growing conditions, which complicates chemotaxonomic identification and classification [139].

9.2. Standardization Issues

A lack of standardized methods for analyzing plant chemical profiles presents another significant challenge in chemotaxonomy [135]. Different researchers may use different analytical techniques, such as HPLC, GC-MS, or NMR, under varying experimental conditions, which can lead to inconsistent results [140]. Additionally, the way plant extracts are prepared, including the solvents used and the extraction methods applied, can influence the chemical composition of the samples [141].

9.3. Lack of Comprehensive Databases

One unavoidable issue in chemotaxonomy is the absence of comprehensive and accessible databases that house detailed chemical profiles of medicinal plants [135]. While several databases have been developed to catalog plant species and their associated chemical compounds, these resources are incomplete, lack uniformity, or are not easily accessible [142]. This limits the utility of these databases for taxonomic and medicinal plant identification.

9.4. Accessibility and High Costs of Analytical Techniques

The high costs associated with advanced analytical techniques like HPLC, GC-MS, NMR, and LC-MS-Qtof are significant barriers to the widespread use of chemotaxonomy, especially in resource-limited settings [17]. The purchase and maintenance costs of the sophisticated equipment required for these techniques, as well as the cost of reagents and consumables, can be prohibitive for many research institutions and laboratories, particularly in developing countries.

9.5. Ethnobotanical Knowledge

Another limitation of chemotaxonomy is a lack of comprehensive ethnobotanical knowledge, which could inform the classification and identification of medicinal plants [135]. Ethnobotanical data, which includes knowledge of traditional plant usage in indigenous communities, is crucial for understanding the medicinal properties of plants and their modern applications [143]. However, much of this knowledge has not been systematically documented and is at risk of being lost as traditional practices fade away.

10. Challenges in Chemotaxonomic Identification

Despite the importance of chemotaxonomy, several limitations do also exist. For instance, there is high variability in plant chemical compositions, which can be influenced by various factors, including environmental conditions, developmental stage, and genetics [29,144]. This variability can complicate identification and classification, leading to inconsistencies in chemotaxonomic results [145]. Another challenge is the high cost and complexity of the analytical techniques required for identifying plant metabolites [146]. Similarly, a lack of standardized methods for using these techniques creates an issue [147]. Moreover, there is a deficiency in globally harmonized chemotaxonomic reference databases, limiting comparative analysis across species or genera. These technical disparities hinder the development of universal chemotaxonomic frameworks, underscoring the need for method standardization, data sharing platforms, and the integration of AI to streamline identification and interpretation.

11. Future Directions in Chemotaxonomy

As new tools and methodologies are emerging, chemotaxonomy is becoming an even more integral part of medicinal plant research and applications. This section highlights several key areas we believe should be considered important focuses of future research.

11.1. Integration of Multi-Omics Approaches

The convergence of genomics, transcriptomics, proteomics, and metabolomics is revolutionizing chemotaxonomy. Tools like MEGA, Cytoscape, and WGCNA enable pathway prediction and co-expression analysis of plant chemical components [148]. Similarly, databases such as MetaboLights, HMDB, and KEGG provide metabolite–gene linkage data that can assist in taxonomic discrimination [149]. Serval multi-omics workflows have recently been applied to cyanobacteria and algae, demonstrating the potential to open new paths in chemotaxonomy [150], which in fact adds to its relevance.

11.2. Application of AI and ML

Both AI and ML models are consistently being used to classify plants based on high-dimensional chemical fingerprints. Random forests and support vector machines have already been used in LC-MS-based medicinal plant classification [151]. Similarly, deep learning-based bioinformatics tools, such as DeepChem and DeepMetabolome, can process large metabolomics datasets for automated taxonomic classification [152].

11.3. Development of Comprehensive Chemotaxonomic Databases

Given the increasing importance of chemotaxonomy, there exists a great need to develop interoperable and centralized platforms that integrate genomics, phytochemistry, and environmental metadata. Databases like KNApSAcK, PlantCyc, MassBank, and NPASS can be used as foundations to create these resources [153].

11.4. Digital Herbarium Platforms with Integrated Chemoprofiling

Future digital herbaria could integrate high-resolution images, geospatial metadata, and chemical fingerprints from NMR, LC-MS, GC-MS, etc., enabling greater taxonomic and biochemical insight [154]. Current systems, like GBIF and iDigBio, lack chemical metadata, which can be helpful in this case [155]. Digital herbaria should allow spectral searchability, further enhancing the synergy created by the integration of data types.

11.5. Synthetic Biology for Metabolic Pathway Validation

Synthetic biology allows scientists to validate biosynthetic pathways inferred from chemotaxonomic studies [156]. By expressing gene clusters in model organisms (e.g., Nicotiana benthamiana’s secondary metabolites expressed in yeast), the origin and regulation of secondary metabolites can be studied [157]. This approach is crucial for confirming gene function, regulatory sequences, and enzyme interactions that are otherwise obscured in native plant systems due to redundancy or low expression levels [158].

11.6. Chemotaxonomy in Conservation and Drug Discovery

By linking phytochemical diversity with taxonomic and phylogenetic information, chemotaxonomy can be a very helpful tool for conservation biology and drug discovery [27]. By identifying taxa with rich and unique metabolite profiles, chemotaxonomy can guide conservation efforts toward chemically and evolutionarily valuable species, many of which may currently be endangered or underexplored [139]. This approach enhances the efficiency of bioprospecting, as taxonomic proximity to known medicinal plants often predicts similar bioactivity profiles. For instance, the chemotaxonomic mapping of phytochemical “hotspots” in the plant kingdom has been proposed as a way to identify priorities for both biodiversity protection and pharmaceutical exploration [159] (Figure 5).

12. Conclusions

Chemotaxonomy serves as a vital tool in medicinal plant research by enabling precise species identification and medicinal product authentication through the chemical profiling of secondary metabolites. Analytical techniques such as HPLC, GC-MS, NMR, LC-MS/QToF, and Fourier transform infrared spectroscopy are instrumental in detecting bioactive compounds, supporting both quality control and therapeutic agent discovery. When integrated with genomics and proteomics, chemotaxonomic approaches facilitate the elucidation of biosynthetic pathways, enhancing our understanding of metabolite diversity and function. Such methodologies are particularly critical for ensuring the safety and efficacy of herbal medicines by preventing misidentification and adulteration. Moreover, chemotaxonomy contributes to drug development by uncovering novel phytochemicals with pharmacological potential. Emerging trends in this field involve the incorporation of AI and ML to accelerate compound identification and classification. Despite these advancements, the standardization of analytical protocols and the development of comprehensive global databases remain key challenges. Continued progress in chemotaxonomy is expected to drive innovations in plant-based drug discovery and personalized phytotherapeutics and accelerate traditional medicine validation while promoting sustainable utilization of botanical resources.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants14142234/s1, Figure S1. Structure of diverse secondary metabolites (and their classes) commonly found in plants. Figure S2. Overview of plant secondary metabolites in correlation with plant identification.

Author Contributions

Conceptualization, writing—original draft preparation, resources, software, validation, visualization, A.A.; writing—review and editing, S.P. and A.A.; supervision, S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kunhikannan, C.; Anju, M.V.; Prasanna, R. Taxonomy: Importance, Relevance and Application. In Textbook of Forest Science; Springer: Berlin/Heidelberg, Germany, 2025; pp. 65–85. [Google Scholar]
  2. Rouhan, G.; Gaudeul, M. Plant taxonomy: A historical perspective, current challenges, and perspectives. In Molecular Plant Taxonomy: Methods and Protocols, 2nd ed.; Humana Press: Totowa, NJ, USA, 2021; pp. 1–38. [Google Scholar]
  3. Zink, R.M.; Klicka, L.B. The taxonomic basis of subspecies listed as threatened and endangered under the endangered species act. Front. Conserv. Sci. 2022, 3, 971280. [Google Scholar] [CrossRef]
  4. Chaachouay, N.; Zidane, L. Plant-derived natural products: A source for drug discovery and development. Drugs Drug Candidates 2024, 3, 184–207. [Google Scholar] [CrossRef]
  5. Gong, X.; Yang, M.; He, C.N.; Bi, Y.Q.; Zhang, C.H.; Li, M.H.; Xiao, P.G. Plant pharmacophylogeny: Review and future directions. Chin. J. Integr. Med. 2022, 28, 567–574. [Google Scholar] [CrossRef] [PubMed]
  6. Demilie, W.B. Plant disease detection and classification techniques: A comparative study of the performances. J. Big Data 2024, 11, 5. [Google Scholar] [CrossRef]
  7. Antil, S.; Abraham, J.S.; Sripoorna, S.; Maurya, S.; Dagar, J.; Makhija, S.; Bhagat, P.; Gupta, R.; Sood, U.; Lal, R. DNA barcoding, an effective tool for species identification: A review. Mol. Biol. Rep. 2023, 50, 761–775. [Google Scholar] [CrossRef] [PubMed]
  8. Wang, M.; Lin, H.; Lin, H.; Du, P.; Zhang, S. From species to varieties: How modern sequencing technologies are shaping Medicinal Plant Identification. Genes 2024, 16, 16. [Google Scholar] [CrossRef] [PubMed]
  9. Zaman, W.; Ayaz, A.; Park, S. Integrating morphological and molecular data in plant taxonomy. Pak. J. Bot. 2025, 57, 1453–1466. [Google Scholar] [CrossRef] [PubMed]
  10. Paul, T.; Kumar, K.J. Standardization of Herbal Medicines for Lifestyle Diseases. In Role of Herbal Medicines: Management of Lifestyle Diseases; Springer: Berlin/Heidelberg, Germany, 2024; pp. 545–557. [Google Scholar]
  11. Intharuksa, A.; Phrutivorapongkul, A.; Thongkhao, K. Integrating DNA barcoding, microscopic, and chemical analyses for precise identification of Plumbago indica L., A prominent medicinal plant. Microchem. J. 2024, 199, 110038. [Google Scholar] [CrossRef]
  12. Erdtman, G. Pollen Morphology and Plant Taxonomy: Angiosperms (An Untroduction to Palynology); Brill: Leiden, The Netherlands, 2023. [Google Scholar]
  13. Chac, L.D.; Thinh, B.B. Species identification through DNA barcoding and its applications: A review. Biol. Bull. 2023, 50, 1143–1156. [Google Scholar] [CrossRef]
  14. Shen, S.; Zhan, C.; Yang, C.; Fernie, A.R.; Luo, J. Metabolomics-centered mining of plant metabolic diversity and function: Past decade and future perspectives. Mol. Plant 2023, 16, 43–63. [Google Scholar] [CrossRef] [PubMed]
  15. Waris, M.; Kocak, E.; Gonulalan, E.M.; Demirezer, L.O.; Kır, S.; Nemutlu, E. Metabolomics analysis insight into medicinal plant science. TrAC Trends Anal. Chem. 2022, 157, 116795. [Google Scholar] [CrossRef]
  16. Zhu, W.; Li, W.; Zhang, H.; Li, L. Big data and artificial intelligence-aided crop breeding: Progress and prospects. J. Integr. Plant Biol. 2025, 67, 722–739. [Google Scholar] [CrossRef] [PubMed]
  17. Mali, S.; Yadav, R.; Gauttam, V.; Sawale, J. An Updated Review on Taxonomy and Chemotaxonomy. Toxicol. Int. 2023, 30, 121–129. [Google Scholar] [CrossRef]
  18. Huymann, L.R.; Hannecker, A.; Giovanni, T.; Liimatainen, K.; Niskanen, T.; Probst, M.; Peintner, U.; Siewert, B. Revised taxon definition in European Cortinarius subgenus Dermocybe based on phylogeny, chemotaxonomy, and morphology. Mycol. Prog. 2024, 23, 26. [Google Scholar] [CrossRef] [PubMed]
  19. Zhang, X.; Yang, L.-E.; Hu, Y.; Wu, X.; Wang, Z.; Miao, Y.; Sun, H.; Nie, Z.; Tan, N. Integrating morphology, molecular phylogeny and chemotaxonomy for the most effective authentication in Chinese Rubia with insights into origin and distribution of characteristic Rubiaceae-type cyclopeptides. Ind. Crops Prod. 2023, 191, 115775. [Google Scholar] [CrossRef]
  20. Lawson, P.A.; Patel, N.B. The strength of chemotaxonomy. In Trends in the Systematics of Bacteria and Fungi; CABI Publishing: Wallingford, UK, 2021; pp. 141–167. [Google Scholar]
  21. Wu, Q.; Tong, W.; Zhao, H.; Ge, R.; Li, R.; Huang, J.; Li, F.; Wang, Y.; Mallano, A.I.; Deng, W. Comparative transcriptomic analysis unveils the deep phylogeny and secondary metabolite evolution of 116 Camellia plants. Plant J. 2022, 111, 406–421. [Google Scholar] [CrossRef] [PubMed]
  22. Weng, J.-K.; Lynch, J.H.; Matos, J.O.; Dudareva, N. Adaptive mechanisms of plant specialized metabolism connecting chemistry to function. Nat. Chem. Biol. 2021, 17, 1037–1045. [Google Scholar] [CrossRef] [PubMed]
  23. Kowsalya, K.; Halka, J.; Ramalashmi, K.; Anand, A.V.; Razia, M.; Anand, M.; Begam, A.K.U.; Arun, M. Molecular Marker-Assisted Breeding in Medicinal Plants for Enhanced Secondary Metabolites Production. In Biotechnology, Multiple Omics, and Precision Breeding in Medicinal Plants; CRC Press: Boca Raton, FL, USA, 2025; pp. 160–176. [Google Scholar]
  24. Peters, K.; Blatt-Janmaat, K.L.; Tkach, N.; van Dam, N.M.; Neumann, S. Untargeted metabolomics for integrative taxonomy: Metabolomics, DNA marker-based sequencing, and phenotype bioimaging. Plants 2023, 12, 881. [Google Scholar] [CrossRef] [PubMed]
  25. Rebiai, A.; Hemmami, H.; Zeghoud, S.; Ben Seghir, B.; Kouadri, I.; Eddine, L.S.; Elboughdiri, N.; Ghareba, S.; Ghernaout, D.; Abbas, N. Current application of chemometrics analysis in authentication of natural products: A review. Comb. Chem. High Throughput Screen. 2022, 25, 945–972. [Google Scholar] [CrossRef] [PubMed]
  26. Parastar, H.; Tauler, R. Big (bio) chemical data mining using chemometric methods: A need for chemists. Angew. Chem. 2022, 134, e201801134. [Google Scholar] [CrossRef]
  27. Umoh, O.T. Chemotaxonomy: The role of phytochemicals in chemotaxonomic delineation of taxa. taxon 2020, 13, 14. [Google Scholar] [CrossRef]
  28. Gershenzon, J.; Mabry, T.J. Secondary metabolites and the higher classification of angiosperms. Nord. J. Bot. 1983, 3, 5–34. [Google Scholar] [CrossRef]
  29. Li, Y.; Kong, D.; Fu, Y.; Sussman, M.R.; Wu, H. The effect of developmental and environmental factors on secondary metabolites in medicinal plants. Plant Physiol. Biochem. 2020, 148, 80–89. [Google Scholar] [CrossRef] [PubMed]
  30. Bocso, N.-S.; Butnariu, M. The biological role of primary and secondary plants metabolites. J. Nutr. Food Process. 2022, 5, 1–7. [Google Scholar]
  31. Salam, U.; Ullah, S.; Tang, Z.-H.; Elateeq, A.A.; Khan, Y.; Khan, J.; Khan, A.; Ali, S. Plant metabolomics: An overview of the role of primary and secondary metabolites against different environmental stress factors. Life 2023, 13, 706. [Google Scholar] [CrossRef] [PubMed]
  32. Ahmed, E.; Arshad, M.; Khan, M.Z.; Amjad, M.S.; Sadaf, H.M.; Riaz, I.; Sabir, S.; Ahmad, N. Secondary metabolites and their multidimensional prospective in plant life. J. Pharmacogn. Phytochem. 2017, 6, 205–214. [Google Scholar]
  33. Böttger, A.; Vothknecht, U.; Bolle, C.; Wolf, A.; Böttger, A.; Vothknecht, U.; Bolle, C.; Wolf, A. Plant secondary metabolites and their general function in plants. In Lessons on Caffeine, Cannabis & Co: Plant-Derived Drugs and Their Interaction with Human Receptors; Springer: Berlin/Heidelberg, Germany, 2018; pp. 3–17. [Google Scholar]
  34. Rai, S.K.; Chauhan, R. Phytochemicals in drug discovery. In Phytochemicals in Medicinal Plants: Biodiversity, Bioactivity and Drug Discovery; De Gruyter: Berlin, Germany, 2023; 331p. [Google Scholar]
  35. Ekiert, H.M.; Szopa, A. Biological activities of natural products. Molecules 2020, 25, 5769. [Google Scholar] [CrossRef] [PubMed]
  36. Hounsome, N.; Hounsome, B.; Lobo, M.G. Biochemistry of Vegetables: Major Classes of Primary Metabolites (Carbohydrates, Amino Acids, Vitamins, Organic Acids, and Fatty Acids). In Handbook of Vegetables and Vegetable Processing; Wiley: Hoboken, NJ, USA, 2018; pp. 25–46. [Google Scholar]
  37. Kandar, C.C. Secondary metabolites from plant sources. In Bioactive Natural Products for Pharmaceutical Applications; Springer: Berlin/Heidelberg, Germany, 2020; pp. 329–377. [Google Scholar]
  38. Simsek, M.; Whitney, K. Examination of primary and secondary metabolites associated with a plant-based diet and their impact on human health. Foods 2024, 13, 1020. [Google Scholar] [CrossRef] [PubMed]
  39. Cope, J.S.; Corney, D.; Clark, J.Y.; Remagnino, P.; Wilkin, P. Plant species identification using digital morphometrics: A review. Expert Syst. Appl. 2012, 39, 7562–7573. [Google Scholar] [CrossRef]
  40. Duminil, J.; Di Michele, M. Plant species delimitation: A comparison of morphological and molecular markers. Plant Biosyst. 2009, 143, 528–542. [Google Scholar] [CrossRef]
  41. Rouhan, G.; Gaudeul, M. Plant taxonomy: A historical perspective, current challenges, and perspectives. In Molecular Plant Taxonomy: Methods and Protocols; Humana Press: Totowa, NJ, USA, 2014; pp. 1–37. [Google Scholar]
  42. Hadacek, F. Secondary metabolites as plant traits: Current assessment and future perspectives. Crit. Rev. Plant Sci. 2002, 21, 273–322. [Google Scholar] [CrossRef]
  43. Julier, A.C.M.; Jardine, P.E.; Coe, A.L.; Gosling, W.D.; Lomax, B.H.; Fraser, W.T. Chemotaxonomy as a tool for interpreting the cryptic diversity of Poaceae pollen. Rev. Palaeobot. Palynol. 2016, 235, 140–147. [Google Scholar] [CrossRef]
  44. Marne, P.A.; Pawar, A.T.; Tagalpallewar, A.A.; Baheti, A.M. Comparative Phytochemistry and Chemotaxonomy. In Pharmacognosy and Phytochemistry: Principles, Techniques, and Clinical Applications; Wiley: Hoboken, NJ, USA, 2025; pp. 333–346. [Google Scholar]
  45. Şentürk, B.; Aytar, E.C.; Durmaz, A.; İşler, S.; Deniz, İ.G.; Kömpe, Y.Ö. Morphological and chemical characteristics of Fritillaria species: Species differentiation through morphometric measurements and GC-MS analysis. Plant Genet. Resour. 2025, 23, 217–227. [Google Scholar] [CrossRef]
  46. Abdelfattah, H.M.E.; Hussein, H.A.; Teleb, S.S.; El-Demerdash, M.M.; George, N.M. Chemotaxonomy compared to morphological and anatomical taxonomy of five Hibiscus species. J. Plant Res. 2024, 137, 967–984. [Google Scholar] [CrossRef] [PubMed]
  47. Bagul, V.S.; Bafna, P.S.; Patil, D.M.; Mutha, R.E. Classification of Crude Drugs of Natural Origin. In Pharmacognosy and Phytochemistry: Principles, Techniques, and Clinical Applications; Wiley: Hoboken, NJ, USA, 2025; pp. 17–44. [Google Scholar]
  48. Chaudhary, M.K.; Misra, A.; Kumar, B.; Srivastava, S. Chemotaxonomy-Guided Effective Herbal Product Development: Asustainable Model For Ayush Industries. In Traditional Medicines in Drug Discovery and Development; Cambridge Scholars Publishing: Newcastle upon Tyne, UK, 2024. [Google Scholar]
  49. Li, Y.; Cheng, Y.; Zhang, Y.; Nan, H.; Lin, N.; Chen, Q. Quality markers of Polygala fallax Hemsl decoction based on qualitative and quantitative analysis combined with network pharmacology and chemometric analysis. Phytochem. Anal. 2024, 35, 1496–1508. [Google Scholar] [CrossRef] [PubMed]
  50. Banerjee, D.; Parekh, N.; Kaur, G.; Sharma, S.; Buttar, H.S.; Chauhan, R.; Kaur, D.; Tuli, H.S.; Jairoun, A.A.; Shahwan, M. Chemometric perspective on herbal medicine evaluation: Tools, techniques, and trends. J. Appl. Pharm. Sci. 2025, 15, 085–093. [Google Scholar] [CrossRef]
  51. Singhal, A.; Saini, U.; Chopra, B.; Dhingra, A.K.; Jain, A.; Chaudhary, J. Uv-visible spectroscopy: A review on its pharmaceutical and bio-allied sciences applications. Curr. Pharm. Anal. 2024, 20, 161–177. [Google Scholar] [CrossRef]
  52. Lestari, N.B.; Sulistyaningsih, Y.C.; Umar, A.H.; Ratnadewi, D. Distribution and FTIR-based fingerprint of secondary metabolites in different organs of ant-plant (Myrmecodia tuberosa). Biodiversitas J. Biol. Divers. 2024, 25, 3. [Google Scholar] [CrossRef]
  53. Halder, M.; Kundu, A.; Jha, S. Secondary Metabolites Identification Techniques of the Current Era. In Plant Specialized Metabolites: Phytochemistry, Ecology and Biotechnology; Springer: Berlin/Heidelberg, Germany, 2024; pp. 1–41. [Google Scholar]
  54. Bajo-Fernández, M.; Souza-Silva, É.A.; Barbas, C.; Rey-Stolle, M.F.; García, A. GC-MS-based metabolomics of volatile organic compounds in exhaled breath: Applications in health and disease. A review. Front. Mol. Biosci. 2024, 10, 1295955. [Google Scholar] [CrossRef] [PubMed]
  55. Verma, A.; Chattopadhaya, A.; Gupta, P.; Tiwari, H.; Singh, S.; Kumar, L.; Gautam, V. Integration of Hyphenated Techniques for Characterizing and Chemical Profiling of Natural Products. Chem. Biodivers. 2025, e202500234, early view. [Google Scholar] [CrossRef] [PubMed]
  56. Marston, A. Role of advances in chromatographic techniques in phytochemistry. Phytochemistry 2007, 68, 2786–2798. [Google Scholar] [CrossRef] [PubMed]
  57. Usman, M.; Khan, W.R.; Yousaf, N.; Akram, S.; Murtaza, G.; Kudus, K.A.; Ditta, A.; Rosli, Z.; Rajpar, M.N.; Nazre, M. Exploring the phytochemicals and anti-cancer potential of the members of Fabaceae family: A comprehensive review. Molecules 2022, 27, 3863. [Google Scholar] [CrossRef] [PubMed]
  58. Šibul, F.; Orčić, D.; Vasić, M.; Anačkov, G.; Nađpal, J.; Savić, A.; Mimica-Dukić, N. Phenolic profile, antioxidant and anti-inflammatory potential of herb and root extracts of seven selected legumes. Ind. Crops Prod. 2016, 83, 641–653. [Google Scholar] [CrossRef]
  59. Ganzera, M.; Krüger, A.; Wink, M. Determination of quinolizidine alkaloids in different Lupinus species by NACE using UV and MS detection. J. Pharm. Biomed. Anal. 2010, 53, 1231–1235. [Google Scholar] [CrossRef] [PubMed]
  60. Li, J.; Dadmohammadi, Y.; Abbaspourrad, A. Flavor components, precursors, formation mechanisms, production and characterization methods: Garlic, onion, and chili pepper flavors. Crit. Rev. Food Sci. Nutr. 2022, 62, 8265–8287. [Google Scholar] [CrossRef] [PubMed]
  61. Gui, Z.-Q.; Yuan, X.-L.; Yang, J.; Du, Y.-M.; Zhang, P. An updated review on chemical constituents from Nicotiana tabacum L.: Chemical diversity and pharmacological properties. Ind. Crops Prod. 2024, 214, 118497. [Google Scholar] [CrossRef]
  62. Shinyuy, L.M.; Loe, G.E.; Jansen, O.; Mamede, L.; Ledoux, A.; Noukimi, S.F.; Abenwie, S.N.; Ghogomu, S.M.; Souopgui, J.; Robert, A. Secondary metabolites isolated from Artemisia afra and Artemisia annua and their anti-malarial, anti-inflammatory and immunomodulating properties—Pharmacokinetics and pharmacodynamics: A review. Metabolites 2023, 13, 613. [Google Scholar] [CrossRef] [PubMed]
  63. Ahmadi, F. Phytochemistry, Mechanisms, and Preclinical Studies of Echinacea Extracts in Modulating Immune Responses to Bacterial and Viral Infections: A Comprehensive Review. Antibiotics 2024, 13, 947. [Google Scholar] [CrossRef] [PubMed]
  64. Alam, F.; Mohammadin, K.; Shafique, Z.; Amjad, S.T.; Asad, M.H.H. Citrus flavonoids as potential therapeutic agents: A review. Phytother. Res. 2022, 36, 1417–1441. [Google Scholar] [CrossRef] [PubMed]
  65. López-Parra, M.B.; Gómez-Domínguez, I.; Iriondo-DeHond, M.; Villamediana Merino, E.; Sánchez-Martín, V.; Mendiola, J.A.; Iriondo-DeHond, A.; Del Castillo, M.D. The impact of the drying process on the antioxidant and anti-inflammatory potential of dried ripe coffee cherry pulp soluble powder. Foods 2024, 13, 1114. [Google Scholar] [CrossRef] [PubMed]
  66. Patil, S.; Shankar, S.R.; Murthy, P.S. Impact of different varieties and mature stages on phytochemicals from Coffea arabica and Coffea robusta leaves. Biochem. Syst. Ecol. 2023, 110, 104699. [Google Scholar] [CrossRef]
  67. Hajlaoui, H.; Arraouadi, S.; Noumi, E.; Aouadi, K.; Adnan, M.; Khan, M.A.; Kadri, A.; Snoussi, M. Antimicrobial, antioxidant, anti-acetylcholinesterase, antidiabetic, and pharmacokinetic properties of Carum carvi L. and Coriandrum sativum L. essential oils alone and in combination. Molecules 2021, 26, 3625. [Google Scholar] [CrossRef] [PubMed]
  68. Hemida, M.H.S. Portable liquid chromatography for on-site analysis. Ph.D. Thesis, University of Tasmania, Hobart, Australia, 2021. [Google Scholar]
  69. Marittimo, N.; Grasselli, G.; Arigò, A.; Famiglini, G.; Agostini, M.; Renzoni, C.; Palma, P.; Cappiello, A. Liquid electron ionization-mass spectrometry as a novel strategy for integrating normal-phase liquid chromatography with low and high-resolution mass spectrometry. Analyst 2024, 149, 2664–2670. [Google Scholar] [CrossRef] [PubMed]
  70. Dathe, B.; Otto, M. Library search for HPLC diode array detection based on spectral interpolation. Chromatographia 1993, 37, 31–36. [Google Scholar] [CrossRef]
  71. Pinto, T.; Aires, A.; Cosme, F.; Bacelar, E.; Morais, M.C.; Oliveira, I.; Ferreira-Cardoso, J.; Anjos, R.; Vilela, A.; Gonçalves, B. Bioactive (poly) phenols, volatile compounds from vegetables, medicinal and aromatic plants. Foods 2021, 10, 106. [Google Scholar] [CrossRef] [PubMed]
  72. Osarieme Imade, R.; Adesina Ayinde, B.; Alam, A. GC-MS Analysis and In Vitro Cytotoxic Effects of Ocimum gratissimum (Lamiaceae) Volatile Oil and Thymol on Cancer Cells. Pharm. Biomed. Res. 2023, 9, 115–124. [Google Scholar] [CrossRef]
  73. Tian, Y.; Xu, Z.; Liu, Z.; Zhu, R.; Zhang, F.; Liu, Z.; Si, X. Botanical discrimination and classification of Mentha plants applying two-chiral column tandem GC–MS analysis of eight menthol enantiomers. Food Res. Int. 2022, 162, 112035. [Google Scholar] [CrossRef] [PubMed]
  74. Galovičová, L.; Borotová, P.; Valková, V.; Vukovic, N.L.; Vukic, M.; Štefániková, J.; Ďúranová, H.; Kowalczewski, P.Ł.; Čmiková, N.; Kačániová, M. Thymus vulgaris essential oil and its biological activity. Plants 2021, 10, 1959. [Google Scholar] [CrossRef] [PubMed]
  75. Wei, S.; Wei, L.; Xie, B.; Li, J.; Lyu, J.; Wang, S.; Khan, M.A.; Xiao, X.; Yu, J. Characterization of volatile profile from different coriander (Coriandrum sativum L.) varieties via HS-SPME/GC–MS combined with E-nose analyzed by chemometrics. Food Chem. 2024, 457, 140128. [Google Scholar] [CrossRef] [PubMed]
  76. Sevindik, E.; Aydın, S.; Sujka, M.; Apaydın, E.; Yıldırım, K.; Palas, G. GC-MS analysis and evaluation of antibacterial and antifungal activity of essential oils extracted from fruit Peel of Citrus aurantium L. (Rutaceae) Grown in the West Anatolian Area. Erwerbs-Obstbau 2021, 63, 135–142. [Google Scholar] [CrossRef]
  77. Al-Busaidi, Y.S.A. Aroma Profile Characterization of Selected Types of Omani Honey by Gas Chromatography-Mass Spectrometry and Electronic Nose. Ph.D. Thesis, Sultan Qaboos University, Seeb, Oman, 2024. [Google Scholar]
  78. Tufariello, M.; Pati, S.; Palombi, L.; Grieco, F.; Losito, I. Use of multivariate statistics in the processing of data on wine volatile compounds obtained by HS-SPME-GC-MS. Foods 2022, 11, 910. [Google Scholar] [CrossRef] [PubMed]
  79. Asmaey, M.A. Utilizing GC-MS and UPLC-MS for rapid metabolomic analysis in medicinal plants. In Propagation to Pharmacopeia; CRC Press: Boca Raton, FL, USA, 2024; pp. 361–378. [Google Scholar]
  80. Nengovhela, N. Investigation of Biochemical Strategies Leading to Metabolome Complexity of Two Closely Related Coccinia Species Through LC-QTOF-MS-based Metabolite Fingerprinting. Master’s Thesis, University of Venda, South Africa, 2021. [Google Scholar]
  81. Li, L.; He, L.; Su, X.; Amu, H.; Li, J.; Zhang, Z. Chemotaxonomy of Aster species from the Qinghai-Tibetan Plateau based on metabolomics. Phytochem. Anal. 2022, 33, 23–39. [Google Scholar] [CrossRef] [PubMed]
  82. Aydin, E. Phytochemicals from Phillyrea latifolia L. leaves and fruit extracted with various solvents: Their identification and quantification by LC-MS and antihyperglycemic effects. Folia Hortic. 2023, 35, 233–242. [Google Scholar] [CrossRef]
  83. Kandyliari, A.; Potsaki, P.; Bousdouni, P.; Kaloteraki, C.; Christofilea, M.; Almpounioti, K.; Moutsou, A.; Fasoulis, C.K.; Polychronis, L.V.; Gkalpinos, V.K. Development of dairy products fortified with plant extracts: Antioxidant and phenolic content characterization. Antioxidants 2023, 12, 500. [Google Scholar] [CrossRef] [PubMed]
  84. Mahboubifar, M.; Zidorn, C.; Farag, M.A.; Zayed, A.; Jassbi, A.R. Chemometric-based drug discovery approaches from natural origins using hyphenated chromatographic techniques. Phytochem. Anal. 2024, 35, 990–1016. [Google Scholar] [CrossRef] [PubMed]
  85. Cong, W. Identification of Compounds that Impact Whole Wheat Bread Flavor Liking Using LC-MS Flavoromics. Ph.D. Thesis, The Ohio State University, Columbus, OH, USA, 2021. [Google Scholar]
  86. Munyoki, N.M. Phytochemical Characterization and Biopesticidal Activity of Aqueous Plant Extracts Against French Bean thrips and Aphids. Master’s Thesis, University of Nairobi, Nairobi, Kenya, 2023. [Google Scholar]
  87. Goya, L.; de Pascual-Teresa, S. Effects of polyphenol-rich foods on chronic diseases. Nutrients 2023, 15, 4134. [Google Scholar] [CrossRef] [PubMed]
  88. Zhu, Y.; Zhang, J.; Wang, C.; Zheng, T.; Di, S.; Wang, Y.; Fei, W.; Liang, W.; Wang, L. Ameliorative effect of ethanolic echinacea purpurea against hyperthyroidism-induced oxidative stress via AMRK and PPAR signal pathway using transcriptomics and network pharmacology analysis. Int. J. Mol. Sci. 2022, 24, 187. [Google Scholar] [CrossRef] [PubMed]
  89. Cong, W.; Tello, E.; Simons, C.T.; Peterson, D.G. Identification of non-volatile compounds that impact flavor disliking of whole wheat bread made with aged flours. Molecules 2022, 27, 1331. [Google Scholar] [CrossRef] [PubMed]
  90. Ali, A.; Bashmil, Y.M.; Cottrell, J.J.; Suleria, H.A.R.; Dunshea, F.R. Lc-ms/ms-qtof screening and identification of phenolic compounds from australian grown herbs and their antioxidant potential. Antioxidants 2021, 10, 1770. [Google Scholar] [CrossRef] [PubMed]
  91. Hu, Y.; Schnaubelt, M.; Chen, L.; Zhang, B.; Hoang, T.; Lih, T.M.; Zhang, Z.; Zhang, H. MS-PyCloud: A cloud computing-based pipeline for proteomic and glycoproteomic data analyses. Anal. Chem. 2024, 96, 10145–10151. [Google Scholar] [CrossRef] [PubMed]
  92. Liu, L.F.; Li, M.; Mao, R.S. Design and implementation of mass spectrometer database. In Proceedings of the 13th International Beam Instrumentation Conference, Beijing, China, 9–13 September 2024; pp. 439–443. [Google Scholar]
  93. Alaribe, S.C.; Titilayo, B.E.; Ikpatt, U.N.; Oladipupo, A.R.; Alabi, F.A.; Ikwugbado, R.I. Computational methods for advanced mass spectrometry—A review. Phys. Sci. Rev. 2025. [Google Scholar] [CrossRef]
  94. Agarwal, V.; Bajpai, M. Imaging and Non-imaging Analytical Techniques Used for Drug Nanosizing and their Patents: An Overview. Recent Pat. Nanotechnol. 2024, 18, 494–518. [Google Scholar] [CrossRef] [PubMed]
  95. Bonatto, C.C.; Silva, L.P. A MALDI-TOF mass spectrometry-based approach for molecular profiling of leaves from pasture and feed forages species. Biochem. Syst. Ecol. 2021, 94, 104215. [Google Scholar] [CrossRef]
  96. Mejía-Morales, C.; Rodríguez-Macías, R.; Salcedo-Pérez, E.; Zamora-Natera, J.F.; Rodríguez-Zaragoza, F.A.; Molina-Torres, J.; Délano-Frier, J.P.; Zañudo-Hernández, J. Contrasting metabolic fingerprints and seed protein profiles of Cucurbita foetidissima and C. Radicans fruits from feral plants sampled in central Mexico. Plants 2021, 10, 2451. [Google Scholar] [CrossRef] [PubMed]
  97. Greene, L.A.; Isaac, I.; Gray, D.E.; Schwartz, S.A. Streamlining plant sample preparation: The use of high-throughput robotics to process echinacea samples for biomarker profiling by MALDI-TOF mass spectrometry. J. Biomol. Tech. 2007, 18, 238–244. [Google Scholar] [PubMed]
  98. Hicks, J.C.E.; Dávila, A.A.C.; Villarruel, D.S.-A. Insights into Nuclear Magnetic Resonance Spectroscopy as a Tool for Metabolic Markers in Pre-and PostHarvest Sustainable Production of Food Crops. In Food Security, Safety, and Sustainability; Apple Academic Press: Palm Bay, FL, USA, 2025; pp. 145–163. [Google Scholar]
  99. Brigante, F.I.; Solovyev, P.; Bontempo, L. Nuclear magnetic resonance applications in Fermented Foods and Plant-based beverages: Challenges and opportunities. Food Rev. Int. 2024, 40, 3370–3397. [Google Scholar] [CrossRef]
  100. Batiha, G.E.-S.; Shaheen, H.M.; Elhawary, E.A.; Mostafa, N.M.; Eldahshan, O.A.; Sabatier, J.-M. Phytochemical constituents, folk medicinal uses, and biological activities of genus angelica: A review. Molecules 2022, 28, 267. [Google Scholar] [CrossRef] [PubMed]
  101. Abdul-Jalil, T.Z.; Ibrahim, R.M.; Nassir, Z.S. Extraction, isolation and structure elucidation of two phenolic acids from aerial parts of celery and coriander. Biomed. Pharmacol. J. 2023, 16, 2315–2332. [Google Scholar] [CrossRef]
  102. Wang, Z.-F.; You, Y.-L.; Li, F.-F.; Kong, W.-R.; Wang, S.-Q. Research progress of NMR in natural product quantification. Molecules 2021, 26, 6308. [Google Scholar] [CrossRef] [PubMed]
  103. Otify, A.M.; Serag, A.; Porzel, A.; Wessjohann, L.A.; Farag, M.A. NMR Metabolome-based classification of Cymbopogon species: A prospect for phyto-equivalency of its different accessions using chemometric tools. Food Anal. Methods 2022, 15, 2095–2106. [Google Scholar] [CrossRef]
  104. Fabry, P.; Weber, S.; Teipel, J.; Richling, E.; Walch, S.G.; Lachenmeier, D.W. Quantitative NMR Spectrometry of Phenylpropanoids, including Isoeugenol in Herbs, Spices, and Essential Oils. Foods 2024, 13, 720. [Google Scholar] [CrossRef] [PubMed]
  105. Zhao, J.; Wang, M.; Lee, J.; Ali, Z.; Khan, I.A. Characterization, Differentiation, and Adulteration Detection of Peppermint Essential Oil: An NMR Approach. J. Pharm. Biomed. Anal. 2025, 263, 116941. [Google Scholar] [CrossRef] [PubMed]
  106. Alberts, P.S.F.; Meyer, J.J.M. Integrating chemotaxonomic-based metabolomics data with DNA barcoding for plant identification: A case study on south-east African Erythroxylaceae species. South Afr. J. Botany. 2022, 146, 174–186. [Google Scholar] [CrossRef]
  107. Sawhney, G.; Sharma, M.; Agrawal, N. DNA-RNA Barcoding and Gene Sequencing. In Computation in BioInformatics: Multidisciplinary Applications; Wiley: Hoboken, NJ, USA, 2021; pp. 165–227. [Google Scholar]
  108. Khan, S. Genome-based Barcoding for the Authentication of Medicinal Plants and Their Products. In Omics Studies of Medicinal Plants; CRC Press: Boca Raton, FL, USA, 2023; pp. 123–136. [Google Scholar]
  109. Letsiou, S.; Madesis, P.; Vasdekis, E.; Montemurro, C.; Grigoriou, M.E.; Skavdis, G.; Moussis, V.; Koutelidakis, A.E.; Tzakos, A.G. DNA Barcoding as a Plant Identification Method. Appl. Sci. 2024, 14, 1415. [Google Scholar] [CrossRef]
  110. Gostel, M.R.; Kress, W.J. The expanding role of DNA barcodes: Indispensable tools for ecology, evolution, and conservation. Diversity 2022, 14, 213. [Google Scholar] [CrossRef]
  111. Caesar, L.K.; Montaser, R.; Keller, N.P.; Kelleher, N.L. Metabolomics and genomics in natural products research: Complementary tools for targeting new chemical entities. Nat. Prod. Rep. 2021, 38, 2041–2065. [Google Scholar] [CrossRef] [PubMed]
  112. Peltomaa, E.; Asikainen, H.; Blomster, J.; Pakkanen, H.; Rigaud, C.; Salmi, P.; Taipale, S. Phytoplankton group identification with chemotaxonomic biomarkers: In combination they do better. Phytochemistry 2023, 209, 113624. [Google Scholar] [CrossRef] [PubMed]
  113. Diaz, M.A.L.; Del Fueyo, G.M.; D’Angelo, J.A.; Carrizo, M.A. How much does Chemotaxonomy help to resolve the overrepresented Cycadolepis Saporta (Bennettitales)? A case study of the Cretaceous of Patagonia, Argentina. Rev. Palaeobot. Palynol. 2021, 293, 104489. [Google Scholar] [CrossRef]
  114. Lau, H.; Tan, L.H.; Ee, L.Y.; Dayal, H.; Lim, S.Y.; Liu, F.; Li, S.F.Y. Application of 1H-NMR-and LC-MS based Metabolomic analysis for the evaluation of celery preservation methods. LWT 2022, 169, 113938. [Google Scholar] [CrossRef]
  115. Lee, N.; Yoo, H.; Yang, H. Cluster analysis of medicinal plants and targets based on multipartite network. Biomolecules 2021, 11, 546. [Google Scholar] [CrossRef] [PubMed]
  116. Zhan, C.; Shen, S.; Yang, C.; Liu, Z.; Fernie, A.R.; Graham, I.A.; Luo, J. Plant metabolic gene clusters in the multi-omics era. Trends Plant Sci. 2022, 27, 981–1001. [Google Scholar] [CrossRef] [PubMed]
  117. Tang, D.; Wei, F.; Cai, Z.; Wei, Y.; Khan, A.; Miao, J.; Wei, K. Analysis of codon usage bias and evolution in the chloroplast genome of Mesona chinensis Benth. Dev. Genes Evol. 2021, 231, 1–9. [Google Scholar] [CrossRef] [PubMed]
  118. Li, G.; Lin, P.; Wang, K.; Gu, C.-C.; Kusari, S. Artificial intelligence-guided discovery of anticancer lead compounds from plants and associated microorganisms. Trends Cancer 2022, 8, 65–80. [Google Scholar] [CrossRef] [PubMed]
  119. Srivastava, M.; Nandan, S.; Zaidi, A.; Samani, A.S.; Shukla, V.; Aslam, H.; Srivastava, A.; Maurya, P.; Khan, M.A.; Khan, M.F. Artificial Intelligence Driven Applications in Analytical Chemistry, Drug Discovery, and Food Science: Advancements, Outlook, and Challenges. ChemistrySelect 2025, 10, e202404446. [Google Scholar] [CrossRef]
  120. Dey, B.; Ferdous, J.; Ahmed, R.; Hossain, J. Assessing deep convolutional neural network models and their comparative performance for automated medicinal plant identification from leaf images. Heliyon 2024, 10, e23655. [Google Scholar] [CrossRef] [PubMed]
  121. Zhong, C.; Li, L.; Wang, Y.-Z. Applications of chemical fingerprints and machine learning in plant ecology: Recent progress and future perspectives. Microchem. J. 2024, 206, 111447. [Google Scholar] [CrossRef]
  122. Chi, J.; Shu, J.; Li, M.; Mudappathi, R.; Jin, Y.; Lewis, F.; Boon, A.; Qin, X.; Liu, L.; Gu, H. Artificial intelligence in metabolomics: A current review. TrAC Trends Anal. Chem. 2024, 178, 117852. [Google Scholar] [CrossRef] [PubMed]
  123. Hosseini, M.; Pereira, D.M. The chemical space of terpenes: Insights from data science and AI. Pharmaceuticals 2023, 16, 202. [Google Scholar] [CrossRef] [PubMed]
  124. Kang, M.; Ko, E.; Mersha, T.B. A roadmap for multi-omics data integration using deep learning. Brief. Bioinform. 2022, 23, bbab454. [Google Scholar] [CrossRef] [PubMed]
  125. Vahabi, N.; Michailidis, G. Unsupervised multi-omics data integration methods: A comprehensive review. Front. Genet. 2022, 13, 854752. [Google Scholar] [CrossRef] [PubMed]
  126. Sarkar, J.; Singh, R.; Chandel, S. Understanding LC/MS-Based Metabolomics: A Detailed Reference for Natural Product Analysis. Proteom. Clin. Appl. 2025, 19, e202400048. [Google Scholar] [CrossRef] [PubMed]
  127. Bajorath, J.; Chávez-Hernández, A.L.; Duran-Frigola, M.; Fernández-de Gortari, E.; Gasteiger, J.; López-López, E.; Maggiora, G.M.; Medina-Franco, J.L.; Méndez-Lucio, O.; Mestres, J. Chemoinformatics and artificial intelligence colloquium: Progress and challenges in developing bioactive compounds. J. Cheminformatics 2022, 14, 82. [Google Scholar] [CrossRef] [PubMed]
  128. Qin, Y.; Li, C.; Shi, X.; Wang, W. MLP-based regression prediction model for compound bioactivity. Front. Bioeng. Biotechnol. 2022, 10, 946329. [Google Scholar] [CrossRef] [PubMed]
  129. Ishfaq, M.; Aamir, M.; Ahmad, F.; M Mebed, A.; Elshahat, S. Machine learning-assisted prediction of the biological activity of aromatase inhibitors and data mining to explore similar compounds. ACS Omega 2022, 7, 48139–48149. [Google Scholar] [CrossRef] [PubMed]
  130. Pérez Santín, E.; Rodríguez Solana, R.; González García, M.; García Suárez, M.D.M.; Blanco Díaz, G.D.; Cima Cabal, M.D.; Moreno Rojas, J.M.; López Sánchez, J.I. Toxicity prediction based on artificial intelligence: A multidisciplinary overview. Wiley WIREs Comput. Mol. Sci. 2021, 11, e1516. [Google Scholar] [CrossRef]
  131. Filimonov, D.A.; Druzhilovskiy, D.S.; Lagunin, A.A.; Gloriozova, T.A.; Rudik, A.V.; Dmitriev, A.V.; Pogodin, P.V.; Poroikov, V.V. Computer-aided prediction of biological activity spectra for chemical compounds: Opportunities and limitations. Biomed. Chem. Res. Methods 2018, 1, e00004. [Google Scholar] [CrossRef]
  132. Dial, K. Democratising Deep Learning In Microbial Metabolites Research. Ph.D. Thesis, McMaster University, Hamilton, ON, Canada, 2023. [Google Scholar]
  133. Soylu, N.N.; Sefer, E. DeepPTM: Protein post-translational modification prediction from protein sequences by combining deep protein language model with vision transformers. Curr. Bioinform. 2024, 19, 810–824. [Google Scholar] [CrossRef]
  134. Moldwin, A.; Shehu, A. Foundation Models for AI-Enabled Biological Design. arXiv 2025, arXiv:2505.11610. [Google Scholar]
  135. Singh, R. Chemotaxonomy of medicinal plants: Possibilities and limitations. In Natural Products and Drug Discovery; Elsevier: Amsterdam, The Netherlands, 2018; pp. 119–136. [Google Scholar]
  136. Spring, O. Chemotaxonomy based on metabolites from glandular trichomes. Adv. Bot. Res. 2000, 31, 153–174. [Google Scholar]
  137. Alston, R.E.; Mabry, T.J.; Turner, B.L. Perspectives in Chemotaxonomy: Studies of secondary compounds in plants may provide knowledge of phylogenetic relationships. Science 1963, 142, 545–552. [Google Scholar] [CrossRef] [PubMed]
  138. Ramawat, K.G. Biodiversity and Chemotaxonomy; Springer: Berlin/Heidelberg, Germany, 2019; Volume 24. [Google Scholar]
  139. Wink, M.; Botschen, F.; Gosmann, C.; Schäfer, H.; Waterman, P.G. Chemotaxonomy seen from a phylogenetic perspective and evolution of secondary metabolism. In Annual Plant Reviews Volume 40: Biochemistry of Plant Secondary Metabolism; Wiley: Hoboken, NJ, USA, 2010; pp. 364–433. [Google Scholar]
  140. Issaq, H.J.; Van, Q.N.; Waybright, T.J.; Muschik, G.M.; Veenstra, T.D. Analytical and statistical approaches to metabolomics research. J. Sep. Sci. 2009, 32, 2183–2199. [Google Scholar] [CrossRef] [PubMed]
  141. Huie, C.W. A review of modern sample-preparation techniques for the extraction and analysis of medicinal plants. Anal. Bioanal. Chem. 2002, 373, 23–30. [Google Scholar] [CrossRef] [PubMed]
  142. Miralles, A.; Bruy, T.; Wolcott, K.; Scherz, M.D.; Begerow, D.; Beszteri, B.; Bonkowski, M.; Felden, J.; Gemeinholzer, B.; Glaw, F. Repositories for taxonomic data: Where we are and what is missing. Syst. Biol. 2020, 69, 1231–1253. [Google Scholar] [CrossRef] [PubMed]
  143. Kumar, A.; Kumar, S.; Komal; Ramchiary, N.; Singh, P. Role of traditional ethnobotanical knowledge and indigenous communities in achieving sustainable development goals. Sustainability 2021, 13, 3062. [Google Scholar] [CrossRef]
  144. Campbell, M.M.; Sederoff, R.R. Variation in lignin content and composition (mechanisms of control and implications for the genetic improvement of plants). Plant Physiol. 1996, 110, 3. [Google Scholar] [CrossRef] [PubMed]
  145. Polatoglu, K. “Chemotypes”–a fact that should not be ignored in natural product studies. Nat. Prod. J. 2013, 3, 10–14. [Google Scholar] [CrossRef]
  146. Hegeman, A.D. Plant metabolomics—Meeting the analytical challenges of comprehensive metabolite analysis. Brief. Funct. Genom. 2010, 9, 139–148. [Google Scholar] [CrossRef] [PubMed]
  147. Wolfender, J.-L.; Marti, G.; Thomas, A.; Bertrand, S. Current approaches and challenges for the metabolite profiling of complex natural extracts. J. Chromatogr. A 2015, 1382, 136–164. [Google Scholar] [CrossRef] [PubMed]
  148. Rosita, A.S.; Shivani, J.; Saji, A.; Sundar, M.B.; Begum, T.N.; Thilagavathi, G.; Sumithra, A.; Anand, A.V. Bioinformatics Tools for Research on Plant Secondary Metabolites. In Biotechnology, Multiple Omics, and Precision Breeding in Medicinal Plants; CRC Press: Boca Raton, FL, USA, 2025; pp. 177–195. [Google Scholar]
  149. Marco-Ramell, A.; Palau-Rodriguez, M.; Alay, A.; Tulipani, S.; Urpi-Sarda, M.; Sanchez-Pla, A.; Andres-Lacueva, C. Evaluation and comparison of bioinformatic tools for the enrichment analysis of metabolomics data. BMC Bioinform. 2018, 19, 1. [Google Scholar] [CrossRef] [PubMed]
  150. Zammit, G.; Zammit, M.G.; Buttigieg, K.G. Emerging technologies for the discovery of novel diversity in cyanobacteria and algae and the elucidation of their valuable metabolites. Diversity 2023, 15, 1142. [Google Scholar] [CrossRef]
  151. Nazarenko, D.V.; Kharyuk, P.V.; Oseledets, I.V.; Rodin, I.A.; Shpigun, O.A. Machine learning for LC–MS medicinal plants identification. Chemom. Intell. Lab. Syst. 2016, 156, 174–180. [Google Scholar] [CrossRef]
  152. Pomyen, Y.; Wanichthanarak, K.; Poungsombat, P.; Fahrmann, J.; Grapov, D.; Khoomrung, S. Deep metabolome: Applications of deep learning in metabolomics. Comput. Struct. Biotechnol. J. 2020, 18, 2818–2825. [Google Scholar] [CrossRef] [PubMed]
  153. Afendi, F.M.; Okada, T.; Yamazaki, M.; Hirai-Morita, A.; Nakamura, Y.; Nakamura, K.; Ikeda, S.; Takahashi, H.; Altaf-Ul-Amin, M.; Darusman, L.K. KNApSAcK family databases: Integrated metabolite–plant species databases for multifaceted plant research. Plant Cell Physiol. 2012, 53, e1. [Google Scholar] [CrossRef] [PubMed]
  154. Lephatsi, M.M. Metabolomics and 4IR Technologies for Natural Products Research: Characterization of the Phytochemistry and Biochemistry of Helichrysum Species with Potential Anti-Cancer Activities. Ph.D. Thesis, University of Johannesburg, Johannesburg, South Africa, 2024. [Google Scholar]
  155. James, S.A.; Soltis, P.S.; Belbin, L.; Chapman, A.D.; Nelson, G.; Paul, D.L.; Collins, M. Herbarium data: Global biodiversity and societal botanical needs for novel research. Appl. Plant Sci. 2018, 6, e1024. [Google Scholar] [CrossRef] [PubMed]
  156. Jamieson, C.S.; Misa, J.; Tang, Y.; Billingsley, J.M. Biosynthesis and synthetic biology of psychoactive natural products. Chem. Soc. Rev. 2021, 50, 6950–7008. [Google Scholar] [CrossRef] [PubMed]
  157. Yao, X.; Wuzhang, K.; Peng, B.; Chen, T.; Zhang, Y.; Liu, H.; Li, L.; Fu, X.; Tang, K. Engineering the expression of plant secondary metabolites-genistein and scutellarin through an efficient transient production platform in Nicotiana benthamiana L. Front. Plant Sci. 2022, 13, 994792. [Google Scholar] [CrossRef] [PubMed]
  158. Mohammadi, S.; Saberidokht, B.; Subramaniam, S.; Grama, A. Scope and limitations of yeast as a model organism for studying human tissue-specific pathways. BMC Syst. Biol. 2015, 9, 96. [Google Scholar] [CrossRef] [PubMed]
  159. Hao, D.-C.; Xiao, P.-G. Genomics and evolution in traditional medicinal plants: Road to a healthier life. Evol. Bioinform. 2015, 11, EBO-S31326. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Taxonomical identification methods used by researchers in previous and current eras.
Figure 1. Taxonomical identification methods used by researchers in previous and current eras.
Plants 14 02234 g001
Figure 2. Applications of chemotaxonomy in herbal and medicinal plant sciences.
Figure 2. Applications of chemotaxonomy in herbal and medicinal plant sciences.
Plants 14 02234 g002
Figure 3. An overview of current trends in chemotaxonomy.
Figure 3. An overview of current trends in chemotaxonomy.
Plants 14 02234 g003
Figure 4. Potential of AI tools in chemotaxonomical identification of medicinal plants.
Figure 4. Potential of AI tools in chemotaxonomical identification of medicinal plants.
Plants 14 02234 g004
Figure 5. Future directions in chemotaxonomical identification of medicinal plants.
Figure 5. Future directions in chemotaxonomical identification of medicinal plants.
Plants 14 02234 g005
Table 1. Primary and secondary metabolites and their role in plant health.
Table 1. Primary and secondary metabolites and their role in plant health.
Metabolite TypeMetabolite Class Occurrence in Plant PartMetabolite RoleReferences
Primary metaboliteCarbohydratesLeavesEnergy source,
essential for respiration.
[30,36]
Primary metaboliteAmino AcidsLeaves, RootsBuilding blocks of proteins, crucial for plant growth.[30,36]
Primary metaboliteFatty AcidsSeeds, LeavesVital for membrane structure and energy storage.[30,36]
Primary metaboliteChlorophyllLeavesKey for photosynthesis, converting light into energy.[30,36]
Secondary metaboliteAlkaloidsRoots, SeedsDefensive compounds, deter herbivores and pathogens.[37,38]
Secondary metaboliteFlavonoidsFlowers, LeavesProvide UV protection, antioxidant properties, and pigmentation.[37,38]
Secondary metaboliteTerpenoidsLeaves, RootsInvolved in plant defense. [37,38]
Secondary metabolitePhenolics, TanninsRoots, LeavesPlay roles in defense, antioxidation, and stress response.[37,38]
Table 2. A comparative review of morphological and chemotaxonomical identification of medicinal plants.
Table 2. A comparative review of morphological and chemotaxonomical identification of medicinal plants.
FeatureMorphological
Identification
Chemotaxonomical
Identification
Citations
Basis of classificationObservable physical traits (leaf shape, flower structure, stem, etc.)Chemical composition, mainly secondary metabolites and other biochemical markers[44,45]
Attributes for examinationExternal features (e.g., leaves, flowers)Secondary metabolites and primary compounds[44,46]
Ecological impactHigh (traits may vary due to climate, soil, etc.)Low (compounds are more stable)
Tools requiredMicroscope, visual inspectionChromatography, spectroscopy[44,46]
Resolutionprecise due to phenotypic
Often limited at intraspecific level (varieties, subspecies)
Can distinguish species and intraspecific taxa[44,45,46]
Part usedLeaf shape, flower color, stem structureAlkaloids, flavonoids, composition of plant part (terpenoids, amino acids)[44,46]
Use in modern taxonomyFoundational, and widely used in conjunction with molecular methodsWidely used in conjunction with molecular methods[44,46]
Speed and accessibilityRelatively quick and low cost, can be carried out in the fieldMore time-consuming and costly, requires laboratory equipment[44,45,46]
Cryptic speciesPhenotype plasticity
Difficulties in identification of cryptic Species
Requires specialized equipment and expertise
More effective; can detect biochemical differences in cryptic species
[44,46]
Table 3. Various analytical techniques used in chemotaxonomy.
Table 3. Various analytical techniques used in chemotaxonomy.
Analytical TechniqueTypical UsesAccuracy/PrecisionTypes of Secondary MetabolitesCitations
UV-Vis SpectroscopyQuantification Moderate accuracy, ideal for fast and non-destructive quantification.Flavonoids, phenolic compounds, carotenoids, alkaloids[51]
FTIRIdentification of functional groups and molecular structuresGood resolution for functional group identification. Lower sensitivity compared to MS-based techniques.Terpenoids, alkaloids, flavonoids, phenolic acids, lipids[52]
HPLCSeparation and quantification of compounds, particularly in mixturesHigh accuracy in separating complex mixtures. Precision depends on column and mobile phase.Alkaloids, flavonoids, phenolic acids, glycosides, terpenoids[53]
GCMSIdentifying and quantifying volatile compounds, especially in complex mixturesHigh sensitivity and precision for volatile organic compounds, good for trace analysis.Volatile terpenes, essential oils, fatty acids, aldehydes[53,54]
LCMS-QTOFComprehensive profiling of metabolites and complex biomoleculesVery high sensitivity and accuracy, capable of accurate molecular mass determination, used for complex samples.Alkaloids, flavonoids, peptides, lipids, steroids, phenolic compounds[53,55]
MALDI-TOF MSHigh-throughput analysis of biomolecules, especially proteins and peptidesHigh sensitivity for large biomolecules like proteins, peptides, and lipids. Excellent for high-throughput applications.Peptides, proteins, lipids, alkaloids[53,55]
NMRStructural elucidation, identification of compounds, and quantification in small to medium-sized moleculesHigh accuracy for molecular structure determination. Limited sensitivity compared to MS techniques, but excellent for structural analysis.Alkaloids, flavonoids, terpenoids, phenolic compounds, saponins[53]
Table 4. Limitations of chemotaxonomy with possible alternate solutions.
Table 4. Limitations of chemotaxonomy with possible alternate solutions.
LimitationDescriptionPossible AlternateCitations
Variability in Secondary Metabolite ProfilesVariability due to environmental, genetic, or developmental factors.Use DNA barcoding or meta barcoding as an alternative.[135]
Standardization IssuesNo standardized methodology for metabolite analysis.Standardize techniques like mass spectrometry or NMR.[135,136]
Lack of Comprehensive DatabasesChemotaxonomic databases are often incomplete.Collaborate to build comprehensive chemotaxonomic databases.[135,137]
Accessibility and High Costs of Analytical TechniquesHigh costs and specialized expertise needed for advanced techniques.Utilize portable, low-cost devices for on-site analysis.[135]
Challenges in Chemotaxonomic IdentificationOverlapping chemical profiles make accurate identification difficult.Use multi-omics approaches for more accurate identification.[27,135,138]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Amin, A.; Park, S. Chemotaxonomy, an Efficient Tool for Medicinal Plant Identification: Current Trends and Limitations. Plants 2025, 14, 2234. https://doi.org/10.3390/plants14142234

AMA Style

Amin A, Park S. Chemotaxonomy, an Efficient Tool for Medicinal Plant Identification: Current Trends and Limitations. Plants. 2025; 14(14):2234. https://doi.org/10.3390/plants14142234

Chicago/Turabian Style

Amin, Adnan, and SeonJoo Park. 2025. "Chemotaxonomy, an Efficient Tool for Medicinal Plant Identification: Current Trends and Limitations" Plants 14, no. 14: 2234. https://doi.org/10.3390/plants14142234

APA Style

Amin, A., & Park, S. (2025). Chemotaxonomy, an Efficient Tool for Medicinal Plant Identification: Current Trends and Limitations. Plants, 14(14), 2234. https://doi.org/10.3390/plants14142234

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop