Next Article in Journal
Isomeric 3-Pyridinylmethylcoumarins Differ in Erk1/2-Inhibition and Modulation of BV2 Microglia-Mediated Neuroinflammation
Previous Article in Journal
Umbrella Refinement of Ensembles—An Alternative View of Ensemble Optimization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Innovative Strategy for Untargeted Mass Spectrometry Data Analysis: Rapid Chemical Profiling of the Medicinal Plant Terminalia chebula Using Ultra-High-Performance Liquid Chromatography Coupled with Q/TOF Mass Spectrometry–Key Ion Diagnostics–Neutral Loss Filtering

1
School of Ethnic Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
2
School of Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
*
Authors to whom correspondence should be addressed.
Molecules 2025, 30(11), 2451; https://doi.org/10.3390/molecules30112451 (registering DOI)
Submission received: 7 April 2025 / Revised: 27 May 2025 / Accepted: 30 May 2025 / Published: 3 June 2025

Abstract

:
Structural characterization of natural products in complex herbal extracts remains a major challenge in phytochemical analysis. In this study, we present a novel post-acquisition data-processing strategy—key ion diagnostics–neutral loss filtering (KID-NLF)—combined with ultra-high-performance liquid chromatography–quadrupole time-of-flight mass spectrometry (UPLC-Q/TOF-MS) for systematic profiling of the medicinal plant Terminalia chebula. The strategy consists of four main steps. First, untargeted data are acquired in negative electrospray ionization (ESI) mode. Second, a genus-specific diagnostic ion database is constructed by leveraging characteristic fragment ions (e.g., gallic acid, chebuloyl, and HHDP groups) and conserved substructures. Third, MS/MS data are high-resolution filtered using key ion diagnostics and neutral loss patterns (302 Da for HHDP; 320 Da for chebuloyl). Finally, structures are elucidated via detailed spectral analysis. The methanol extract of T. chebula was separated on a C18 column using a gradient of acetonitrile and 0.1% aqueous formic acid within 33 min. This separation enabled detection of 164 compounds, of which 47 were reported for the first time. Based on fragmentation pathways and diagnostic ions (e.g., m/z 169 for gallic acid, m/z 301 for ellagic acid, and neutral losses of 152, 302, and 320 Da), the compounds were classified into three major groups: gallic acid derivatives, ellagitannins (containing HHDP, chebuloyl, or neochebuloyl moieties), and triterpenoid glycosides. KID-NLF overcomes key limitations of conventional workflows—namely, isomer discrimination and detection of low-abundance compounds—by exploiting genus-specific structural signatures. This strategy demonstrates high efficiency in resolving complex polyphenolic and triterpenoid profiles and enables rapid annotation of both known and novel metabolites. This study highlights KID-NLF as a robust framework for phytochemical analysis in species with high chemical complexity. It also paves the way for applications in quality control, drug discovery, and mechanistic studies of medicinal plants.

Graphical Abstract

1. Introduction

The exploration of medicinal plants has gained significant momentum globally, driven by their long-established therapeutic history and untapped potential for drug discovery. A central aspect of this effort is the thorough analysis of the chemical composition of medicinal plants, as their efficacy, safety, and quality are inherently tied to their phytochemical profiles [1]. High-resolution mass spectrometry (HRMS), particularly when combined with ultra-high-performance liquid chromatography (UPLC), has become a cornerstone technology for analyzing complex plant extracts, facilitating the rapid identification of both known and novel compounds [2]. However, some limitations remain. For instance, the peak capacity is limited, making it challenging to fulfill the separation requirements of chemical components in the complex systems of medicinal plants [3]. Traditional identification strategies, which rely on chromatographic retention behavior and mass spectrometry cleavage rules, are constrained by a limited number of reference substances and inadequate structural coverage in dedicated databases [4].
Current strategies for LC-MS/MS data processing are typically classified into targeted and untargeted approaches [5]. Targeted methods rely on predefined compound libraries and fragmentation pathways, offering high specificity but limited capacity for discovering novel compounds. In contrast, untargeted methods capture MS/MS spectra for all detectable ions, providing a comprehensive chemical profile but resulting in large datasets that require advanced post-acquisition processing. In recent years, feature-based molecular networking (FBMN) [6] and ion identity molecular networking (IIMN) have enhanced metabolite grouping by integrating retention time and admixture data [7]. Additionally, tools such as SIRIUS 5, powered by deep learning, have increased annotation accuracy through fragment tree calculations [8]. However, these general strategies remain inadequate for highly specific structures. These include excimer ion matching errors, automatic matching of endogenous cleavage fragments to other components, and the low recognition accuracy of complex additive ions [9].
Terminalia chebula, known as the “king of Tibetan medicine”, has been used for thousands of years in traditional Chinese and Ayurvedic medicine [10]. T. chebula is used in the treatment of asthma, bronchitis, hepatitis, dyspepsia, eye diseases, and hoarseness and to promote hair growth [11]. The flesh of the plant has been used to treat diarrhea, leprosy, and edema [12]. It improves appetite, reduces cholesterol and blood pressure, strengthens the immune system, prevents aging, and enhances resistance to infections [13]. In clinical practice, the therapeutic effects of specific preparations can be tailored to treat various diseases and optimized by combining them based on distinct cold and heat symptoms [14]. T. chebula contains a diverse array of chemical constituents, primarily including phenolic acids, tannins, triterpenoids, and flavonoids [15]. In T. chebula, 33% of the total phytoconstituents are hydrolysable tannins, with variation between 20 and 50%. These tannins contain phenolic carboxylic acids such as gallic acid, as well as gallotannins. Ellagitannins including punacalagin, casuarinin, corilagin, and terchebulin as well as others such as chebulanin, neochebulinic, chebulagic, and chebulinic acids are also present [16]. However, the chemical complexity of T. chebula poses a unique analytical challenge. Its ellagic tannins often contain multiple configurations, including HHDP, chebulyl, or neochebulyl groups. Numerous isomers with the same molecular weight exist, and their fragment pathways are complex. Most components still lack systematic annotation, which significantly hinders the establishment of quality control standards and the investigation of the material basis for its efficacy. Therefore, addressing the challenge of component analysis in T. chebula can reveal the chemical nature of its therapeutic potential and offer insights into the metabolomic analysis of similar complex plants.
To overcome the above limitations, combined with the additive ions that are easily formed by the T. chebula components and the characteristic fragments, key ion diagnostics–neutral loss filtering (KID-NLF) is presented: a post-acquisition data processing approach that leverages characteristic fragment ions and neutral loss patterns of target compounds. KID-NLF constructs a genus-specific diagnostic ion database based on known T. chebula metabolites and their fragmentation profiles. This enables the prioritization of structurally relevant ions in complex, untargeted MS/MS data, thereby improving both the speed and accuracy of compound identification. The approach consists of two key components: (1) key ion diagnosis (KID), which utilizes high-resolution MS/MS data to identify diagnostic fragment ions specific to gallic acid derivatives, ellagitannins, and triterpenoids; and (2) neutral loss filtering (NLF), which systematically detects characteristic mass losses (e.g., 302 Da for HHDP, 320 Da for chebuloyl) to isolate structurally related compounds. This approach addresses challenges in isomer discrimination and low-abundance compound detection by exploiting the unique structural signatures of T. chebula constituents. It offers a robust framework for profiling even the most complex plant extracts.
In this study, KID-NLF was applied to analyze methanol extracts of T. chebula using UPLC-Q/TOF-MS. The method enabled the rapid identification of 164 compounds, 47 of which are newly reported. KID-NLF overcomes limitations of existing workflows by combining targeted fragmentation analysis with genus-specific database curation. This provides a transformative strategy for phytochemical profiling, especially in species rich in polyphenols and triterpenoids. The strategy enhances understanding of T. chebula’s chemical diversity and provides a scalable framework for quality control and drug discovery in medicinal plant research.

2. Results and Discussion

2.1. Research Strategy

Despite observing only a few major peaks in the UHPLC/UV analysis of T. chebula, the enlarged chromatogram revealed numerous minor compounds (Figure 1). To conduct a comprehensive analysis of these compounds, a 70% methanol extract was examined using UPLC-Q-TOF/MS in a non-targeted method (without specifying the parent ion). Operating in (—)-ESI mode, the analysis detected numerous compounds, facilitating the acquisition of key quasi-molecular ions with enhanced sensitivity. During the scan, all parent ions underwent analysis in MSE mode at a collision energy ranging from 20 to 40 eV. Using these settings, satisfactory secondary mass spectrometry data were obtained.
Subsequently, the data underwent processing using targeted KID-NLF efficiently identifying the parent ion and its respective daughter ion structure. The fundamental structure of T. chebula compounds, gallic acid, features similar skeletons and substitution patterns that facilitate the generation of KID and NLF in MS/MS spectra. KID-NLF were identified from the MS/MS spectra of reference substances, leading to the creation of a database cataloguing ions related to all reported Terminalia compound structures. High-resolution KID were next extracted via UNIFI software (Version 1.9) for MS diagnosis and MS/MS spectral data filtering, achieving rapid and efficient structural identification. Ultimately, further determination of the identified compounds’ structures was accomplished through in-depth analysis of their high-resolution MS and MS/MS spectra.
In summary, the analytical strategy comprises four key steps: (1) acquisition of high-resolution MS and MS/MS data via UPLC-Q-TOF/MS; (2) creation of a diagnostic ion database informed by MS/MS fragments, similar skeletons, and substitution patterns of reported T. chebula compounds; (3) identification of precursor ions and significant fragments using high-resolution diagnostic ions and neutral loss filtering; and (4) further elucidation of target compound structures through detailed MS and MS/MS spectral analysis (Figure 2). According to the KID-NLF strategy, a total of 164 components were identified (Figure 3).

2.2. Establishing a Diagnostic Ion Database

The process for establishing the database is illustrated in Figure 2A. Initially, the literature was reviewed to summarize and classify T. chebula compounds into three main structural types: gallic acid and its derivatives, ellagic acid and its derivatives, and terpenoids. Subsequently, characteristic diagnostic ions were identified for each type of compound.
For gallic acid and its derivatives, diagnosis involved identifying quasi-molecular ions, including [M–2H]2− and [M–H] ions. These compounds frequently feature multiple galloyl groups or exhibit neutral loss of gallic acid, resulting in ions such as [M–H–152] and [M–H–170]. Consequently, the accurate masses of [M–H–152], [M–H–170], and potential derivatives were utilized to develop a diagnostic ion database.
T. chebula contains ellagic acid and its derivatives, typically comprising HHDP (302 Da), chebuloyl (320 Da), neoche (338 Da), THDP (292 Da), DHHDP (318 Da), flavgallonyl (452 Da), and Gallagyl (602 Da) groups. These groups form complex combinations, possibly including HHDP alongside groups such as chebuloyl, neoche, THDP, DHHDP, Gallagyl, or flavgallonyl (Figure 4). In a similar manner, a diagnostic ion database for T. chebula ellagic acid compounds was created using the accurate masses of potential derivatives.
Terpenoids typically yield abundant deprotonated molecular ions [M–H] in the primary mass spectrometer, with some compounds also producing [2M–H] ions. Terpenoids in T. chebula commonly attach to sugar groups like glucose (Glc), galloyl, and glucoheptonic acid via various substitution methods. Aglycone fragments, which are relatively stable, typically lose sugar and galloyl segments (152 Da, 162 Da, 208 Da) under standard voltage. Analysis of terpenoid cleavage fragments reveals two primary types of aglycones, 503 Da and 487 Da, each with various configurations. Ultimately, the diagnostic ion database for T. chebula terpenoids was developed using the accurate masses of potential sugar derivatives.

2.3. Chemical Composition Analysis of T. chebula by Key Ion Diagnostics and Neutral Loss Filtering Using UPLC-Q-TOF/MS

Mass spectrometry analysis of gallic tannin or glucose gallate frequently reveals the neutral loss of multiple galloyl groups and gallic esters, as evidenced by the observation of fragments such as [M–H–152] and [M–H–170]. These characteristic neutral loss fragments serve as valuable indicators for deducing the molecular structure. Additionally, the primary mass spectrum typically displays high-intensity [M–2H]2− peaks for macromolecular components containing multiple galloyl groups, while derivatives with fewer galloyl groups primarily generate [M–H] or [M+HCOO] adduct ions. However, misinterpretation may occur wherein the [M–2H]2− peak is incorrectly identified as [M–H], leading to the misidentification of the true [M–H] peak as [2M–H]. To mitigate the potential for such misjudgments, the secondary mass spectrum is generally employed for confirmation. A comparative analysis of the primary and secondary mass spectra reveals that fragments exhibiting higher intensity in the primary mass spectrum often disappear in the secondary mass spectrum, indicating their status as [M–2H]2− characteristic peaks. Concurrently, fragments that exhibit reduced intensity in the secondary mass spectrum can be more reliably assigned as [M–H]. This analytical approach significantly enhances the reliability of structural analysis for the target compound by effectively reducing the risk of misidentification.
By using the UPLC-Q-TOF/MS instrument, (Waters Corp., Manchester, UK) the T. chebula extract could be analyzed within 33 min, and the data were processed by KID-NLF. As a result, a total of 164 compounds were rapidly identified (Figure 2 and Figure 3 and Table 1). They could be divided into three groups according to their structural types and MS/MS fragmentation pathways.

2.3.1. Gallic Acid Derivatives

In T. chebula, the primary compounds of gallic acid and its derivatives consist of simple gallic acid components and their acyl esters. The main constituents identified are gallic acid (8) and methyl gallate (39). These compounds are characterized by multiple phenolic hydroxyl and carboxyl groups, which render them susceptible to the loss of molecules such as CO2, CO, and H2O during cleavage. A relatively high content of gallic acid is present in myrobalan, with two significant ion peaks observed at m/z 169 and m/z 125 in the primary mass spectrum. Based on the peak elution time and ionic strength of the compounds, m/z 169 has been confirmed as the deprotonated molecular ion [M–H], while m/z 125 is identified as the main fragment ion resulting from in-source fragmentation, with the molecular formula C7H6O5. Further analysis indicates that the primary secondary fragment is the ion at m/z 125.0238 [M–H–CO2], formed by the removal of a carboxyl group (CO2, Δm = 44). By comparing with the mass spectrometry data of the reference substance, peak 8 has been confirmed as gallic acid. The primary mass spectrum of peak 39 exhibits a prominent abundance at m/z 183, which is identified as a deprotonated molecular ion [M–H], corresponding to the molecular formula C8H8O5. In the secondary mass spectrum, in addition to the m/z 169 and m/z 125 fragments of gallic acid, characteristic fragments m/z 168 [M–H–CH3] and m/z 124.0160 [M–H–CH3−CO2] have also been detected. Consequently, peak 39 has been confirmed as methyl gallate.
Simple galloyl ester compounds are mainly divided into three categories: gallic acid bound to glucose (Glc), gallic acid bound to quinic acid, and gallic acid bound to shikimic acid. In this study, 38 gallotannin components were identified. These components were classified according to the number of galloyl groups attached to the Glc molecules. The categories include mono-galloyl glucose (1, 6, 7, 9, 15), di-galloyl glucose (23, 30, 32, 36, 37, 40, 41, 44, 45, 49), tri-galloyl glucose (51, 55, 57, 60, 72, 73, 75, 77, 82, 85), tetra-galloyl glucose (80, 108, 112, 114, 120), and penta-galloyl glucose (130). Characteristic neutral losses were observed as galloyl and gallic acid moieties were gradually removed (Figure 5).
The mass spectrum of penta-galloyl glucose (m/z 939) demonstrated a sequential loss of galloyl groups, resulting in the formation of tetra-galloyl glucose (m/z 787), tri-galloyl glucose (m/z 635), di-galloyl glucose (m/z 483), and mono-galloyl glucose (m/z 331). Additionally, the primary intermediate ions observed for di-galloyl glucose and mono-galloyl glucose were m/z 271 and m/z 211, respectively. These fragment ions were attributed to the continuous loss of -CHOH groups from the glucose moiety, indicating that mono-galloyl glucose undergoes fragmentation to yield [M–H–60] and [M–H–60–60].
Based on the number and structure of galloyl and cinnamoyl groups, a series of isomers can be distinguished, and their peaks can be clearly identified. These compounds are categorized as cinnamoyl-mono-galloyl glucose (136, 140, 142, 144), cinnamoyl-di-galloyl glucose (143, 146, 152), and cinnamoyl-tri-galloyl glucose (157). In the mass spectrum, cinnamoyl-tri-galloyl glucose (m/z 765) exhibits a gradual loss of the gallic acid component, leading to the formation of cinnamoyl-di-galloyl glucose (m/z 595), cinnamoyl-mono-galloyl glucose (m/z 425), and cinnamoyl glucose (m/z 255) (Figure 6). Additionally, characteristic diagnostic ions of m/z 169, attributed to gallic acid, are observed in the mass spectrometry analysis. Ion fragments of m/z 125 are generated through the neutral loss of carboxyl groups (CO2, Δm = 44), while m/z 103 is formed by the loss of carboxyl groups (CO2, Δm = 44) from cinnamic acid, serving as a distinctive fragment for cinnamic acid. For the position isomers, the calculated lipophilicity parameter (ClogP) was used to estimate the retention time of isomers in the reversed-phase column as the basis for differentiation. Generally, compounds with a larger ClogP value would retain longer. The structures of these isomers were ultimately assigned by combining peak times with calculated ClogP values (Table S1).
Another type of simple galloyl ester is formed by the combination of gallic acid and shikimic acid. Based on the number of galloyl groups, these compounds can be categorized as mono-galloyl shikimic acid, di-galloyl shikimic acid, and tri-galloyl shikimic acid. In this study, a total of seven related compounds were identified and classified according to the number of galloyl groups connected to shikimic acid, including mono-galloyl shikimic acid (22, 25, 26), di-galloyl shikimic acid (48, 61, 65), and tri-galloyl shikimic acid (99). Prominent characteristic fragment ions were observed in the mass spectrum through the sequential elimination of galloyl and shikimic acid moieties. Tri-galloyl shikimic acid (m/z 629) demonstrated continuous mass loss of the galloyl moiety, leading to the formation of di-galloyl shikimic acid (m/z 477) and mono-galloyl shikimic acid (m/z 325). A fragment ion of m/z 169 was detected for all components and identified as [M–H] of gallic acid, resulting in a fragment of m/z 125 through the neutral loss of a carboxyl group (CO2, Δm = 44). In addition, fragment ions of m/z 155 and m/z 137 were attributed to the loss of water molecules (H2O, Δm = 18) from shikimic acid. Similarly, further removal of the carboxyl group (CO2, Δm = 44) after water loss resulted in fragment ions of m/z 111 and m/z 93, which are considered characteristic fragments of shikimic acid. Finally, by analyzing the arrangement of galloyl groups at different positions on shikimic acid, a series of isomers was identified (Figure 7). The assignment of each peak was determined by examining peak times and ClogP values (Table S1).
The final type of simple galloyl ester is formed by the combination of gallic acid and quinic acid. This compound typically produces a characteristic fragment at m/z 191. While components with two or three galloyl groups combined with quinic acid may exist, none were identified in this study. A total of three isomers of mono-galloyl quinic acid were identified. In the primary mass spectrum, peaks 5, 11, and 17 all displayed ion peaks at m/z 389 and m/z 343. Notably, the intensity of peak 17 was significantly higher than that of peaks 5 and 11, approximately five times greater. Additionally, peak 17 presented fragments at m/z 709 and m/z 687. In the secondary mass spectrum, the ion peak intensities of m/z 709, m/z 687, and m/z 389 were all reduced. Based on these observations, it was concluded that m/z 709 corresponds to the [2M+Na–2H] ion peak, m/z 687 corresponds to the [2M–H] ion peak, and m/z 389 represents the adduct ion [M+HCOO], with the corresponding molecular formula being C16H14O10. Fragment ion peaks of m/z 191 [quinic acid–H] and m/z 169 [gallic acid–H] can be generated through the neutral loss of quinic acid or gallic acid. By incorporating relevant literature, the order of the peaks was further clarified, leading to the identification of peak 5 as 3-galloyl quinic acid, peak 11 as 5-galloyl quinic acid, and peak 17 as 4-galloyl quinic acid.

2.3.2. Ellagitannins

Ellagitannins are a significant class of polyphenolic compounds containing one or more hexahydroxydiphenoyl (HHDP) groups or their oxidized forms, such as dehydrohexahydroxydiphenoyl (DHHDP) and chebuloyl (Che). Upon hydrolysis, they yield stable ellagic acid. In negative ion mode, ellagic acid tannins exhibit characteristic fragment ion losses, including HHDP (302 Da), DHHDP (318 Da), and chebuloyl (320 Da) (Figure 4).

Ellagic Acid and Its Simple Derivatives

The simple derivatives of ellagic acid primarily consist of ellagic acid conjugated with rhamnose to form glycosides at the 1-position, with additional hydroxyl groups on the rhamnose linked to multiple galloyl groups. MS analysis reveals that peaks 134, 137, and 138 display identical ions at m/z 375 and m/z 751 in the primary MS. The disappearance of m/z 375 ions in the secondary MS suggests that m/z 375 corresponds to the [M–2H]2 ion peak. The m/z 751 ion is identified as the [M–H] ion peak, corresponding to the molecular formula C34H24O20. In the secondary mass spectrum, fragment ions were observed at m/z 599 [M–H–152], m/z 581 [M–H–170], m/z 449 [M–H–302], m/z 411 [M–H–170–170], and m/z 301 [M–H–152–152–146]/[ellagic acid–H], generated by the sequential loss of galloyl, gallic, and ellagic acid groups. The polarity was assessed using ClogP values (Table S1), leading to the identification of peak 134 as 4-O-(2″,3″-di-O-galloyl-α-L-rhamnosyl) ellagic acid, peak 137 as 4-O-(2″,4″-di-O-galloyl-α-L-rhamnosyl) ellagic acid and peak 138 as 4-O-(3″,4″-di-O-galloyl-α-rhamnopyranosyl) ellagic acid. Similar cleavage patterns were observed for peaks 102 and 132, where peak 102 was identified as Eschweilenol C and peak 132 as 4-O-(4″-O-galloyl-α-rhamnopyranosyl) ellagic acid. Additionally, ellagic acid can directly conjugate with gallic acid. In the primary mass spectrum of peak 29, ion peaks were observed at m/z 469 and m/z 939. The intensity of the m/z 939 ion peak diminished in the secondary mass spectrum, indicating that m/z 939 corresponds to the [2M–H] ion peak, while m/z 469 corresponds to the [M–H] ion peak, with the molecular formula C21H10O13. Fragment ion peaks at m/z 425 [M–H–CO2], m/z 407 [M–H–CO2−H2O], and m/z 299 [M–H–gallic acid] were generated by the loss of CO2 and H2O. Peak 29 was identified as Valoneic acid dilactone. Terminalia chebula contains a significant amount of free ellagic acid. Numerous studies have focused on the quantification of ellagic acid using HPLC/UPLC. Peak 88 was identified as ellagic acid through comparison with reference standards.

Simple Tannins Containing a Single HHDP Group

Simple tannins containing a single HHDP group are primarily composed of compounds with one HHDP group and may include one or more attached galloyl groups. Based on the number of galloyl groups, these compounds can be classified into three types: mono-galloyl-HHDP glucose (10, 18, 38, 50, 71), di-galloyl-HHDP glucose (59, 68, 76, 110, 117), and tri-galloyl-HHDP glucose (81, 91, 97, 104, 107, 135). In mass spectrometry, these compounds exhibit a characteristic cleavage pattern, initially losing the HHDP group, followed by sequential losses of galloyl groups. The mass spectrum shows a characteristic neutral loss of 302 Da, indicative of HHDP group cleavage. For instance, tri-galloyl-HHDP glucose (m/z 937) first loses the HHDP group to generate tri-galloyl glucose (m/z 635), which then fragments following the cleavage pathway typical for tri-galloyl glucose. Similarly, mono-galloyl-HHDP glucose and di-galloyl-HHDP glucose exhibit comparable fragmentation patterns. Additionally, this class of compounds also produces a characteristic fragment at m/z 275, primarily generated by the elimination of the HHDP group, followed by the loss of a galloyl group and subsequently glucose.

Chebulic Acid and Its Simple Derivatives

The primary chebulic acids found in Terminalia chebula are neochebulic acid (3), chebulic acid (4), and isochebulic acid (12). These compounds are rich in phenolic hydroxyl and carboxyl groups, making them prone to losing CO2, H2O, and other groups during mass spectrometry fragmentation. In the primary mass spectrum, ion peaks at m/z 711, m/z 355, and m/z 337 correspond to peaks 3, 4, and 12, respectively. These ion peaks are also observed in the secondary mass spectrum. The m/z 711 peak is presumed to be the [2M–H] ion, m/z 355 as the [M–H] ion, and m/z 337 as a fragment with the molecular formula C14H12O11. During pyrolysis, the sequential loss of H2O and CO2 generates fragment ion peaks at m/z 293 [M–H–H2O–CO2], m/z 249 [M–H–H2O–2CO2], and m/z 205 [M–H–H2O–3CO2]. The polarity was assessed using ClogP values (Table S1), and peak order was established by comparison with reference standards. Consequently, peak 3 was identified as isochebulidic acid, peak 4 as chebulidic acid, and peak 12 as neochebulidic acid. In the primary mass spectrum, peaks 21 and 28 exhibit ion peaks at m/z 369 and m/z 739, respectively. In the secondary mass spectrum, the intensity of the m/z 739 ion peak diminishes, suggesting that m/z 739 corresponds to a [2M–H] ion, while m/z 369 corresponds to a [M–H] ion, with the molecular formula C15H14O11. During pyrolysis, these compounds produce fragment ion peaks at m/z 351 [M–H–H2O], m/z 325 [M–H–CO2], and m/z 307 [M–H–H2O−CO2] through the sequential loss of H2O and CO2. Peak 21 was identified as 7′-O-methylchebulate and peak 28 as 6′-O-methylchebulate.

Simple Tannins Containing a Single Chebuloyl or Neoche Group

In this study, the primary types of simple tannin-like components within the chebuloyl, neoche, and methylneoche groups were identified through mass spectrometry. These components primarily consist of a chebuloyl (320 Da), neoche (338 Da), or methylneoche (352 Da) group and their related derivatives, potentially containing multiple galloyl groups. The molecular weight difference between the chebuloyl and neoche groups is equivalent to one H2O molecule (18 Da). The key distinction between these groups lies in the attachment: the chebuloyl group is linked to two glycohydroxyl groups, while the neoche group is linked to only one. Based on the number of galloyl groups, these compounds can be further categorized as mono galloyl-neoche glucose (24, 27, 33, 35, 42, 52, 53, 56, 62, 63, 66), bisgalloyl-chebuloyl glucose (78, 92, 98), and trigaloyl-chebuloyl/neoche glucose (83, 94, 100, 106, 111, 122, 124, 125, 126, 127, 128, 129). These compounds exhibited neutral losses of 320 Da, 338 Da, or 352 Da in the mass spectrum. This indicates that these compounds are prone to losing chebuloyl, neoche, or methylneoche groups during mass spectrometric analysis. Specifically, the fragmentation of trigaloyl-chebuloyl/neoche glucose results in the loss of the chebuloyl/neoche group, forming trigaloyl glucose (m/z 635), which then undergoes further cleavage following the pattern typical of trigaloyl glucose. Likewise, monogalloyl-chebuloyl/neoche glucose displayed a comparable fragmentation pattern.

Tannins Containing HHDP, Chebuloyl, Neoche, and Other Groups

Terminalia chebula contains various tannins, which typically include groups such as HHDP (302 Da), chebuloyl (320 Da), neoche (338 Da), THDP (292 Da), DHHDP (318 Da), flavogallonyl (452 Da), and Gallagyl (602 Da). The combinations of these groups are complex and varied, potentially including HHDP along with other groups such as chebuloyl, neoche, THDP, DHHDP, Gallagyl, or flavogallonyl. Different components can be distinguished and identified based on the characteristic neutral loss of these groups and their related fragments.
In mass spectrometry, peaks 19, 20, 31, 43, and 84 display identical ions at m/z 541 and 1083 in the primary mass spectrum. The m/z 541 ion disappears in the secondary mass spectrum and is identified as [M–2H]2, while m/z 1083 is identified as the [M–H] ion with the molecular formula C48H28O30. The secondary mass spectrum reveals that these compounds generate major fragment ions at m/z 601 [M–H–HHDP–glucose] and m/z 300 [ellagic acid–H], indicating significant neutral losses of HHDP and glucose. Peaks 19 and 20 also exhibit characteristic fragments at m/z 451 [Flavogallonic acid–H–H2O]. Peak 19 is identified as Punicacortein C and peak 20 as Punicacortein D based on the literature comparison. Peaks 31 and 43 are characterized by m/z 781 [M–H–HHDP] fragments. The retention time and fragmentation pathway of peaks 31 and 43 were confirmed to correspond to Punicalagin-α and Punicalagin-β; the fragmentation pathway of Punicalagin is showed in Figure 8. Peak 84 is also characterized by m/z 449 [M–H–HHDP–glucose–galloyl], identified as T. chebula based on its retention time. The ions at m/z 542 and 1085 are observed in the primary mass spectrum, while m/z 542 disappears in the secondary spectrum. These are identified as [M–2H]2 and m/z 1085 as [M–H] with the molecular formula C48H30O30. Through the loss of HHDP and galloyl groups, the fragment ions m/z 783 [M–H–HHDP], m/z 631 [M–H–HHDP–galloyl], and characteristic fragments at m/z 451 [Flavogallonic acid–H–H2O] are generated. Based on ClogP values (Table S1) and the literature, peak 34 was identified as Rhoipteleanin G and peak 46 as Terflavin A. The ions at m/z 494 and 989 are observed in the primary mass spectrum, while m/z 494 disappears in the secondary spectrum. These ions are identified as [M–2H]2 and m/z 989 as [M–H], with the molecular formula C41H34O29. The compound first loses a neoche group, forming the m/z 651 [M–H–neoche] fragment, and then loses a gallic acid to generate the m/z 481 [M–H–neoche–gallic acid] fragment. The characteristic fragment at m/z 337 [neochebulic acid–H–H2O] is generated, and peak 58 is identified as Carpinusnin.
Peaks 64, 67, 70, 74, 79, 87, 101, 109, 118, and 121 display ions at m/z 485 and 971 in the primary mass spectrum. The m/z 485 ion disappears in the secondary mass spectrum, while m/z 971 is identified as an [M–H] ion with the molecular formula C41H32O28. These compounds are detected in the secondary mass spectrum through the loss of groups such as gallic acid, HHDP, and neoche. The fragment ions observed include m/z 801 [M-H-170], m/z 669 [M–H–302], m/z 633 [M–H–338], m/z 499 [M–H–170–302], m/z 463 [M–H–338–302], m/z 337 [neochebulic acid–H–H2O], and m/z 301 [ellagic acid–H]. Based on the chemical structure of components in Terminalia chebula, HHDP is likely attached to the 3,6, 4,6, or 2,3 hydroxyl groups of glucose, while neoche groups generally do not attach to the 1-hydroxyl group of glucose. Using ClogP values (Table S1) to assess polarity, the following isomer structures were inferred and confirmed: Peak 64 corresponds to 4-galloyl-6-neoche-2,3-HHDP-glucose and peak 67 to 1-galloyl-2-neoche-4,6-HHDP-glucose. Peak 70 is identified as 1-galloyl-3-neoche-4,6-HHDP-glucose and peak 74 as 1-galloyl-2-neoche-3,6-HHDP-glucose. Peak 79 corresponds to 1-galloyl-4-neoche-3,6-HHDP-glucose and peak 87 to 1-galloyl-4-neoche-2,3-HHDP-glucose. Peak 101 is identified as 2-galloyl-3-neoche-4,6-HHDP-glucose and peak 109 as 1-galloyl-6-neoche-2,3-HHDP-glucose. Peak 118 corresponds to 2-galloyl-4-neoche-3,6-HHDP-glucose and peak 121 to 6-galloyl-4-neoche-2,3-HHDP-glucose.
In the primary mass spectrum, the ion signal at m/z 953 was observed in peaks 86, 90, 95, 105, 119, and 131, while the ion at m/z 476 was detected in peaks 86, 90, 95, 105, 119, and 123, but not at peak 131. In the secondary mass spectrum, the signal at m/z 476 disappeared. Thus, m/z 476 was inferred to be the [M–2H]2 ion peak, and m/z 953 was identified as the [M–H] ion peak. The molecular formula was determined to be C41H30O27. Further analysis of the secondary mass spectrum indicates that the fragment patterns of peaks 86, 90, 95, 105, 119, and 123 are very similar. The main fragment ions observed include m/z 783 [M–H–170], m/z 651 [M–H–302], m/z 633 [M–H–320], m/z 481 [M–H–302–170], m/z 463 [M–H–320–170], m/z 337 [chebuloyl–H], m/z 331 [M–H–320–302], m/z 319 [M–H–302–170–162], and m/z 301 [ellagic acid–H]. These fragment ions primarily result from the loss of groups such as gallic acid, chebuloyl, HHDP, and glucose. Based on the fragment ion information, six isomers were identified. Peak 105 was identified as chebulagic acid by comparing its retention time and intensity with those of a reference standard; the fragmentation pathway of chebulagic acid is showed in Figure 9. By assessing the polarity of each peak using ClogP values (Table S1), the following structures were assigned: Peak 86 corresponds to 1-O-galloyl-3,4-chebuloyl-2,6-HHDP-D-glucose, and peak 90 corresponds to 1-O-galloyl-3,6-chebuloyl-2,4-HHDP-D-glucose. Peak 95 was assigned as 1-O-galloyl-4,6-chebuloyl-3,3-HHDP-D-glucose and peak 119 as 1-O-galloyl-2,6-chebuloyl-3,4-HHDP-D-glucose. Peak 123 was assigned as 1-O-galloyl-2,3-chebuloyl-4,6-HHDP-D-glucose. The secondary fragments of peak 131 mainly include m/z 935.0799 [M–H–H2O], m/z 917.0695 [M–H–2H2O], m/z 635.0896 [M–H–DHHDP], m/z 617.0781 [M–H–H2O– DHHDP], m/z 465.0671 [M–H–DHHDP–gallic acid], and m/z 316.9932 [DHHDP–H]. The loss of DHHDP (318 Da) is the main neutral loss characteristic of the compound, suggesting that peak 131 is terchebin.
Peaks 113 and 115 display identical ions at m/z 492 and 985 in the primary mass spectrum. The m/z 492 ion disappears in the secondary mass spectrum, indicating that m/z 492 corresponds to the [M–2H]2 ion, while m/z 985 is the [M–H] ion, with the molecular formula determined as C42H34O28. In the secondary mass spectrum, fragment ions were observed at m/z 815 [M–H–170], m/z 683 [M–H–302], m/z 633 [M–H–352], m/z 513 [M–H–302–170], m/z 463 [M–H–352–170], m/z 351 [6′-O-methyl neochebulic acid-H2O-H], and m/z 301 [ellagic acid–H]. These ions are generated through the sequential loss of gallic acid, HHDP, and 6′-O-methyl neochebuloyl. Based on the location of the galloyl group, two isomers were deduced, and their polarity was determined using ClogP values (Table S1). Peak 113 corresponds to 1-O-galloyl-3,6-HHDP-4-6′-methyl neochebuloyl-glucose, while peak 115 corresponds to 2-O-galloyl-3,6-HHDP-4-6′-methyl neochebuloyl-glucose.
Peaks 89, 93, 96, 103, 116, and 133 display identical ions at m/z 462 and 925 in the primary mass spectrum. The ion at m/z 462 disappears in the secondary mass spectrum, indicating that it corresponds to the [M–2H]2 ion, while m/z 925 is assigned to the [M–H] ion. The molecular formula is determined to be C40H30O26. In the secondary mass spectrum, fragment ions were observed at m/z 773 [M–H–152], m/z 633 [M–H–292], m/z 481 [M–H–292–152], and m/z 465 [M–H–292–170], resulting from the sequential loss of galloyl, THDP, and other groups. Based on the positions of the galloyl and THDP groups, six isomers were deduced, and their polarity was determined using ClogP values (Table S1). Peak 89 was identified as 1-O-galloyl-3,4-THDP-2,6-HHDP-D-glucose. Peak 93 was identified as 1-O-galloyl-2,4-THDP-6,6-HHDP-D-glucose (Phyllanthusiin C) and peak 96 as 1-O-galloyl-3,6-THDP-2,4-HHDP-D-glucose. Peak 103 was identified as 1-O-galloyl-4,6-HHDP-2,3-HHDP-D-glucose, while peak 116 was identified as 1-O-galloyl-2,3-HHDP-4,6-HHDP-D-glucose. Peak 133 was identified as 1-O-galloyl-2,6-HHDP-3,4-HHDP-D-glucose.

2.3.3. Terpenoids

Terpenoids generate abundant deprotonated molecular ions [M–H] in primary mass spectrometry, with some also forming [2M–H] ions. These characteristics facilitate the identification of excimer ions and the determination of their molecular formulas. The saponins in Terminalia chebula are typically linked to glucose (Glc), galloyl, and glucoheptonic acid through various substitution patterns. Aglycone fragments are relatively stable, primarily losing sugar and galloyl fragments (152, 162, 208 Da) under normal voltages. By analyzing the fragment ions of saponins, aglycones can be classified into 503 Da and 487 Da groups, each with multiple core configurations.
Peaks 139, 149, and 153 exhibit a high-intensity ion signal at m/z 711 in the primary mass spectrum, identified as the [M–H] ion peak corresponding to the molecular formula C37H60O13. In the secondary mass spectrum, the fragment at m/z 503 [M–H–208] displayed a high-intensity signal, suggesting that it resulted from the loss of glucoheptonic acid. Based on the saponin characteristics, the fragment at m/z 503 is attributed to the loss of glucoheptonic acid. Peak 139 was identified as Arjungenin-24-O-glucoheptonic acid, and peak 149 as Madecassic acid-24-O-glucoheptonic acid. Peak 153 was identified as Terminolic acid-24-O-glucoheptonic acid. Peaks 141, 145, and 147 exhibit strong ion signals at m/z 817 in the primary mass spectrum. Combined with secondary mass spectrometry, m/z 817 is identified as the [M–H] ion peak, corresponding to the molecular formula C43H62O15. The secondary mass spectrum reveals intense fragment ions at m/z 655 [M–H–162] and m/z 503 [M–H–162–152], likely resulting from the loss of glucose and galloyl groups, as indicated by the characteristic fragments at m/z 503. Peak 141 was identified as Quercotriterpenoside I (Arjungenin-24-galloyl-28-glucose), peak 145 as Madecassic acid-24-galloyl-28-glucose, and peak 147 as Terminolic acid-24-galloyl-28-glucose. Peaks 158, 159, and 161 exhibit similar ion signals at m/z 503, 549, and 1007 in the primary mass spectrum. The m/z 1007 peak is identified as the [2M–H] ion, while the m/z 549 peak corresponds to the [M+HCOO] ion, with a molecular formula of C30H48O6. Peak 158 was identified as Arjungenin, peak 159 as Madecassic acid, and peak 161 as Terminolic acid.
Peaks 148, 151, and 155 appear as [M–H] ion peaks at m/z 801, corresponding to the molecular formula C43H62O14. The intense fragments at m/z 639 [M–H–162] and m/z 487 [M–H–162–152] in the secondary mass spectrum suggest that the characteristic fragments at m/z 487 result from the loss of glucose and gallic acid groups. Peak 148 was identified as Rotundic acid-24-galloyl-28-glucose and peak 151 as Asiatic acid-24-galloyl-28-glucose. Peak 155 was identified as Arjunolic acid-24-galloyl-28-glucose. Peaks 150, 154, and 156 all appear as [M–H] ion peaks at m/z 695, with the molecular formula identified as C37H60O12. In the secondary mass spectrum, an intense fragment at m/z 487 [M–H–208] was observed, which was attributed to the loss of glucoheptonic acid. Peak 150 was identified as Rotundic acid-24-O-glucoheptonic acid. Peak 154 was identified as Asiatic acid-24-O-glucoheptonic acid and peak 156 as Arjunolic acid-24-O-glucoheptonic acid. Peak 160 exhibits a strong ion signal at m/z 639, corresponding to the molecular formula C37H52O9. In the secondary mass spectrum, a fragment at m/z 487 [M–H–152] was detected, attributed to the loss of a galloyl group. Peak 160 was identified as 23-galloyl-arjunolic acid. Peaks 162, 163, and 164 show ion peaks at m/z 487 and 975 in the primary mass spectrum. The m/z 975 peak is identified as the [2M–H] ion, while m/z 487 corresponds to the [M–H] ion, with a molecular formula of C30H48O5. Peak 162 was identified as Rotundic acid. Peak 163 was identified as Asiatic acid and peak 164 as Arjunolic acid.

2.3.4. Other Components

Besides the previously mentioned ingredients, four additional compounds (2, 16, 47, 54, 69) have been identified in T. chebula. Details regarding their chemical compositions are provided in the accompanying table.

2.4. The Applicability of the KID-NLF Strategy

The KID-NLF strategy has established identification methods for various medicinal plants by summarizing the research of Moilanen et al. [60] and examining the mass spectrometry rules in T. chebula. Employing this strategy allows for the identification of gallic acid derivatives, ellagitannins, and triterpenoids by precisely recognizing quasi-molecular ion peaks and their related secondary characteristic fragments. In addition to T. chebula, plants like Phyllanthus emblica [61] and Punica granatum [62] (rich in tannins) as well as Panax ginseng, P. quinquefolium, and P. notoginseng (containing triterpenoids) [63] also adhere to this identification rule. Further research has shown that iridoid glycosides, phenolic acids, and flavonoids are compatible with this strategy. For instance, iridoid glycosides dipsanosides A and B demonstrate specific mass spectrometry features, such as m/z 1519.519 [M+HCOO], 1473.514 [M–H], 759.252 [M+2HCOO]2−, and 736.249 [M–2H]2− as adduct ions. Similarly, the phenolic acids isochlorogenic acid A, B, and C display peaks at m/z 1029.229 [2M–H], 537.100 [M+Na–2H], and 515.118 [M–H], and the flavonoids hyperoside and isoquercitrin show ions at m/z 949.164 [2M+Na–2H], 927.182 [2M–H], and 463.089 [M–H] [64,65]. Notably, alkaloids generate primarily [M+H]+ type quasi-molecular ions and lack distinctive adduct patterns, making them incompatible with the KID-NLF strategy’s rule system for mass spectrometry. This finding indicates the need for methodological validation based on component type when applying this strategy. Consequently, it is advised to undertake methodological validation according to component type when implementing this strategy and to formulate a specific identification strategy for alkaloids.

3. Experimental

3.1. Chemicals, Reagents, and Plant Materials

Deionized water was prepared using a Millipore Q purification system (Rephile, Shanghai, China). HPLC-grade acetonitrile, methanol, and formic acid were obtained from Sigma-Aldrich (Milwaukee, WI, USA). The crude medicinal materials derived from dried, pitted mature fruits of T. chebula were procured from Chengdu in 2023. The author identified these materials, and the specimens were deposited in the laboratory of the author. Reference products including shikimic acid (2), gallic acid (8), punicalagin α (31), punicalagin β (43), corilagin (71), ellagic acid (88), chebulagic acid (105), chebulinic acid (127), and 1,2,3,4,6-penta-O-galloyl-β-D-glucose (130) were purchased from Yuanye (Shanghai, China), with purity exceeding 98% as determined by HPLC analysis.

3.2. Sample Preparation

Approximately 1 g of pitted and dried fruit powder of T. chebula powder, sieved through a No. 3 mesh, was accurately weighed and transferred into a stoppered conical flask. Subsequently, 50 mL of 70% (v/v) methanol was added with precision. The sealed flask was weighed and then subjected to ultrasonic treatment at 250 W and 40 kHz for 30 min. After being allowed to cool, the flask was reweighed and any weight loss compensated with 70% methanol. The supernatant was centrifuged at 13,000 rpm for 15 min and filtered through a 0.22 μm microporous filter membrane.

3.3. UHPLC Analysis

UHPLC analysis was conducted using a Shimadzu LC-40 system (Shimadzu Corporation, Kyoto, Japan), which was equipped with an SPD-M40 detector. A Shim-pack GIST-C18 column (2.1 × 100 mm, 2 μm) was utilized. The mobile phase consisted of 0.2% phosphoric acid aqueous solution (A) and acetonitrile (B). A gradient elution profile was employed as follows: 0–3 min, 3–6% B; 3–4 min, 6–6% B; 4–5 min, 6–7% B; 5–6 min, 7–7% B; 6–7 min, 7–15% B; 7–10 min, 15–15% B; 10–11 min, 15–21% B; 11–23 min, 21–21% B; 23–26 min, 21–33% B; 26–28 min, 33–35% B; 28–40 min, 35–35%. Detection wavelengths were monitored at 254 nm. The flow rate was set at 0.4 mL/min, injection volume at 2 μL, and column temperature at 30 °C.

3.4. UPLC-Q-TOF/MS Analysis

UPLC analysis was carried out using a 100 mm × 2.1 mm, 1.7 μm Waters Acquity UPLCR BEH C18 column (Waters Corporation, Milford, MA, USA). The mobile phase comprised acetonitrile (A) and water (B), each containing 0.1% formic acid. The linear gradient program was set as follows: from 2% to 4% B over 0–2 min; from 4% to 5% B over 2–4 min; from 5% to 6% B over 4–11 min; from 6% to 9% B over 11–17 min; from 9% to 13% B over 17–21 min; maintained at 13% B over 21–27 min; from 13% to 60% B over 27–32 min; from 60% to 99% B over 32–33 min; maintained at 99% B over 33–38 min; from 99% to 1% B over 38–40 min. The flow rate was set at 0.4 mL/min, injection volume at 1 μL, and column temperature at 40 °C.
The Waters SYNAPT G2HDMS system with an ion source was employed for electrospray ionization (ESI). Scanning was conducted in negative (ESI) ion mode, using nitrogen as the atomization and conical gas. The source temperature was set at 100 °C, and the cone gas flow rate was maintained at 40 L/h. The desolvation temperature was held at 350 °C, and the gas flow rate was 800 L/h. Further MS settings included a sampling cone voltage of 40 V, extraction cone voltage of 4 V, capillary voltage of 2.5 kV, scan time of 0.3 s, inter scan time of 0.02 s, and a mass-to-charge ratio (m/z) ranging from 100 to 1200. Leucine-enkephalin (200 pg/mL) flowing at 10 μL/min was employed to calibrate the mass number m/z 554.2615. Data processing was conducted using MassLynx V4.2 and UNIFI software (Version 1.9) from Waters Corporation, Milford, MA, USA.

4. Conclusions

In conclusion, a post-acquisition LC-MS data processing strategy, key ion diagnostics–neutral loss filtering (KID-NLF), can effectively identify the structure of the natural products responsible for the herbal extract. In this study, a total of 164 compounds were identified by UPLC-Q-TOF/MS technique and KID-NLF strategy screening in 33 min running time, 47 of which were reported for the first time. This study provides a powerful strategy for rapid profiling of chemical constituents of herbal medicines.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/molecules30112451/s1, Table S1: The LogP and CLogP values of the chemical components in T. chebula were identified by UPLC-Q-TOF/MS.

Author Contributions

J.Y.: writing—original draft preparation, drawing preparation, table arrangement; X.Z.: drawing preparation, table arrangement; Y.H.: drawing preparation, table arrangement; Y.Z.: funding acquisition, conceptualization, supervision. C.T.: conceptualization, writing—review and editing, supervision. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge the financial support from the National Key Research and Development Program of China (No. 2023YFC3504400, 2023YFC3504401, 2023YFC3504402), the Natural Science Foundation of Sichuan Province (2025ZNSFSC1821), and the “Xinglin Scholars” Program of Chengdu University of TCM (CCYB2022009).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare that they have no competing financial interests or personal relationships that may have influenced the work reported in this study.

References

  1. Klein-Junior, L.C.; de Souza, M.R.; Viaene, J.; Bresolin, T.M.B.; de Gasper, A.L.; Henriques, A.T.; Heyden, Y.V. Quality Control of Herbal Medicines: From Traditional Techniques to State-of-the-art Approaches. Planta Med. 2021, 87, 964–988. [Google Scholar] [CrossRef] [PubMed]
  2. Li, X.; Wen, D.; He, Y.; Liu, Y.; Han, F.; Su, J.; Lai, S.; Zhuang, M.; Gao, F.; Li, Z. Progresses and Prospects on Glucosinolate Detection in Cruciferous Plants. Foods 2024, 13, 4141. [Google Scholar] [CrossRef] [PubMed]
  3. Ye, C.X.; Huang, W.J.; Qin, S.Y.; Mai, Z.Y.; Wang, L.Q.; Li, S.S.; Xu, H.R.; Wang, D.H.; Yan, S.K.; Xiao, X. Application Progress of 2D-LC in Analysis and Quality Control of Traditional Chinese Medicine. Chem. Reag. 2022, 44, 1103–1110. [Google Scholar] [CrossRef]
  4. Hong, L.L.; Wang, H.D.; Xu, X.Y.; Hu, W.D.; Liu, J.Y.; Wang, X.Y.; Gao, X.M.; Yang, W.Z. Systematic characterization and identification of the chemical constituents of the Schisandra chinensis decoction based on a hybrid scanning technique of UHPLC/IM-QTOF-MS. Acta Pharm. Sin. 2024, 59, 678–692. [Google Scholar] [CrossRef]
  5. Qiao, X.; Li, R.; Song, W.; Miao, W.J.; Liu, J.; Chen, H.B.; Guo, D.A.; Ye, M. A targeted strategy to analyze untargeted mass spectral data: Rapid chemical profiling of Scutellaria baicalensis using ultra-high performance liquid chromatography coupled with hybrid quadrupole orbitrap mass spectrometry and key ion filtering. J. Chromatogr. A 2016, 1441, 83–95. [Google Scholar] [CrossRef]
  6. Chen, J.; Wang, X.; Xin, X.; Zheng, Y.; Hou, F.; Li, C.; Guo, L.; Wang, L. Comprehensive comparison of two colour varieties of Perillae folium by UHPLC-Q-TOF/MS analysis combining with feature-based molecular networking. Food Chem. 2025, 463, 141293. [Google Scholar] [CrossRef]
  7. Schmid, R.; Petras, D.; Nothias, L.F.; Wang, M.X.; Aron, A.T.; Jagels, A.; Tsugawa, H.; Rainer, J.; Garcia-Aloy, M.; Dührkop, K.; et al. Ion identity molecular networking for mass spectrometry-based metabolomics in the GNPS environment. Nat. Commun. 2021, 12, 3832. [Google Scholar] [CrossRef]
  8. Duhrkop, K.; Fleischauer, M.; Ludwig, M.; Aksenov, A.A.; Melnik, A.V.; Meusel, M.; Dorrestein, P.C.; Rousu, J.; Bocker, S. SIRIUS 4: A rapid tool for turning tandem mass spectra into metabolite structure information. Nat. Methods 2019, 16, 299–302. [Google Scholar] [CrossRef]
  9. Wang, C.; Zhang, H.; Wang, X.; Wang, X.; Li, X.; Li, C.; Wang, Y.; Zhang, M. Comprehensive Review on Fruit of Terminalia chebula: Traditional Uses, Phytochemistry, Pharmacology, Toxicity, and Pharmacokinetics. Molecules 2024, 29, 5547. [Google Scholar] [CrossRef]
  10. Dhar, A.; Maurya, S.K.; Mishra, A.; Singh, G.K.; Singh, M.K.; Seth, A. Preliminary Screening of a Classical Ayurvedic Formulation for Anticonvulsant Activity. Anc. Sci. Life 2016, 36, 28–34. [Google Scholar] [CrossRef]
  11. Nigam, M.; Mishra, A.P.; Adhikari-Devkota, A.; Dirar, A.I.; Hassan, M.M.; Adhikari, A.; Belwal, T.; Devkota, H.P. Fruits of Terminalia chebula Retz.: A review on traditional uses, bioactive chemical constituents and pharmacological activities. Phytother. Res. 2020, 34, 2518–2533. [Google Scholar] [CrossRef] [PubMed]
  12. Gao, H.; Lu, H.; Fang, N.; Su, J.; Li, R.; Wang, W.; Zhang, Y. The potential of Terminalia chebula in alleviating mild cognitive impairment: A review. Front. Pharmacol. 2024, 15, 1484040. [Google Scholar] [CrossRef]
  13. Kim, H.L.; Choi, B.K.; Yang, S.H. Terminalia chebula Medicinal Uses: A Review of in vitro and in vivo Studies. Biotechnol. Bioprocess Eng. 2022, 27, 703–713. [Google Scholar] [CrossRef]
  14. Zhang, Y.X.; Zhang, Y.S.; Zhang, G.X.; Sun, Y.; Yang, W.L.; Zhang, C.Y. Research Progress and Application Prospects of Medicinal and Edible Tibetan Medicine Terminalia chebula. Chin. Wild Plant Res. 2024, 43, 49–59. [Google Scholar] [CrossRef]
  15. Tiwari, M.; Barooah, M.S. A comprehensive review on the ethno-medicinal and pharmacological properties of Terminalia chebula fruit. Phytochem. Rev. 2024, 23, 125–145. [Google Scholar] [CrossRef]
  16. Singh, A.; Bajpai, V.; Kumar, S.; Sharma, K.R.; Kumar, B. Profiling of Gallic and Ellagic Acid Derivatives in Different Plant Parts of Terminalia arjuna by HPLC-ESI-QTOF-MS/MS. Nat. Prod. Commun. 2016, 11, 239–244. [Google Scholar] [CrossRef]
  17. Kaunda, J.S.; Zhang, Y.J. Two New Phenolic Constituents from the Stems of Euphorbia griffithii. Nat. Prod. Bioprospect. 2019, 9, 405–410. [Google Scholar] [CrossRef]
  18. Li, K.; Han, X.; Li, R.; Xu, Z.; Pan, T.; Liu, J.; Li, B.; Wang, S.; Diao, Y.; Liu, X. Composition, Antivirulence Activity, and Active Property Distribution of the Fruit of Terminalia chebula Retz. J. Food Sci. 2019, 84, 1721–1729. [Google Scholar] [CrossRef]
  19. Ding, G.; Liu, Y.Z.; Yan, Z.; Lu, Y.R.; Sheng, L.S. Two new isomers of chebulic acid from Terminalia chebula. J. Chin Pharma Univ. 2000, 32, 333–335. [Google Scholar]
  20. Lee, D.Y.; Kim, H.W.; Yang, H.; Sung, S.H. Hydrolyzable tannins from the fruits of Terminalia chebula Retz and their α-glucosidase inhibitory activities. Phytochemistry 2017, 137, 109–116. [Google Scholar] [CrossRef]
  21. Bouchet, N.; Levesque, J.; Blond, A.; Bodo, B.; Poisset, J.L. 1,3-di-O-galloylquinic acid from Guiera senegalensis. Phytochemistry 1996, 42, 189–190. [Google Scholar] [CrossRef]
  22. Ahn, M.J.; Kim, C.Y.; Lee, J.S.; Kim, T.G.; Kim, S.H.; Lee, C.K.; Lee, B.B.; Shin, C.G.; Huh, H.; Kim, J. Inhibition of HIV-1 integrase by galloyl glucoses from Terminalia chebula and flavonol glycoside gallates from Euphorbia pekinensis. Planta Med. 2002, 68, 457–459. [Google Scholar] [CrossRef] [PubMed]
  23. Li, Y.J.; He, F.Q.; Zhao, H.H.; Li, Y.; Chen, J. Screening and identification of acetylcholinesterase inhibitors from Terminalia chebula fruits by immobilized enzyme on cellulose filter paper coupled with ultra-performance liquid chromatography-quadrupole time-of-flight mass spectrometry and molecular docking. J. Chromatogr. A 2022, 1663, 462784. [Google Scholar] [CrossRef] [PubMed]
  24. Liang, C.C.; Zhang, F.Q.; Chen, J. Screening and characterization of cosmetic efficacy components of Terminalia chebula based on biological activity-guided methodology. Biomed. Chromatogr. 2024, 38, e5974. [Google Scholar] [CrossRef]
  25. Tang, Y.; Yin, L.; Zhang, Y.; Huang, X.; Zhao, F.; Cui, X.; Shi, L.; Xu, L. Study on anti-inflammatory efficacy and correlative ingredients with pharmacodynamics detected in acute inflammation rat model serum from Caulis Lonicerae japonicae. Phytomedicine 2016, 23, 597–610. [Google Scholar] [CrossRef]
  26. Saha, S.; Imran, I.B.B. Sensitive lanthanide label array method for rapid fingerprint analysis of plant polyphenols based on time-resolved luminescence. Anal. Methods 2019, 11, 5044–5054. [Google Scholar] [CrossRef]
  27. Ajala, O.S.; Jukov, A.; Ma, C.M. Hepatitis C virus inhibitory hydrolysable tannins from the fruits of Terminalia chebula. Fitoterapia 2014, 99, 117–123. [Google Scholar] [CrossRef]
  28. Chen, H.F.; Zhang, C.; Yao, Y.; Li, J.M.; Du, W.D.; Li, M.L.; Wu, B.; Yang, S.L.; Feng, Y.L.; Zhang, W.G. Study on anti-hyperuricemia effects and active ingredients of traditional Tibetan medicine TongFengTangSan (TFTS) by ultra-high-performance liquid chromatography coupled with quadrupole time-of-flight mass spectrometry. J. Pharm. Biomed. Anal. 2019, 165, 213–223. [Google Scholar] [CrossRef]
  29. Wang, Z.; Zhang, Y.; Fu, Z.; Jin, T.; Zhao, C.; Zhao, M. A comprehensive strategy for quality evaluation of Changan powder by fingerprinting combined with rapid qualitative and quantitative multi-ingredients profiling. Phytochem. Anal. 2024, 35, 840–859. [Google Scholar] [CrossRef]
  30. Han, N.; Ye, Q.; Guo, Z.; Liang, X. Metabolomics analysis of differential chemical constituents and α-glucosidase inhibiting activity of Phyllanthus urinaria L. root, stem, leaf and fruit. Nat. Prod. Res. 2023, 37, 642–645. [Google Scholar] [CrossRef]
  31. Tanaka, T.; Jiang, Z.H.; Kouno, I. Structures and biogenesis of rhoipteleanins, ellagitannins formed by stereospecific intermolecular C-C oxidative coupling, isolated from Rhoiptelea chiliantha. Chem. Pharm. Bull. 1997, 45, 1915–1921. [Google Scholar] [CrossRef]
  32. Jha, A.K.; Sit, N. Effect of ultrasound, microwave, and enzymatically pre-treated Terminalia chebula pulp on extraction of bioactive compounds using supercritical CO2. Sustain. Chem. Pharm. 2023, 33, 101098. [Google Scholar] [CrossRef]
  33. Li, C.W.; Dong, H.J.; Cui, C.B. The synthesis and antitumor activity of twelve galloyl glucosides. Molecules 2015, 20, 2034–2060. [Google Scholar] [CrossRef] [PubMed]
  34. de Oliveira, F.M.G.; Lyrio, M.V.V.; Filgueiras, P.R.; de Castro, E.V.R.; Kuster, R.M. ESI(-)FT-ICR MS for the determination of best conditions for producing extract abundant in phenolic compounds from leaves of E. uniflora and FTIR-PCA as a sample screening method. Anal. Methods 2024, 16, 3701–3713. [Google Scholar] [CrossRef] [PubMed]
  35. Zheng, Y.; Zhao, L.; Yi, J.; Cai, S. Effects and Mechanisms of Rhus chinensis Mill. Fruits on Suppressing RANKL-Induced Osteoclastogenesis by Network Pharmacology and Validation in RAW264.7 Cells. Nutrients 2022, 14, 1020. [Google Scholar] [CrossRef]
  36. Zheng, W.; Zhang, M.; Zhao, Y.; Miao, K.; Jiang, H. NMR-based metabonomic analysis on effect of light on production of antioxidant phenolic compounds in submerged cultures of Inonotus obliquus. Bioresour. Technol. 2009, 100, 4481–4487. [Google Scholar] [CrossRef]
  37. Li, M.; Zhang, Y.; Yan, J.; Ding, F.; Chen, C.; Zhong, S.; Li, M.; Zhu, Y.; Yue, P.; Li, P.; et al. Comparative Metabolomic Analysis Reveals the Differences in Nonvolatile and Volatile Metabolites and Their Quality Characteristics in Beauty Tea with Different Extents of Punctured Leaves by Tea Green Leafhopper. J. Agric. Food Chem. 2023, 71, 16233–16247. [Google Scholar] [CrossRef]
  38. Xu, M.; Zhu, H.T.; Cheng, R.R.; Wang, D.; Yang, C.R.; Tanaka, T.; Kouno, I.; Zhang, Y.J. Antioxidant and hyaluronidase inhibitory activities of diverse phenolics in Phyllanthus emblica. Nat. Prod. Res. 2016, 30, 2726–2729. [Google Scholar] [CrossRef]
  39. Chang, C.W.; Lin, M.T.; Lee, S.S.; Liu, K.C.; Hsu, F.L.; Lin, J.Y. Differential inhibition of reverse transcriptase and cellular DNA polymerase-alpha activities by lignans isolated from Chinese herbs, Phyllanthus myrtifolius Moon, and tannins from Lonicera japonica Thunb and Castanopsis hystrix. Antiviral. Res. 1995, 27, 367–374. [Google Scholar] [CrossRef]
  40. Zhang, X.R.; Qiao, Y.J.; Zhu, H.T.; Kong, Q.H.; Wang, D.; Yang, C.R.; Zhang, Y.J. Multiple in vitro biological effects of phenolic compounds from Terminalia chebula var. tomentella. J. Ethnopharmacol. 2021, 275, 114135. [Google Scholar] [CrossRef]
  41. Przybylska, D.; Kucharska, A.Z.; Cybulska, I.; Sozański, T.; Piórecki, N.; Fecka, I. Cornus mas L. Stones: A Valuable By-Product as an Ellagitannin Source with High Antioxidant Potential. Molecules 2020, 25, 4646. [Google Scholar] [CrossRef] [PubMed]
  42. Shimozu, Y.; Kuroda, T.; Tsuchiya, T.; Hatano, T. Structures and Antibacterial Properties of Isorugosins H-J, Oligomeric Ellagitannins from Liquidambar formosana with Characteristic Bridging Groups between Sugar Moieties. J. Nat. Prod. 2017, 80, 2723–2733. [Google Scholar] [CrossRef] [PubMed]
  43. Liu, J.B.; Ding, Y.S.; Zhang, Y.; Chen, J.B.; Cui, B.S.; Bai, J.Y.; Lin, M.B.; Hou, Q.; Zhang, P.C.; Li, S. Anti-inflammatory Hydrolyzable Tannins from Myricaria bracteata. J. Nat. Prod. 2015, 78, 1015–1025. [Google Scholar] [CrossRef]
  44. Sehgal, R.; Sharma, A.K.; Singh, B.J.; Saini, R.V.; Saini, A.K.; Beniwal, V. Augmenting the antioxidant, anti-bacterial and anti-carcinogenic potential of Terminalia chebula and Terminalia bellirica after tannin acyl hydrolase mediated biotransformation. Biocatal. Agric. Biotechnol. 2024, 56, 103045. [Google Scholar] [CrossRef]
  45. Avula, B.; Wang, Y.H.; Isaac, G.; Yuk, J.; Wrona, M.; Yu, K.; Khan, I.A. Metabolic Profiling of Hoodia, Chamomile, Terminalia Species and Evaluation of Commercial Preparations Using Ultrahigh-Performance Liquid Chromatography Quadrupole-Time-of-Flight Mass Spectrometry. Planta Med. 2017, 83, 1297–1308. [Google Scholar] [CrossRef]
  46. Takemoto, M.; Kawamura, Y.; Hirohama, M.; Yamaguchi, Y.; Handa, H.; Saitoh, H.; Nakao, Y.; Kawada, M.; Khalid, K.; Koshino, H.; et al. Inhibition of protein SUMOylation by davidiin, an ellagitannin from Davidia involucrata. J. Antibiot. 2014, 67, 335–338. [Google Scholar] [CrossRef]
  47. Haramiishi, R.; Okuyama, S.; Yoshimura, M.; Nakajima, M.; Furukawa, Y.; Ito, H.; Amakura, Y. Identification of the characteristic components in walnut and anti-inflammatory effect of glansreginin A as an indicator for quality evaluation. Biosci. Biotechnol. Biochem. 2020, 84, 187–197. [Google Scholar] [CrossRef]
  48. Spinelli, L.V.; Anzanello, M.J.; Areze da Silva Santos, R.; Carboni Martins, C.; Freo Saggin, J.; Aparecida Silva Da Silva, M.; Rodrigues, E. Uncovering the phenolic diversity of Guabiju fruit: LC-MS/MS-based targeted metabolomics approach. Food Res. Int. 2023, 173, 113236. [Google Scholar] [CrossRef]
  49. Pfundstein, B.; El Desouky, S.K.; Hull, W.E.; Haubner, R.; Erben, G.; Owen, R.W. Polyphenolic compounds in the fruits of Egyptian medicinal plants (Terminalia bellerica, Terminalia chebula and Terminalia horrida): Characterization, quantitation and determination of antioxidant capacities. Phytochemistry 2010, 71, 1132–1148. [Google Scholar] [CrossRef]
  50. Engström, M.T.; Arvola, J.; Nenonen, S.; Virtanen, V.T.J.; Leppä, M.M.; Tähtinen, P.; Salminen, J.P. Structural Features of Hydrolyzable Tannins Determine Their Ability to Form Insoluble Complexes with Bovine Serum Albumin. J. Agric. Food Chem. 2019, 67, 6798–6808. [Google Scholar] [CrossRef]
  51. Liu, L.; Jia, W.; Jiang, S.; Zhang, G.; Zhao, J.; Xu, J.; Wang, L.; Wu, D.; Tao, J.; Yue, H.; et al. Inhibitory activities and rules of plant gallotannins with different numbers of galloyl moieties on sucrase, maltase and α-amylase in vitro and in vivo. Phytomedicine 2023, 120, 155063. [Google Scholar] [CrossRef]
  52. Pradeep, S.; Patil, S.M.; Dharmashekara, C.; Jain, A.; Ramu, R.; Shirahatti, P.S.; Mandal, S.P.; Reddy, P.; Srinivasa, C.; Patil, S.S.; et al. Molecular insights into the in silico discovery of corilagin from Terminalia chebula as a potential dual inhibitor of SARS-CoV-2 structural proteins. J. Biomol. Struct. Dyn. 2023, 41, 10869–10884. [Google Scholar] [CrossRef] [PubMed]
  53. Kashiwada, Y.; Nonaka, G.; Nishioka, I.; Lee, K.J.; Bori, I.; Fukushima, Y.; Bastow, K.F.; Lee, K.H. Tannins as potent inhibitors of DNA topoisomerase II in vitro. J. Pharm. Sci. 1993, 82, 487–492. [Google Scholar] [CrossRef] [PubMed]
  54. Quan, S.; Wen, M.; Xu, P.; Chu, C.; Zhang, H.; Yang, K.; Tong, S. Efficient screening of pancreatic lipase inhibitors from Rheum palmatum by affinity ultrafiltration-high-performance liquid chromatography combined with high-resolution inhibition profiling. Phytochem. Anal. 2024, 35, 540–551. [Google Scholar] [CrossRef] [PubMed]
  55. Lee, D.Y.; Yang, H.; Kim, H.W.; Sung, S.H. New polyhydroxytriterpenoid derivatives from fruits of Terminalia chebula Retz. and their α-glucosidase and α-amylase inhibitory activity. Bioorg. Med. Chem. Lett. 2017, 27, 34–39. [Google Scholar] [CrossRef]
  56. Xu, P.; Xiang, H.; Wen, W.; Quan, S.; Qiu, H.; Chu, C.; Tong, S. Application of two-dimensional reversed phase countercurrent chromatography × high-performance liquid chromatography to bioactivity-guided screening and isolation of α-glucosidase inhibitors from Rheum palmatum L. J. Chromatogr. A 2024, 1717, 464667. [Google Scholar] [CrossRef]
  57. Xu, Y.; Zhang, L.; Wang, Q.; Luo, G.; Gao, X. An integrated strategy based on characteristic fragment filter supplemented by multivariate statistical analysis in multi-stage mass spectrometry chromatograms for the large-scale detection and identification of natural plant-derived components in rat: The rhubarb case. J. Pharm. Biomed. Anal. 2019, 174, 89–103. [Google Scholar] [CrossRef]
  58. Kim, M.S.; Lee, D.Y.; Sung, S.H.; Jeon, W.K. Anti-cholinesterase activities of hydrolysable tannins and polyhydroxytriterpenoid derivatives from Terminalia chebula Retz. fruit. Rec. Nat. Prod. 2018, 13, 284–289. [Google Scholar] [CrossRef]
  59. Wang, Z.F.; Sun, W.Y.; Yu, D.H.; Zhao, Y.; Xu, H.M.; He, Y.F.; Li, H.J. Rotundic acid enhances the impact of radiological toxicity on MCF-7 cells through the ATM/p53 pathway. Int. J. Oncol. 2018, 53, 2269–2277. [Google Scholar] [CrossRef]
  60. Moilanen, J.; Sinkkonen, J.; Salminen, J.P. Characterization of bioactive plant ellagitannins by chromatographic, spectroscopic and mass spectrometric methods. Chemoecology 2013, 23, 165–179. [Google Scholar] [CrossRef]
  61. Avula, B.; Wang, Y.H.; Wang, M.; Shen, Y.H.; Khan, I.A. Simultaneous determination and characterization of tannins and triterpene saponins from the fruits of various species of Terminalia and Phyllantus emblica using a UHPLC-UV-MS method: Application to triphala. Planta Med. 2013, 79, 181–188. [Google Scholar] [CrossRef] [PubMed]
  62. Hernández-Corroto, E.; Marina, L.; García, C. Extraction and identification by high resolution mass spectrometry of bioactive substances in different extracts obtained from pomegranate peel. J. Chromatogr. A 2019, 1594, 82–92. [Google Scholar] [CrossRef] [PubMed]
  63. Mi, Y.G.; Xu, X.Y.; Hong, L.L.; Jiang, M.T.; Chen, B.X.; Li, X.H.; Wang, H.D.; Zou, Y.D.; Zhao, X.; Li, Z.; et al. Comparative Characterization of the Ginsenosides from Six Panax Herbal Extracts and Their In Vitro Rat Gut Microbial Metabolites by Advanced Liquid Chromatography-Mass Spectrometry Approaches. J. Agric. Food Chem. 2023, 71, 9391–9403. [Google Scholar] [CrossRef] [PubMed]
  64. Tang, C.; Li, H.J.; Fan, G.; Kuang, T.T.; Meng, X.L.; Zou, Z.M.; Zhang, Y. Network pharmacology and UPLC-Q-TOF/MS studies on the anti-arthritic mechanism of Pterocephalus hookeri. Trop. J. Pharm. Res. 2018, 17, 1095–1110. [Google Scholar] [CrossRef]
  65. Geng, Z.; Wang, J.; Zhu, L.; Yu, X.; Zhang, Q.; Li, M.; Hu, B.; Yang, X. Metabolomics provide a novel interpretation of the changes in flavonoids during sea buckthorn (Hippophae rhamnoides L.) drying. Food. Chem. 2023, 413, 135598. [Google Scholar] [CrossRef] [PubMed]
Figure 1. UHPLC/UV chromatograms of Terminalia chebula extract (254 nm).
Figure 1. UHPLC/UV chromatograms of Terminalia chebula extract (254 nm).
Molecules 30 02451 g001
Figure 2. Construction of key ion database (A) and the identification of 164 chemical constituents in T. chebula using the KID-NLF strategy (B). The red dashed box marks components compliant with the KID strategy, the purple dashed box denotes those not only adhering to the KID strategy but also aligning with the NLF rules.
Figure 2. Construction of key ion database (A) and the identification of 164 chemical constituents in T. chebula using the KID-NLF strategy (B). The red dashed box marks components compliant with the KID strategy, the purple dashed box denotes those not only adhering to the KID strategy but also aligning with the NLF rules.
Molecules 30 02451 g002
Figure 3. The representative base peak intensity (BPI) chromatogram of T. chebula in negative ion mode. The box and arrow denote the enlarged section.
Figure 3. The representative base peak intensity (BPI) chromatogram of T. chebula in negative ion mode. The box and arrow denote the enlarged section.
Molecules 30 02451 g003
Figure 4. The main neutral loss fragment in T. chebula.
Figure 4. The main neutral loss fragment in T. chebula.
Molecules 30 02451 g004
Figure 5. Schematic putative fragmentation pattern of gallotannins. The positions of galloyl groups on the sugar structure are indicated by numbers in circular shapes of different colors. The number of galloyl groups in glycoside is shown by red dashed boxes.
Figure 5. Schematic putative fragmentation pattern of gallotannins. The positions of galloyl groups on the sugar structure are indicated by numbers in circular shapes of different colors. The number of galloyl groups in glycoside is shown by red dashed boxes.
Molecules 30 02451 g005
Figure 6. Schematic putative fragmentation pattern of galloyl derivatives of cinnamic acid. The positions of galloyl groups on the sugar structure are indicated by numbers in circular shapes of different colors. The number of galloyl groups in glycoside is shown by red dashed boxes.
Figure 6. Schematic putative fragmentation pattern of galloyl derivatives of cinnamic acid. The positions of galloyl groups on the sugar structure are indicated by numbers in circular shapes of different colors. The number of galloyl groups in glycoside is shown by red dashed boxes.
Molecules 30 02451 g006
Figure 7. Schematic putative fragmentation pattern of galloyl derivatives of shikimic acid. The positions of galloyl groups on the shikimic acid structure are indicated by numbers in circular shapes of different colors. The number of galloyl groups in shikimic acid is shown by red dashed boxes.
Figure 7. Schematic putative fragmentation pattern of galloyl derivatives of shikimic acid. The positions of galloyl groups on the shikimic acid structure are indicated by numbers in circular shapes of different colors. The number of galloyl groups in shikimic acid is shown by red dashed boxes.
Molecules 30 02451 g007
Figure 8. The putative fragmentation pathway of Punicalagin.
Figure 8. The putative fragmentation pathway of Punicalagin.
Molecules 30 02451 g008
Figure 9. The putative fragmentation pathway of chebulagic acid.
Figure 9. The putative fragmentation pathway of chebulagic acid.
Molecules 30 02451 g009
Table 1. Characterization of chemical constituents in T. chebula by UPLC-Q-TOF/MS. (a Isolated from T. chebula previously; b not isolated from T. chebula previously; c a newly discovered ingredient. Gallic acid derivatives; ellagitannins; triterpenoids; others. * Identified by reference standards; # MS/MS self-built library matching. In MS/MS fragment ions, bold characters with horizontal lines are KID, and bold characters only are NLF.).
Table 1. Characterization of chemical constituents in T. chebula by UPLC-Q-TOF/MS. (a Isolated from T. chebula previously; b not isolated from T. chebula previously; c a newly discovered ingredient. Gallic acid derivatives; ellagitannins; triterpenoids; others. * Identified by reference standards; # MS/MS self-built library matching. In MS/MS fragment ions, bold characters with horizontal lines are KID, and bold characters only are NLF.).
No.IdentificationFormulatR (min)Experimental (m/z)AdductsError (ppm)MS/MS Fragment Ions (m/z)
1 b,#,♠3-O-galloyl-glucose [17]C13H16O100.71331.0662[M–H]−0.9663.1391, 271.0435, 211.0234, 169.0134, 125.0235
2 a,*,♦Shikimic acid [18]C7H10O50.75173.0445[M–H]−2.9155.0342, 137.0235, 93.0341
3 a,#,♣Neochebulic acid [19]C14H12O110.76355.0298[M–H]−0.8711.0679, 337.0191, 293.0295, 249.0401, 205.0497
4 a,#,♣Chebulic acid [20]C14H12O110.93355.0303[M–H]0.6711.0682, 337.0194, 293.0297, 249.0402, 205.0501
5 b,#,♠3-galloylquinic acid [21]C14H16O101.05343.0661[M–H]−1.2389.0355, 191.0572, 169.0141, 125.0240
6 a,#,♠1-O-galloyl-glucose [22]C13H16O101.17331.0665[M–H]0.0663.1403, 271.0457, 211.0239, 169.0140, 125.0238
7 a,#,♠6-O-galloyl-glucose [20]C13H16O101.38331.0666[M–H]0.3663.1408, 271.0455, 211.0244, 169.0135, 125.0238
8 a,*,♠Gallic acid [22]C7H6O51.53169.0139[M–H]1.2125.0238
9 a,#,♠2-O-galloyl-glucose [23]C13H16O101.55331.0665[M–H]0.0663.1407, 271.0455, 211.0244, 169.0134, 125.0236
10 a,#,♣Gemin D [20]C27H22O181.59633.0726[M–H]−0.3463.0544, 300.9984, 275.0194, 169.0135, 125.0234
11 b,#,♠5-galloylquinic acid [21]C14H16O101.72343.0662[M–H]−0.9389.0356, 191.0542, 169.0138, 125.0236
12 a,#,♣Isochebulic acid [19]C14H12O111.78355.0299[M–H]−0.6711.0686, 337.0196, 293.0303, 249.0401, 205.0499
13 a,#,♣Punicalin α [24]C34H22O221.78781.0529[M–H]0.6600.9893, 448.9792, 300.9988
14 a,#,♣Punicalin β [24]C34H22O221.85781.0524[M–H]−0.4600.9888, 448.9778, 300.9974
15 a,#,♠4-O-galloyl-glucose [23]C13H16O101.88331.06663[M–H]−0.6663.1405, 271.0455, 211.0242, 169.0132, 125.0232
16 b,#,♦Caffeic acid 3,4-O-Di glucuronide [25]C21H24O161.93531.0993[M–H]1.31063.2059, 355.0307, 337.0200, 179.0710, 161.0603, 135.0446
17 b,#,♠4-galloylquinic acid [21]C14H16O102.19343.0665[M–H]0709.0888, 687.1402, 389.0350, 191.0535, 169.0132, 125.0238
18 b,#,♣Isostrictinin [26]C27H22O182.19633.0728[M–H]0.2463.0517, 300.9981, 275.0190, 169.0136, 125.0235
19 a,#,♣Punicacortein C [20]C48H28O302.331083.0585[M–H]−0.2541.0242, 1065.0491, 1021.0580, 600.9891, 499.0722, 300.9983, 169.0136, 125.0235
20 a,#,♣Punicacortein D [20]C48H28O302.521083.0599[M–H]1.1541.0250, 1065.0491, 1021.0573, 600.9891, 499.0722, 300.9983, 169.0136, 125.0235
21 a,#,♣7′-O-methyl chebulate [20]C15H14O112.57369.0460[M–H]0.5739.1009, 351.0338, 325.0563, 307.0460
22 a,#,♠4-O-galloyl-shikimic acid [20]C14H14O92.74325.0557[M–H]−0.9651.1190, 307.0451, 173.0440, 169.0136, 155.0342, 125.0235
23 a,#,♠1,4-di-O-galloyl-β-D-glucose [23]C20H20O142.78483.0778[M–H]0.6967.1592, 331.0670, 313.0560, 211.0242, 193.0137, 169.0135, 125.0234
24 a,#,♣Chebumeinin A [27]C27H26O202.78669.0943[M–H]0.6691.0753, 517.0834, 499.0733, 337.0200, 293.0300, 249.0397, 205.0500
25 a,#,♠5-O-galloyl-shikimic acid [20]C14H14O92.92325.0559[M–H]−0.3651.1192, 307.0451, 173.0440, 169.0136, 155.0342, 125.0235
26 a,#,♠3-O-galloyl-shikimic acid [20]C14H14O93.04325.0559[M–H]−0.3651.1185, 307.0451, 173.0440, 169.0136, 155.0342, 125.0235
27 a,#,♣Chebumeinin B [27]C27H26O203.21669.0939[M–H]0.0691.0773, 517.0753, 499.0724, 337.0219, 293.0300, 249.0402, 205.0500
28 a,#,♣6′-O-methyl chebulate [20]C15H14O113.45369.0453[M–H]−1.4739.0991, 351.0316, 325.0558, 307.0438
29 a,#,♣Valoneic acid dilactone [28]C21H10O133.57469.0050[M–H]−0.6939.0164, 425.0144, 407.0038, 300.9969, 299.9902, 169.0137, 125.0237
30 a,#,♠2,4-di-O-galloyl-β-D-glucose [29]C20H20O143.77483.0780[M–H]1.0967.1611, 331.0666, 313.0543, 211.0245, 193.0131, 169.0134, 125.0236
31 a,*,♣Punicalagin α [20]C48H28O303.841083.0583[M–H]−0.4541.0252, 781.0526, 600.9891, 300.9983, 169.0136, 125.0235
32 b,#,♠3,4-di-O-galloyl-β-D-glucose [30]C20H20O143.92483.0771[M–H]−0.8967.1619, 331.0651, 313.0544, 271.0446, 211.0243, 193.0128, 169.0134, 125.0235
33 a,#,♣Phyllanemblinin D [24]C27H26O204.10669.0940[M–H]0.1691.0770, 517.0822, 499.0734, 337.0210, 293.0293, 249.0396, 205.0501
34 b,#,♣Rhoipteleanin G [31]C48H30O304.391085.0745[M–H]0.1542.0319, 783.0687, 631.0569, 450.9943, 300.9985
35 a,#,♣Phyllanemblinin F [20]C27H26O204.45669.0939[M–H]0.0691.0776, 499.0734, 337.0196, 293.0295, 249.0397, 205.0500
36 a,#,♠2,6-di-O-galloyl-β-D-glucose [32]C20H20O144.59483.0778[M–H]0.6967.1616, 331.0658, 313.0567, 271.0459, 211.0241, 193.0138, 169.0134, 125.0239
37 a,#,♠4,6-di-O-galloyl-β-D-glucose [23]C20H20O144.92483.0776[M–H]0.2967.1622, 331.0672, 313.0565, 271.0455, 211.0243, 193.0134, 169.0134, 125.0236
38 a,#,♣Strictinin [26]C27H22O185.08633.0728[M–H]0.0463.0482, 300.9982, 275.0190, 169.0135, 125.0237
39 a,#,♠Methyl gallate [20]C8H8O55.41183.0290[M–H]−1.6169.0134, 168.0053, 125.0230, 124.0160
40 b,#,♠1,2-di-O-galloyl-β-D-glucose [33]C20H20O145.69483.0776[M–H]0.2967.1614, 331.0665, 313.0554, 271.0451, 211.0243, 193.0136, 169.0134, 125.0237
41 a,#,♠2,3-di-O-galloyl-β-D-glucose [23]C20H20O146.01483.0776[M–H]0.2967.1624, 331.0657, 313.0558, 271.0454, 211.0242, 193.0136, 169.0133, 125.0237
42 a,#,♣Phyllanemblinin E [20]C27H26O206.23669.0933[M–H]−0.9691.0761, 499.0725, 337.0203, 293.0300, 249.0397, 205.0497
43 a,*,♣Punicalagin β [20]C48H28O306.431083.0586[M–H]−0.1541.0252, 781.0526, 600.9891, 300.9983, 169.0136
44 a,#,♠1,6-di-O-galloyl-β-D-glucose [20]C20H20O146.79483.0780[M–H]1.0967.1631, 331.0668, 313.0558, 271.0455, 211.0242, 193.0136, 169.0134, 125.0237
45 a,#,♠3,6-di-O-galloyl-β-D-glucose [20]C20H20O147.21483.0782[M–H]1.4967.1636, 331.0666, 313.0560, 271.0458, 211.0243, 169.0135, 125.0238
46 a,#,♣Terflavin a [20]C48H30O307.731085.0756[M–H]1.1542.0331, 783.0696, 631.0578, 450.9946, 300.9987
47 a,#,♦Brevifolin carboxylic acid [20]C13H8O87.73291.0145[M–H]1.4337.0199, 247.0232, 203.0342
48 a,#,♠3,4-di-O-galloylshikimic acid [29]C21H18O138.03477.0672[M–H]0.6499.0485, 325.0577, 307.0452, 169.0135, 137.0237, 125.0238
49 a,#,♠1,3-di-O-galloyl-β-D-glucose [32]C20H20O148.21483.0777[M–H]0.4967.1615, 331.0655, 313.0557, 271.0454, 211.0240, 169.0136, 125.0237
50 b,#,♣Hippomanin A [34]C27H22O188.35633.0732[M–H]0.6463.0542, 300.9984, 275.0194, 169.0138, 125.0237
51 b,#,♠1,2,4-tri-O-galloyl-β-D-glucose [35]C27H24O188.57635.0885[M–H]0.2317.0391, 657.0715, 483.0771, 465.0676, 313.0557, 295.0457, 169.0135, 125.0235
52 a,#,♣Amlaic acid [32]C27H24O199.48651.0836[M–H]0.3325.0374, 633.0739, 481.0624, 337.0210, 319.0083, 275.0190, 169.0135
53 a,#,♣Methyl neochebulanin [20](4-O-methyl neochebulate-1-O-galloyl-glucose)C28H28O209.48683.1100[M–H]0.6705.0916, 341.0500, 513.0892, 351.0351, 307.0457, 263.0559, 219.0293, 204.0395,
54 b,#,♦Phelligridin J [36]C13H6O89.98288.9984[M–H]1.4245.0086
55 b,#,♠1,3,4-tri-O-galloyl-β-D-glucose [37]C27H24O1810.19635.0891[M–H]1.1657.0726, 317.0398, 483.0783, 465.0667, 313.0562, 295.0447, 169.0134, 125.0237
56 c,♣2-O-methyl neochebulate-1-O-galloyl-glucoseC28H28O2010.48683.1098[M–H]0.3705.0890, 341.0493, 513.0911, 351.0357, 307.0466, 263.0541, 219.0296, 204.0395
57 a,#,♠1,2,6-tri-O-galloyl-β-D-glucose [23]C27H24O1810.90635.0886[M–H]0.3657.0710, 317.0399, 483.0784, 465.0669, 313.0558, 295.0454, 169.0134, 125.0239
58 b,#,♣Carpinusnin [38]C41H34O2911.10989.1116[M–H]0.8494.0514, 651.0831, 481.0624, 337.0194
59 a,#,♣Tercatain [20]C34H26O2211.56785.0833[M–H]−0.5807.0775, 392.0367, 615.0608, 483.0765, 463.0505, 445.0401, 300.9980
60 a,#,♠3,4,6-tri-O-galloyl-β-D-glucose [20]C27H24O1811.69635.0884[M–H]0657.0705, 317.0396, 483.0775, 465.0668, 313.0557, 295.0449, 169.0133, 125.0237
61 b,#,♠3,5-di-O-galloylshikimic acid [39]C21H18O1311.92477.0671[M–H]0.4499.0492, 325.0577, 307.0466, 169.0133, 137.0235, 125.0236
62 a,#,♣Chebulanin [20]C27H24O1912.17651.0833[M–H]−0.2325.0369, 633.0728, 481.0620, 337.0202, 319.0089, 275.0195, 169.0136, 125.0236
63 c,♣6-O-methyl neochebulate-1-O-galloyl-glucoseC28H28O2012.17683.1091[M–H]−0.7705.0886, 341.0510, 513.0881, 351.0343, 307.0459, 263.0560, 219.0291, 204.0392
64 c,♣4-galloyl-6-neochebuloyl-2,3-HHDP-glucoseC41H32O2812.17971.1006[M–H]0.4485.0460, 953.0898, 935.0793, 801.0782, 669.0914, 633.0727, 499.0726, 463.0513, 337.0202, 300.9981
65 b,#,♠4,5-di-O-galloylshikimic acid [39]C21H18O1312.89477.0672[M–H]0.6499.0485, 325.0577, 307.0452, 169.0135, 137.0237, 125.0238
66 c,♣3-O-methyl neochebulate-1-O-galloyl-glucoseC28H28O2013.02683.1088[M–H]−1.2705.0922, 341.0499, 513.0967, 351.0341, 307.0438, 263.0558, 219.0274, 204.0385
67 c,♣1-galloyl-2-neochebuloyl-4,6-HHDP-glucoseC41H32O2813.29971.1009[M–H]0.7485.0450, 953.0893, 935.0793, 801.0742, 669.0939, 633.0729, 499.0729, 463.0508, 337.0195, 300.9981
68 a,#,♣Tellimagrandin I [20]C34H26O2213.32785.0848[M–H]1.4807.0668, 392.0373, 633.0729, 483.0779, 463.0504, 445.0413, 300.9980
69 a,#,♦Urolithin M5 [40]C13H8O713.32275.0195[M–H]1.1551.0461, 257.0085, 229.0139, 201.0183
70 c,♣1-galloyl-3-neochebuloyl-4,6-HHDP-glucoseC41H32O2813.67971.1014[M–H]1.2485.0457, 953.0887, 935.0786, 801.0779, 669.0923, 633.0719, 499.0723, 463.0506, 337.0191, 300.9980
71 a,*,♣Corilagin [20]C27H22O1814.07633.0724[M–H]−0.6463.0514, 300.9985, 275.0189, 169.0135, 125.0235
72 a,#,♠1,4,6-tri-O-galloyl-β-D-glucose [23]C27H24O1815.17635.0880[M–H]−0.6317.0391, 657.0696, 483.0782, 465.0675, 313.0558, 295.0453, 169.0134, 125.0236
73 b,#,♠2,3,4-tri-O-galloyl-β-D-glucose [41]C27H24O1815.71635.0881[M–H]−0.5317.0393, 657.0696, 483.0773, 465.0668, 313.0555, 295.0451, 169.0133, 125.0238
74 c,♣1-galloyl-2-neochebuloyl-3,6-HHDP-glucoseC41H32O2815.96971.1000[M–H]−0.2485.0450, 953.0904, 935.0748, 801.0808, 669.1053, 633.0721, 499.0756, 463.0502, 337.0193, 300.9983
75 b,#,♠2,4,6-tri-O-galloyl-β-D-glucose [42]C27H24O1816.16635.0881[M–H]−0.5317.0395, 657.0709, 483.0776, 465.0690, 313.0558, 295.0443, 169.0134, 125.0240
76 b,#,♣1,3-di-O-galloyl-4,6-HHDP-glucose [43]C34H26O2216.25785.0848[M–H]1.4807.0668, 392.0373, 633.0729, 483.0779, 463.0504, 445.0413, 300.9980
77 a,#,♠1,3,6-tri-O-galloyl-β-D-glucose [22]C27H24O1816.68635.0872[M–H]−1.9317.0391, 657.0693, 483.0772, 465.0667, 313.0555, 295.0449, 169.0134, 125.0234
78 a,#,♣1,3-di-O-galloyl-2,4-chebuloyl-D-glucose [44]C34H28O2317.10803.0942[M–H]−0.1825.0756, 401.0424, 785.0836, 633.0723, 589.0815,533.0569, 483.0198, 313.0563
79 a,#,♣Neochebulagic acid [20](1-galloyl-4-neochebuloyl-3,6-HHDP-glucose)C41H32O2817.57971.0999[M–H]−0.3485.0457, 953.0826, 935.0793, 801.0682, 669.1024, 633.0735, 499.0777, 463.0483, 337.0194, 300.9984
80 a,#,♠1,2,3,6-tetra-O-galloyl-β-D-glucose [20]C34H28O2217.62787.0999[M–H]0.6809.0811, 393.0450, 635.0889, 617.0783, 483.0782, 465.0676, 447.0562, 313.0555, 295.0450
81 a,#,♣Tellimagrandin II [18]C41H30O2617.82937.0959[M–H]1.3468.0425, 767.0801, 635.0894, 465.0661, 313.0561, 300.9982
82 b,#,♠2,3,6-tri-O-galloyl-β-D-glucose [35]C27H24O1817.97635.0879[M–H]−0.8317.0401, 657.0699, 483.0765, 465.0678, 313.0568, 295.0454, 169.0136, 125.0237
83 c,♣1,2,3-tri-O-galloyl-4-neochebuloyl-D-glucoseC41H34O2818.15973.1159[M–H]0.1486.0532, 803.0942, 635.0881, 633.0729, 483.0779, 465.0667, 463.0511, 337.0193, 300.9985
84 a,#,♣Terchebulin [45]C48H28O3019.001083.0604[M–H]1.6541.0252, 781.0580, 600.9910, 448.9790, 300.9985, 169.0136, 125.0236
85 a,#,♠1,2,3-tri-O-galloyl-β-D-glucose [23]C27H24O1819.24635.0880[M–H]−0.6657.0691, 317.0393, 483.0773, 465.0668, 313.0558, 295.0443, 169.0133, 125.0238
86 c,♣1-O-galloyl-3,4-chebuloyl-2,6-HHDP-D-glucoseC41H30O2719.65953.0919[M–H]2.4476.0416, 783.0739, 651.0840, 633.0735, 481.0611, 463.0504, 337.0199, 331.0665, 319.0095, 300.9988
87 c,♣1-galloyl-4-neochebuloyl-2,3-HHDP-glucoseC41H32O2819.92971.1003[M–H]0.1485.0459, 953.0898, 935.0773, 801.0789, 669.0975, 633.0717, 499.0704, 463.0513, 337.0197, 300.9984
88 a,*,♣Ellagic acid [20]C14H6O820.04300.9987[M–H]1.0283.9958, 257.0087, 229.0122, 201.0182, 185.0236, 173.0223, 145.0288, 117.0326
89 c,♣1-O-galloyl-3,4-THDP-2,6-HHDP-D-glucoseC40H30O2620.13925.0954[M–H]0.8462.0426, 773.0887, 755.0728, 633.0735, 463.0518, 300.9986
90 c,♣1-O-galloyl-3,6-chebuloyl-2,4-HHDP-D-glucoseC41H30O2720.27953.0895[M–H]−0.1476.0406, 783.0743, 651.0848, 633.0744, 481.0629, 463.0519, 337.0198, 331.0643, 319.0092, 300.9991
91 b,#,♣Punicafolin [38]C41H30O2620.29937.0961[M–H]1.5468.0435, 767.0743, 635.0884, 465.0688, 313.0545, 300.9987
92 c,♣1,6-di-O-galloyl-2,4-chebuloyl-D-glucoseC34H28O2320.37803.0957[M–H]1.7401.0431, 825.0756, 785,0836, 633.0723, 589.0815, 533.0569, 483.0198, 313.0563
93 b,#,♣Phyllantusiin C [30]C40H30O2620.37925.0955[M–H]−0.3462.0449, 773.0849, 755.0762, 633.0731, 463.0514, 300.9985
94 a,♣1,3,6-tri-O-galloyl-4-neochebuloyl-D-glucose [29]C41H34O2820.50973.1162[M–H]0.4486.0536, 803.0956, 635.0879, 633.0724, 483.0755, 465.0685, 463.0514, 337.0192, 300.9984
95 c,♣1-O-galloyl-4,6-chebuloyl-2,3-HHDP-D-glucoseC41H30O2720.60953.0893[M–H]−0.3476.0412, 783.0712, 651.0827, 633.0726, 481.0653, 463.0526, 337.0191, 331.0658, 319.0093, 300.9987
96 c,♣1-O-galloyl-3,6-THDP-2,4-HHDP-D-glucoseC40H30O2620.77925.0959[M–H]1.3462.0430, 773.0862, 755.0744, 633.0735, 463.0521, 300.9988
97 b,#,♣Davidiin [46]C41H30O2620.86937.0964[M–H]1.8468.0435, 767.0731, 635.0885, 465.0673, 313.0563, 300.9987
98 c,♣3,6-di-O-galloyl-2,4-chebuloyl-D-glucoseC34H28O2320.90803.0961[M–H]2.2825.0776, 401.0432, 785.0864, 633.0743, 589.0851, 533.0582, 483.0163, 313.0558
99 c,♠3,4,5-tri-O-galloyl shikimic acidC28H22O1820.90629.0789[M–H]1.6477.0655, 459.0561, 325.0501, 307.0443, 289.0357, 169.0138
100 c,♣1,2,6-tri-O-galloyl-4-neochebuloyl-D-glucoseC41H34O2820.94973.1168[M–H]1.0486.0546, 803.0958, 635.0894, 633.0729, 483.0772, 465.0677, 463.0529, 337.0198, 300.9986
101 c,♣2-galloyl-3-neochebuloyl-4,6-HHDP-glucoseC41H32O2820.94971.1013[M–H]1.1485.0463, 953.0900, 935.0801, 801.0813, 669.0943, 633.0731, 499.0749, 463.0515, 337.0198, 300.9986
102 a,#,♣Eschweilenol C [20]C20H16O1221.01447.0573[M–H]2.0895.1224, 300.9984
103 c,♣1-O-galloyl-4,6-THDP-2,3-HHDP-D-glucoseC40H30O2621.07925.0956[M–H]1.0462.0424, 773.0867, 755.0684, 633.0731, 463.0518, 300.9983
104 c,♣1,3,4-tri-O-galloyl-2,6-HHDP-glucoseC41H30O2621.10937.0964[M–H]1.8468.0437, 767.0753, 635.0886, 465.0679, 313.0557, 300.9991
105 a,*,♣Chebulagic acid [22]C41H30O2721.15953.0896[M–H]1.7476.0415, 783.0693, 651.0840, 633.0735, 481.0627, 463.0518, 337.0198, 331.0667, 319.0092, 300.9990
106 c,♣2,3,6-tri-O-galloyl-4-neochebuloyl-D-glucoseC41H34O2821.29973.1172[M–H]1.4486.0541, 803.0955, 635.0891, 633.07235, 483.0763, 465.0677, 463.0523, 337.0198, 300.9992
107 b,#,♣Pterocarinin C [47]C41H30O2621.34937.0959[M–H]1.3468.0432, 767.0742, 635.0887, 465.0670, 313.0559, 300.9989
108 a,#,♠1,2,3,4-tetra-O-galloyl-β-D-glucose [23]C34H28O2221.35787.0999[M–H]0.6809.0834, 393.0457, 635.0889, 617.0782, 483.0770, 465.0676, 447.0555, 313.0556, 295.0450
109 c,♣1-galloyl-6-neochebuloyl-2,3-HHDP-glucoseC41H32O2821.39971.1014[M–H]1.2485.0464, 953.0909, 935.0782, 801.0809, 669.1044, 633.0738, 499.0724, 463.0503, 337.0197, 300.9990
110 b,#,♣1,6-di-O-galloyl-2,3-HHDP-glucose [48]C34H26O2221.47785.0848[M–H]1.4807.0668, 392.0373, 633.0729, 483.0779, 463.0504, 445.0413, 300.9980
111 c,♣1,2,3-tri-O-galloyl-4-methyl neochebuloyl-glucoseC42H36O2821.74987.1318[M–H]0.3493.0620, 817.1097, 635.0881, 465.0668, 351.0352, 295.0450
112 a,#,♠1,2,4,6-tetra-O-galloyl-β-D-glucose [23]C34H28O2221.85787.0994[M–H]0.0809.0809, 393.0447, 635.0881, 617.0773, 483.0768, 465.0674, 447.0560, 313.0555, 295.0451
113 a,#,♣1-O-galloyl-3,6-HHDP-4-6′ methyl neochebuloyl-glucose [49]C42H34O2821.94985.1156[M–H]−0.2492.0533, 815.0938, 683.0846, 633.0719, 513.0870, 463.0513, 351.0342, 300.9983
114 a,#,♠1,3,4,6-tetra-O-galloyl-β-D-glucose [20]C34H28O2222.00787.0995[M–H]0.1809.0819, 393.0447, 635.0874, 617.0775, 483.0771, 465.0668, 447.0558, 313.0556, 295.0449
115 c,♣2-O-galloyl-3,6-HHDP-4-6′ methyl neochebuloyl-glucoseC42H34O2822.13985.1154[M–H]−0.4492.0534, 815.0939, 683.0846, 633.0718, 513.0867, 463.0512, 351.0346, 300.9983
116 c,♣1-O-galloyl-2,3-THDP-4,6-HHDP-D-glucoseC40H30O2622.20925.0941[M–H]−0.6462.0423, 773.0825, 755.0762, 633.0716, 463.0508, 300.9982
117 a,#,♣1,2-di-O-galloyl-4,6-HHDP-glucose [50]C34H26O2222.29785.0833[M–H]−0.5807.0775, 392.0367, 615.0608, 483.0765, 463.0505, 445.0401, 300.9980
118 c,♣2-galloyl-4-neochebuloyl-3,6-HHDP-glucoseC41H32O2822.49971.1006[M–H]0.4485.0472, 953.0893, 935.0806, 801.0945, 669.0826, 633.0731, 499.0716, 463.0505, 337.0193, 300.9982
119 c,♣1-O-galloyl-2,6-chebuloyl-3,4-HHDP-D-glucoseC41H30O2722.65953.0883[M–H]−1.4476.0383, 783.0655, 651.0851, 633.0722, 481.0618, 463.0501, 337.0194, 331.0634, 319.0079, 300.9979
120 b,#,♠2,3,4,6-tetra-O-galloyl-β-D-glucose [51]C34H28O2222.66787.0986[M–H]−1.0809.0819, 393.0447, 635.0871, 617.0767, 483.0764, 465.0662, 447.0559, 313.0554, 295.0446
121 c,♣6-galloyl-4-neochebuloyl-2,3-HHDP-glucoseC41H32O2822.84971.0999[M–H]−0.3485.0458, 953.0894, 935.0800, 801.0879, 669.0791, 633.0726, 499.0721, 463.0498, 337.0190, 300.9981
122 c,♣1,2,3-tri-O-galloyl-4,6-neochebuloyl-glucoseC41H32O2723.03955.1048[M–H]−0.5477.0480, 785.0828, 617.0756, 465.0661, 447.0555, 337.0194
123 c,♣1-O-galloyl-2,3-chebuloyl-4,6-HHDP-D-glucoseC41H30O2723.76953.0882[M–H]−1.5476.0395, 783.0699, 651.0839, 633.0712, 481.0641, 463.0492, 337.0190, 331.0685, 319.0087, 300.9981
124 c,♣1,2,4-tri-O-galloyl-3,6-neochebuloyl-glucoseC41H32O2723.82955.1045[M-H]−0.8477.0481, 785.0828, 617.0765, 465.0669, 447.0562, 337.0189
125 a,#,♣1,3,6-tri-O-galloyl-4-methyl neochebuloyl-glucose [20]C42H36O2823.93987.1302[M–H]1.1493.0611, 817.1092, 635.0872, 465.0662, 351.0343, 295.0445
126 c,♣1,2,6-tri-O-galloyl-4-methyl neochebuloyl-glucoseC42H36O2824.36987.1313[M–H]−0.2493.5631, 817.1094, 635.0876, 465.0673, 351.0338, 295.0440
127 a,*,♣Chebulinic acid [20](1,3,6-tri-O-galloyl-2,4-chebuloyl-glucose)C41H32O2724.57955.1054[M–H]0.1477.0486, 785.0839, 635.0873, 617.0771, 465.0668, 447.0558, 337.0190
128 c,♣2,3,6-tri-O-galloyl-4-methyl neochebuloyl-glucoseC42H36O2825.47987.1324[M–H]0.9493.0623, 817.1107, 635.0883, 465.0674, 351.0348, 295.0458
129 c,♣1,4,6-tri-O-galloyl-2,3-neochebuloyl-glucoseC41H32O2725.47955.1041[M–H]−1.3477.0491, 785.0850, 617.0781, 465.0674, 447.0564, 337.0197
130 a,*,♠1,2,3,4,6-penta-O-galloyl-β-D-glucose [22]C41H32O2626.73939.1113[M–H]1.0469.0515, 787.1005, 769.0900, 617.0784, 599.0676, 465.0677, 447.0568, 429.0460, 313.0565, 295.0456, 277.0352, 259.0245, 169.0136, 125.0237
131 a,#,♣Terchebin [52]C41H30O2728.19953.0901[M–H]0.5935.0799, 917.0695, 617.0781, 465.0671, 316.9932, 295.0452
132 a,#,♣4-O-(4″-O-galloyl-α-rhamnopyranosyl) ellagic acid [49]C27H20O1628.94599.0673[M–H]0.01199.1431, 621.0486, 447.0560, 429.0441, 300.9985
133 c,♣1-O-galloyl-2,6-THDP-3,4-HHDP-D-glucoseC40H30O2629.04925.0959[M–H]1.3462.0432, 773.0815, 755.0752, 633.0731, 463.0506, 300.9982
134 a,#,♣4-O-(2″,3″-di-O-galloyl-α-L-rhamnosyl) ellagic acid [20]C34H24O2029.31751.0782[M–H]−0.1375.0342, 599.0668, 581.0553, 449.0718, 411.0366, 300.9985
135 b,#,♣Nupharin A [53]C41H30O2629.35937.0955[M–H]0.9468.0433, 767.0732, 635.0894, 465.0674, 313.0573, 300.9988
136 c,♠1-O-cinnamoyl-6-O-galloyl-glucoseC22H22O1129.39461.1084[M–H]0313.0558, 169.0135, 147.0436
137 a,#,♣4-O-(2″,4″-di-O-galloyl-α-L-rhamnosyl) ellagic acid [20]C34H24O2029.42751.0782[M–H]−0.1375.0342, 599.0668, 581.0553, 449.0718, 411.0366, 300.9985
138 a,#,♣4-O-(3″,4″-di-O-galloyl-α-rhamnopyranosyl) ellagic acid [49]C34H24O2029.50751.0776[M–H]−0.9375.0338, 599.0660, 581.0549, 449.0713, 411.0292, 300.9977
139 c,♥Arjungenin-24-O-glucoheptonic acidC37H60O1329.59711.3952[M–H]−0.6503.3379
140 b,#,♠1-O-galloyl-2-O-cinnamoyl-glucose [54]C22H22O1129.73461.1092[M–H]1.7923.2247, 313.0561, 169.0133, 147.0443
141 a,#,♥Quercotriterpenoside I [55]C43H62O1529.73817.4018[M–H]1.0655.3484, 503.3366
142 b,#,♠1-O-galloyl-6-O-cinnamoyl-glucose [54]C22H22O1129.81461.1077[M–H]−1.5313.0562, 169.0132, 147.0443
143 a,#,♠1,2-O-galloyl-6-O-cinnamoyl-glucose [20]C29H26O1529.82613.1190[M–H]−0.5465.0682, 461.1077, 313.0560, 169.0131, 147.0443
144 b,#,♠1-O-cinnamoyl-2-O-galloyl-glucose [56]C22H22O1129.95461.10978[M–H]−1.3313.0549, 169.0132, 147.0434
145 c,♥Madecassic acid-24-galloyl-28-glucoseC43H62O1530.13817.4008[M–H]−0.2655.3470, 503.3369
146 b,#,♠1-O-cinnamoyl-2,6-O-galloyl-glucose [57]C29H26O1530.19613.1190[M–H]−0.5465.0667, 461.1085, 313.0558, 169.0131, 147.0441
147 a,#,♥Terminolic acid-24-galloyl-28-glucose [56]C43H62O1530.27817.3998[M–H]−1.5655.3473, 503.3361
148 c,♥Rotundic acid-24-galloyl-28-glucoseC43H62O1430.30801.4066[M–H]0.6639.3536, 487.3401
149 c,♥Madecassic acid-24-O-glucoheptonic acidC37H60O1330.34711.3954[M–H]−0.3503.3372
150 c,♥Rotundic acid-24-O-glucoheptonic acidC37H60O1230.37695.3983[M–H]−3.5487.3413
151 c,♥Asiatic acid-24-galloyl-28-glucoseC43H62O1430.39801.4073[M–H]1.5639.3538, 487.3424
152 a,#,♠1,6-O-galloyl-2-O-cinnamoyl-glucose [20]C29H26O1530.39613.1200[M–H]1.1465.0645, 461.1104, 313.0572, 169.0134, 147.0451
153 c,♥Terminolic acid-24-O-glucoheptonic acidC37H60O1330.43711.3954[M–H]−0.3503.3377
154 c,♥Asiatic acid-24-O-glucoheptonic acidC37H60O1230.47695.4011[M–H]0.6487.3428
155 c,♥Arjunolic acid-24-galloyl-28-glucoseC43H62O1430.51801.4060[M–H]1–0.1639.3539, 487.3427
156 c,♥Arjunolic acid-24-O-glucoheptonic acidC37H60O1230.57695.4019[M–H]0.6487.3432
157 a,#,♠1,2,3-tri-O-galloyl-6-O-cinnamoyl-β- D-glucose [20]C36H30O1930.57765.1310[M–H]0.9811.4279, 787.1131, 635.0932, 617.0841, 613.1192, 595.1093, 443.0975
158 a,#,♥Arjungenin [40]C30H48O631.12503.3384[M–H]2.21007.6838, 549.3438
159 a,#,♥Madecassic acid [28]C30H48O631.56503.3387[M–H]2.81007.6841, 549.3439
160 a,#,♥23-galloyl-arjunolic acid [58]C37H52O931.71639.3547[M–H]2.2487.3378
161 a,#,♥Terminolic acid [55]C30H48O631.77503.3387[M–H]2.81007.6847, 549.3439
162 b,#,♥Rotundic acid [59]C30H48O531.90487.3437[M–H]2.9975.6937
163 a,#,♥Asiatic acid [32]C30H48O532.09487.3438[M–H]3.1975.6949
164 a,#,♥Arjunolic acid [18]C30H48O532.35487.3434[M–H]2.3975.6979
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yu, J.; Zhao, X.; He, Y.; Zhang, Y.; Tang, C. An Innovative Strategy for Untargeted Mass Spectrometry Data Analysis: Rapid Chemical Profiling of the Medicinal Plant Terminalia chebula Using Ultra-High-Performance Liquid Chromatography Coupled with Q/TOF Mass Spectrometry–Key Ion Diagnostics–Neutral Loss Filtering. Molecules 2025, 30, 2451. https://doi.org/10.3390/molecules30112451

AMA Style

Yu J, Zhao X, He Y, Zhang Y, Tang C. An Innovative Strategy for Untargeted Mass Spectrometry Data Analysis: Rapid Chemical Profiling of the Medicinal Plant Terminalia chebula Using Ultra-High-Performance Liquid Chromatography Coupled with Q/TOF Mass Spectrometry–Key Ion Diagnostics–Neutral Loss Filtering. Molecules. 2025; 30(11):2451. https://doi.org/10.3390/molecules30112451

Chicago/Turabian Style

Yu, Jia, Xinyan Zhao, Yuqi He, Yi Zhang, and Ce Tang. 2025. "An Innovative Strategy for Untargeted Mass Spectrometry Data Analysis: Rapid Chemical Profiling of the Medicinal Plant Terminalia chebula Using Ultra-High-Performance Liquid Chromatography Coupled with Q/TOF Mass Spectrometry–Key Ion Diagnostics–Neutral Loss Filtering" Molecules 30, no. 11: 2451. https://doi.org/10.3390/molecules30112451

APA Style

Yu, J., Zhao, X., He, Y., Zhang, Y., & Tang, C. (2025). An Innovative Strategy for Untargeted Mass Spectrometry Data Analysis: Rapid Chemical Profiling of the Medicinal Plant Terminalia chebula Using Ultra-High-Performance Liquid Chromatography Coupled with Q/TOF Mass Spectrometry–Key Ion Diagnostics–Neutral Loss Filtering. Molecules, 30(11), 2451. https://doi.org/10.3390/molecules30112451

Article Metrics

Back to TopTop