Next Article in Journal
Assay-Dependent Variability of Antioxidant Responses in Hop Extracts: Implications for Cross-Study Comparability and Antioxidant Interpretation
Previous Article in Journal
Star-like Cobalt Sulfide Nanoarrays Coupled with Fe Single-Atom Catalyst as Binder-Free Integrated Cathodes for Efficient and Robust Seawater Zinc–Air Batteries
Previous Article in Special Issue
Chemical Profiling of Aboveground and Underground Parts of Pterocephalus hookeri by Integrated FBMN, Untargeted LC-MS Metabolomics, and PAD-DESI-MSI
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dual-Mode Native Mass Spectrometry Screening Identifies Ginsenoside Ligands of 6-Hydroxymethyl-7,8-Dihydropterin Pyrophosphokinase (HPPK)

1
Institute for Biomedicine and Glycomics, Griffith University, Southport, Gold Coast, QLD 4215, Australia
2
Centre for Cell Factories and Biopolymers, Institute for Biomedicine and Glycomics, Griffith University, Southport, Gold Coast, QLD 4215, Australia
3
Center for Global Infectious Disease Research, Seattle Children’s Research Institute, Seattle, WA 98109, USA
4
School of Environment and Science, Griffith University, Nathan, Brisbane, QLD 4111, Australia
*
Author to whom correspondence should be addressed.
Molecules 2026, 31(12), 2065; https://doi.org/10.3390/molecules31122065
Submission received: 30 April 2026 / Revised: 6 June 2026 / Accepted: 9 June 2026 / Published: 12 June 2026
(This article belongs to the Special Issue Application of Mass Spectrometry Techniques in Analytical Chemistry)

Abstract

Identification of ligands targeting essential enzymes in Mycobacterium species remains an important strategy for anti-tuberculosis drug discovery. Here, a native mass spectrometry approach was employed using pooled 100-compound mixtures, enabling the direct detection of intact HPPK–ligand complexes in solution. Dual-mode MS acquisitions (low collision energy for complex detection and high collision energy for ligand confirmation), combined with an automated data analysis workflow, ensured robust identification of binding events from these complex samples. This strategy led to the identification of several HPPK-binding small molecules, all belonging to the dammarane triterpene glycoside (ginsenoside) class. Subsequent analysis of the hits revealed clear structure–affinity relationships, highlighting how specific aglycone modifications and glycosylation patterns influence binding to HPPK. Our findings expand the known chemical space of HPPK ligands and demonstrate the utility of native MS-based screening coupled with automated data analysis to uncover new ligand scaffolds for challenging enzyme targets.

1. Introduction

Tuberculosis (TB) remains a global health emergency [1]. In 2024 alone, an estimated 10.7 million new cases of TB were recorded worldwide, with approximately 1.23 million deaths, making Mycobacterium tuberculosis (Mtb) the leading cause of mortality from a single infectious agent [2]. The rise of multidrug-resistant TB (MDR-TB) has further underscored the urgent need for novel therapeutic strategies beyond the current first-line drug regimen [3,4]. A key component of these efforts is the identification of new drug targets in Mycobecterium species and the discovery of new chemical scaffolds capable of modulating those targets [5,6].
One promising area of focus is the folate biosynthesis pathway in Mycobacterium, which is essential for bacterial growth and absent in humans [7,8]. Traditional antifolate drugs (e.g., sulfonamides targeting dihydropteroate synthase and trimethoprim targeting dihydrofolate reductase) have been effective antibiotics, but emerging resistance has limited their use in modern TB therapy [9,10]. 6-Hydroxymethyl-7,8-dihydropterin pyrophosphokinase (HPPK), encoded by the folK gene (Rv3606c), catalyzes the ATP-dependent transfer of a pyrophosphate group to 6-hydroxymethyl-7,8-dihydropterin [11,12,13]. Like other upstream folate enzymes, HPPK is essential to Mtb’s viability yet has no human counterpart, making it an appealing selective drug target [14,15]. Despite this, Mtb HPPK and related upstream enzymes have so far remained underexplored relative to the well-studied downstream targets DHPS and DHFR [16,17]. Targeting such a unique enzyme could circumvent existing resistance mechanisms and expand the arsenal against TB [18].
In the search for new inhibitors or ligands of HPPK, natural products present a rich source of chemical diversity [19]. Natural product libraries offer a wide variety of complex scaffolds that differ markedly from synthetic libraries and may reveal novel binding chemotypes [20,21]. However, conventional HTS of pure compound libraries typically tests one compound per well using an indirect functional readout, a process that is relatively slow, resource-intensive, and prone to ambiguous outcomes or false signals due to assay interference or nonspecific compound effects [22,23].
Affinity selection mass spectrometry (ASMS) has addressed some of these limitations by enabling pooled-library screening and MS-based ligand identification, and it is now a well-established strategy for ligand discovery [24]. In typical ASMS workflows, protein–ligand mixtures are subjected to separation steps to isolate bound species, followed by identification of the associated ligands. While highly effective for identifying binders from complex mixtures, this workflow involves multiple processing steps, and binding is indirectly inferred from the presence of retained ligands after separation.
Native mass spectrometry (native MS) is a label-free, fast, and accurate method that permits the direct observation of non-covalent and covalent protein–ligand complexes [25]. The technique relies on non-denaturing electrospray-ionization (ESI) to recognize multi-charged proteins in their near-native states and is fast emerging as an advanced structural biology tool in drug discovery [26]. In the resulting mass spectrum, the difference between the mass-to-charge ratio (Δ m/z) for the protein–ligand complex and the unbound protein ions multiplied by the charge state (z) directly affords the molecular weight of the bound ligand (MWligand = Δ m/z × z). The specificity of the interaction allows the molecular weight (MW) determination of ligands in complex mixtures, such as natural product extracts/fractions [27,28], pooled compound libraries [29] and pooled fragment libraries [30].
Beyond hit identification, native MS provides direct information on binding stoichiometry, complex heterogeneity, oligomeric states, and relative binding affinities [31]. Native MS can also provide estimates of apparent binding affinities by quantifying the relative abundances of apo and ligand-bound protein species observed in the mass spectrum. For protein–small molecule interactions, it is generally assumed that ligand binding does not substantially alter the ionization behavior of the protein, allowing relative signal intensities to serve as an approximation of binding populations in solution [32]. Under these assumptions, native MS has been widely applied for affinity measurements and ranking of ligand-binding strengths [33]. Recent developments in instrumentation, such as ultra-high mass range Orbitrap platforms [34], ion mobility spectrometry [35], and surface-induced dissociation [36], have further expanded the analytical capability of native MS for the characterization of increasingly large and heterogeneous biomolecular assemblies.
To date, the majority of native MS ligand discovery studies have focused on soluble proteins, including enzymes, transcription factors, viral proteins, and protein–protein interaction targets, owing to their relatively straightforward sample preparation and gas-phase stability. However, significant progress has recently been made in extending native MS to membrane proteins, which remain highly challenging targets in drug discovery [37,38]. Advances in detergent selection [39], amphipols [40], nanodiscs [41], and other membrane mimetic systems have enabled the preservation and transfer of intact membrane protein complexes into the gas phase while retaining bound lipids and ligands. These developments have facilitated native MS studies of G protein-coupled receptors (GPCRs) [42,43], ion channels [44,45], and other pharmaceutically important membrane proteins, allowing direct interrogation of ligand binding, lipid interactions, allosteric modulation, and conformational stability.
Native MS has also emerged as a valuable platform for studying higher-order protein assemblies and dynamic interactions in increasingly complex biological systems [46]. Applications now include characterization of PROTAC-mediated ternary complexes [47,48], protein–nucleic acid assemblies [49], intact viral particles [50], and endogenous ligand binding directly from partially purified biological samples [51,52]. Together, these advances position native MS as a versatile and rapidly evolving technology for ligand discovery and mechanistic investigation across a broad range of therapeutic targets.
In this study, we applied a high-throughput native MS screening approach to identify potential ligands of Mycobacterium smegmatis HPPK from a large, structurally diverse natural product collection. M. smegmatis HPPK was used as a non-pathogenic surrogate enzyme for initial hit finding because it is an Rv3606c ortholog with substantial sequence conservation to M. tuberculosis HPPK, with approximately 62% amino-acid identity and 73% positive similarity over aligned residues (Figure S1) [53]. Importantly, key catalytic residues and active-site features of HPPK are conserved between the two enzymes, supporting preservation of the catalytic mechanism and ligand-binding environment.
By directly observing HPPK–small molecule complexes under physiologically relevant conditions, we aimed to uncover previously unrecognized ligand scaffolds for HPPK and gain insight into the structural features that govern these binding interactions. This target-based, label-free screening strategy allowed us to rapidly highlight promising HPPK binders for further characterization, demonstrating an effective route to probe the ligandability of a challenging anti-TB enzyme target.

2. Results

2.1. A Dual-Mode Native Mass Spectrometry (MS) Workflow

To identify ligands targeting HPPK, a native MS-based screening approach was applied to an in-house natural product library comprising 1890 compounds from extensive isolation efforts [54,55,56]. The library consists of structurally diverse natural products, predominantly within a molecular weight range of 200–800 Da (Figure 1A). The chemical diversity of the library was further evaluated using a Morgan fingerprint-based UMAP projection (Figure 1B) [57,58]. Morgan fingerprints encode molecular substructures and connectivity patterns into numerical representations, enabling structural similarity analysis across large compound datasets [59]. Dimensionality reduction by UMAP revealed a broad distribution of compounds throughout the chemical space, indicating substantial structural diversity within the library and supporting its suitability for ligand discovery screening. Compounds were randomly grouped into 19 pools (up to 100 compounds per pool). Pools containing fewer compounds were supplemented with DMSO to maintain constant volume and uniform ligand concentrations across all pools.
Prior to screening, the structural integrity and ionization behavior of recombinant M. smegmatis HPPK were evaluated under native MS conditions. The protein at 1 μM exhibited a narrow charge state distribution centered at 7+ and 8+, consistent with a compact, folded conformation (Figure 2A). The deconvoluted molecular mass of the apo-protein was 20,205 Da, in agreement with the theoretical mass calculated from the amino acid sequence, confirming the preservation of native-like structure during analysis.
8-Mercaptoguanine (8-MG, Figure 2B) is a previously reported ligand and inhibitor of bacterial HPPK enzymes, particularly against the Staphylococcus aureus HPPK (SaHPPK), where it was shown to bind within the pterin/substrate-binding pocket with a reported K d of approximately 11–13 μM and an IC50 of ~41 μM [60]. Structural and biophysical studies further demonstrated that the thiol group of 8-MG contributes significantly to binding affinity through interactions within the conserved substrate-binding region of HPPK [12]. In the present study, 8-MG was employed as a positive-control ligand to evaluate the binding competence of recombinant M. smegmatis HPPK under native MS conditions (Figure 2A). Direct observation of intact HPPK–8-MG complexes confirmed that the purified protein remained functionally folded and capable of ligand engagement during native MS analysis. Native MS titration experiments further indicated an apparent binding affinity in the low micromolar range, with an estimated K d of approximately 16.2 μM under the experimental conditions used (Figure 2C).
To enable high-throughput screening of complex mixtures, we developed a dual-mode native MS workflow for ligand detection and identification in pooled natural product libraries. In each experiment, HPPK (10 μM) was incubated with the compound pools (10 μM per compound; total ligand concentration 1 mM), followed by direct infusion into the mass spectrometer (Figure 3).
Native MS provides the sensitivity and resolution required to detect protein–ligand complexes in multiplexed mixtures. However, the low concentration of individual ligands results in weak complex signals, while the high total ligand load can induce ion suppression, further reducing signal intensity. In addition, high protein charge states compress m/z spacing between complexes, limiting confident assignment of bound species based solely on intact masses (e.g., a 5 Da mass difference at 10+ corresponds to only 0.5 m/z units).
To address these challenges, a dual-mode acquisition strategy combining low- and high-energy collision-induced dissociation (CID) was employed. Low CID preserves intact protein–ligand complexes for detection of binding events and estimation of relative binding affinities. The exact voltage for low CID represents a balance between maintaining complex integrity and maximizing protein signal intensity for detection. Based on method optimization with a positive control ligand, CID voltages in the range of 5–10 V did not disrupt the protein–ligand complex; 10 V was selected to provide a stronger protein signal while still preserving intact complexes. In contrast, high CID is expected to induce dissociation, releasing ligands as singly charged ions with broader m/z spacing and improved signal-to-noise ratios, enabling confident ligand identification. The high-CID range was determined using the positive control and prior studies [45,61], where voltages of 25–30 V resulted in efficient dissociation of bound ligands while maintaining the protein in a native-like state.
Importantly, the intensity of released ligand ions is influenced by ionization efficiency and therefore does not reliably reflect binding strength, whereas the intensity of intact protein–ligand complexes under low CID provides a more direct measure of apparent relative affinity under fixed screening conditions. By integrating both modes—complex detection for quantification and ligand release for identification—this workflow enables confident ligand assignment while retaining comparative information on intact complex formation. This approach is particularly well-suited for screening structurally diverse libraries, where binding is often weak and spectral complexity is high.
The relationship between intact complex abundance and apparent relative affinity is most informative when binding is not saturated and complex peaks are well resolved. According to our previous work, the natural-product ligands screened here showed weak binding responses, and 1:1 protein/ligand screening at 10 µM is therefore expected to remain sensitive to differences in binding strength.

2.2. Automated Data Analysis

To handle the high data volume and complexity of the 100-compound pooled screening experiments, we implemented a custom automated data analysis pipeline (Figure 4). This workflow was necessary to ensure consistent, high-throughput processing of the dual-mode native MS data across ~2000 compounds. Manual interpretation of such multiplexed spectra would be impractical and prone to error, especially given the weak signals and significant ion suppression observed in 100-ligand mixtures. By automating key steps of spectral processing and analysis, the pipeline systematically flags putative protein–ligand complexes in the low-collision energy (intact complex) datasets and assigns corresponding ligand identities using the high-collision energy (ligand-release) datasets, providing a robust foundation for hit identification.
Raw data from both low- and high-CID acquisitions were first processed using a peak-picking filter to extract centroided spectra, followed by conversion to mzML format using MSConvert for automated analysis. Centroided spectra from multiple scans were aligned to generate an apex spectrum for each sample.
For low-CID datasets, protein charge states were assigned based on the deconvoluted protein mass obtained from the protein-only spectrum, supported by the isotopic distribution of HPPK, and the two most intense charge states were selected for downstream analysis. Protein–ligand complexes were identified as newly emerged peaks adjacent to the apo-protein signals. Consistent mass differences observed across these charge states were used to define putative binding events. These mass differences were matched against the ligand database within a tolerance of ±2 Da, and each observed complex was assigned to a single ligand candidate based on the closest mass match to minimize ambiguity in highly multiplexed mixtures. Binding ratios were calculated as the relative intensity of the protein–ligand complex compared to the total protein signal (free and bound).
For high-CID datasets, protein–ligand complexes were dissociated, and released ligand ions were detected with reduced charge states and increased m/z separation. The molecular weights derived from the low-CID datasets were used to generate the expected adduct forms ([M + H]+, [M + Na]+, [M + K]+), and peaks were searched within the corresponding m/z regions in the high-CID spectra. The most intense peaks in these regions were selected and matched against the ligand database based on minimal mass difference.
Due to the limited resolution of intact protein–ligand complexes under native MS conditions, particularly for weak-binding ligands, mass differences derived from low-CID data may deviate from theoretical values. In contrast, released ligand ions detected under high-CID conditions provide improved mass accuracy and facilitate more confident matching to the ligand database. Integration of these complementary datasets enables reconciliation of complex-derived mass differences with directly observed ligand signals, thereby improving the reliability of ligand identification.
The workflow supports batch processing, automated binding ratio calculation, and visualization of results. Consistent detection of low-intensity complexes across dominant charge states was considered sufficient evidence for inclusion as true binding events.

2.3. Screening Results

Using the dual-mode native MS screening workflow in combination with automated data analysis, hit ligands from the screened natural product library were identified, using the binding ratio threshold of 1% in low-energy CID datasets. The binding threshold was low as the screening was done in an extremely complex pool of 100 compounds; ligand competition and signal suppression from high ligand concentrations could potentially influence the binding ratios.
A total of 14 ligands were detected to form protein–ligand complexes with HPPK (Figures S2 and S3, Table S1). Using a hit rate of 0.25%, the top five ligands were selected as initial hits (Table 1), all of which—pseudoginsenoside F11, ginsenoside Rb1, ginsenoside F3, ginsenoside Re, and ginsenoside Rb3—belong to the ginsenoside family. Among these, pseudoginsenoside F11 exhibited the highest complex intensity, with a binding ratio approaching 2%. The remaining four ginsenosides showed binding ratios ranging from approximately 1.0% to 1.5%.
However, in the 100-compound pooled screen, weak complex signals, ionization competition, and partial spectral overlap can affect apparent binding ratios. Therefore, binding ratios from the pooled screen were used primarily for hit detection and prioritization rather than definitive affinity ranking. Individual confirmation experiments, performed without other pool components with cleaner spectra need to be performed to compare relative binding strength among confirmed hits.

2.4. Hit Validation

The identified hits were re-evaluated individually against HPPK at a protein–ligand ratio of 1:2.5 to validate their binding under non-competitive conditions (Figure 5). The resulting binding ratios are summarized in Figure 6 and Table 2. All five hits exhibited increased binding ratios in the individual measurements compared to the initial mixture screening.
Among them, ginsenoside F3 (3) showed the most pronounced enhancement, with more than a tenfold increase in binding ratio, reaching 12.49% under individual conditions. The remaining four compounds displayed binding in the order of ginsenoside Rb1 (7, 5.35%), ginsenoside Re (9, 4.56%), pseudoginsenoside F11 (11, 4.23%), and ginsenoside Rb3 (13, 3.87%).
In addition to the five initial hits identified from pooled screening, 20 additional ginsenosides were individually evaluated for HPPK binding at a protein-to-ligand ratio of 1:2.5 (Figure 4, Table 2 and Figure S4). Binding complexes were detected for the majority of compounds tested, while seven ginsenosides showed no detectable complex formation under the conditions used.
Among all individually tested compounds, the highest binding ratios were observed for (S)-ginsenoside Rh1 (1, 14.40%) and ginsenoside Rg2 (2, 13.66%), both of which exceeded that of ginsenoside F3 (3, 12.49%), the strongest ligand identified from the initial screening. Two additional ginsenosides, ginsenoside Rf (4, 11.35%) and ginsenoside F1 (5, 10.08%), also showed binding ratios above 10%. Notably, all of these highest-affinity ligands belong to the protopanaxatriol (PPT)-type subclass.
A clear structure–activity relationship was observed between the PPT- and protopanaxadiol (PPD)-type ginsenosides. The key structural difference between PPT- and PPD-type ginsenosides lies at the C-6 position. PPT-type ginsenosides possess a hydroxyl group at C-6, which is frequently substituted by glycosidic moieties, whereas PPD-type ginsenosides lack oxygenation at this position. Among the 18 ginsenosides for which binding was detected, 11 were PPT-type compounds, excluding ginsenoside Ro, which does not belong to the dammarane-type scaffold. More importantly, the strongest binders were exclusively PPT-type ginsenosides. By contrast, the PPD-type ginsenosides generally exhibited substantially weaker binding, and six of the seven non-binding ginsenosides belonged to this subclass.
These results indicate that the oxygenation at C-6 is an important structural determinant for HPPK recognition. This feature may enhance binding by providing an additional polar interaction site, either directly through the C-6 hydroxyl group or indirectly through a glycosidic substituent attached at this position.
The degree of glycosylation also appears to play an important role in modulating binding. The six strongest binders (16) all contain only one or two glycosidic substituents at the C-6 or C-20 positions, whereas more highly glycosylated ginsenosides, such as ginsenoside Rb1 (7, four sugars), ginsenoside Ro (8, three sugars), and ginsenoside Re (9, three sugars), exhibited reduced binding.
Increased glycosylation is expected to enhance overall polarity and steric bulk, which may limit the ability of the aglycone core to adopt a binding-competent conformation and reduce favorable interactions with the protein surface. Partially deglycosylated ginsenosides likely provide a more balanced presentation of polar and hydrophobic features, enabling more effective engagement with HPPK.
To further examine the role of physicochemical properties in binding, the cLogP values (calculated by DataWarrior v06.01.00) of all tested ginsenosides were analyzed. The strongest ligands (16) exhibited intermediate cLogP values ranging from 2.1 to 4.0. In contrast, weaker binders (718) generally showed lower cLogP values, with the exception of (R)-ginsenoside Rh1 (10) and (R)-ginsenoside Rg2 (14), which share identical cLogP values with their corresponding C-20 epimers. The lowest cLogP values were observed for Rb1 (7, −0.7), Ro (8, 0.2), Rb3 (13, 0.4), Rc (16, −0.2), and Rb2 (18, −0.1), all of which exhibited binding ratios below 6%.
Notably, most of the non-binding ginsenosides were predominantly associated with higher cLogP values, including ginsenoside Rg6 (20, 4.1), ginsenoside Rk1 (21, 4.0), and both (S)- and (R)-ginsenoside Rh2 (2425, 4.8). These data suggest that an optimal range of hydrophobicity is required for productive interaction with HPPK. Compounds with low cLogP values are likely too polar, which may limit effective interaction with the protein surface, whereas highly hydrophobic ginsenosides may suffer from reduced solubility and/or an unfavourable presentation of polar functional groups required for binding.
A clear dependence on the C-20 configuration was also observed. Among the tested ginsenosides, there were four pairs of stereoisomers differing only at C-20, including two PPD-type pairs, (S/R)-ginsenoside Rg3 (22 and 23) and (S/R)-ginsenoside Rh2 (24 and 25), which did not exhibit detectable binding to HPPK in either configuration. For the two PPT-type pairs, (S)-ginsenoside Rh1 (1, 14.40%) and (S)-ginsenoside Rg2 (2, 13.66%) showed substantially stronger binding than their corresponding 20(R) counterparts, (R)-ginsenoside Rh1 (10, 4.44%) and (R)-ginsenoside Rg2 (14, 3.62%), corresponding to 3.1-fold and 3.6-fold differences, respectively.
The orientation at C-20 alters the spatial arrangement and conformational flexibility of the side chain, thereby modulating the three-dimensional presentation of the C-20 substituent and neighboring functional groups. This, in turn, can significantly influence the positioning of glycosidic moieties and polar functionalities relative to the protein surface. Previous studies have demonstrated that 20(S) and 20(R) epimers of ginsenosides often exhibit markedly different biological activities across a range of systems, including modulating lipid mediator profiles [62], antioxidant, anti-cancer [63], anti-inflammatory [64], and regenerative effects [65], consistent with differences in their ability to engage in productive molecular interactions.

3. Discussion

In this study, native MS was applied to screen a natural product library for ligands binding to HPPK. A dual-mode acquisition strategy was implemented, combining low-CID conditions for detection of intact protein–ligand complexes with high-CID conditions for ligand release and identification. This design enables complementary readouts from a single experiment, allowing both binding detection and ligand assignment in highly multiplexed samples. In addition, the development of an automated Python-based data analysis workflow ensures consistent processing of complex datasets and supports scalable application of this approach to larger libraries and higher-throughput screening campaigns.
Screening results revealed a clear enrichment of ginsenoside-related compounds among the detected binders. Analysis of this compound class showed a consistent structure–binding relationship, where binding strength is influenced by scaffold type, glycosylation pattern, and stereochemistry. In particular, PPT-type ginsenosides exhibited stronger binding compared to PPD-type analogues, and ligands with limited and strategically positioned glycosylation showed more favourable interactions. Small variations in sugar substitution may alter ligand orientation or introduce steric effects that affect how the molecule interacts with the protein surface, leading to distinct binding behaviors among closely related structures [66,67]. These observations indicate that HPPK recognition is sensitive to both the physicochemical properties and three-dimensional presentation of ginsenoside ligands, highlighting this class as a structurally informative set for probing protein–ligand interactions.
Although the identified ginsenosides exhibited clear and structure-dependent binding to HPPK, none of the tested compounds showed significant inhibition of M. smegmatis growth at concentrations up to 100 μM. Similar discrepancies between enzyme inhibition and antibacterial activity have also been reported for previously described high-affinity HPPK inhibitors, which show strong inhibition in biochemical assays but limited antibacterial effects in bacterial systems [68]. The high intracellular concentration of ATP and the highly optimized catalytic mechanism of the enzyme may limit the effectiveness of small-molecule inhibitors [69,70]. For an ATP-dependent enzyme such as HPPK, functional inhibition would require a ligand to bind with sufficient affinity and orientation to interfere with ATP- or substrate-dependent catalysis. In addition, cellular activity depends on compound uptake, retention, and stability within mycobacterial cells, which may be challenging for highly glycosylated ginsenosides.
The ability of these detected ginsenosides to bind HPPK in a structure-dependent manner suggests that they engage defined regions of the protein surface, even if these interactions do not directly interfere with catalysis. This may reflect the challenge of targeting ATP-dependent enzymes, where functional inhibition requires precise engagement of the catalytic machinery. In this context, ginsenosides provide a useful starting point for further investigation of ligand recognition, and may serve as a basis for future optimization toward functionally active compounds. More broadly, the results demonstrate that native MS-based screening can uncover structurally coherent ligand classes and provide insight into molecular recognition processes.
Future studies incorporating orthogonal biophysical methods such as surface plasmon resonance (SPR) and isothermal titration calorimetry (ITC) will be valuable for defining the binding affinities and binding modes of the identified ginsenosides in greater detail. Such studies will further clarify the molecular basis underlying the observed structure-dependent binding behaviour and support the development of optimized ligands targeting HPPK.

4. Materials and Methods

4.1. Ligand Preparation

8-Mercaptoguanine (8-MG) was purchased from Combi-Blocks, Inc. (San Diego, CA, USA).
The Quinn−Liu natural product library (~1890 compounds) was plated by Compounds Australia (Griffith University, QLD, Australia) into 19 pools of up to 100 compounds each (with any pools containing fewer than 100 compounds supplemented with equivalent volumes of DMSO to maintain consistent total volume and concentration). All compounds in each pool were present at equal nominal concentration (50 µM per compound in 100% DMSO stocks).

4.2. Protein Expression and Purification

Recombinant 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase (Rv3606c ortholog, HPPK) from Mycobacterium smegmatis was supplied as a purified protein by the Seattle Structural Genomics Center for Infectious Diseases (SSGCID, Seattle, WA, USA; protein ID MysmA.01570.a.B1.PS02122). The protein corresponded to full-length M. smegmatis HPPK/FolK residues 1–178 and contained an N-terminal His-tag sequence (MAHHHHHH) preceding the native sequence. The recombinant protein was expressed in E. coli BL21(DE3) Rosetta/BL21(DE3)R3 Rosetta using auto-induction medium and purified by His-tag-based Ni2+ affinity chromatography followed by size-exclusion chromatography. The protein was stored under −80 °C in a buffer containing 20 mM HEPES (pH 7.0), 300 mM NaCl, 5% (v/v) glycerol, and 1 mM tris(2-carboxyethyl)phosphine (TCEP).

4.3. Sample Preparation for Native MS

The protein was buffer-exchanged into 300 mM ammonium acetate (pH ~ 7.5, Sigma-Aldrich, St. Louis, MO, USA) using an Amersham NAP-5 desalting column (Cytiva, Marlborough, MA, USA) to remove non-volatile salts and prepare the protein for native MS. The protein concentration in all native MS binding experiments was 10 µM.
To prepare screening samples for native MS, a 10 µL aliquot of the DMSO stock solution from a 100-compound pool was dried under vacuum using a freeze dryer to remove DMSO. The dried compound mixture was reconstituted in 1 µL of methanol with careful mixing and visual for any obvious undissolved material. The sample was then incubated with 49 µL of HPPK solution for 1 h. This resulted in a final screening mixture containing HPPK at 10 µM and each individual compound at approximately 10 µM, with a total ligand concentration of approximately 1 mM and 2% methanol.
For individual ligand validation experiments, selected hit compounds (along with additional related ginsenosides) were obtained as 500 µM DMSO stock solutions from Compounds Australia, Griffith University (Nathan, QLD, Australia) in 96-well plates. An aliquot of each compound (2.5 µL) was similarly dried, reconstituted in 1 µL of methanol, and incubated with 49 µL of 10 µM HPPK solution for 1 h prior to native MS analysis. All experiments were performed in duplicate.

4.4. Mass Spectrometry Instrument Control and Data Acquisition

Screening measurements were performed on a Bruker SolariX XR 12T FT-ICR mass spectrometer (Bruker Daltonics, Bremen, Germany). Samples were directly infused using a 500 μL Hamilton syringe mounted on the integrated syringe pump at a flow rate of 120 μL/h. The capillary voltage was set to 3500 V, with an end-plate offset of −800 V. Drying gas was delivered at a flow rate of 4 L/min, with a nebulizer pressure of 3 bar and a temperature of 200 °C. Source optics voltages were configured as follows: capillary exit 200 V, deflector plate 220 V, funnel 1 at 150 V, and skimmer 1 at 30 V. The optical transfer frequency was set to 4 MHz with a time-of-flight (TOF) of 1.0 ms. Collision-induced dissociation (CID) voltages of 10 V (low-CID) and 25–30 V (high-CID) were applied in separate acquisitions for each sample. Each mass spectrum was obtained by summing 16 transients, with each transient consisting of 1 million data points. Mass spectra were recorded in positive ion mode over an m/z range of 100–6000. Instrument control and data acquisition were performed using Bruker SolariX control software (ftmsControl), version 2.0 (Bruker Daltonics, Bremen, Germany).
Ligand validation and additional ginsenosides were analysed on a Q Exactive UHMR mass spectrometer (Thermo Fisher Scientific, Waltham, MA, USA) equipped with a Nanospray Flex ion source. Typical parameters were as follows: spray voltage 1.8 kV, capillary temperature 220 °C, S-lens RF level 200, AGC target fixed, maximum injection time 200 ms, and in-source trapping enabled. Ion optics were configured with a source DC offset of 21 V, desolvation voltage of 0 V, injection flatapole DC of 5 V, inter-flatapole lens of 4 V, bent flatapole DC of 2 V, transfer multipole DC of 0 V, and a C-trap entrance lens inject voltage of 1.8 V. RF amplitudes were set to 150 V for the injection flatapole, 300 V for the bent flatapole, 250 V for the transfer multipole and HCD cell, and 2300 V for the C-trap. The trapping gas pressure was set to 3.0 (arbitrary units), and the C-trap charge detector support was disabled. Data were typically acquired with 5 microscans at a scan rate of approximately 0.8 scans/s. Raw data were processed using FreeStyle software, version 1.8 (Thermo Fisher Scientific, USA).

4.5. Data Analysis

Mass spectrometry screening data from Bruker FTMS files were processed using a custom automated pipeline implemented in Python version 3.14 (Code S1).
First, data files from each acquisition mode (low-CID and high-CID/HCD) were centroided and converted to an open format (mzML) using the MSConvert tool (ProteoWizard). For each sample and each collision condition, multiple individual scans were aligned and combined (averaged) to generate a single representative “apex” spectrum to improve detection of low-intensity signals. Peak assignment and hit identification were then carried out by comparing the low-CID and high-CID spectra.
In the low-CID (intact complex) spectra, the charge states of the apo-HPPK ion signals were identified using the known mass (20,205 Da) and isotopic distribution of HPPK (as determined from a separate protein-only spectrum). The two most intense HPPK charge states were used for analysis of binding. New peaks corresponding to protein–ligand complex ions were detected as reproducible signals at higher m/z than the apo-HPPK peaks. The mass differences between each putative complex and the apo-protein (for each charge state) were calculated and compared against the molecular weights of compounds in the screened ligand pool. A mass tolerance of ±2 Da was used for matching experimental mass differences to entries in the ligand library database. Only those mass shifts that appeared consistently across at least two independent charge states were considered evidence of a binding event.
In parallel, the high-CID (ligand-release) spectra were analyzed to confirm the identities of the bound ligands: the expected m/z values of potential ligand adduct ions ([M + H]+, [M + Na]+, and [M + K]+ species) were calculated for each candidate hit from the low-CID analysis, and the high-CID spectra were searched for corresponding peaks. Detection of a matching ligand ion in the high-CID data, within the defined mass tolerance, was used to verify the assignment of each protein-bound mass shift to a specific ligand.
Binding ratios for HPPK–ligand complexes were computed from the low-CID spectra as the intensity of the ligand-bound protein peak divided by the total intensity of protein (free HPPK plus HPPK bound to that ligand) for the chosen charge state(s).

4.6. Titration of 8-MG Against HPPK by Native MS

8-MG was prepared at ten concentrations ranging from 1 to 80 μM in 1 μL methanol and incubated with 49 μL of HPPK solution for 1 h at room temperature prior to native MS analysis. All experiments were performed in triplicate.
Deconvolution of native MS spectra acquired from all instruments was carried out using UniDec (University of Oxford, UK). Charge-state-resolved intensities of apo and ligand-bound protein were then extracted from deconvoluted spectra.
The dissociation constant ( K d ) values, defined as
K d = P × [ L ] [ P L ] = ( [ P ] 0 P L ) × ( [ L ] 0 P L ) [ P L ]
where P , [ L ] , [ P ] 0 , [ L ] 0 and P L represent the concentrations of the free protein, free ligand, total protein, total ligand and protein–ligand complex, respectively.
The ratio of protein–ligand complex ( P L ) was determined using the following equation:
P L P 0 =   n I [ P L ] n + n ( I P n + + I P L n + )
where I [ P L ] n + is the intensity of the P L at charge state n, I [ P ] n + is the intensity of the apo-protein at charge state n, and the summation accounts for all observed charge states of the protein.
A binding curve was then generated by plotting ligand concentration ( [ L ] 0 ) against the percentage of P L . Non-linear regression was performed in GraphPad Prism version 11 using the One-site-Specific binding model to obtain the fitted K d values.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/molecules31122065/s1, Figure S1: Pairwise sequence alignment and conserved motif annotation of HPPK from Mycobacterium tuberculosis and M. smegmatis; Figure S2: Native MS screening spectra for HPPK-binding ligands acquired under low-CID conditions (10 V); Figure S3: Native MS screening spectra for HPPK-binding ligands acquired under high-CID conditions (25–30 V); Figure S4: Native MS spectra of 20 additional ginsenosides (25 μM) against HPPK (10 μM); Table S1: Automated data analysis results from dual-mode native MS pooled screening of HPPK against 100-compound natural product pools under low-CID and high-CID conditions; Code S1: Python script for automated analysis of integrated dual-mode native MS datasets.

Author Contributions

Conceptualization, X.X., R.J.Q., B.H.A.R., P.J.M. and M.L.; methodology, X.X. and P.J.M.; software, X.X.; validation, X.X.; resources, M.L.; data curation, X.X.; writing—original draft preparation, X.X. and M.L.; writing—review and editing, X.X., R.J.Q., B.H.A.R., P.J.M. and M.L.; funding acquisition, M.L. All authors have read and agreed to the published version of the manuscript.

Funding

This project is supported by a 2022 National Health and Medical Research Council (NHMRC) Investigator Grant Emerging Leadership 1 (EL1) 2017517.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Dataset available on request from the corresponding author.

Acknowledgments

X.X. acknowledges award of a GU International Postgraduate Research Scholarship and a GU Postgraduate Research Scholarship. B.H.A.R. received funding from the Australian Research Council (ARC Discovery Projects DP200100874 and DP220102236). The purified protein was provided by the Seattle Structural Genomics Center for Infectious Disease (SSGCID; www.SSGCID.org, accessed on 11 June 2026) which is supported by Federal Contract No. 75N93022C00036 from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services. The authors acknowledge the facilities and the scientific and technical assistance of Compounds Australia and the Ramaciotti Australian Native Mass Spectrometry Platform for Health Discoveries, Griffith University. The authors thank Wendy Loa for mass spectrometry instrumentation support.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
HPPK6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase
TBTuberculosis
MtbMycobacterium tuberculosis
MDR-TBmultidrug-resistant TB
MSmass spectrometry
CIDcollision-induced dissociation
PPTprotopanaxatriol
PPDprotopanaxadiol
SSGCIDSeattle Structural Genomics Center for Infectious Diseases
TOFtime-of-flight

References

  1. Crombie, I.K. Where They Come from, Why They Spread, and When They End. In Major Pandemics; Palgrave Macmillan Singapore: Singapore, 2025. [Google Scholar]
  2. WHO. WHO Global Tuberculosis Report 2025; WHO: Geneva, Switzerland, 2025. [Google Scholar]
  3. Omoteso, O.A.; Fadaka, A.O.; Walker, R.B.; Khamanga, S.M. Innovative Strategies for Combating Multidrug-Resistant Tuberculosis: Advances in Drug Delivery Systems and Treatment. Microorganisms 2025, 13, 722. [Google Scholar] [CrossRef] [PubMed]
  4. Zhang, S.; Cheng, J.; Tang, Y. Drug resistance mechanisms in Mycobacterium tuberculosis infection and challenges in vaccine development. Front. Pharmacol. 2026, 17, 1762214. [Google Scholar] [CrossRef]
  5. Xu, M.; Kastner, D.W.; Luo, W.; Li, F.S.; Muller, P.; Sun, Y.; Huang, W.; Glinkerman, C.M.; Guempel, M.; Kulik, H.J.; et al. Mammalian-like steroidogenesis in plants gives rise to endocrine-mimetic cardenolides. Sci. Adv. 2026, 12, eaeb5460. [Google Scholar] [CrossRef] [PubMed]
  6. Grosjean, H.; Biggin, P.C. Developments and challenges in hit progression within fragment-based drug discovery. Nat. Commun. 2026, 17, 2226. [Google Scholar] [CrossRef]
  7. Vassiliades, S.V.; Borges, L.G.; Giarolla, J.; Parise-Filho, R. Folate Pathway Inhibitors, An Underestimated and Underexplored Molecular Target for New Anti-tuberculosis Agents. Mini Rev. Med. Chem. 2023, 23, 1711–1732. [Google Scholar] [CrossRef] [PubMed]
  8. Hajian, B.; Scocchera, E.; Shoen, C.; Krucinska, J.; Viswanathan, K.; G-Dayanandan, N.; Erlandsen, H.; Estrada, A.; Mikusova, K.; Kordulakova, J.; et al. Drugging the Folate Pathway in Mycobacterium tuberculosis: The Role of Multi-targeting Agents. Cell Chem. Biol. 2019, 26, 781–791.e6. [Google Scholar] [CrossRef] [PubMed]
  9. Wrobel, A.; Arciszewska, K.; Maliszewski, D.; Drozdowska, D. Trimethoprim and other nonclassical antifolates an excellent template for searching modifications of dihydrofolate reductase enzyme inhibitors. J. Antibiot. 2020, 73, 5–27. [Google Scholar] [CrossRef]
  10. Jacobowski, A.C.; Boleti, A.P.A.; Cruz, M.V.; Santos, K.; de Andrade, L.R.M.; Frihling, B.E.F.; Migliolo, L.; Paiva, P.M.G.; Teodoro, P.E.; Teodoro, L.P.R.; et al. Combating Antimicrobial Resistance: Innovative Strategies Using Peptides, Nanotechnology, Phages, Quorum Sensing Interference, and CRISPR-Cas Systems. Pharmaceuticals 2025, 18, 1119. [Google Scholar] [CrossRef]
  11. Shi, G.; Shaw, G.X.; Zhu, F.; Tarasov, S.G.; Ji, X. Bisubstrate inhibitors of 6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase: Transition state analogs for high affinity binding. Bioorg. Med. Chem. 2021, 29, 115847. [Google Scholar] [CrossRef]
  12. Dennis, M.L.; Pitcher, N.P.; Lee, M.D.; DeBono, A.J.; Wang, Z.C.; Harjani, J.R.; Rahmani, R.; Cleary, B.; Peat, T.S.; Baell, J.B.; et al. Structural Basis for the Selective Binding of Inhibitors to 6-Hydroxymethyl-7,8-dihydropterin Pyrophosphokinase from Staphylococcus aureus and Escherichia coli. J. Med. Chem. 2016, 59, 5248–5263. [Google Scholar] [CrossRef]
  13. Jongkon, N.; Gleeson, D.; Gleeson, M.P. Elucidation of the Catalytic Mechanism of 6-Hydroxymethyl-7,8-dihydropterin Pyrophosphokinase using QM/MM Calculations. Org. Biomol. Chem. 2018, 16, 6239–6249. [Google Scholar] [CrossRef]
  14. Minato, Y.; Thiede, J.M.; Kordus, S.L.; McKlveen, E.J.; Turman, B.J.; Baughn, A.D. Mycobacterium tuberculosis folate metabolism and the mechanistic basis for para-aminosalicylic acid susceptibility and resistance. Antimicrob. Agents Chemother. 2015, 59, 5097–5106. [Google Scholar] [CrossRef]
  15. Yogavel, M.; Nettleship, J.E.; Sharma, A.; Harlos, K.; Jamwal, A.; Chaturvedi, R.; Sharma, M.; Jain, V.; Chhibber-Goel, J.; Sharma, A. Structure of 6-hydroxymethyl-7,8-Dihydropterin Pyrophosphokinase-Dihydropteroate Synthase from Sheds Light on Drug Resistance. J. Biol. Chem. 2018, 293, 14962–14972. [Google Scholar] [CrossRef] [PubMed]
  16. Shaw, G.X.; Li, Y.; Shi, G.; Wu, Y.; Cherry, S.; Needle, D.; Zhang, D.; Tropea, J.E.; Waugh, D.S.; Yan, H.; et al. Structural enzymology and inhibition of the bi-functional folate pathway enzyme HPPK-DHPS from the biowarfare agent Francisella tularensis. FEBS J. 2014, 281, 4123–4137. [Google Scholar] [CrossRef] [PubMed]
  17. Hoarau, M.; Suwanakitti, N.; Varatthan, T.; Thiabma, R.; Rattanajak, R.; Charoensetakul, N.; Redman, E.K.; Khotavivattana, T.; Vilaivan, T.; Yuthavong, Y.; et al. Assay Development and Identification of the First Plasmodium falciparum 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase Inhibitors. Molecules 2022, 27, 3515. [Google Scholar] [CrossRef] [PubMed]
  18. Falcão, V.C.d.A.; Czeczot, A.d.M.; Alam, M.M.; Park, K.-H.P.; Heo, J.; Woo, M.; Camini, A.M.; Timmers, L.F.S.M.; Shum, D.; Perelló, M.A.; et al. Identification and characterization of new structural scaffolds modulating the activity of Mycobacterium tuberculosis dihydroneopterin aldolase (FolB) in vitro. Front. Trop. Dis. 2024, 5, 1402321. [Google Scholar] [CrossRef]
  19. Han, J.; Liu, X.; Zhang, L.; Quinn, R.J.; Feng, Y. Anti-mycobacterial natural products and mechanisms of action. Nat. Prod. Rep. 2022, 39, 77–89. [Google Scholar] [CrossRef]
  20. Pye, C.R.; Bertin, M.J.; Lokey, R.S.; Gerwick, W.H.; Linington, R.G. Retrospective analysis of natural products provides insights for future discovery trends. Proc. Natl. Acad. Sci. USA 2017, 114, 5601–5606. [Google Scholar] [CrossRef]
  21. Gagare, S.; Patil, P.; Jain, A. Natural product-inspired strategies towards the discovery of novel bioactive molecules. Future J. Pharm. Sci. 2024, 10, 55. [Google Scholar] [CrossRef]
  22. Doyle, S.K.; Pop, M.S.; Evans, H.L.; Koehler, A.N. Advances in Discovering Small Molecules to Probe Protein Function in a Systems Context. Curr. Opin. Chem. Biol. 2016, 30, 28–36. [Google Scholar] [CrossRef]
  23. Auld, D.S.; Inglese, J.; Dahlin, J.L. Assay Interference by Aggregation. In Assay Guidance Manual; Markossian, S., Grossman, A., Baskir, H., Arkin, M., Auld, D., Austin, C., Baell, J., Brimacombe, K., Chung, T.D.Y., Coussens, N.P., et al., Eds.; Bethesda: Rockville, MD, USA, 2004. [Google Scholar]
  24. Prudent, R.; Lemoine, H.; Walsh, J.; Roche, D. Affinity selection mass spectrometry speeding drug discovery. Drug Discov. Today 2023, 28, 103760. [Google Scholar] [CrossRef]
  25. Tamara, S.; den Boer, M.A.; Heck, A.J.R. High-Resolution Native Mass Spectrometry. Chem. Rev. 2022, 122, 7269–7326. [Google Scholar] [CrossRef]
  26. Gavriilidou, A.F.M.; Sokratous, K.; Yen, H.-Y.; De Colibus, L. High-Throughput Native Mass Spectrometry Screening in Drug Discovery. Front. Mol. Biosci. 2022, 9, 837901. [Google Scholar] [CrossRef] [PubMed]
  27. Nguyen, G.T.H.; Bennett, J.L.; Liu, S.; Hancock, S.E.; Winter, D.L.; Glover, D.J.; Donald, W.A. Multiplexed Screening of Thousands of Natural Products for Protein-Ligand Binding in Native Mass Spectrometry. J. Am. Chem. Soc. 2021, 143, 21379–21387. [Google Scholar] [CrossRef] [PubMed]
  28. Elnaas, A.R.; Grice, D.; Han, J.Y.; Feng, Y.J.; Capua, A.D.; Mak, T.; Laureanti, J.A.; Buchko, G.W.; Myler, P.J.; Cook, G.; et al. Discovery of a Natural Product That Binds to the Protein Rv1466 Using Native Mass Spectrometry. Molecules 2020, 25, 2384. [Google Scholar] [CrossRef]
  29. Quinn, R.J.; Mak, T.; Littler, D.R.; Rossjohn, J.; Liu, M. Discovery of Anti-SARS-CoV-2 Nsp9 Binders from Natural Products by a Native Mass Spectrometry Approach. J. Nat. Prod. 2023, 86, 2630–2637. [Google Scholar] [CrossRef] [PubMed]
  30. Vu, H.; Pedro, L.; Mak, T.; McCormick, B.; Rowley, J.; Liu, M.M.; Di Capua, A.; Williams-Noonan, B.; Pham, N.B.; Pouwer, R.; et al. Fragment-Based Screening of a Natural Product Library against 62 Potential Malaria Drug Targets Employing Native Mass Spectrometry. ACS Infect. Dis. 2018, 4, 431–444. [Google Scholar] [CrossRef]
  31. Gu, Y.S.; Liu, M.M.; Quinn, R.J. Metabolite-protein interactions: Native mass spectrometry and collision induced affinity selection mass spectrometry in natural product screening. Front. Anal. Sci. 2022, 2, 1014017. [Google Scholar] [CrossRef]
  32. Kitova, E.N.; El-Hawiet, A.; Schnier, P.D.; Klassen, J.S. Reliable determinations of protein-ligand interactions by direct ESI-MS measurements. Are we there yet? J. Am. Soc. Mass Spectrom. 2012, 23, 431–441. [Google Scholar] [CrossRef]
  33. Karch, K.R.; Snyder, D.T.; Harvey, S.R.; Wysocki, V.H. Native Mass Spectrometry: Recent Progress and Remaining Challenges. Annu. Rev. Biophys. 2022, 51, 157–179. [Google Scholar] [CrossRef]
  34. Grgic, A.; Nagornov, K.O.; Kozhinov, A.N.; Michael, J.A.; Anthony, I.G.M.; Tsybin, Y.O.; Heeren, R.M.A.; Ellis, S.R. Ultrahigh-Mass Resolution Mass Spectrometry Imaging with an Orbitrap Externally Coupled to a High-Performance Data Acquisition System. Anal. Chem. 2023, 96, 794–801. [Google Scholar] [CrossRef] [PubMed]
  35. Christofi, E.; Barran, P. Ion Mobility Mass Spectrometry (IM-MS) for Structural Biology: Insights Gained by Measuring Mass, Charge, and Collision Cross Section. Chem. Rev. 2023, 123, 2902–2949. [Google Scholar] [CrossRef] [PubMed]
  36. Snyder, D.T.; Harvey, S.R.; Wysocki, V.H. Surface-induced Dissociation Mass Spectrometry as a Structural Biology Tool. Chem. Rev. 2022, 122, 7442–7487. [Google Scholar] [CrossRef] [PubMed]
  37. Keener, J.E.; Zhang, G.Z.; Marty, M.T. Native Mass Spectrometry of Membrane Proteins. Anal. Chem. 2021, 93, 583–597. [Google Scholar] [CrossRef]
  38. Oluwole, A.; Shutin, D.; Bolla, J.R. Mass spectrometry of intact membrane proteins: Shifting towards a more native-like context. Essays Biochem. 2023, 67, 201–213. [Google Scholar] [CrossRef]
  39. Kumar, S.; Stover, L.; Wang, L.; Bahramimoghaddam, H.; Zhou, M.; Russell, D.H.; Laganowsky, A. Native Mass Spectrometry of Membrane Protein-Lipid Interactions in Different Detergent Environments. Anal. Chem. 2024, 96, 16768–16776. [Google Scholar] [CrossRef]
  40. Le, J.; Loo, J.A. Native Top-Down Mass Spectrometry Characterization of Model Integral Membrane Protein Bacteriorhodopsin. J. Am. Soc. Mass Spectr. 2025, 36, 961–968. [Google Scholar] [CrossRef]
  41. Deedwania, A.; Wang, Y.; Robinson, C.V.; Bolla, J.R. Native mass spectrometry of membrane proteins reconstituted in peptidiscs. RSC Chem. Biol. 2026, 7, 120–128. [Google Scholar] [CrossRef]
  42. Kim, D.; Liu, W.J.; Viner, R.; Cherezov, V. Native mass spectrometry prescreening of G protein-coupled receptor complexes for cryo-EM structure determination. Structure 2024, 32, 2206–2219. [Google Scholar] [CrossRef]
  43. Tajiri, M.; Imai, S.; Konuma, T.; Shimamoto, K.; Shimada, I.; Akashi, S. Evaluation of Drug Responses to Human βAR Using Native Mass Spectrometry. ACS Omega 2023, 8, 24544–24551. [Google Scholar] [CrossRef]
  44. Gu, Y.S.; Liu, M.M.; Ma, L.L.; Quinn, R.J. Identification of Ligands for Ion Channels: TRPM2. Chembiochem 2024, 25, e202300790. [Google Scholar] [CrossRef]
  45. Gu, Y.S.; Liu, M.M.; Ma, L.L.; Quinn, R.J. Advancing Kir4.2 Channel Ligand Identification through Collision-Induced Affinity Selection Mass Spectrometry. ACS Chem. Biol. 2024, 19, 763–773. [Google Scholar] [CrossRef] [PubMed]
  46. Liu, R.J.; Xia, S.J.; Li, H.L. Native top-down mass spectrometry for higher-order structural characterization of proteins and complexes. Mass Spectrom. Rev. 2023, 42, 1876–1926. [Google Scholar] [CrossRef] [PubMed]
  47. Sternicki, L.M.; Nonomiya, J.; Liu, M.M.; Mulvihill, M.M.; Quinn, R.J. Native Mass Spectrometry for the Study of PROTAC GNE-987-Containing Ternary Complexes. Chemmedchem 2021, 16, 2206–2210. [Google Scholar] [CrossRef] [PubMed]
  48. Beveridge, R.; Kessler, D.; Rumpel, K.; Ettmayer, P.; Meinhart, A.; Clausen, T. Native Mass Spectrometry Can Effectively Predict PROTAC Efficacy. ACS Central Sci. 2020, 6, 1223–1230. [Google Scholar] [CrossRef]
  49. He, X.M.; Chen, X.Y.; Wang, Y.S. Mass Spectrometry for Assessing Protein-Nucleic Acid Interactions. Anal. Chem. 2023, 95, 115–127. [Google Scholar] [CrossRef]
  50. Cotham, V.C.; Wang, S.H.; Li, N. An Online Native Mass Spectrometry Approach for Fast, Sensitive, and Quantitative Assessment of Adeno-Associated Virus Capsid Content Ratios. J. Am. Soc. Mass Spectr. 2024, 35, 1567–1575. [Google Scholar] [CrossRef]
  51. Rogawski, R.; Sharon, M. Characterizing Endogenous Protein Complexes with Biological Mass Spectrometry. Chem. Rev. 2022, 122, 7386–7414. [Google Scholar] [CrossRef]
  52. Gan, J.R.; Ben-Nissan, G.; Arkind, G.; Tarnavsky, M.; Trudeau, D.; Garcia, L.N.; Tawfik, D.S.; Sharon, M. Native Mass Spectrometry of Recombinant Proteins from Crude Cell Lysates. Anal. Chem. 2017, 89, 4398–4404. [Google Scholar] [CrossRef]
  53. Shi, G.; Shaw, G.; Li, Y.; Wu, Y.; Yan, H.; Ji, X. Bisubstrate analog inhibitors of 6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase: New lead exhibits a distinct binding mode. Bioorg. Med. Chem. 2012, 20, 4303–4309. [Google Scholar] [CrossRef]
  54. Han, J.Y.; Liu, X.T.; Zhang, L.X.; Van Voorhis, W.C.; Quinn, R.J.; Liu, M.M. Rapid Discovery of Antimicrobial and Antimalarial Agents from Natural Product Fragments. Separations 2024, 11, 194. [Google Scholar] [CrossRef]
  55. Han, J.Y.; Liu, X.T.; Zhang, L.X.; Quinn, R.J.; Liu, M.M. Nuclear Magnetic Resonance Fingerprinting and Principal Component Analysis Strategies Lead to Anti-Tuberculosis Natural Product Discovery from Actinomycetes. Antibiotics 2025, 14, 108. [Google Scholar] [CrossRef]
  56. Liu, M.M.; Han, J.Y.; Feng, Y.J.; Guymer, G.; Forster, P.; Quinn, R.J. Antimicrobial Benzyltetrahydroisoquinoline-Derived Alkaloids from the Leaves of Doryphora aromatica. J. Nat. Prod. 2021, 84, 676–682. [Google Scholar] [CrossRef]
  57. McInnes, L.; Healy, J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv 2018, arXiv:1802.03426. [Google Scholar]
  58. Sharma, R.; Saghapour, E.; Chen, J.Y. An NLP-based technique to extract meaningful features from drug SMILES. iScience 2024, 27, 109127. [Google Scholar] [CrossRef]
  59. Zhou, H.Y.; Skolnick, J. Utility of the Morgan Fingerprint in Structure-Based Virtual Ligand Screening. J. Phys. Chem. B 2024, 128, 5363–5370. [Google Scholar] [CrossRef]
  60. Chhabra, S.; Barlow, N.; Dolezal, O.; Hattarki, M.K.; Newman, J.; Peat, T.S.; Graham, B.; Swarbrick, J.D. Exploring the Chemical Space around 8-Mercaptoguanine as a Route to New Inhibitors of the Folate Biosynthesis Enzyme HPPK. PLoS ONE 2013, 8, e59535. [Google Scholar] [CrossRef]
  61. Xue, X.; Quinn, R.J.; Rehm, B.H.A.; Day, C.J.; Jennings, M.P.; McBroom, J.; Littler, D.R.; Liu, M. Advancing Collision-Induced Affinity Selection Mass Spectrometry for Quantitative Ligand Analysis in Complex Mixtures. Anal. Chem. 2026, 98, 7247–7261. [Google Scholar] [CrossRef]
  62. Bruggink, V.; Gutjahr, C.; Decker, A.; Engelbrecht, H.; Beekmann, U.; Kralisch, D.; Werner, M.; Schädel, P.; Jordan, P.M.; Werz, O.; et al. Ginsenosides from Panax ginseng modulate lipid mediator profiles in human leukocytes by interference with cellular 5-lipoxygenase activity. Biochem. Pharmacol. 2025, 236, 116882. [Google Scholar] [CrossRef]
  63. Lee, S.Y.; Kim, G.T.; Roh, S.H.; Song, J.S.; Kim, H.J.; Hong, S.S.; Kwon, S.W.; Park, J.H. Proteomic analysis of the anti-cancer effect of 20S-ginsenoside Rg3 in human colon cancer cell lines. Biosci. Biotechnol. Biochem. 2009, 73, 811–816. [Google Scholar] [CrossRef]
  64. Wang, C.; Liu, J.; Deng, J.; Wang, J.; Weng, W.; Chu, H.; Meng, Q. Advances in the chemistry, pharmacological diversity, and metabolism of 20(R)-ginseng saponins. J. Ginseng Res. 2020, 44, 14–23. [Google Scholar] [CrossRef]
  65. Liu, J.; Shiono, J.; Shimizu, K.; Yu, H.; Zhang, C.; Jin, F.; Kondo, R. 20(R)-ginsenoside Rh2, not 20(S), is a selective osteoclastgenesis inhibitor without any cytotoxicity. Bioorg. Med. Chem. Lett. 2009, 19, 3320–3323. [Google Scholar] [CrossRef]
  66. Bennett, J.L.; Nguyen, G.T.H.; Donald, W.A. Protein-Small Molecule Interactions in Native Mass Spectrometry. Chem. Rev. 2022, 122, 7327–7385. [Google Scholar] [CrossRef]
  67. Leslie, K.G.; Berry, S.S.; Miller, G.J.; Mahon, C.S. Sugar-Coated: Can Multivalent Glycoconjugates Improve upon Nature’s Design? J. Am. Chem. Soc. 2024, 146, 27215–27232. [Google Scholar] [CrossRef]
  68. Czeczot, A.M.; Roth, C.D.; Ducati, R.G.; Pissinate, K.; Rambo, R.S.; Timmers, L.; Abbadi, B.L.; Macchi, F.S.; Pestana, V.Z.; Basso, L.A.; et al. 8-Mercaptoguanine-based inhibitors of Mycobacterium tuberculosis dihydroneopterin aldolase: Synthesis, in vitro inhibition and docking studies. J. Enzym. Inhib. Med. Chem. 2021, 36, 847–855. [Google Scholar] [CrossRef]
  69. Wolfe, L.M.; Veeraraghavan, U.; Idicula-Thomas, S.; Schurer, S.; Wennerberg, K.; Reynolds, R.; Besra, G.S.; Dobos, K.M. A chemical proteomics approach to profiling the ATP-binding proteome of Mycobacterium tuberculosis. Mol. Cell Proteom. 2013, 12, 1644–1660. [Google Scholar] [CrossRef]
  70. Zhang, X.; Zhao, R.; Qi, Y.; Yan, X.; Qi, G.; Peng, Q. The progress of Mycobacterium tuberculosis drug targets. Front. Med. 2024, 11, 1455715. [Google Scholar] [CrossRef]
Figure 1. (A) Molecular weight distribution of the natural product library; (B) Structural similarity map of the screened natural product library generated from RDKit Morgan fingerprints (ECFP4, radius = 2, 2048 bits) and visualized by UMAP using Tanimoto/Jaccard distance. Each black dot represents one compound.
Figure 1. (A) Molecular weight distribution of the natural product library; (B) Structural similarity map of the screened natural product library generated from RDKit Morgan fingerprints (ECFP4, radius = 2, 2048 bits) and visualized by UMAP using Tanimoto/Jaccard distance. Each black dot represents one compound.
Molecules 31 02065 g001
Figure 2. (A) Native MS spectra of M. smegmatis HPPK (1 μM) in the absence (top) and presence of 8-mercaptoguanine (8-MG, 10 μM). Apo HPPK signals are highlighted in blue, while HPPK–ligand complexes are highlighted in orange. (B) Chemical structure of 8-MG. (C) Titration curves of 8-MG at various concentrations against M. smegmatis HPPK (1 μM).
Figure 2. (A) Native MS spectra of M. smegmatis HPPK (1 μM) in the absence (top) and presence of 8-mercaptoguanine (8-MG, 10 μM). Apo HPPK signals are highlighted in blue, while HPPK–ligand complexes are highlighted in orange. (B) Chemical structure of 8-MG. (C) Titration curves of 8-MG at various concentrations against M. smegmatis HPPK (1 μM).
Molecules 31 02065 g002
Figure 3. Native MS screening workflow. HPPK (10 μM) was incubated with natural product pools. Samples were analyzed under dual CID conditions: low CID (10 V) to preserve intact protein–ligand complexes and high CID (25–30 V) to induce ligand release. Hit identification was based on combined analysis of both datasets.
Figure 3. Native MS screening workflow. HPPK (10 μM) was incubated with natural product pools. Samples were analyzed under dual CID conditions: low CID (10 V) to preserve intact protein–ligand complexes and high CID (25–30 V) to induce ligand release. Hit identification was based on combined analysis of both datasets.
Molecules 31 02065 g003
Figure 4. Python-based data analysis workflow for dual-mode native MS screening of ligand libraries.
Figure 4. Python-based data analysis workflow for dual-mode native MS screening of ligand libraries.
Molecules 31 02065 g004
Figure 5. Native MS validation of individual binding of the five initial ginsenoside hits (25 μM) to HPPK (10 μM). The top panel shows HPPK alone, displaying the 7+, 8+, and 9+ charge states. The lower panels show spectra acquired in the presence of pseudoginsenoside F11, ginsenoside Rb1, ginsenoside F3, ginsenoside Re, and ginsenoside Rb3, respectively. Apo HPPK signals are highlighted in blue, while HPPK–ligand complexes are highlighted in orange.
Figure 5. Native MS validation of individual binding of the five initial ginsenoside hits (25 μM) to HPPK (10 μM). The top panel shows HPPK alone, displaying the 7+, 8+, and 9+ charge states. The lower panels show spectra acquired in the presence of pseudoginsenoside F11, ginsenoside Rb1, ginsenoside F3, ginsenoside Re, and ginsenoside Rb3, respectively. Apo HPPK signals are highlighted in blue, while HPPK–ligand complexes are highlighted in orange.
Molecules 31 02065 g005
Figure 6. Structures of ginsenosides 125. The protopanaxatriol (PPT) and protopanaxadiol (PPD) skeletons are highlighted in red and blue, respectively.
Figure 6. Structures of ginsenosides 125. The protopanaxatriol (PPT) and protopanaxadiol (PPD) skeletons are highlighted in red and blue, respectively.
Molecules 31 02065 g006
Table 1. Top five initial hits identified from native MS screening of HPPK.
Table 1. Top five initial hits identified from native MS screening of HPPK.
Initial HitsExpected MW (Da)Calculated MW (Da) Under Low-CID ConditionsDetected Adduct Ions Under High-CID ConditionsBinding Ratios (%)
[M + H]+[M + Na]+[M + K]+
Pseudoginsenoside F11801.01801.43802.5024824.4843 *840.45761.89
Ginsenoside Rb11109.291108.551109.60891131.5910 *1147.58651.36
Ginsenoside F3770.99770.39771.4890 *793.4710809.44481.12
Ginsenoside Re947.15947.09-970.5422 *986.53731.09
Ginsenoside Rb31079.271079.931079.59791101.5804 *1117.55461.02
* The most intense peak selected for downstream analysis.
Table 2. Structural features of the 25 ginsenosides tested for HPPK binding.
Table 2. Structural features of the 25 ginsenosides tested for HPPK binding.
HitsNameClassC-3C-6C-20Number of SugarscLogPBinding Ratios (%)
1S-ginsenoside Rh1PPTOHGlcOH14.014.40 ± 0.11
2Ginsenoside Rg2PPTOHGlc–RhaOH23.013.66 ± 0.69
3Ginsenoside F3PPTOHOHGlc–Ara(p)22.712.49 ± 0.17
4Ginsenoside RfPPTOHGlc–GlcOH22.111.35 ± 0.54
5Ginsenoside F1PPTOHOHGlc14.010.08 ± 1.41
6Pseudoginsenoside RT5PPTOHGlc20,24-epoxy12.38.32 ± 0.49
7Ginsenoside Rb1PPDGlc-GlcHGlc–Glc4−0.75.35 ± 0.03
8Ginsenoside Ro*GlcA-GlcHH30.25.09 ± 0.06
9Ginsenoside RePPTOHGlc–RhaGlc31.24.56 ± 0.33
10R-ginsenoside Rh1PPTOHGlcOH14.04.44 ± 0.18
11Pseudoginsenoside F11PPTOHGlc–Rha20,24-epoxy21.44.23 ± 0.19
12Ginsenoside Rg1PPTOHGlcGlc22.13.98 ± 0.25
13Ginsenoside Rb3PPDGlc-GlcHGlc–Xyl40.43.87 ± 0.22
14R-ginsenoside Rg2PPTOHGlc–RhaOH23.03.62 ± 0.08
15Ginsenoside F2PPDGlcHGlc23.03.60 ± 0.08
16Ginsenoside RcPPDGlc-GlcHGlc–Ara(f)4−0.23.46 ± 0.09
17Ginsenoside RdPPDGlc-GlcHGlc31.12.65 ± 0.18
18Ginsenoside Rb2PPDGlc-GlcHGlc–Ara(p)4−0.12.29 ± 0.04
19Ginsenoside CKPPDOHHGlc14.80
20Ginsenoside Rg6PPTOHGlc–Rha#24.10
21Ginsenoside Rk1PPDGlc-GlcH#24.00
22S-ginsenoside Rg3PPDGlc-GlcHOH23.00
23R-ginsenoside Rg3PPDGlc-GlcHOH23.00
24S-ginsenoside Rh2PPDGlcHOH14.80
25R-ginsenoside Rh2PPDGlcHOH14.80
* Ginsenoside Ro: oleanane-type (oleanolic acid-based), not PPD/PPT. # Dehydrated side chain.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xue, X.; Quinn, R.J.; Rehm, B.H.A.; Myler, P.J.; Liu, M. Dual-Mode Native Mass Spectrometry Screening Identifies Ginsenoside Ligands of 6-Hydroxymethyl-7,8-Dihydropterin Pyrophosphokinase (HPPK). Molecules 2026, 31, 2065. https://doi.org/10.3390/molecules31122065

AMA Style

Xue X, Quinn RJ, Rehm BHA, Myler PJ, Liu M. Dual-Mode Native Mass Spectrometry Screening Identifies Ginsenoside Ligands of 6-Hydroxymethyl-7,8-Dihydropterin Pyrophosphokinase (HPPK). Molecules. 2026; 31(12):2065. https://doi.org/10.3390/molecules31122065

Chicago/Turabian Style

Xue, Xinru, Ronald J. Quinn, Bernd H. A. Rehm, Peter J. Myler, and Miaomiao Liu. 2026. "Dual-Mode Native Mass Spectrometry Screening Identifies Ginsenoside Ligands of 6-Hydroxymethyl-7,8-Dihydropterin Pyrophosphokinase (HPPK)" Molecules 31, no. 12: 2065. https://doi.org/10.3390/molecules31122065

APA Style

Xue, X., Quinn, R. J., Rehm, B. H. A., Myler, P. J., & Liu, M. (2026). Dual-Mode Native Mass Spectrometry Screening Identifies Ginsenoside Ligands of 6-Hydroxymethyl-7,8-Dihydropterin Pyrophosphokinase (HPPK). Molecules, 31(12), 2065. https://doi.org/10.3390/molecules31122065

Article Metrics

Back to TopTop