GC-MS, LC-MS/MS, Docking and Molecular Dynamics Approaches to Identify Potential SARS-CoV-2 3-Chymotrypsin-Like Protease Inhibitors from Zingiber officinale Roscoe

This study aims to identify and isolate the secondary metabolites of Zingiber officinale using GC-MS, preparative TLC, and LC-MS/MS methods, to evaluate the inhibitory potency on SARS-CoV-2 3 chymotrypsin-like protease enzyme, as well as to study the molecular interaction and stability by using docking and molecular dynamics simulations. GC-MS analysis suggested for the isolation of terpenoids compounds as major compounds on methanol extract of pseudostems and rhizomes. Isolation and LC-MS/MS analysis identified 5-hydro-7, 8, 2′-trimethoxyflavanone (9), (E)-hexadecyl-ferulate (1), isocyperol (2), N-isobutyl-(2E,4E)-octadecadienamide (3), and nootkatone (4) from the rhizome extract, as well as from the leaves extract with the absence of 9. Three known steroid compounds, i.e., spinasterone (7), spinasterol (8), and 24-methylcholesta-7-en-3β-on (6), were further identified from the pseudostem extract. Molecular docking showed that steroids compounds 7, 8, and 6 have lower predictive binding energies (MMGBSA) than other metabolites with binding energy of −87.91, −78.11, and −68.80 kcal/mole, respectively. Further characterization on the single isolated compound by NMR showed that 6 was identified and possessed 75% inhibitory activity on SARS-CoV-2 3CL protease enzyme that was slightly different with the positive control GC376 (77%). MD simulations showed the complex stability with compound 6 during 100 ns simulation time.


Introduction
COVID-19, caused by the SARS-CoV-2 virus, is a global pandemic that has negatively impacted human life in this recent time. As of July 2021, approximately 196 million people have been infected, and 4.2 million have died from this disease [1]. The absence of medicine has encouraged the application of several synthetic drugs to be repurposed for combating the virus replication, such as hydroxy chloroquine and remdesivir. However, the attention for adverse side effect prompts us to find drugs that are effective and selective in inhibiting the replication of the virus [2,3]. To date, several targets of SARS-CoV-2 virus have been identified, such as 3 chymotrypsin-like protease, papain-like protease, RNA dependent RNA polymerase, and spike-glycoprotein, which have afforded significant

Isolation and LC-MS/MS Identification of Isolates from Z. officinale n-Hexane Extract
The n-hexane extracts of Z. officinale leaves, pseudostems and rhizomes were fractionated on a vacuum-liquid chromatography (VLC) column, with silica gel 60 and eluted by a different polarity of solvents from n-hexane-dichloromethane-ethyl acetate to methanol. The VLC fractions having the same profiles were combined each other to collect 21, 17, and 18 fractions from leaves, pseudostems, and rhizomes extract, respectively. Based on the identification of terpenoids compounds (assigned by the purplish color after spraying with Liebermann-Burchard reagent), the fraction number 14 (leaves), number 7 (pseudostems), and number 46 (rhizomes) were further isolated using preparative-TLC on silica gel GF254 to possess a single isolated compound from each fraction and then characterized them using LC-MS/MS (Supplementary Materials).

Isolation and LC-MS/MS Identification of Isolates from Z. officinale n-Hexane Extract
The n-hexane extracts of Z. officinale leaves, pseudostems and rhizomes were fractionated on a vacuum-liquid chromatography (VLC) column, with silica gel 60 and eluted by a different polarity of solvents from n-hexane-dichloromethane-ethyl acetate to methanol. The VLC fractions having the same profiles were combined each other to collect 21, 17, and 18 fractions from leaves, pseudostems, and rhizomes extract, respectively. Based on the identification of terpenoids compounds (assigned by the purplish color after spraying with Liebermann-Burchard reagent), the fraction number 14 (leaves), number 7 (pseudostems), and number 46 (rhizomes) were further isolated using preparative-TLC on silica gel GF 254 to possess a single isolated compound from each fraction and then characterized them using LC-MS/MS (Supplementary Materials).

Molecular Docking
The LC-MS/MS identified compounds from Z. officinale n-hexane extracts (leaves, pseudostems, and rhizomes) were then subjected for molecular docking simulations to predict the potential compounds that can inhibit the SARS-CoV-2 3 CL protease. The optimized structures were docked to the viral protease binding site, which can be seen in Table 2. It showed that three known steroid compounds identified from the pseudostem part have lower predictive binding energies of molecular mechanics-generalized Born surface area (MMGBSA) than other compounds, including the co-crystallized ligand, and indinavir as the positive controls. Interestingly, 7, 8, and 6 exhibited binding energies with the value of −87.41, −78.11, and −68.80 kcal/mol, respectively, much lower than two positive controls, including baicalein (−47.14 kcal/mol) and remdesivir (−68.55 kcal/mol). Although indinavir showed binding energy slightly lower (−76.44 kcal/mol) than 6, it is higher than 7 and 8, indicating that compounds 6-8 are at least comparable with the positive controls in binding to the 3CLpro. The more negative value of this energy will exhibit the lower free energy along with the stronger binding. Therefore, the isolated steroid compounds were suggested for further purification and NMR analysis to elucidate the molecular structure, as well as to confirm the activity on SARS-CoV-2 3CL protease enzyme. SARS-CoV-2 3CL protease in complex with baicalein was chosen as the protein model based on the characteristic of this such ligand having more drug-like structure than peptidomimetic compound, which is commonly used as a protease inhibitor. Re-docking analysis of this co-crystallized nonpeptidomimetic inhibitor on the binding site of the viral protease (PDB 6m2n) represents the similar pose with the reported X-ray crystallography, in which hydrogen bonding (H-bond) interaction was found between the carbonyl group of baicalein and Glu166, as well as the multiple H-bond between the two phenolic hydroxyl groups and Gly143. Hydrophobic interactions were found between the free phenyl ring of baicalein with Met49, Cys44, Pro52, and Tyr54 ( Figure 2D) [22,23]. The steroid compounds showed binding modes mimicking the baicalein by polarly interacting with Glu166 and Gly143. The unique interaction was found only in the hydroxyl group of 8 through H-bond interaction with Thr190 ( Figure 2B). Meanwhile, the only carboxyl group of 6 interacts via H-bond with Cys44 ( Figure 2C). Definitely, there is no H-bond interaction found on 7 complexes with the 3CLpro. However, hydrophobic interaction was observed on the residues of Val 42, Cys 44, Leu 167, and Pro 168, giving an extra affinity to bind with the 3CLpro (Figure 2A).
of baicalein and Glu166, as well as the multiple H-bond between the two phenolic hydroxyl groups and Gly143. Hydrophobic interactions were found between the free phenyl ring of baicalein with Met49, Cys44, Pro52, and Tyr54 ( Figure 2D) [22,23]. The steroid compounds showed binding modes mimicking the baicalein by polarly interacting with Glu166 and Gly143. The unique interaction was found only in the hydroxyl group of 8 through H-bond interaction with Thr190 ( Figure 2B). Meanwhile, the only carboxyl group of 6 interacts via H-bond with Cys44 ( Figure 2C). Definitely, there is no H-bond interaction found on 7 complexes with the 3CLpro. However, hydrophobic interaction was observed on the residues of Val 42, Cys 44, Leu 167, and Pro 168, giving an extra affinity to bind with the 3CLpro (Figure 2A).

NMR Analysis and SARS-CoV-2 3CL Protease Inhibitory Activity Verification
The steroid compounds isolated from the pseudostem part of Z. officinale were purified by successive TLC preparative and underwent NMR analysis that led to structure elucidation of 6, based on 1 H and 13 C-NMR spectral data that can be seen in Table 3.
Compound 6 was isolated as a colorless powder. Its molecular mass was based on LC-MS/MS analysis with m/z at 399.36180 [M + H] + and molecular formula of C 28 H 46 O. The 13 C NMR data (Table 3) supported the chemical structure with 28 carbon atom signals categorized by Distortionless Enhancement by Polarization Transfer (DEPT) experiment into six methyls, ten methylenes, eight methines, and four quaternary carbons. Six degrees of unsaturation, calculated from the molecular formula, were attributed to a carbonyl group (δ C 204.6 ppm) assigned for C-3, a vinylic system at C-7 and C-8 (δ C 140.1 and 123.7 ppm), and four rings of steroidal skeleton. The 1 H NMR (Table 3) showed six methyl signals characteristic for cholesterol-type steroids skeleton, at δ H : 0.70 and 1.17 ppm, assigned for two methyls at C-18 and C-19, and four doublet signals for a methyl at δ H 0.79, 0.81, 0.90, and 1.00 ppm for C-28, C-21, C-26, and C-27, respectively. A broad singlet at δ H 5.71 was for olefinic proton H-7. From the 1 H-1 H COSY correlation, a spin system between H-C1 and H-C2 was observed and connected by long-range C-H correlation (HMBC) between Me-19 (δ H 1.17, br s) and C-1 (35.6), C-9 (53.8), and C-10 (38.6) established the closing of ring A. The connection with ring B was supported by long-range C-H correlation (HMBC) between H-C7 (δ H 5.71, s) and C-6 (32.9) and C-10 (38.6). The connection to the ring C was proven by the 1   Compound 6 was tested on SARS-CoV-2 enzyme inhibition and found the percentage of inhibition as 75% at the concentration of 200 µg/mL (Figure 4). This in vitro assay used the foster resonance energy transfer (FRET) principle, worked by measuring the fluorescence of the donor bead in the fluorogenic substrate upon cleaving under proteolysis by the 3CLpro [24]. The fluorescence will be reduced by the presence of an inhibitor, describing the inhibitory activity of such an inhibitor toward the protease. Biological activity verification showed that this compound demonstrated enzymatic inhibition by 75% (500 µM) against the SARS-CoV-2 3CLpro enzyme. Although it is still less potent than the positive control (GC376, 77% at 100 µM), there is still a hope to proceed a series of Compound 6 was tested on SARS-CoV-2 enzyme inhibition and found the percentage of inhibition as 75% at the concentration of 200 µ g/mL ( Figure 4). This in vitro assay used the foster resonance energy transfer (FRET) principle, worked by measuring the fluorescence of the donor bead in the fluorogenic substrate upon cleaving under proteolysis by the 3CLpro [24]. The fluorescence will be reduced by the presence of an inhibitor, describing the inhibitory activity of such an inhibitor toward the protease. Biological activity verification showed that this compound demonstrated enzymatic inhibition by 75% (500 µ M) against the SARS-CoV-2 3CLpro enzyme. Although it is still less potent than the positive control (GC376, 77% at 100 µ M), there is still a hope to proceed a series of concentration of compound 6 to calculate its real IC50. Unfortunately, due to our resource limitations, we have not performed this experiment yet.

Molecular Dynamics (MD) Simulation
MD simulations were performed to further analyze the complex stability of steroid compound 6 with the viral protease active site. The result can be seen in Figure 5. 6 was found to be relatively stable during 100 ns of the simulation times by distinction of RMSD complex ligand-protein and Cα protein below 3 Å.  Compound 6 was tested on SARS-CoV-2 enzyme inhibition and found the percentage of inhibition as 75% at the concentration of 200 µ g/mL (Figure 4). This in vitro assay used the foster resonance energy transfer (FRET) principle, worked by measuring the fluorescence of the donor bead in the fluorogenic substrate upon cleaving under proteolysis by the 3CLpro [24]. The fluorescence will be reduced by the presence of an inhibitor, describing the inhibitory activity of such an inhibitor toward the protease. Biological activity verification showed that this compound demonstrated enzymatic inhibition by 75% (500 µ M) against the SARS-CoV-2 3CLpro enzyme. Although it is still less potent than the positive control (GC376, 77% at 100 µ M), there is still a hope to proceed a series of concentration of compound 6 to calculate its real IC50. Unfortunately, due to our resource limitations, we have not performed this experiment yet.

Molecular Dynamics (MD) Simulation
MD simulations were performed to further analyze the complex stability of steroid compound 6 with the viral protease active site. The result can be seen in Figure 5. 6 was found to be relatively stable during 100 ns of the simulation times by distinction of RMSD complex ligand-protein and Cα protein below 3 Å.

Molecular Dynamics (MD) Simulation
MD simulations were performed to further analyze the complex stability of steroid compound 6 with the viral protease active site. The result can be seen in Figure 5. 6 was found to be relatively stable during 100 ns of the simulation times by distinction of RMSD complex ligand-protein and Cα protein below 3 Å. Figure 6A shows the residue interactions of viral protease with the steroid compound 6. It shows that H-bond between carboxyl group of 6 and Cys44 was retained during the simulation time of 100 ns. Meanwhile, hydrophobic interaction keep occurs with Met49, Met165, Leu167, Pro168, and Ala191. Figure 6B shows how 6 interacted mostly through H-bonds with Tyr54 and Cys44. Furthermore, Root Mean Square Fluctuation (RMSF) was used to evaluate the stability of the ligand with the specific amino acid residues of the SARS-CoV-2 3CLpro catalytic site, which are His41 and Cys145 [23]. Figure 6C shows that 6 might bind to catalytic sites both His41 and Cys145 with RMSF 0.755 Å and 0.880 Å, respectively. The lowest RMSF with the specific amino acid residues could reflect the most stable interaction leading to the marker of such amino acids in the 3CLpro catalytic site.  Figure 6A shows the residue interactions of viral protease with the steroid compound 6. It shows that H-bond between carboxyl group of 6 and Cys44 was retained during the simulation time of 100 ns. Meanwhile, hydrophobic interaction keep occurs with Met49, Met165, Leu167, Pro168, and Ala191. Figure 6B shows how 6 interacted mostly through H-bonds with Tyr54 and Cys44. Furthermore, Root Mean Square Fluctuation (RMSF) was used to evaluate the stability of the ligand with the specific amino acid residues of the SARS-CoV-2 3CLpro catalytic site, which are His41 and Cys145 [23]. Figure 6C shows that 6 might bind to catalytic sites both His41 and Cys145 with RMSF 0.755 Å and 0.880 Å , respectively. The lowest RMSF with the specific amino acid residues could reflect the most stable interaction leading to the marker of such amino acids in the 3CLpro catalytic site.
(A)   Figure 6A shows the residue interactions of viral protease with the steroid compound 6. It shows that H-bond between carboxyl group of 6 and Cys44 was retained during the simulation time of 100 ns. Meanwhile, hydrophobic interaction keep occurs with Met49, Met165, Leu167, Pro168, and Ala191. Figure 6B shows how 6 interacted mostly through H-bonds with Tyr54 and Cys44. Furthermore, Root Mean Square Fluctuation (RMSF) was used to evaluate the stability of the ligand with the specific amino acid residues of the SARS-CoV-2 3CLpro catalytic site, which are His41 and Cys145 [23]. Figure 6C shows that 6 might bind to catalytic sites both His41 and Cys145 with RMSF 0.755 Å and 0.880 Å, respectively. The lowest RMSF with the specific amino acid residues could reflect the most stable interaction leading to the marker of such amino acids in the 3CLpro catalytic site. (A)

Discussion
Many bioactive compounds from ginger has been identified so far, mostly con tuted by phenolics and terpenoids compounds [25]. The GC-MS analysis of methanol tract of Z. officinale identified terpenoids as major compounds in the pseudostems rhizomes parts. Meanwhile, in leaves, the terpenoids were found as the second after f acids. Among the terpenoids, zerumbone was detected with the high percentage 36.04%, 15.25%, and 2.25% on rhizomes, pseudostems, and leaves, respectively. Ter noids, including steroids, were reported to possess potential antiviral activities, such anti-hepatitis B, anti-HIV-1, hepatitis C, and anti-Herpes simplex virus [26][27][28]. Sev studies also suggest the potency of terpenoids/steroids as SARS-CoV-2 3CL protease hibitors [29][30][31][32][33]. Based on both chromatographic and spectroscopic data, the isolated co pounds from our study were confirmed having terpenoids/steroids structure, such a and 6.
The prediction for anti-SARS-CoV-2 on the identified compounds was performed in silico methods employing docking molecular as a fast and less time-consuming strat in the drug discovery process. Interestingly, steroid compounds 6, 7, and 8 were found be more potential in inhibiting SARS-CoV-2 3CL protease enzymes than others. To d there is no report for the presence of the steroid compound 6, 7, and 8 on Z. officinale tract. Compound 6 was reported as main sterol of Polyporus sulfureus. Meanwh

Discussion
Many bioactive compounds from ginger has been identified so far, mostly constituted by phenolics and terpenoids compounds [25]. The GC-MS analysis of methanol extract of Z. officinale identified terpenoids as major compounds in the pseudostems and rhizomes parts. Meanwhile, in leaves, the terpenoids were found as the second after fatty acids. Among the terpenoids, zerumbone was detected with the high percentage of 36.04%, 15.25%, and 2.25% on rhizomes, pseudostems, and leaves, respectively. Terpenoids, including steroids, were reported to possess potential antiviral activities, such as anti-hepatitis B, anti-HIV-1, hepatitis C, and anti-Herpes simplex virus [26][27][28]. Several studies also suggest the potency of terpenoids/steroids as SARS-CoV-2 3CL protease inhibitors [29][30][31][32][33]. Based on both chromatographic and spectroscopic data, the isolated compounds from our study were confirmed having terpenoids/steroids structure, such as 2 and 6.
The prediction for anti-SARS-CoV-2 on the identified compounds was performed by in silico methods employing docking molecular as a fast and less time-consuming strategy in the drug discovery process. Interestingly, steroid compounds 6, 7, and 8 were found to be more potential in inhibiting SARS-CoV-2 3CL protease enzymes than others. To date, there is no report for the presence of the steroid compound 6, 7, and 8 on Z. officinale extract. Compound 6 was reported as main sterol of Polyporus sulfureus. Meanwhile, compound 7 and 8 were reported from Samanea saman [34,35]. For biological activity, only compound 8 was reported to possess an antiproliferative effect on HeLa and RAW 264.7 cervical cancer cell lines [36]. The ability of steroids to inhibit the protease SARS-CoV-2 enzyme was also reported by Narkedhe et al. (2020), in which β-sitosterol, a common steroid from plant, was proposed by computational docking as inhibitor for main protease SARS-CoV-2. However, in vitro examination on SARS-Coronavirus 3CLpro showed weak inhibition with the IC 50 of 1210 µM [32,37]. Further purification and characterization of the isolated compound by utilizing NMR analysis was successful in elucidating the chemical structure of 6. Biological activity verification showed that this compound demonstrated enzymatic inhibition by 75% against the SARS-CoV-2 3CLpro enzyme, slightly less than the positive control (GC376). Molecular dynamics simulation during 100 ns showed that 6 was more stable in complex with the viral protease. Its H-bond interactions mediated by water molecule with the His41 and Cys145 were retained during the 100 ns simulation which are in line with the reference ligand. This enzymatic inhibition supported the insight on molecular mechanism predicted by molecular docking and molecular dynamics simulations results suggesting that this approach can be used for discovering potential drug compounds from nature. The inhibitory potency of other isolates and the extract must be further tested and continued by in vivo and clinical studies to support the application as medicine for COVID-19 disease.

General
TLC aluminium sheets 20 × 20 cm silica gel 60 F254, silica gel 60 (Merck, Darmstadt, Germany), and pre-coated TLC glass plates SIL G-25 UV254, 0.25 mm silica gel (Sigma, St. Louis, MO, USA) were used for thin layer chromatography analysis, vacuum liquid column chromatography, and preparative TLC, respectively. Spots on TLC were visualized by using spraying reagent of Liebermann Burchard for terpenoid/steroid detection. Shimadzu QP-2010 Gas Chromatograph Mass Spectrometer Ultra (Shimadzu, Kyoto, Japan) was used for GC-MS analysis. Acquity UPLC I-Class System with the XEVO G2-XS QTof Mass Spectrometer (Waters, Milford, MA, USA) was used for LC-MS/MS analysis, and NMR JEOL ECZ-500 and Variant Unity INOVA-500 Spectrometer (Agilent Technologies, Santa Carla, CA, USA) were used to elucidate the structural compounds. The nuclear magnetic resonance (NMR) was recorded at 500 MHz for 1 H and 150 MHz for 13 C. Chemical shifts are given in δ (ppm) relative to TMS as internal standard, and deuterated chloroform was used as the solvent. The SARS-CoV-2 3CLpro #78042 assay kit was purchased from BPS Bioscience Inc., (San Diego, CA, USA).

Plant Material
The Z. officinale plant was collected from Banggai Regency, Central Sulawesi, Indonesia, in December 2019. The plant was identified in Sulawesi Biodiversity Unit-Tadulako University (Herbarium Celebense) with the voucher specimen number 118/UN.28.UPT-SDHS/LK/2019 and deposited at the Herbarium.

Extraction and Isolation
The whole part of the plant (1290 g of leaves, 310 g of pseudostem, and 368 g of rhizomes) was extracted by maceration method using methanol (14 L) for 3-5 days. The collected filtrate was evaporated to obtain viscous methanolic extract (18.58 g for leaves, 27.52 g for pseudostem, and 72.9 g for rhizome). The methanolic extract of each part of the plant (10 g) was further separated by applying liquid-liquid extraction using n-hexane:water (1:1) and successively continued by using ethyl acetate:water (1:1) to separately collect the nhexane, soluble, and insoluble ethyl acetate fractions. The n-hexane fraction (each 3 g) was poured into a vacuum column chromatography system packed by silica gel (60-120 mesh). Gradient system of solvent, starting from n-hexane 100%, n-hexane/dichloromethane mixture, dichloromethane/ethyl acetate mixture and ended by ethyl acetate 100%, ethyl acetate/methanol, and then methanol 100%, was used to elute the crude fractions. A total of 78 fractions for leaf, 153 fractions for pseudostem, and 105 fractions for rhizome were collected. TLC analysis was used to identify the similar fractions, affording 21 fractions for leaf, 17 fractions for pseudostem, and 18 fractions for rhizome. The fraction 14 (leaves, 260 mg), 7 (pseudostems, 230 mg), and 46 (rhizomes, 320 mg), which was found to contain terpenoid/steroids, further isolated using preparative TLC with n-hexane:ethyl acetate (1:9) as a mobile phase. Isolation of fraction 14 (leaves) generated chromatogram where the third band with R f 0.46 (brown color with sulfuric acid-methanol) was taken to give the first isolate as colorless powder (10.5 mg). Likewise, isolation of fraction 7 (pseudostem) shows R f 0.50 (brown color with sulfuric acid-methanol) was taken to give the second isolate as colorless powder (15.2 mg). Finally, isolation of fraction 46 (rhizome) demonstrates the first band with R f 0.46 (brown color with sulfuric acid-methanol) was taken to give the third isolate as colorless powder (12.4 mg), as well. All isolated compounds were further analyzed by using LC-MS/MS. The isolated compound from fraction 7 (pseudostem) was further purified by successive preparative TLC using n-hexane:chloroform (5:2) as the mobile phase. The single spot at TLC (R f 0.71) was collected to give compound 6 (3.99 mg; yield 1.73%).

GC-MS Analysis
Methanol extract of leaves, pseudostems, and rhizomes of Z. officinale were analyzed by GC-MS (Shimadzu QP-2010 Gas Chromatograph Mass Spectrometer Ultra), equipped by an autosampler AOC-20i and capillary column (SH-Rxi-5Sil MS) with the diameter 30 m × 0.25 mm × 0.25 µm. Helium was used as carrier gas (1.0 mL/min), with temperature injection of 250 • C; splitless mode; a column oven temperature of 70 • C at the beginning and held for 2 min, and then ramped to 200 • C at the rate of 10 • C/min and end temperature 280 • C and held for 9 min at the rate 5 • C/min; an MS ion source temperature of 200 • C, and an interface temperature of 280 • C, were set. The secondary metabolites were identified by comparing the experimental molecular mass spectra and base peak of each chromatogram with the Wiley and NIST database libraries.

LC-MS/MS Analysis
Each of the isolated compounds (1 mg) was dissolved in 1 mL of methanol (LC-MS chromasolv ® grade). One microliter aliquot of the sample was injected into the column. Formic acid 0.1% (v/v) in water and acetonitrile plus formic acid 0.1% (v/v) were used as gradient solvent for eluting the sample that initially started by ratio 95:5, continued by ratio 60:40 from 1.00 to 8.00 min, ratio 0:100 from 8.00 to 13.00 min, and at the end with the ratio 95:5. The flowing rate is 0.3 mL/min. The instrument was set as follows: acquisition time 0.00-16.00 min, start mass 50.00-1200.00 m/z, scan time 0.100 s, low CE 6 eV, high CE 10-40 eV, cone voltage 30 V, collision energy 6 eV; acquisition mode ESI (+), capillary voltage 2 kV, source temperature 120 • C, desolvation temperature 500 • C, cone gas flow 50 L/h, and desolvation gas flow 1000 L/h, sample temperature 20 • C, and column temperature 40 • C. The data obtained were processed by using UNIFI software (version 1.8, Waters Corporation, Milford, MA, USA) with a screening solution workflow, which helped in automated data processing for reporting the positive identifications. The result was compared with database that collected more than 1200 compounds based on chemical structure, molecular formula, and molecular mass from various web-based resources.

Molecular Docking
For the molecular docking, the protein used was the crystal structures of SARS-CoV-2 3CL protease with the pdb code 6m2n [23]. The protein was optimized by Protein Preparation Wizard module in Maestro Schrödinger 2020-3 software (Schrödinger, New York, NY, USA) [38,39]. The missing hydrogens were added during optimization process, and partial charges were also assigned using OPLS_2005 forcefield. Moreover, hydrogens and heavy atoms in protein were prepared in restrained minimization state. The 2D structures of Z. officinale compounds identified from LC-MS/MS analysis were converted to 3D structures using Maestro Schrödinger 2020-3 by LigPrep Module and OPLS_2005 forcefield with pH adjusted 7.4 via Epik [40]. LigPrep facilitated the protonation, tautomeric, and ionization states of each compound, as well as correct proper bond orders. In order to specify the docking region, the grid box was distinguished by selecting co-crystallized ligand of protease receptor to maintain that the center of docked compounds is in a similar dimension with the binding box. Docking protocol was run in extra precision (XP) mode through Glide using OPLS_2005 forcefield with flexible ligand and rigid receptor conditions [41,42]. To evaluate the potential of each ligand as 3CL protease inhibitor of SARS-CoV-2, molecular mechanics-generalized Born surface area (MM-GBSA) was used for scoring the docked pose of the ligand [21,43,44].

SARS-CoV-2 3CL Protease In Vitro Inhibition Assay
The assay buffer was prepared by adding 12 µL 0.5 M DTT for a total of 6 mL of assay buffer, which then would be furtherly used. The enzyme and the substrate were each diluted separately by adding, respectively, 3.95 mL and 950 µL of the previously prepared assay buffer (with DTT). The sample for the assay was prepared by dissolving it in DMSO at 100-fold concentration than the final or required concentration in a 96-well microplate. The final concentration of the sample was 200 µg/mL~500 µM. Inhibitor control used in this assay was a peptidomimetic called GC376 diluted in 200 µL water for 500 µM solution. The assay was carried by adding them in the following order: 30 µL of enzyme (5 ng/µL), required volume of sample or inhibitor (GC376), and assay buffer (with DTT) (if necessary) to a total volume of 40 µL. The initial mixture was incubated for 30 min at 25 • C with slow shaking, and then followed by the addition of 10 µL of substrate (250 µM) for a mixture with the final volume of 50 µL. The mixtures were then incubated overnight and measured with Synergy HTX-3 Multi-mode Reader (Winooski, VT, USA) at 360/460 nm.

Molecular Dynamics Simulation
A molecular dynamics (MD) study was conducted using the Desmond module in Schrödinger software (Version) [45][46][47]. Before commencing the MD process, the system was constructed by selecting ligand-protein complexes and submerged them into an SPC (simple point charge) water box at 10 Å. Moreover, to the system was added Counter ions (33 Na + , and 29 Cl − ions), to neutralize charges; additionally, Salts ions (sodium and chloride) were also set to 0.15 M to approximate physiological conditions. The MD simulation was conducted in NPT conditions (temperature 300 K and pressure 1.63 bar) for 50 ns with recording intervals set to 1.2 ps for energy and 20 ps for trajectory. Afterward, the MD simulations were run with the OPLS_2005 forcefield.

Conclusions
GC-MS, LC-MS/MS analysis, and docking molecular simulations were successfully used to identify the potential compounds from Zingiber officinale plant with the activity as inhibitor for SARS-CoV-2 3CL protease enzyme. Compound 6, 7, and 8 were steroid class compounds, found in pseudostem part, that showed low values of predictive binding energy (MMGBSA). Further purification and NMR characterization led to the structure of 6 that showed inhibitory activity 75%, slightly less than the positive control GC376 (77%). Further molecular dynamics simulation showed that 6 was found to be more stable during 100 ns molecular dynamics simulation.