Next Article in Journal
Identification of Exosome-Related Genes Associated with Prognosis and Immune Infiltration Features in Head-Neck Squamous Cell Carcinoma
Previous Article in Journal
Emergent Inflammatory Markers and Echocardiographic Indices in Patients with Bronchial Asthma
Previous Article in Special Issue
Simulated Docking Predicts Putative Channels for the Transport of Long-Chain Fatty Acids in Vibrio cholerae
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

In Silico Identification of Potential Inhibitors of the SARS-CoV-2 Main Protease among a PubChem Database of Avian Infectious Bronchitis Virus 3CLPro Inhibitors

Univ Lyon, INSA Lyon, Université Claude Bernard Lyon 1, CNRS, CPE-Lyon, ICBMS, UMR 5246, Institut de Chimie et de Biochimie Moléculaires et Supramoléculaires, Bâtiment Lederer, 1 Rue Victor Grignard, F-69622 Villeurbanne, France
*
Author to whom correspondence should be addressed.
Biomolecules 2023, 13(6), 956; https://doi.org/10.3390/biom13060956
Submission received: 26 April 2023 / Revised: 31 May 2023 / Accepted: 2 June 2023 / Published: 7 June 2023

Abstract

:
Remarkable structural homologies between the main proteases of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the avian infectious bronchitis virus (IBV) were revealed by comparative amino-acid sequence and 3D structural alignment. Assessing whether reported IBV 3CLPro inhibitors could also interact with SARS-CoV-2 has been undertaken in silico using a PubChem BioAssay database of 388 compounds active on the avian infectious bronchitis virus 3C-like protease. Docking studies of this database on the SARS-CoV-2 protease resulted in the identification of four covalent inhibitors targeting the catalytic cysteine residue and five non-covalent inhibitors for which the binding was further investigated by molecular dynamics (MD) simulations. Predictive ADMET calculations on the nine compounds suggest promising pharmacokinetic properties.

Graphical Abstract

1. Introduction

Potential drugs able to fight the coronavirus disease 2019 (COVID-19) are still very scarce [1]. Several compounds have been reported to be active on the spike protein, the RNA-dependent RNA polymerase, the 3C-like (3CLpro, or Main protease, MPro), and papain-like (PLpro) viral proteases [2], and repurposing approaches were also released [3,4,5]. Bioinformatics and molecular docking techniques applied to available databases and XRD protein structures are essential tools for the discovery of new compounds and therapies effective against COVID-19 [6,7,8,9,10]. Among possible targets, the main protease (Mpro) is the most studied one owing to its crucial role in the virus replication process [11,12,13], and either non-covalent Mpro inhibitors [14] or covalent ones targeting the catalytic cysteine residue [15,16,17,18,19,20] have been described.
The purpose of the present study is to contribute to the identification of other potential drugs using bioinformatics tools, namely homology modeling, molecular docking, XRD structures, and available databases of molecules related to two severe acute respiratory syndrome coronavirus (SARS-CoV) main proteases, the avian infectious bronchitis virus (IBV) 3CLPro and the SARS-CoV-2 MPro. The PubChem BioAssay database collects and describes biological screenings and other assays, including the protocols and the structures of active (and inactive) compounds [21,22]. The idea is to start with a PubChem database of inhibitors active against the avian IBV 3CLPro and look for compounds that could also interact with the SARS-CoV-2 MPro. The bioassay AID 1706, deposited in 2009 by The Scripps Research Institute Molecular Screening Center, is part of the project “Summary of probe development efforts to identify inhibitors of the SARS coronavirus 3C-like protease (3CLPro)” from the avian infectious bronchitis virus [23,24,25,26,27]. The collection of 290,726 samples was tested for the inhibition of the peptide cleavage by 3CLPro using a fluorescent peptide. Among the 405 compounds active on the enzyme, 388 have their 3D structures directly retrievable from the database and can be used for further in silico investigations.
The structural similarity between the 3C-like proteases in coronaviruses supports the interest in undertaking docking studies of compounds active on the avian IBV within the active site of the SARS-CoV-2 MPro. With the crystal structures of the SARS-CoV-2 MPro in complex with a covalent inhibitor [28] and of the IBV main protease [29] being both available, protein structure homology and docking studies of 3CLPro inhibitors to specifically target the SARS-CoV-2 MPro active site were performed. Both non-covalent and covalent potential inhibitors were considered. Covalent inhibitors are less frequently studied because they involve an electrophilic functional group (reacting with a nucleophilic residue in the target active site), which may increase toxicity; however, recent work has tended to re-examine covalent inhibition [30,31,32]. SARS-CoV-2 MPro possesses a catalytic cysteine (Cys145) able to form a covalent bond with inhibitors [33].
The assay AID 1706 was retrieved from PubChem BioAssay, and the active compounds available were downloaded as SDF files. Two approaches were then applied: (i) a search for potential covalent inhibitors by selecting, among the starting library of 388 available compounds, those possessing an electrophilic moiety able to form a covalent bond with the Cys145 residue; and (ii) a search for potential non-covalent inhibitors by docking the library within the active site of the SARS-CoV-2 MPro protein (Figure 1). The stability of the complex of the protein and the potential non-covalent inhibitors was assessed by molecular dynamics (MD), and the predictive ADMET properties were examined.

2. Materials and Methods

2.1. Homology Studies

The PDB structures of 3C-like proteases of avian IBV (PDB code 2Q6F) [29] and of SARS-CoV-2 (PDB code 6Y2F) [28] were downloaded from the PDB database. The multiple sequence alignment was obtained using the T-COFFEE server [34], with the corresponding FASTA file retrieved from the NCBI database. The ESPript server [35] used with the clustal W file generated the multiple sequence alignment. The PyMOL software generated the 3D-structural alignment between the two proteins and the comparison of the two active sites with divergent amino acids in the surroundings of the catalytic dyad shown in magenta.

2.2. Docking Studies

2.2.1. Docking Studies for Covalent Inhibition

The library of compounds active on the main protease of the avian IBV 3CLPro was obtained from the PubChem Bioassay (code AID 1706, https://pubchem.ncbi.nlm.nih.gov/bioassay/1706 (accessed on 5 June 2023)). A set of 388 compounds (out of 405) with their structures fully retrievable were downloaded as an SDF file. This library was then scrutinized to search for compounds bearing a soft electrophilic moiety, resulting in a sub-library of 40 compounds (Table S1, Supplementary Information). Each of them was then docked within the MPro active site of SARS-CoV-2, centered on the β-keto amide inhibitor, using the Arguslab software v4.0.1 (ArgusLaB 4.0.1; WA planetaria Software LLC: Seattle, WA, USA, 2004) [36,37] with the Argusdock engine and default parameters. The docking results of the 40 compounds were analyzed by focusing on the distance between the sulfur of the cysteine protein active site and the electrophilic center, leading to the selection of four compounds showing a distance <4 Å [16]. The binding modes of these four compounds were examined with PyMOL, and flexible docking experiments were performed using a genetic algorithm engine implemented in Arguslab to corroborate the docking results obtained with the Argusdock engine (Figure S1A, Supplementary Information).

2.2.2. Docking Studies for Non-Covalent Inhibition

Each of the 388 compounds from the SDF library (downloaded from https://pubchem.ncbi.nlm.nih.gov/bioassay/1706 (accessed on 5 June 2023)) were docked within the MPro active site of SARS-CoV-2, centered on the β-keto amide inhibitor, using Arguslab software v4.0.1 with the Argusdock engine and default parameters. Compounds were then ranked according to their docking score (Table S2), and the corresponding binding modes of the top 5 compounds were examined using PyMOL. The hydrogen bond networks were generated using LigPlot + v2.2.4 [38]. For the five selected compounds, flexible docking experiments with a genetic algorithm engine implemented in Arguslab were performed to corroborate the docking results obtained with the Argusdock engine (Figure S1B).

2.2.3. Molecular Dynamics Studies

The protein-ligand complexes with each of the five best non-covalent potential inhibitors were minimized using the minimization module of MOE 2019 with default parameters. Molecular dynamics simulations were then performed in triplicate on the complexes using the dynamics module of MOE with the AMBER10-EHT force field and R-Field implicit solvation model (dielectric constant ε r = 80), based on the Amber 10 force field for proteins and the Extended Hückel Theory for the ligands [39,40,41]. The Nosé-Poincaré-Andersen (NPA), a sensitive and precise method, was used. The system was equilibrated at 300 K for 50 ps, and 500 ps production was performed, with all other parameters set as default [42,43]. The same study was conducted on the protein alone. Conformational sampling was set up, with each ps leading to 500 frames (Figure S2). Further calculations were performed with the non-covalent inhibitors for 5 ns (NPA method, equilibration at 300 K for 100 ps and 5000 ps production). Conformational sampling was set up with each 10 ps, leading to 500 frames. Trajectories were visualized with VMD, and Calpha RMSD (root-mean-square deviation) fluctuations were analyzed. Conformational samples from each 1000 ps were superimposed to provide an overview of the binding modes and identify the important amino acids bound to each ligand through hydrogen bonds. The hydrogen bond networks, with the distances to each atom, were generated and examined using LigPlot + v2.2.4 [38]. Conformational samples, i.e., protein-ligand complexes from each 1000 ps, were retrieved and submitted as pdb files to the Prodigy webserver to evaluate ΔGBinding [44,45]. The consistency of the method was assessed using the weakest binder, CID 2057165 (Table S2).

3. Results and Discussion

3.1. Homology Studies

The structural homology between the main proteases of the avian IBV and of SARS-CoV-2 was investigated by molecular modeling. The amino-acid sequence alignment of the two proteins revealed important similarities and a fully conserved His41 and Cys145 catalytic dyad (SARS-CoV-2 numbering) (Figure 2A) [26].
The 3D structural alignment clearly showed a similar overall secondary structure, with superimposed α-helixes and β-sheets, although some differences in the tertiary structure were observed for the unstructured parts of the two proteins, with a calculated Calpha RMSD value of 1.770 Å (Figure 2B). Importantly, significant superimposition of the residues of the catalytic dyad and an identical 3D structure of the catalytic domain are observed.
Although examination of the active sites of the two proteins revealed similarities in the amino acid sequences and conformations, some differences were also identified (Figure 3).
In particular, the histidine 41 environment differs, with Met49, Gln189, Met165, Thr25, and Asn142 for SARS-CoV-2 and Lys45, Glu187, Leu163, Asn25, and Ala140 in IBV. Furthermore, the catalytic cysteine (145 in SARS-CoV-2 or 143 in IBV) is neighbored by the residues Asn142 and Thr25 in SARS-CoV-2 and Ala140 and Asn25 in IBV.

3.2. Docking Studies of the 388 Active Compounds from the PubChem BioAssay AID 1706 within the SARS-CoV-2 Protease Active Site

Considering the structural homologies and differences between the two proteins, particularly in their active site, the PubChem BioAssay AID 1706 (IBV) was used as a starting collection to look for potential covalent inhibitors and non-covalent inhibitors directed at the SARS-CoV-2 MPro. The docking method has been validated in our previous study using three covalent inhibitors co-crystallized with SARS-CoV-2 MPro (PDB codes 5RHF, 5REN, and 5REK) [16], with results consistent with the reported crystallographic data [46].

3.2.1. Docking Studies Looking for Covalent Inhibitors

Covalent inhibition of SARS-CoV-2 MPro requires the formation of a covalent bond by the nucleophilic attack of an electrophilic ligand by the thiol group of Cys145 of the active site. Visual inspection of the 388 active compounds present in the bioassay led to the selection of 40 compounds with an electrophilic functional group, including chloroacetyl derivatives, acrylamides, benzonitriles, nitriles, and 2-cyanoacetamides. Docking studies of this sub-library of 40 compounds were analyzed in order to determine whether the distance between the electrophilic residue and the SH group is consistent with the formation of a covalent bond [16]. Four compounds are able, firstly, to bind within the active site of the SARS-CoV-2 MPro and, secondly, to react with the cysteine residue due to a distance between the sulfur and the electrophilic center of less than 4 Å, have been identified (Figure 4).
Three of the four compounds (CID 1154427, CID 4868361, and CID 4961646) are structurally related to chloroacetamide, while the fourth one is cyanovinyl benzamide (CID 843322 [47]). These compounds interact tightly within the active site with one to four hydrogen bonds, and their electrophilic center clearly lies in the vicinity of the sulfur atom of the cysteine residue (Table 1). The compounds CID 843322 and CID 1154427 interact tightly with the catalytic dyad, while the two other compounds interact with the residues Ser144 and Gly143, in addition to the nucleophilic residue Cys145.

3.2.2. Docking Studies Looking for Non-Covalent Inhibitors

For the non-covalent inhibition of SARS-CoV-2 MPro, the 388 active compounds on the avian IBV retrieved from the PubChem database were submitted to docking experiments within the active site and ranked according to their calculated docking score (Table S2). The five best binders, with calculated docking scores ranging from −9.92 to −9.54 kcal/mol, are structurally related to hydrazine (CID 1632360), thiazole (CID 4586109), benzotriazole (CID 645492 or CID 654498), or diaminophenyl (CID 2193552) derivatives (Figure 5).
The binding modes of these top five compounds were then studied in more detail. All five interact tightly within the active site, with between one and five H-bonds, and all are located in the close vicinity of the catalytic dyad His41 and Cys145, often interacting with the cysteine residue itself (Table 2).

3.3. Molecular Dynamics Studies (Non-Covalent Inhibitors)

The stability of the complexes formed between the protein and the non-covalent potential inhibitors was then investigated by molecular dynamics. First, molecular dynamics simulations were performed for 500 ps in triplicate (Figure S2), showing that the ligands remained tightly bound to the active site over the simulation time. Extension of the simulation time to 5 ns confirmed the stability of the complexes, with Calpha RMSD values fluctuating between 1 Å and 2 Å, except for the compound 4586109 (1.5 Å to 2.5 Å) (Figure 6).
The superimposition of conformational samples taken every 1000 ps (six representative samples over the 500 generated) indicates that the different compounds remain bound to the active site during the simulation (Figure 7). To identify the amino acids involved in the interaction with the ligands via hydrogen bonds, six representative samples were examined for each compound using LigPlot+ (European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK) [38] (Table 3). The number of residues interacting with the ligand (maximal interactions with five residues at the same time) ranges from three to eight. Some residues are involved in the binding of all five ligands, notably Glu166, already found to be a key residue in other bioinformatic studies [48], and His41 of the catalytic dyad with compound 2193552. Calculations of the RMSD of the ligands also show fluctuations of four ligands (all except for CID 4586109) within the 1 to 2 Å range, supporting good stability of the active site—ligand complexes. For CID 4586109, both protein and ligand RMSD calculations consistently suggest significantly lower complex stability.
In Figure 6, there are also plotted the ratios between the number of H-bonds over the maximum number of possible H-bonds (both values obtained using the H-bond plugin of VMD) for each ligand, which show remarkable persistence of the H-bond network contribution to the stability of the complexes.
The protein-ligand binding free energies and the H-bond networks during the MD simulation were further investigated by evaluating the ΔGBinding of the complexes collected each 1000 ps and by visualization of these complexes (Table 4).
The ΔGBinding values were evaluated using the Prodigy webserver for protein-ligand complexes [44,45]. The values ranging from −10.63 to −8.7 kcal/mol indicate strong affinities. and are consistent with the docking simulations (see Table 2). For comparison, the binding free energy value for the weakest binder CID 2057165 of the study (Table S2) calculated by the same method is consistently much higher, with a ΔGBinding value of −5.05 kcal/mol. The compound 2193552 develops several interactions with many amino acids (from 3 to 6) and displays the best ΔGBinding values, remaining constant during the MD simulations. Similar observations for all ligands showed consistency with the docking studies. At least one residue of the catalytic dyad is always involved in H-bond interactions with all 5 ligands over the MD simulation duration, in particular Cys145 for CID 1632360, CID 645492, CID 2193552, and His41 for CID 2193552. The persistence of the H-bonds over the MD simulation duration can be estimated by observing which and when the concerned residues enter or leave the H-bonding network (Table 4, Figure 7).

3.4. ADMET Studies

The ADMET properties of the nine (four covalent and five non-covalent) potential inhibitors, predicted using SwissADME [49], pkCSM [50], and ProTox-II [51], are listed in Table 5. Regarding bioavailability, all nine molecules respect Lipinski’s “rule of five” [52] and, partially, Veber’s rule [53]. All identified derivatives except CID 2193552 have very good predicted overall intestinal absorption, with high Caco-2 permeability (log Papp > 0.9) and intestinal absorption over 90%. Although the predicted steady-state volume of distribution (VDss) and the fraction unbound to serum proteins are both low, the compounds seem unlikely to cross the blood-brain barrier (BBB) and reach the central nervous system (CNS). Most compounds are also not likely to be metabolized by the two main isoforms of cytochrome P450. Finally, almost no compound showed predicted hepato-, immuno-, or cytotoxicity. Anticipated carcinogenicity could be observed for three covalent and three non-covalent potential inhibitors). Overall, the nine compounds exhibit satisfactory predicted pharmacokinetic properties, adequate for being considered as potential drug candidates for COVID-19 infection treatment.

4. Conclusions

The investigation of the homology between SARS-CoV-2 and the avian IBV main proteases has revealed a similar overall secondary structure, a fully conserved His41 and Cys145 catalytic dyad, and some differences within other residues of their active sites, justifying the search for SARS-CoV-2 MPro among reported IBV 3CLPro inhibitors. A library of bioactive substances on the avian IBV protease (PubChem BioAssay AID 1706) was thus used to look for covalent and non-covalent inhibitors by docking studies within the active site of SARS-CoV-2 MPro. Four covalent candidate inhibitors (including chloroacetyl derivatives, acrylamides, benzonitriles, nitriles, and 2-cyanoacetamides) and five non-covalent (hydrazine, thiazole, benzotriazole, and diaminophenyl derivatives) potential inhibitors have been identified and their binding modes investigated. The covalent ones interact tightly with the active site through several H-bonds and locate their electrophilic function at the reactive distance of the SH group of Cys145. The non-covalent potential inhibitors also bind tightly with the active site residues and are located in close proximity to the catalytic dyad. Molecular dynamics simulations confirmed that these ligands remain bound to the active site of the protein. Protein-ligand binding free energies appear consistent with the docking studies and highlight the residues responsible for most of the binding, notably the catalytic dyad, which develops key interactions with all identified ligands. Further, ADMET predictions showed calculated pharmacokinetic properties that are consistent with possible use as drug candidates. Overall, this bioinformatics study contributes to the identification of new bioactive compounds with possible activity against COVID-19.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom13060956/s1, Figure S1. Docking experiments using a genetic algorithm. To corroborate the binding modes obtained using the Argusdock engine (in cyan color), docking experiments with a genetic algorithm engine implemented in Arguslab was achieved leading to the binding modes in magenta color. As depicted in the different figures, the binding modes obtained with the two methods are consistent for all compounds with only some differences for the compound 843322. A. Covalent inhibitors. B. Non-covalent inhibitors; Figure S2. Calpha RMSD obtained by molecular dynamics simulations during 500 ps for the protein without ligand and for all reversible complexes. Molecular dynamics simulations were performed for 500 ps in triplicate to ascertain that the complexes between the protein and the potential reversible inhibitors were stable. It could be noted that the different compounds remain tightly bound to the active site during the simulation; Table S1. Compound CID of the 40 potential covalent inhibitors; Table S2. Summary of results in order of docking score (kcal/mol).

Author Contributions

Conceptualization, L.S.; Methodology, L.S., T.B. and Y.Q.; Supervision, L.S.; Writing—original draft, L.S., T.B. and Y.Q.; Writing—review and editing, L.S., T.B. and Y.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data can be found in the manuscript and in the Supplementary Materials.

Acknowledgments

The authors are grateful to CNRS and MESRI for financial support and to MESRI for a scholarship to T.B. The authors thank Valerie James for improving the English of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pillaiyar, T.; Meenakshisundaram, S.; Manickam, M. Recent discovery and development of inhibitors targeting coronaviruses. Drug Discov. Today 2020, 25, 668–688. [Google Scholar] [CrossRef]
  2. Menéndez, J.C. Approaches to the Potential Therapy of COVID-19: A General Overview from the Medicinal Chemistry Perspective. Molecules 2022, 27, 658. [Google Scholar] [CrossRef]
  3. Castillo-Garit, J.A.; Cañizares-Carmenate, Y.; Pham-The, H.; Pérez-Doñate, V.; Torrens, F.; Pérez-Giménez, F. A Review of Computational Approaches Targeting SARS-CoV-2 Main Protease to the Discovery of New Potential Antiviral Compounds. Curr. Top. Med. Chem. 2023, 23, 3–16. [Google Scholar] [CrossRef] [PubMed]
  4. Suryavanshi, H.; Chaudhari, R.D.; Patil, V.; Majumdar, S.; Debnath, S.; Biswas, G. Design, synthesis and docking study of Vortioxetine derivatives as a SARS-CoV-2 main protease inhibitor. DARU J. Pharm. Sci. 2022, 30, 139–152. [Google Scholar] [CrossRef] [PubMed]
  5. Mslati, H.; Gentile, F.; Perez, C.; Cherkasov, A. Comprehensive Consensus Analysis of SARS-CoV-2 Drug Repurposing Campaigns. J. Chem. Inf. Model. 2021, 61, 3771–3788. [Google Scholar] [CrossRef] [PubMed]
  6. Kitchen, D.B.; Decornez, H.; Furr, J.R.; Bajorath, J. Docking and scoring in virtual screening for drug discovery: Methods and applications. Nat. Rev. Drug Discov. 2004, 3, 935–949. [Google Scholar] [CrossRef]
  7. Zhang, S.; Amahong, K.; Sun, X.; Lian, X.; Liu, J.; Sun, H.; Lou, Y.; Zhu, F.; Qiu, Y. The miRNA: A small but powerful RNA for COVID-19. Brief. Bioinform. 2021, 22, 1137–1149. [Google Scholar] [CrossRef]
  8. Zhang, S.; Amahong, K.; Zhang, C.; Li, F.; Gao, J.; Qiu, Y.; Zhu, F. RNA–RNA interactions between SARS-CoV-2 and host benefit viral development and evolution during COVID-19 infection. Brief. Bioinform. 2022, 23, bbab397. [Google Scholar] [CrossRef]
  9. Gimeno, A.; Mestres-Truyol, J.; Ojeda-Montes, M.J.; Macip, G.; Saldivar-Espinoza, B.; Cereto-Massagué, A.; Pujadas, G.; Garcia-Vallvé, S. Prediction of Novel Inhibitors of the Main Protease (M-pro) of SARS-CoV-2 through Consensus Docking and Drug Reposition. Int. J. Mol. Sci. 2020, 21, 3793. [Google Scholar] [CrossRef]
  10. Yamamoto, K.Z.; Yasuo, N.; Sekijima, M. Screening for Inhibitors of Main Protease in SARS-CoV-2: In Silico and In Vitro Approach Avoiding Peptidyl Secondary Amides. J. Chem. Inf. Model 2022, 62, 350–358. [Google Scholar] [CrossRef]
  11. Mengist, H.M.; Dilnessa, T.; Jin, T. Structural Basis of Potential Inhibitors Targeting SARS-CoV-2 Main Protease. Front. Chem. 2021, 9, 622898. [Google Scholar] [CrossRef] [PubMed]
  12. Citarella, A.; Scala, A.; Piperno, A.; Micale, N. SARS-CoV-2 Mpro: A Potential Target for Peptidomimetics and Small-Molecule Inhibitors. Biomolecules 2021, 11, 607. [Google Scholar] [CrossRef] [PubMed]
  13. Agost-Beltrán, L.; de la Hoz-Rodríguez, S.; Bou-Iserte, L.; Rodríguez, S.; Fernández-de-la-Pradilla, A.; González, F.V. Advances in the Development of SARS-CoV-2 Mpro Inhibitors. Molecules 2022, 27, 2523. [Google Scholar] [CrossRef] [PubMed]
  14. Antonopoulou, I.; Sapountzaki, E.; Rova, U.; Christakopoulos, P. Inhibition of the main protease of SARS-CoV-2 (M(pro)) by repurposing/designing drug-like substances and utilizing nature’s toolbox of bioactive compounds. Comput. Struct. Biotechnol. J. 2022, 20, 1306–1344. [Google Scholar] [CrossRef]
  15. Liu, S.; Zheng, Q.; Wang, Z. Potential covalent drugs targeting the main protease of the SARS-CoV-2 coronavirus. Bioinformatics 2020, 36, 3295–3298. [Google Scholar] [CrossRef]
  16. Soulère, L.; Barbier, T.; Queneau, Y. Docking-based virtual screening studies aiming at the covalent inhibition of SARS-CoV-2 M(Pro) by targeting the cysteine 145. Comput. Biol. Chem. 2021, 92, 107463. [Google Scholar] [CrossRef]
  17. Ton, A.T.; Gentile, F.; Hsing, M.; Ban, F.; Cherkasov, A. Rapid Identification of Potential Inhibitors of SARS-CoV-2 Main Protease by Deep Docking of 1.3 Billion Compounds. Mol. Inform. 2020, 39, e2000028. [Google Scholar] [CrossRef] [Green Version]
  18. Wang, J. Fast Identification of Possible Drug Treatment of Coronavirus Disease-19 (COVID-19) through Computational Drug Repurposing Study. J. Chem. Inf. Model. 2020, 60, 3277–3286. [Google Scholar] [CrossRef] [Green Version]
  19. Mishra, B.; Ballaney, P.; Saha, G.; Shinde, A.; Banerjee, S.; Thimmakondu, V.S.; Aduri, R. An in silico discovery of potential 3CL protease inhibitors of SARS-CoV-2 based upon inactivation of the cysteine 145-Histidine 41 catalytic dyad. J. Biomol. Struct. Dyn. 2022, 41, 3167–3186. [Google Scholar] [CrossRef]
  20. Awoonor-Williams, E.; Abu-Saleh, A.A.A. Covalent and non-covalent binding free energy calculations for peptidomimetic inhibitors of SARS-CoV-2 main protease. Phys. Chem. 2021, 23, 6746–6757. [Google Scholar] [CrossRef]
  21. Xie, X.Q. Exploiting PubChem for Virtual Screening. Expert Opin. Drug Discov. 2010, 5, 1205–1220. [Google Scholar] [CrossRef] [Green Version]
  22. Wang, Y.; Xiao, J.; Suzek, T.O.; Zhang, J.; Wang, J.; Zhou, Z.; Han, L.; Karapetyan, K.; Dracheva, S.; Shoemaker, B.A.; et al. PubChem’s BioAssay Database. Nucleic Acids Res. 2012, 40, D400–D412. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Mohapatra, S.; Nath, P.; Chatterjee, M.; Das, N.; Kalita, D.; Roy, P.; Satapathi, S. Repurposing therapeutics for COVID-19: Rapid prediction of commercially available drugs through machine learning and docking. PLoS ONE 2020, 15, e0241543. [Google Scholar] [CrossRef]
  24. Alves, V.M.; Bobrowski, T.; Melo-Filho, C.C.; Korn, D.; Auerbach, S.; Schmitt, C.; Muratov, E.N.; Tropsha, A. QSAR Modeling of SARS-CoV M(pro) Inhibitors Identifies Sufugolix, Cenicriviroc, Proglumetacin, and other Drugs as Candidates for Repurposing against SARS-CoV-2. Mol. Inform. 2021, 40, e2000113. [Google Scholar] [CrossRef] [PubMed]
  25. Delijewski, M.; Haneczok, J. AI drug discovery screening for COVID-19 reveals zafirlukast as a repurposing candidate. Med. Drug Discov. 2021, 9, 100077. [Google Scholar] [CrossRef] [PubMed]
  26. Nand, M.; Maiti, P.; Joshi, T.; Chandra, S.; Pande, V.; Kuniyal, J.C.; Ramakrishnan, M.A. Virtual screening of anti-HIV1 compounds against SARS-CoV-2: Machine learning modeling, chemoinformatics and molecular dynamics simulation based analysis. Sci. Rep. 2020, 10, 20397. [Google Scholar] [CrossRef] [PubMed]
  27. Santana, M.V.S.; Silva-Jr, F.P. De novo design and bioactivity prediction of SARS-CoV-2 main protease inhibitors using recurrent neural network-based transfer learning. BMC Chem. 2021, 15, 8. [Google Scholar] [CrossRef]
  28. Zhang, L.; Lin, D.; Sun, X.; Curth, U.; Drosten, C.; Sauerhering, L.; Becker, S.; Rox, K.; Hilgenfeld, R. Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved alpha-ketoamide inhibitors. Science 2020, 368, 409–412. [Google Scholar] [CrossRef] [Green Version]
  29. Xue, X.; Yu, H.; Yang, H.; Xue, F.; Wu, Z.; Shen, W.; Li, J.; Zhou, Z.; Ding, Y.; Zhao, Q.; et al. Structures of two coronavirus main proteases: Implications for substrate binding and antiviral drug design. J. Virol. 2008, 82, 2515–2527. [Google Scholar] [CrossRef] [Green Version]
  30. Dalton, S.E.; Campos, S. Covalent Small Molecules as Enabling Platforms for Drug Discovery. Chembiochem 2020, 21, 1080–1100. [Google Scholar] [CrossRef]
  31. Ghosh, A.K.; Samanta, I.; Mondal, A.; Liu, W.R. Covalent Inhibition in Drug Discovery. Chemmedchem 2019, 14, 889–906. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Singh, J.; Petter, R.C.; Baillie, T.A.; Whitty, A. The resurgence of covalent drugs. Nat. Rev. Drug Discov. 2011, 10, 307–317. [Google Scholar] [CrossRef] [PubMed]
  33. Jin, Z.; Du, X.; Xu, Y.; Deng, Y.; Liu, M.; Zhao, Y.; Zhang, B.; Li, X.; Zhang, L.; Peng, C.; et al. Structure of M(pro) from SARS-CoV-2 and discovery of its inhibitors. Nature 2020, 582, 289–293. [Google Scholar] [CrossRef] [Green Version]
  34. Notredame, C.; Higgins, D.G.; Heringa, J. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 2000, 302, 205–217. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Robert, X.; Gouet, P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 2014, 42, W320–W324. [Google Scholar] [CrossRef] [Green Version]
  36. Thompson, M.A. ArgusLaB 4.0.1; WA planetaria Software LLC: Seattle, WA, USA, 2004. [Google Scholar]
  37. Oda, A.; Okayasu, M.; Kamiyama, Y.; Yoshida, T.; Takahashi, O.; Matsuzaki, H. Evaluation of docking accuracy and investigations of roles of parameters and each term in scoring functions for protein–ligand docking using ArgusLab software. Bull. Chem. Soc. Jpn. 2007, 80, 1920–1925. [Google Scholar] [CrossRef]
  38. Laskowski, R.A.; Swindells, M.B. LigPlot+: Multiple ligand-protein interaction diagrams for drug discovery. J. Chem. Inf. Model 2011, 51, 2778–2786. [Google Scholar] [CrossRef]
  39. Case, D.A.; Darden, T.A.; Cheatham, T.E.; Simmerling, C.L.; Wang, J.; Duke, R.E.; Luo, R.; Crowley, M.; Walker, R.C.; Zhang, W. Amber 10; University of California: San Francisco, CA, USA, 2008. [Google Scholar]
  40. Gerber, P.R.; Müller, K. MAB, a generally applicable molecular force field for structure modelling in medicinal chemistry. J. Comput. Aided Mol. Des. 1995, 9, 251–268. [Google Scholar] [CrossRef] [PubMed]
  41. Hoffmann, R. An extended Hückel theory. I. hydrocarbons. J. Chem. Phys. 1963, 39, 1397–1412. [Google Scholar] [CrossRef]
  42. Sturgeon, J.B.; Laird, B.B. Symplectic algorithm for constant-pressure molecular dynamics using a Nosé–Poincaré thermostat. J. Chem. Phys. 2000, 112, 3474–3482. [Google Scholar] [CrossRef] [Green Version]
  43. Khelfaoui, H.; Harkati, D.; Saleh, B.A. Molecular docking, molecular dynamics simulations and reactivity, studies on approved drugs library targeting ACE2 and SARS-CoV-2 binding with ACE2. J. Biomol. Struct. Dyn. 2021, 39, 7246–7262. [Google Scholar] [CrossRef]
  44. Kurkcuoglu, Z.; Koukos, P.I.; Citro, N.; Trellet, M.E.; Rodrigues, J.; Moreira, I.S.; Roel-Touris, J.; Melquiond, A.S.J.; Geng, C.; Schaarschmidt, J.; et al. Performance of HADDOCK and a simple contact-based protein-ligand binding affinity predictor in the D3R Grand Challenge 2. J. Comput. Aided Mol. Des. 2018, 32, 175–185. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Vangone, A.; Schaarschmidt, J.; Koukos, P.; Geng, C.; Citro, N.; Trellet, M.E.; Xue, L.C.; Bonvin, A. Large-scale prediction of binding affinity in protein-small ligand complexes: The PRODIGY-LIG web server. Bioinformatics 2019, 35, 1585–1587. [Google Scholar] [CrossRef] [Green Version]
  46. Douangamath, A.; Fearon, D.; Gehrtz, P.; Krojer, T.; Lukacik, P.; Owen, C.D.; Resnick, E.; Strain-Damerell, C.; Aimon, A.; Ábrányi-Balogh, P.; et al. Crystallographic and electrophilic fragment screening of the SARS-CoV-2 main protease. Nat. Commun. 2020, 11, 5047. [Google Scholar] [CrossRef]
  47. Grygorenko, O.O.; Volochnyuk, D.M.; Vashchenko, B.V. Emerging Building Blocks for Medicinal Chemistry: Recent Synthetic Advances. Eur. J. Org. Chem. 2021, 2021, 6478–6510. [Google Scholar] [CrossRef]
  48. Yoshino, R.; Yasuo, N.; Sekijima, M. Identification of key interactions between SARS-CoV-2 main protease and inhibitor drug candidates. Sci. Rep. 2020, 10, 12493. [Google Scholar] [CrossRef]
  49. Daina, A.; Michielin, O.; Zoete, V. SwissADME: A free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules. Sci. Rep. 2017, 7, 42717. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Pires, D.E.; Blundell, T.L.; Ascher, D.B. pkCSM: Predicting Small-Molecule Pharmacokinetic and Toxicity Properties Using Graph-Based Signatures. J. Med. Chem. 2015, 58, 4066–4072. [Google Scholar] [CrossRef]
  51. Banerjee, P.; Eckert, A.O.; Schrey, A.K.; Preissner, R. ProTox-II: A webserver for the prediction of toxicity of chemicals. Nucleic Acids Res. 2018, 46, W257–W263. [Google Scholar] [CrossRef] [Green Version]
  52. Lipinski, C.A.; Lombardo, F.; Dominy, B.W.; Feeney, P.J. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv. Drug Deliv. Rev. 2001, 46, 3–26. [Google Scholar] [CrossRef] [PubMed]
  53. Veber, D.F.; Johnson, S.R.; Cheng, H.Y.; Smith, B.R.; Ward, K.W.; Kopple, K.D. Molecular properties that influence the oral bioavailability of drug candidates. J. Med. Chem. 2002, 45, 2615–2623. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions, and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions, or products referred to in the content.
Figure 1. Two approaches to investigating either the covalent or the non-covalent inhibition of the SARS-CoV-2 main protease, starting from the PubChem 3CLPro BioAssay AID 1706.
Figure 1. Two approaches to investigating either the covalent or the non-covalent inhibition of the SARS-CoV-2 main protease, starting from the PubChem 3CLPro BioAssay AID 1706.
Biomolecules 13 00956 g001
Figure 2. (A) Sequence alignment of the two main proteases from the avian IBV and SARS-CoV-2. The conserved residues are framed and highlighted in red (fully conserved) or written in red (partially conserved). The figure was generated using ESPript 3.0 [35]. The secondary structure of the main protease of SARS-CoV-2 is also indicated. (B) 3D structural alignment of the two proteins (Calpha RMSD value 1.770 Å) with secondary structure showing the catalytic dyad for both proteins (the SARS-CoV-2 MPro is colored in cyan).
Figure 2. (A) Sequence alignment of the two main proteases from the avian IBV and SARS-CoV-2. The conserved residues are framed and highlighted in red (fully conserved) or written in red (partially conserved). The figure was generated using ESPript 3.0 [35]. The secondary structure of the main protease of SARS-CoV-2 is also indicated. (B) 3D structural alignment of the two proteins (Calpha RMSD value 1.770 Å) with secondary structure showing the catalytic dyad for both proteins (the SARS-CoV-2 MPro is colored in cyan).
Biomolecules 13 00956 g002
Figure 3. Comparison between the main protease active site of SARS-CoV-2 (A,C) and avian IBV (B,D). (A,B): entire active sites; and (C,D): active sites simplified by hiding the residues common to the two proteins. The main differences for amino acids close to the catalytic dyad are indicated in magenta.
Figure 3. Comparison between the main protease active site of SARS-CoV-2 (A,C) and avian IBV (B,D). (A,B): entire active sites; and (C,D): active sites simplified by hiding the residues common to the two proteins. The main differences for amino acids close to the catalytic dyad are indicated in magenta.
Biomolecules 13 00956 g003
Figure 4. Structure of IBV 3CLPro inhibitors identified as potential covalent inhibitors of SARS-CoV-2 MPro.
Figure 4. Structure of IBV 3CLPro inhibitors identified as potential covalent inhibitors of SARS-CoV-2 MPro.
Biomolecules 13 00956 g004
Figure 5. Structure of the five IBV 3CLPro inhibitors identified as non-covalent inhibitors of SARS-CoV-2 MPro.
Figure 5. Structure of the five IBV 3CLPro inhibitors identified as non-covalent inhibitors of SARS-CoV-2 MPro.
Biomolecules 13 00956 g005
Figure 6. Calpha RMSD obtained by molecular dynamics simulations during 5 ns for all non-covalent complexes (in blue) and H-bonds proportion during the MD simulations calculated as the number of H-bonds per frames divided by the maximum of H-bond number obtained in a given frame (in orange).
Figure 6. Calpha RMSD obtained by molecular dynamics simulations during 5 ns for all non-covalent complexes (in blue) and H-bonds proportion during the MD simulations calculated as the number of H-bonds per frames divided by the maximum of H-bond number obtained in a given frame (in orange).
Biomolecules 13 00956 g006
Figure 7. Superimposition of conformational samples collected every 1000 ps for non-covalent inhibitor complexes during molecular dynamics simulations.
Figure 7. Superimposition of conformational samples collected every 1000 ps for non-covalent inhibitor complexes during molecular dynamics simulations.
Biomolecules 13 00956 g007
Table 1. 2D and 3D binding modes of the four potential covalent inhibitors within the SARS-CoV-2 MPro active site (Cys145 is indicated in magenta in the 3D representation).
Table 1. 2D and 3D binding modes of the four potential covalent inhibitors within the SARS-CoV-2 MPro active site (Cys145 is indicated in magenta in the 3D representation).
Compound CID—
H-Bonds and Interactions with Amino Acids
Binding Modes
Biomolecules 13 00956 i001Biomolecules 13 00956 i002
Biomolecules 13 00956 i003Biomolecules 13 00956 i004
Biomolecules 13 00956 i005Biomolecules 13 00956 i006
Biomolecules 13 00956 i007Biomolecules 13 00956 i008
Table 2. Binding modes within the SARS-CoV-2 MPro active site of compounds ranked in the top 5, shown as the results of docking experiments for 388 out of the 405 active compounds on the avian IBV protease.
Table 2. Binding modes within the SARS-CoV-2 MPro active site of compounds ranked in the top 5, shown as the results of docking experiments for 388 out of the 405 active compounds on the avian IBV protease.
Compound CID—
H-Bonds and Interactions with Amino Acids
Binding Modes
and Docking Score (kcal/mol)
Biomolecules 13 00956 i009Biomolecules 13 00956 i010
Biomolecules 13 00956 i011Biomolecules 13 00956 i012
Biomolecules 13 00956 i013Biomolecules 13 00956 i014
Biomolecules 13 00956 i015Biomolecules 13 00956 i016
Biomolecules 13 00956 i017Biomolecules 13 00956 i018
Table 3. Average ligands RMSD with hydrogen atoms and identification of important amino acids bound to the ligands via hydrogen bonds.
Table 3. Average ligands RMSD with hydrogen atoms and identification of important amino acids bound to the ligands via hydrogen bonds.
Compounds CIDLigand RMSD (Å)Residues Interacting via H-Bonds during the MD Simulations
16323601.602Asn142, Gly143, Glu166
6454921.087Met49, Ser144, Gln189
21935521.851His41, Met49, Asn51, Glu166, Asp187, Thr190, Gln192, Gln189
45861093.344Gly143, Ser144, His163, Glu166, Gln189,
6544981.717Met49, Glu166, Gln189,
Table 4. ΔGBinding (kcal/mol) of complexes collected each 1000 ps (residues involved in hydrogen bonds are indicated in brackets).
Table 4. ΔGBinding (kcal/mol) of complexes collected each 1000 ps (residues involved in hydrogen bonds are indicated in brackets).
Compounds CID163236064549221935524586109654498
ΔGBinding t = 0−9.16
(Gly143)
−9.21
(Gln189, Met49, Cys145)
−10.23
(Glu166, Thr190, Gln192)
−9.42
(Gly143, Ser144, His163, Glu166)
−8.52
(Met49, Gln189)
ΔGBinding t = 1000−9.34
(Gly143, Glu166)
−9.24
(Gln189)
−9.95
(His41, Glu166, Asp187, Gln189, Thr190, Gln192)
−9.91
(Gly143, Ser144, His163, Gln189)
−9.08
(Met49, Glu166, Gln189)
ΔGBinding t = 2000−9.47
(Gly143)
−9.08
(Gln189)
−10.28
(His41, Glu166, Asp187, Gln189, Thr190, Gln192)
−9.89
(Gly143, Ser144, His163, Gln189)
−9.17
(Cys145)
ΔGBinding t = 3000−9.37
(Asn142)
−9.28
(Met49)
−10.33
(His41, Glu166, Gln189, Thr190, Gln192)
−10.20
(Gly143, Ser144, His163, Gln189)
−8.70
(Glu166, Gln189)
ΔGBinding t = 4000−9.16
(Cys145)
−9.09
(Cys145)
−10.63
(Met49
Glu166, Thr190, Gln192)
−9.79
(Gly143
Ser144, Cys145, His163)
−9.05
(Met49, Glu166, Gln189)
ΔGBinding t = 5000−9.12
(Glu166)
−9.22
(Ser144)
−10.52
(Met49, Asn51,
Glu166, Thr190, Gln192)
−9.82
(Gly143
Ser144, His163, Gln189)
−9.36
(Glu166, Gln189)
Table 5. Predicted ADMET properties for identified potential MPro inhibitors.
Table 5. Predicted ADMET properties for identified potential MPro inhibitors.
PropertyUnitCID 843322CID 1154427CID 4868361CID 4961646CID 1632360CID 645492CID 2193552CID 4586109CID 654498
Molecular weightg/mol255.10356.80303.74292.72429.51499.58476.52474.59485.56
LogP 2.552.922.322.273.703.603.924.893.22
H-bond donors 112031311
H-bond acceptors 232435645
Rotatable bonds 36741111111011
PSAÅ252.8962.4061.4458.9587.30121.50113.96124.04121.50
Caco2 permeabilitylog Papp in 10–6 cm/s1.3241.1331.3071.3150.7930.7390.6261.040.83
Intestinal absorption (human)% Absorbed91.93796.6491.27996.93793.66394.39370.34493.42193.919
VDss (human)log L/kg0.011−0.076−0.021−0.21−0.240.166−1.842−0.7140.117
Fraction unbound (human)Fu0.3190.03800.340.0240.08600.2780.072
BBB permeabilitylog BB0.079−0.0770.2440.122−0.662−0.429−1.359−0.996−0.435
CNS permeabilitylog PS−2.781−2.35−2.088−2.877−2.191−2.239−2.896−2.257−2.308
CYP2D6 inhibitorYes/NoNoNoNoNoNoNoNoNoNo
CYP3A4 inhibitorYes/NoNoNoNoNoYesYesNoNoYes
Total Clearancelog ml/min/kg−0.0540.086−0.0690.1890.2410.035−0.0450.6240.091
HepatotoxicityYes/No (probability %)No (61)No (71)No (54)Yes (51)No (59)No (58)No (75)No (57)No (58)
CarcinogenicityYes/No (probability %)No (59)Yes (52)Yes (66)Yes (66)Yes (56)Yes (52)No (69)No (53)Yes (52)
ImmunotoxicityYes/No (probability %)No (99)Yes (87)No (99)No (98)No (99)No (99)No (99)No (99)No (99)
MutagenicityYes/No (probability %)Yes (55)Yes (52)No (55)No (57)No (52)Yes (55)No (79)No (65)Yes (56)
CytotoxicityYes/No (probability %)No (76)Yes (57)No (72)No (68)No (77)No (61)No (60)No (66)No (63)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Soulère, L.; Barbier, T.; Queneau, Y. In Silico Identification of Potential Inhibitors of the SARS-CoV-2 Main Protease among a PubChem Database of Avian Infectious Bronchitis Virus 3CLPro Inhibitors. Biomolecules 2023, 13, 956. https://doi.org/10.3390/biom13060956

AMA Style

Soulère L, Barbier T, Queneau Y. In Silico Identification of Potential Inhibitors of the SARS-CoV-2 Main Protease among a PubChem Database of Avian Infectious Bronchitis Virus 3CLPro Inhibitors. Biomolecules. 2023; 13(6):956. https://doi.org/10.3390/biom13060956

Chicago/Turabian Style

Soulère, Laurent, Thibaut Barbier, and Yves Queneau. 2023. "In Silico Identification of Potential Inhibitors of the SARS-CoV-2 Main Protease among a PubChem Database of Avian Infectious Bronchitis Virus 3CLPro Inhibitors" Biomolecules 13, no. 6: 956. https://doi.org/10.3390/biom13060956

APA Style

Soulère, L., Barbier, T., & Queneau, Y. (2023). In Silico Identification of Potential Inhibitors of the SARS-CoV-2 Main Protease among a PubChem Database of Avian Infectious Bronchitis Virus 3CLPro Inhibitors. Biomolecules, 13(6), 956. https://doi.org/10.3390/biom13060956

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop