Abstract
In December 2019, the Coronavirus disease-2019 (COVID-19) virus emerged in Wuhan, China. The first resolved COVID-19 crystal structure (main protease) has been developed and various repurposing activities are in process. In this study, a knowledge gap in relation to COVID-19, with the previously known fatal Coronavirus (CoV) epidemics, Severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS) CoVs, is covered by the investigation of sequence statistics, molecular modelling, virtual screening, docking, and sequence comparison statistics of the COVID-19 main protease. COVID-19 main protease Mpro formed a sequence similarity group with SARS CoV that was distant from MERS CoV. The identity % was 96 and 51 for COVID-19/SARS and COVID-19/MERS CoV sequence comparisons, respectively. We used molecular docking and a molecular interaction approach to identify small-molecules that bind to the isolated Viral S-protein at its host receptor region. These molecules have good solubility and pharmacodynamics properties. They also obey Lipinski’s rule, which makes them promising compounds to pursue further biochemical and cell-based assays to explore their potential for use against COVID-19. We hypothesize that the top score identified molecules that may be used to limit viral recognition of host cells and/or disrupt host-virus interactions. A ranked list of selected compounds is given that can be tested experimentally.
1. Introduction
On the penultimate day of 2019, health officials at the Wuhan Municipal Health Commission (Hubei Province, Wuhan, China) reported an occurrence of concentrated pneumonia in the city of Wuhan. Shortly after reporting the outbreak, the Chinese Centre for Disease Control (Chinese CDC) and local Chinese health workers determined that the cause of the outbreak was a novel coronavirus, i.e., nCov-2019 [,,]. On 11 March 2020, WHO declared it as a pandemic. The symptoms of Coronavirus disease-2019 (COVID-19) infection are mild respiratory symptoms and a fever that occurs on an average of 5–6 days after infection (mean incubation period 5–6 days, range 1–14 days) [,,]. The current treatment options are use of antivirals and antimalarials. The first available crystal structure of COVID-19 proteins was Mpro, which was published in February 2020 (Protien data bank (PDB ID) 6lu7). In this study, the first virtual screening study against the first known COVID-19 was performed. The obtained results will help in identifying some potential inhibitors to combat the recent dangerous COVID-19. We propose to use food grade dyes that could acts as a treatment option in case of COVID-19 patients. We have used computational methods, e.g., molecular docking, to evaluate the activity as well as the interactions.
2. Materials and Methods
2.1. Retrieval of Mpro Sequences
The NCBI GenBank or GISAID (Available Online: https://www.gisaid.org/ (accessed on 10 October 2020)) were used to obtain the COVID-19 sequences. SARS Coronavirus (CoV) and MERS CoV sequences were obtained from the GenBank [,].
2.2. Sequence Alignment and Multiple Sequence Comparisons
Pairwise and multiple sequence comparisons of Mpro were done using CLC genomics software (Qiagen Inc., USA). The sequence comparison matrix was generated, including the number of gaps, number of different residues, and identity %.
Sequences alignments of Mpro were from SARS CoV, MERS CoV, and COVID-19.
|
| Identities 294/306 (96%) |
| SARS Mpro 2AMD SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIR 65 |
| COVID-19 Mpro YP_009725301 ...................................................................................................V........................S.................................... 60 |
| SARS Mpro 2AMD KSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNG 125 |
| COVID-19 Mpro YP_009725301........N............................................................................V.K.........A........................................................... 120 |
| SARS Mpro 2AMD SPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGK 185 |
| COVID-19 Mpro YP_009725301..............................................................F.................................................................................................N 180 |
| SARS Mpro 2AMD FYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE 245 |
| COVID-19 Mpro YP_009725301...........................................................................V....................................................................................... 240 |
| SARS Mpro 2AMD PLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQC 305 |
| COVID-19 Mpro YP_009725301..............................................................S.................................................AL............................................. 300 |
| SARS Mpro 2AMD SGVTFQ 311 |
| COVID-19 Mpro YP_009725301 ...... 306 |
|
| Identities 157/310 (51%) |
| MERS Mpro 5C3N SGLVKMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQLSDPNYDALLIS 60 |
| COVID-19 Mpro YP_009725301.........FR........AF...........K........G...............TT..............DV.......Y............I.......TSEDMLN.........ED..........R 60 |
| MERS Mpro 5C3N MTNHSFSVQKHIGAPANLRVVGHAMQGTLLKLTVDVANPSTPAYTFTTVKPGAAFSVLAC 120 |
| COVID-19 Mpro YP_009725301 KS.......N......L.......---AGNVQ........I.......S.......NCV........K.......T........K.......K......K......VRIQ.......QT..... 117 |
| MERS Mpro 5C3N YNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVINFCYMHQMELANGTHTGSAF 180 |
| COVID-19 Mpro YP_009725301...........S.......S........VYQCA...........F............N............FNIDYDCVS..........H........PT......V.......A.......TDL 177 |
| MERS Mpro 5C3N DGTMYGAFMDKQVHQVQLTDKYCSVNVVAWLYAAILNGCAWFVKPNRTSVVSFNEWALAN 240 |
| COVID-19 Mpro YP_009725301 E.......NF........P.......V....R....TA....AAG.....TTIT......L............VI.....DR.....LNRFT....TLND.....LV....MKY 237 |
| MERS Mpro 5C3N QFTEFVGTQSVDM---LAVKTGVAIEQLLYAIQQLY-TGFQGKQILGSTMLEDEFTPEDV 296 |
| COVID-19 Mpro YP_009725301 NY-.....PLTQDH......ILGP.....SAQ......I.....VLDMCASLKE.....LQN.....MN.....RT...........AL..............F.. 296 |
| MERS Mpro 5C3N NMQIMGVVMQ 306 |
| COVID-19 mpro YP_009725301 VR.CS..TF. 306 |
2.3. Docking
The structure of COVID-19 virus Mpro in complex with N3 provides a model for identifying lead inhibitors to target COVID-19 virus Mpro through in silico screening. We used a molecular docking approach to predict the binding energy and inhibition constants of various food grade dyes under study [,]. We docked our ligands into the main protease of COVID-19 and screened them for their activity against COVID-19.
2.4. Predictive ADME Studies
Predictive Absorption, Distribution, Metabolism, Excretion (ADME) studies were performed by using SWISS tools*, an online tool that requires the structure or the SMILES for calculating the parameters.
The test compounds were built within the window by using the drawing tools of the online server, otherwise SMILES could be directly copied instead of drawing the structures []. To assure a drug-like pharmacokinetic profile in rational drug designing, predictive ADME calculations are done on the basis of Lipinski’s rule of five.
2.5. Toxicity
The toxicity of the molecules were predicted by using Toxtree [], a free offline tool available for the prediction of toxicity. It requires the SMILES format of structures to calculate the toxicity.
The SMILES format of the compounds were pasted in the chemical identifier bar, and then their toxicity was estimated on the basis of creamer rules. The compounds were categorized into three classes, i.e., Low (Class I), Intermediate (Class II), and High (Class III).
3. Results and Discussions
3.1. Docking
The PDB ID of protein used was 6LU7, which was retrieved from the protein data bank. The validation of the model that was performed redocked the internal ligand/inhibitor into the active site of the macromolecule. The individual ligands were then prepared in Auto Dock 4.2.6 software, as per standard protocols, and docking was carried out. The results are listed below Table 1 and Figure 1.
Table 1.
List of ligands with binding energy and inhibition constants.

Figure 1.
Docking interactions: (a) Orange B, (b) Cochineal Red A, (c) Erythrosine, (d) Laccaic acid A, (e) Laccaic acid B.
3.2. Predictive ADME Studies
Analysis of all the compounds was done for the physicochemically and pharmacokinetically important descriptors using SWISS tools. In order to predict the drug-alike properties of molecules, these major descriptors were required:
- ⮚
- Molecular weight (mol MW) (150–650)
- ⮚
- Octanol/water partition coefficient (Log Po/w) (−2–6.5)
- ⮚
- Hydrogen bond donor (≤5)
- ⮚
- Hydrogen bond acceptor (≤10)
- ⮚
- Human oral absorption percentage (≥80% is high, ≤25% is poor)
The entire set of compounds showed appreciable values for the properties analyzed, as well as exhibited drug-like aspects based on Lipinski’s rule of five. The results are summarized in Table 2.
Table 2.
SWISS ADME for compounds DG01-15.
3.3. Toxicity
Toxicity prediction of the compounds is necessary before further development. The toxicity is predicted by using Craemer rules. It categorizes the compounds into the classes, i.e., Low (Class I), Intermediate (Class II), and High (Class III), depending on its toxicity index. The categories are based on different thresholds of toxicological concern, as follows:
- ⮚
- Class I—1800 (30 µg/kg bw/d)
- ⮚
- Class II—540 (9 µg/kg bw/d)
- ⮚
- Class III—90 (1.5 µg/kg bw/d)
The results are summarized in Table 3.
Table 3.
Toxicity of the compounds DG01-15.
From the ADME studies, it was found that only a few compounds followed all the parameters for being a suitable drug candidate, but all the other compounds violated the parameters by a few factors, which, on further modifications, can be modified to promising drug candidates. The toxicity studies suggest that the therapeutic range of some compounds is very narrow, whereas some have wide therapeutic ranges, and these can be modified as per the purpose. The modifications required can be taken as a future perspective to develop these compounds as promising drug candidates.
4. Conclusions
Researchers are now focusing mainly on synthetic protease inhibitors, but natural compounds have always been found to be better than their synthetic counterparts. As natural chemists, we tried to focus on untouched natural drugs that could provide better drug therapies in the future. As per our study, the sequence identity % was 96 and 51 for COVID-19/SARS and COVID-19/MERS CoV, respectively. Docking studies revealed that Orange B (−10.35 kcal/mol) and Cochineal Red A (−9.52 kcal/mol) had the best binding affinity with the receptor. They had low GI absorption but showed no BLOOD BRAIN BARRIER (BBB) permeation activity. They obeyed the Lipinski rule and bioavailability score was 0.11 and showed drug-like aspects. Cochineal Red A was classified under Low Class I toxicity. Erythrosine, Laccaic Acid A, Laccaic Acid B, Azorubine, and Quinoline yellow also had a comparable binding affinity. These two molecules/compounds proved to be a good inhibitor against the COVID-19 main protease. Further MD simulation studies can be performed to mimic the interaction of the molecules with the receptor. These molecules can further be studied for their in vitro and in vivo activity. This work may be able to pave a new path for the development of potential drugs using food grade dyes and for the selection of compounds, as well as designing new scaffolds or novel combinatorial libraries of analogs/derivatives; however, before coming to any outcome of an in-silico study, proper in-vitro and in-vivo research works should be performed.
Author Contributions
Authors contributed equally: Conceptualization, P.D. and M.G.; Software P.D.; Supervision M.G. and A.M.; Manuscript writing P.D., M.G., A.M. and J.A.S. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors would like to thank the Head, Department of Pharmaceutical Sciences and Technology, BIT, Mesra for providing the research facilities for performing the current study.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Xu, X.; Chen, P.; Wang, J.; Feng, J.; Zhou, H.; Li, X.; Zhong, W.; Hao, P. Evolution of the novel coronavirus from the ongoing Wuhan outbreak and modeling of its spike protein for risk of human transmission. Sci. China Life Sci. 2020, 63, 457–460. [Google Scholar] [CrossRef] [PubMed]
- Peiris, J.S.M.; Lai, S.T.; Poon, L.L.M.; Guan, Y.; Yam, L.Y.C.; Lim, W.; Nicholls, J.; Yee, W.K.S.; Yan, W.W.; Cheung, M.T.; et al. Coronavirus as a possible cause of severe acute respiratory syndrome. Lancet 2003, 361, 1319–1325. [Google Scholar] [CrossRef]
- Li, W.; Wong, S.-K.; Li, F.; Kuhn, J.H.; Huang, I.-C.; Choe, H.; Farzan, M. Animal Origins of the Severe Acute Respiratory Syndrome Coronavirus: Insight from ACE2-S-Protein Interactions. J. Virol. 2006, 80, 4211–4219. [Google Scholar] [CrossRef] [PubMed]
- Zaki, A.; Van Boheemen, S.; Bestebroer, T.; Osterhaus, A.; Fouchier, R. Isolation of a Novel Coronavirus from a Man with Pneumonia in Saudi Arabia. N. Engl. J. Med. 2012, 367, 1814–1820. [Google Scholar] [CrossRef] [PubMed]
- Desenclos, J.C.; Van der Werf, S.; Bonmarin, I.; Levy-Bruhl, D.; Yazdanpanah, Y.; Hoen, B. Introduction of SARS in France, March–April, 2003. Emerg. Infect. Dis. 2004, 10, 195. [Google Scholar] [CrossRef] [PubMed]
- Guarner, J. Three Emerging Coronaviruses in Two Decades. Am. J. Clin. Pathol. 2020, 153, 420–421. [Google Scholar] [CrossRef] [PubMed]
- Wan, Y.; Shang, J.; Graham, R.; Baric, R.S.; Li, F. Receptor Recognition by the Novel Coronavirus from Wuhan: An Analysis Based on Decade-Long Structural Studies of SARS Coronavirus. J. Virol. 2020, 94, e00127-20. [Google Scholar] [CrossRef] [PubMed]
- Hilgenfeld, R. From SARS to MERS: Crystallographic studies on coronaviral pro- teases enable antiviral drug design. FEBS J. 2014, 281, 4085–4096. [Google Scholar] [CrossRef] [PubMed]
- Kandeel, M.; Altaher, A.A.; Alnazawi, M. Molecular Dynamics and Inhibition of MERS CoV Papain-like Protease by Small Molecule Imidazole and Aminopurine Derivatives. Lett. Drug Des. Discov. 2019, 16, 584–591. [Google Scholar] [CrossRef]
- Li, Y.-H.; Hu, C.-Y.; Wu, N.-P.; Yao, H.; Li, L. Molecular Characteristics, Functions, and Related Pathogenicity of MERS-CoV Proteins. Engineering 2019, 5, 940–947. [Google Scholar] [CrossRef] [PubMed]
- Available online: http://www.swissadme.ch (accessed on 15 October 2020).
- Patlewicz, G.; Jeliazkova, N.; Safford, R.J.; Worth, A.P.; Aleksiev, B. An evaluation of the implementation of the Cramer classification scheme in the Toxtree software. SAR QSAR Environ. Res. 2008, 19, 495–524. [Google Scholar] [CrossRef] [PubMed]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).