Abstract
Cancer is a life-threatening disease and is the second leading cause of death worldwide. Although many drugs are available for the treatment of cancer, survival outcomes are very low. Hence, rapid development of newer anticancer agents is a prime focus of the medicinal chemistry community. Since the recent past, computational methods have been extensively employed for accelerating the drug discovery process. In view of this, in the present study we performed 2D-QSAR (Quantitative Structure-Activity Relationship) analysis of a series of compounds reported with potential anticancer activity against breast cancer cell line MCF7 using QSARINS software. The best four models exhibited a r2 value of 0.99. From the generated QSAR equations, a series of pyrimidine-coumarin-triazole conjugates were designed and their MCF7 cell inhibitory activities were predicted using the QSAR equations. Furthermore, molecular docking studies were carried out for the designed compounds using AutoDock Vina against dihydrofolate reductase (DHFR), colchicine and vinblastine binding sites of tubulin, the key enzyme targets in breast cancer. The most active compounds identified through these computational studies will be useful for synthesizing and testing them as prospective novel anti-breast cancer agents.
1. Introduction
Cancer is the second leading cause of death worldwide [1]. For decades, conventional cytotoxic chemotherapy has been a key component of advanced cancer treatment in the cancer therapeutic arsenal [2]. However, only a minor improvement in survival rates has been achieved. Recent anticancer drug development is heavily reliant on drug targets, such as proteins, enzymes and receptors, and mechanism-based drug discovery would considerably accelerate the process. [3,4,5]. Various targets reported for anti-cancer activities include ribonucleotide reductase [6], estrogen receptors (ERs) [7,8], aromatase enzymes [9], type I and type II topoisomerases [10], microtubules [11] and dihydrofolate reductase, among others. Although many targets are well-known and validated, still they offer various opportunities.
Dihydrofolate reductase (DHFR) and tubulin proteins of microtubules are of great interest to medicinal chemists, since the inhibition of these sites is an important action of the marketed anticancer drugs methotrexate and vinca alkaloids. The inhibition of DHFR is imparted by compounds with antibacterial [12,13,14], antimalarial [15,16], antifungal [17] and anticancer effects [18,19]. Further, DHFR is an excellent template for enzyme selectivity and antiproliferative effects of antifolates for cancer chemotherapy [20]. DHFR enzyme converts dihydrofolate to tetrahydrofolate by means of NADPH in microbial and eukaryotic cells [21]. Accordingly, it is tangled in the combination of crude material for cell expansion, in both prokaryotic and eukaryotic cells. Trimethoprim (TMP), (2,4-diamino-5-(3′,4′,5′-trimethoxybenzyl) pyrimidine) is the renowned dihydrofolate reductase inhibitor used in urinary tract infections [22,23,24]. Tubulin is an important protein involved in cell division. It contains the α- and β- families, which polymerize to produce microtubules during cell division. These subunits are highly conserved and ubiquitous in eukaryotic cells. Microtubules are manipulated into separate daughter chromosomes during mitosis by rapid construction and disassembly. They are crucial for cellular replication. Furthermore, microtubules support cell morphology and material transport. The interference of microtubules causes cell death by apoptosis. Hence, the mitotic microtubules are novel platforms for cancer chemotherapy. Anticancer drugs halt cell division during mitosis, slowing cell proliferation by inhibiting or promoting tubulin polymerization. Vinca alkaloids such as vincristine and vinblastine, in addition to colchicine and paclitaxel, are well-known tubulin-interactive anticancer drugs [25,26,27,28].
The major challenges with the drugs acting on single targets are drug resistance and innumerable side effects [29,30]. Molecular hybridization technique maces the active pharmacophores with a linker that could simultaneously address more than one sole target. This is a very useful approach for the design of novel drugs against a complex disease such as cancer. In drug development, one of the rational and successful methods is the quantitative structure-activity relationship, which is a crucial step in the development and optimization of lead compounds, and consequently to improving their biological activity. Natural products bind with biomolecular drug targets more readily, as they are the metabolites of living organisms. Thus, they serve as an ideal resource for drug development. It is obvious to choose coumarin, a widely distributed secondary metabolite in plant kingdom and pyrimidine, a common component of nucleic acids, for the present study. Previously, Mohit Sandhuja et al. reported the anticancer activity for uracil–coumarin-based bifunctional molecular hybrids connected with 1,2,3-triazole moiety [31]. By considering the above facts and the report by Sandhuja et al., in the present investigation we utilized QSAR and conducted a detailed QSAR study using “QSARINS” software [32,33,34] for accelerating the drug discovery process for the identification of novel lead compounds with anti-breast cancer activity.
2. Results
2.1. D-QSAR
2.1.1. Development of Model and Descriptors
Mohit Sandhuja et al. synthesized a series of uracil–coumarin conjugates and reported their anticancer activity [31]. This series was selected for carrying out 2D-QSAR studies in the present study. Table 1 indicates the compounds synthesized by Mohit Sandhuja. The anticancer activity was reported in IC50 by Mohit Sandhuja. They were converted to pIC50 for carrying out 2D-QSAR studies and included in Table 1.
Table 1.
GI50 (mM) values of synthesized hybrids.
2.1.2. Generation of 2D-QSAR Models
For practical use of the QSAR model, robustness (cross validated performance) and predictability (external predictive capacity) are the parameters that determine the superiority of the QSAR model. Four models were selected for QSAR study.
Model 1
pIC50 = −4.3799 + 0.8065(MAXDP) + 14.2488(SIC0) + 35.9491(JGI7)
ntr = 18, npred = 10, R2 = 0.9700, R2adj = 0.9635, R2−R2 adj = 0.0064, LOF = 0.0060, RMSEtr = 0.0515, MAEtr = 0.0422, RSStr = 0.0478, CCCtr = 0.9848, s = 0.0584, F = 150.7739, Q2 LOO = 0.9495, Q2LMO = 0.9449, R2Yscr = 0.1763, Q2Yscr = −0.3864, RMSEcv = 0.0668, MAEcv = 0.0549, PRESScv = 0.0804, CCCcv = 0.6865, R2ext = 0.0983, MAEext = 0.0856, PRESSext = 0.0966, RMSEext = 0.0983, CCCext = 0.6865, Q2 F1 =0.4530, Q2 F2 =0.4375, Q3 F3 = 0.8908 (Equation (1)).
Model 2
pIC50 = 17.6885 – 0.0831 (AATS7s) − 3.6745 (MATS2c) − 8.2998 (SpMin3_Bhi) + 0.2438(MDEC-33)
Ntr = 20, npred = 6, R2 = 0. 0.9883, R2 adj = 0.9851, R2−R2 adj = 0.0031 LOF = 0.0016, RMSEtr = 0.0236, MAEtr = 0.0192, RSStr = 0.0112, CCCtr = 0.9941, s = 0.0273, F = 315.6856, Q2 LOO = 0.9796, Q2LMO = 0.9740, R2Yscr = 0.2090, Q2Yscr = −0.4425, RMSEcv = 0.0312, MAEcv = 0.0256, PRESScv = 0.0194, CCCcv = 0.9897, R2ext = 0.9726, MAEext = 0.0448, PRESSext = 0.0121, RMSEext = 0.0450, CCCext = 0.9754, Q2 F1 = 0.9698, Q2 F2 = 0.432, Q3 F3 = 0.9575 (Equation (2)).
Model 3
pIC50 = 18.2893 − 3.1558 (MATS2c) − 8.6164 (SpMin3_Bhi) − 0.3147 (ETA_EtaP_F) + 0.2471 (MDEC-33)
ntr = 20, npred = 6, R2 = 0.9875, R2adj = 0.9842, R2–R2 adj = 0.0033, LOF = 0.0016.
RMSE tr = 0.0244 MAE tr = 0.0198 RSS tr = 0.0119 CCC tr = 0.9937s = 0.0281F = 296.9651 Q2loo = 0.9789 RMSE cv = 0.0316 MAE cv 0.0260 PRESS cv = 0.0200 CCC cv = 0.9894 Q2LMO = 0.9738 R2Yscr = 0.2095 Q2Yscr = −0.4288 RMSE ext = 0.0527 MAE ext = 0.0512 PRESS ext = 0.0167 R2ext = 0.9665 Q2-F1 = 0.9586 Q2-F2 = 0.9220 Q2-F3 = 0.9416CCC ext: 0.9677 (Equation (3)).
Model 4
MATS2c and ETA_EtaP_F were included in equation 3 along with MATS2c, SpMin3_Bhi MDEC-33. MATS2c and ETA_EtaP_F and contributed negatively to the activity.
pIC50 = 20.1355 − 2.8197 (MATS2c) − 9.3329 (SpMin3_Bhi) − 0.3066 (IC1) + 0.2710 (MDEC-33)
R2 = 0.9874, R2adj = 0.9840, R2-R2adj = 0.0034, LOF = 0.0017, RMSE tr = 0.0245, MAEtr = 0.0199, RSS tr = 0.0120, CCC tr = 0.9937, s = 0.0283, F = 293.8200, Q2loo = 0.9777, R2-Q2loo: 0.0097, RMSE cv = 0.0326, MAE cv = 0.0270, PRESS cv = 0.0212, CCC cv = 0.9888.
Q2LMO = 0.9733, R2Yscr = 0.2098, Q2Yscr = −0.4428, RMSE ext = 0.0488, MAE ext = 0.0433 PRESS ext = 0.0143, R2ext = 0.9703, Q2-F1 = 0.9644, Q2-F2 = 0.9330, Q2-F3 = 0.9499 CCC ext = 0.9720 (Equation (4)).
IC1 parameter is newly included in model 4 and it had a negative contribution to the activity.
2.1.3. Validation of 2D-QSAR Models
The correlation matrix for the best model (model 2) obtained is provided in Table 2.
Table 2.
Descriptor correlation matrix for the best model.
A scatter plot of data set compounds obtained from experimental values and the final model equation is presented in Figure 1. Figure 2 represents the LMO scatter plot. The Y-scramble plot of Kxy versus R2 Yscr and Q2 Yscr is shown in Figure 3, which shows that correlation coefficients of the final model are much higher than those after endpoint scrambling, and a broken relationship can be evidenced between structure and responses.
pIC50 = 18.2893 − 3.1558 (MATS2c) − 8.6164 (SpMin3_Bhi) − 0.3147 (ETA_EtaP_F) + 0.2471 (MDEC-33)
Figure 1.
Scatter plot of data set compounds obtained from experimental values and the final model equation.
Figure 2.
The LMO scatter plot (plot of Kxy vs. Q2 LMO).
Figure 3.
Y-scramble plot (plot of Kxy vs. R2 and Q2 LOO from Y-scrambling procedure).
The applicability domain of the model was explained by William’s plot, standardized residuals versus leverage values shown in Figure 4, and it illustrates the prediction and expression. William’s plot indicated that all the molecules are located in the applicability domain of the model, with leverage values lower than the warning h* of 0.750.
Figure 4.
William’s plot of the best model. The dashed lines are the cut-off 3σ and the warning value of HAT (h*, 0.750).
From the four models, model 2 was found to be the best equation. The predicted activities calculated from the best model were found to be closest to observed activities (Table 3). Twelve new compounds were designed using ChemSketch software. The novelty of the compounds was confirmed by Scifinder search (Table 4).
Table 3.
pKi values of the original data set predicted by the best model equation.
Table 4.
Structures of the predicted compounds a–l.
Physicochemical properties of designed compounds were predicted using the PaDEL descriptor. Their anticancer activity against MCF7 cells was predicted using the generated QSAR models.
The novel designed compounds exhibited very high inhibitory activity compared to the most active compound of the original data set. The most active compound in the original data set had an IC50 value of 1.55. Its pIC50 value was found to be 5.809. All the predicted compounds exhibited pIC50 values of more than 9.0 except one compound which had a pIC50 value of 8.007. The predicted activities against MCF7 cells using the generated QSAR equation for the compounds g, h, i and j were found to be 7.9, 7.80, 8.59 and 8.33, respectively (Table 5).
Table 5.
pKi values of designed compounds predicted by the best model equation.
2.2. Docking Studies
A molecular docking study was performed in order to study possible interactions between the protein complex and the ligand for the designed compounds. In the present manuscript, we attempted to study the interactions of novel pyrimidine-coumarin-triazole hybrids with DHFR. In order to study the binding efficacy of all the designed compounds a–l, molecular docking studies were performed in the binding pockets of S. aureus dihydrofolate reductase [PDB ID:3SRQ]. All the designed compounds exhibited better binding scores ranging from −6.9 to −9.3 kcal/mol. Binding energies of compounds which target colchicine binding site (PDB ID 1SA0) and vinblastine binding site of tubulin (PDB ID 5J2T) and Staphylococcus aureus dihydrofolate reductase [PDB ID: 3SRQ] are presented in Table 6. Docking interactions of the best active compounds are shown in Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9 whereas Figure 10 and Figure 11 represents the binding interactions of designed compounds b and h on the vinblastine binding site of tubulin and DHFR of human proteins.
Table 6.
Binding energies of compounds targeting colchicine binding site (PDB ID 1SA0) and vinblastine binding site of tubulin (PDB ID 5J2T) and S. aureus dihydrofolate reductase [PDB ID: 3SRQ].
Figure 5.
Docking interactions of designed compound g with DHFR enzyme.
Figure 6.
Docking interactions of designed compound h with DHFR enzyme.
Figure 7.
Docking interactions of designed compound i with DHFR enzyme.
Figure 8.
Docking interactions of designed compound k with colchicine binding site of tubulin enzyme.
Figure 9.
Docking interactions of designed compound b with vinblastine binding site of tubulin enzyme.
Figure 10.
Docking interactions of designed compound b with vinblastine binding site of tubulin enzyme of human.
Figure 11.
Docking interactions of the designed compound h with DHFR enzyme of human.
2.3. Effects of Predicted Compounds on Nuclear Signaling Pathways were Predicted Using ProTox-II
The compounds were found to be active against three nuclear signaling pathways, namely aromatase, estrogen receptor alpha and estrogen receptor ligand-binding domain (Table 7).
Table 7.
Effect of predicted compounds on nuclear signaling pathways predicted using ProTox-II.
2.4. In Silico Studies Using SwissADME Pathways
The prediction of key physicochemical, pharmacokinetic, drug-like and related parameters for one or multiple molecules can be performed by SwissADME. Thus, the SwissADME studies for the designed compounds were carried out (Table 8).
Table 8.
SwissADME results of the predicted compounds.
3. Discussion
In the present study, a series of compounds reported by Mohit Sandhuja et al. were taken for QSAR analysis. In QSAR model 1 MAXDP is a dimensionless maximal electrotopological positive variation, which correlates the molecule’s electrophilicity and is a measure of the electronic distribution in the topological graph. MAXDP contributes positively to the activity. In previous research, also MAXDP contributed positively to the anticancer activity of coumarin analogs [35]. Information indices are best associated with cytotoxic activity [36]. A positive contribution to the activity was observed for structural information content index (neighborhood symmetry of 0-order) contributes JGI7. The topological charge parameter also contributes positively to the activity. Since two outliers (A2 and A5) were observed in William’s plot, they were removed and QSAR models were again generated. In model 2, AATS7s, MATS2c and SpMin3_Bhi contributed negatively to anticancer activity while MDEC-33 contributed positively. Similar values were found for Q2 F1, Q2F2 and Q2F3, along with elevated CCC (concordance correlation coefficient) parameter values (Figure 1). The results clearly indicate that the best model obtained was not by chance and truly there is a relationship between structures of pyrimidine-coumarin-triazole based trifunctional molecular hybrid analogs with corresponding MCF 7 cell line inhibitory activity.
In model 3, no outliers in William’s plot were observed. The scatter plot of the experimental versus the calculated MCF7 cell line inhibitory activities of pyrimidine-coumarin-triazole based trifunctional molecular hybrids is shown in Figure 1; it shows that predicted values are similar to corresponding experimental values. Figure 2 describes the correlation between the resulting Kxy (the inter-correlation among descriptors and response) versus Q2 LMO of the final model, which shows the LMO parameter values were around the model parameters, meaning the model is robust and stable.
IC1 parameter was newly included in model 4 and it had a negative contribution to the activity. The applicability domain of the model was explained by William’s plot, standardized residuals versus leverage values shown in Figure 4, and it illustrates the prediction and expression. William’s plot indicated that all the molecules are located in the applicability domain of the model with leverage values lower than the warning h* of 0.750.
All four models showed good statistical values for the training group with R2 values greater than 0.9 (equation 1, R2 = 0.9700; equation 2, R2 = 0.9883; equation 3, R2 = 0.9875; equation 4, R2 = 0.9874). The cross-validated Q2 must be higher for the models to be statistically significant. In all four models Q2 value was more than 0.9 (equation 1, Q2 = 0.9495; equation 2, Q2 = 0.9796; equation 3, Q2 = 0.9789; equation 4, Q2 = 0.9777,). The difference between R2 and Q2 should not be more than 0.3. In all four generated models the difference between R2 and Q2 was found to be 0.009. All of these parameters suggest that the generated QSAR equations have good predictive power. The predicted activities calculated from the best model were found to be close to observed activities.
Twelve new compounds were designed using ChemSketch software. The novelty of the compounds was confirmed by Scifinder. Physicochemical properties of the designed compounds were predicted using the PaDEL descriptor. Their anticancer activity against MCF7 cells was predicted using the generated QSAR models.
The novel designed compounds exhibited very high inhibitory activity compared to the most active compound of the original data set. The most active compound in the original data set had an IC50 value of 1.55. Its pIC50 value was found to be 5.809. All the predicted compounds exhibited pIC50 values of more than 9.0 except one compound which had a pIC50 value of 8.007. The predicted activities against MCF7 cells using the generated QSAR equation for compounds g, h, i and j were found to be 7.9, 7.80, 8.59 and 8.33, respectively. Hence, it was proven that all the predicted compounds are found to be more active against MCF7 cells and could serve as lead compounds for treating breast cancer.
To study the binding efficacy of all the designed compounds a–l, molecular docking studies were performed in the binding pockets of S. aureus dihydrofolate reductase [PDB ID:3SRQ]. All the designed compounds exhibited better binding scores, ranging from −6.9 to −9.3 kcal/mol. A dock score with a high negative value represents minimum binding energy for the formation of the complex between protein and ligand. The docking study reveals that introduction of an electron-withdrawing group at the second and fourth positions of the pyrimidine ring had a better affinity of the ligand to the protein DHF reductase complex. These compounds exhibited H bond interaction with Ile X-51 and Leu X-21. The Ile formed an H bond with NH of triazole while Leu formed an H bond with the trifluoromethyl group of the pyrimidine nucleus. The two trifluoromethyl groups also formed halogen bonds with Trp X-23, Leu X-21 and Ser X-50. Phe X-93 forms pi-pi stackings with the coumarin nucleus. Ala X-8, Val X-32, Leu X-21, IsoLeu X-51 and Lys 46 are bonded through alkyl interaction. Gly X-20 and His X-24 are linked with the nucleus through van der Waals attraction. Compounds g, h, i and j showed excellent binding energies. Their binding energy values were found to be −8.7, −9.0, −9.1 and −8.6, respectively. The binding affinity of novel compounds towards colchicine binding sites of tubulin were in the range of −7.4 to −9.8. The designed compound k showed the highest affinity with a binding value of −9.8.
Conventional hydrogen bonds are formed between designed compound k and Leu-248, Gly-179, Asn-143 and Gln-11 groups of enzymes at distances of 5.40, 4.47, 4.47 and 4.54 angstroms, respectively. Three halogen bonds were formed between the trifluoromethyl group of compounds and Ala-247 at a distance of 5.43 Å. Pi-sigma bonds and pi-pi stacked bonds were formed between the pyrone nuclei of Cys-12 and Gln-11 at distances of 4.80 and 6.95 Å, respectively. Pi anion bonds were formed 6.95 Å away from the triazole fragments of nuclei with Glu-254 of enzymes (Figure 8).
The binding affinity of novel compounds towards vinblastine binding sites of tubulin were in the range of −9.2 to −6. The designed compound b showed a binding energy of −9.2. The designed compound b binds with the target by forming two conventional hydrogen bonds with Gln-394 and Val-181 at distances of 4.42 and 2.56 Å, respectively. Two pi-alkyl bonds were formed between the coumarin nucleus and Pro-175 of the target at 5.37 Å. (Figure 9). On comparing the binding affinities between the colchicine binding site and vinblastine binding site of tubulin, it was found that compound k with 4-cyclopropyl-2-(trifluoromethyl)pyrimidinyl substitution showed binding affinity towards the colchicine binding site, while compound b with 5-fluoropyrimidinyl substitution showed binding affinity towards the vinblastine binding site of tubulin. The interactions of the compounds g, h, and i with the DHFR for protein are presented in Figure 5, Figure 6 and Figure 7.
By comparing the different forms of bond interactions between the protein and ligand among both tubulin targets, interatomic lengths for hydrogen bonds were found to be 2.56 and 5.40 Å, respectively. The hydrogen bonds in both the high- and low-resolution targets of tubulin were found to be at a distance of 4.4 Å. In addition, pi-interaction was identified in both tubulin proteins at a distance of 4.8 to 6.9 Å. Despite the fact that proteins 5J2T and 1SA0 have resolutions of 2.2 and 3.58 Å, respectively, the docking of the ligands showed the same distance range. This illustrates the ligand’s capability of binding to targets of various resolutions. The docking studies were carried out with human tubulin (e.g.,PDB id 6O5N) and DHFR (4KD7) for a comparative study. The results were quite surprising. The amino acid sequence was totally different for human tubulin and rat tubulin. Although the amino acid sequence is different between human and rat, the binding energy of designed compound b was found to be −9.0 kcal/mol for human tubulin and −9.4 kcal/mol for rat tubulin (Figure 10). The binding energies for the designed compounds were similar in human tubulin and rat tubulin. In addition, the docking with human DHFR enzyme PDB ID 4KD7 was also carried out for the compounds that exhibited best in the rat DHFR enzyme (PDB ID: 3SRQ). The compounds g, h and i exhibited good binding energies of −9.0, −10.1 and −8.9 kcal/mol, respectively, for human DHFR (Figure 11). The binding sites of the compounds g, h and i vary in both the rat and the human DHFR enzymes, but they bind effectively through hydrogen, van der Waals and pi-stacking interactions in both. By this comparison, we can conclude that the best compounds b and h can serve as the lead for anti-cancer activity in both rat and human breast cells.
The compounds were found to be active against three nuclear signaling pathways, namely aromatase, estrogen receptor alpha and estrogen receptor ligand-binding domain, which was determined by ProTox-II. Aromatase inhibitors work by inhibiting the enzyme aromatase. Aromatase converts the hormone androgen into small amounts of estrogen in the body. Thus, aromatase inhibitors reduce estrogen levels that stimulate the growth of hormone-receptor-positive breast cancer cells [37]. The effects of estrogen are largely mediated by estrogen receptors ER-α and ER-β, which are members of the nuclear receptor superfamily of transcription factors. Estrogen receptor alpha (ER-α) is expressed in approximately 65% of breast cancer cases [38]. Estrogen receptor α is mainly responsible for breast cancer initiation and progression. Since the predicted compounds act as ligands that selectively bind to the estrogen alpha receptor and inhibit estrogen-dependent proliferative activity, they are expected to show anticancer activity. The characterization of estrogen provided a molecular basis for the regulation of estrogen receptors and, thereby a basis to describe the mechanism of the hormone therapy in treating breast cancer [39]. Tamoxifen, a well-known anticancer agent, interferes with all three pathways in ProTox-II, similarly to the designed compounds. Tamoxifen is a stilbenoid. The designed compounds contain coumarin. In general, coumarins are biosynthesised from coumaric acid, which is also a stilbenoid. Thus, coumarins are structurally related to tamoxifen and both of them interfere with aromatase, estrogen receptor alpha and estrogen binding domain.
By the above facts, it was very clear that the predicted compounds could act as lead compounds for breast cancer treatment, and that they can act through all three nuclear signaling pathways. The prediction of key physicochemical, pharmacokinetic, drug-like and related parameters for one or multiple molecules can be performed by SwissADME. Thus, SwissADME studies for the designed compounds were carried out. Qualitative estimation of the class of solubility was conducted according to the following log S scale: insoluble < −10 < poorly < −6 < moderately < −4 < soluble < −2 < very < 0 < highly. The log S scale of the predicted compounds was found to be in the range of −6 to −2. Thus, the designed compounds were found to be moderately soluble to very soluble in water. The compounds were not permeable to the blood-brain barrier. This indicates that they were devoid of CNS side effects.
Cytochrome P4501A2 (CYP1A2) is a key enzyme in the cause of breast cancer (BC). It plays a role in activation of breast carcinogen, in the production of beneficial estrogen [2-hydroxyestrone (2-OHE1)] and in converting arachidonic acid (AAc) to epoxyeicosatrienoic acids (EETs), which have anti-inflammatory properties. All the designed compounds were inhibitors of CYP1A2 which would further enhance their anticancer activity [40]. CYP3A4 causes the oxidation of compounds that are usually used as chemotherapeutic agents for the treatment of osteosarcomas such as etoposide, ifosfamide, cyclophosphamide and doxorubicin, suggesting that the response to these drugs could be worse in tumors with high CYP3A expression, increasing the risk of metastasis. All the designed compounds were inhibitors of CYP1A2 which would further enhance the anticancer activities of other drugs, such as etoposide [41]. The synthetic accessibility (SA) score suggests to us the ease of synthesis. The score can be between 1 and 10, where 1 is very easy, while 10 denotes difficulty in synthesizing. The predicted compounds had scores of 3.16–3.71. Thus, these compounds can be synthesized easily. A bioavailability score of 0.55 or 0.56 means a compounds has good pharmacokinetic properties. All the designed compounds showed values of more than 0.55.
The overall results of the above research indicated that the designed compounds exhibited promising results in MCF 7 cell inhibition prediction. The ProTox-II results further confirmed their anticancer activity. The SwissADME results also indicated that the designed compounds are inhibitors of two important enzymes, CYP1A2 and CYP3A4, and thus are predicted to possess anticancer activity.
4. Materials and Methods
Multiple linear regression models by ordinary least squares were developed by “QSARINS”, carefully verified and validated in detail according to the chemometric approach. A series containing 28 compounds (Table 1) of uracil–coumarin based bifunctional molecular hybrids linked by 1,2,3-triazole moiety with MCF7 inhibitory values were selected from reported literature. pKi was calculated from observed Ki values and considered as the dependent variable.
4.1. Molecule Structure Preparation and 3D Geometry Optimization
The molecular structures were drawn by ACD/labs ChemSketch freeware 2017.2.116 and converted to mol2 format. Geometry optimization was performed by Avogadro V1.2.018 on adding hydrogens. MMFF94, Merck molecular force field, was employed, along with the steepest descent algorithm. The best conformer with global minimum energy was used throughout the study.
4.2. Data Setup
The molecular descriptor values for the compounds were calculated from PaDEL descriptor. All zero values, missing values and constant value (>50%) descriptors were excluded from variables. Descriptors with values greater than 0.85 were filtered out using pair-wise correlation. All twenty-eight compounds were divided into training set and prediction set in a 5:1 ratio. Many trials and models were developed, a few best models are presented in this manuscript.
4.3. Variable Selection and Model Calculation
QSARINS software considers all combinations of selected descriptors defined by user options. Descriptor selection relational to biological activities of molecules and Friedman’s “lack-of-fit” (LOF) function was calculated by genetic algorithm. LOF smoothness level is kept at the default level of 1.0. Along with the genetic algorithm, more combinations and maximum generations (user-defined value:2000) were explored by parameters including mutation probability (0.1), population size (200).
4.4. Model Validation
Internal validation and external validation, in addition to applicability domain of the model, were performed. Internal validation was performed by cross-validation leave-one-out (Q2LOO), cross-validation leave-many-out (Q2LMO), root mean squared error (RMSE), Y-scrambling, and external validation by Q2 F1, Q2 F2, and Q2 F3,22; CCC was applied on selected models. Q2LMO was repeated 2000 times with 30% of objects left from the training set each time. Y-scrambling was performed by 2000 iterations method in order to exclude chance correlation in the original model. R2 and Q2LOO of the model must be reasonably higher than scrambled ones, and RMSE of the model underprediction must be reasonably smaller than scrambled ones. The concordance correlation coefficient (CCCext) was analyzed. The leverage (hat) was calculated by hi = xi (XT X)-1 xTi (I = 1, 2, …m), where xi is the descriptor row-value of the query compound i, and m is the number of query compounds. X is n × p matrix of the training set, where n is the number of training set samples and p is the number of model descriptors. The leverage cut-off value h* is 3(p + 1)/n. Leverage greater than h* for the training set means that the sample is highly influential in determining the model, whereas in the test set (X outlier) the prediction is extrapolation of the model. Any compound with a standardized residual of more than 3σ (3 standard deviation units) is identified as a Y outlier.
4.5. Molecular Docking Study
Molecular docking protocols are widely used for predicting the binding affinities for a number of ligands. Intermediary steps, such as PDBQT files for protein and ligands preparation and grid box creation, were completed using the graphical user interface program AutoDock Tools (ADT). ADT assigned polar hydrogens, united atom Kollman charges, solvation parameters and fragmental volumes to the protein. AutoDock saved the prepared file in PDBQT format. AutoGrid was used for the preparation of the grid map using a grid box. The grid size was set to 60 × 60 × 60 xyz points with a grid spacing of 0.375 Å and grid center was designated at dimensions (x, y and z): −1.095, −1.554 and 3.894. A scoring grid is calculated from the ligand structure in order to minimize computation time. AutoDock Vina was employed for docking using protein and ligand information along with grid box properties in the configuration file. AutoDock Vina employs iterated local search global optimizer. During the docking procedure, both the protein and ligands are considered as rigid. Results less than 1.0 Å in positional root-mean-square deviation (RMSD) were clustered together and represented by the result with the most favorable free energy of binding. The pose with the lowest energy of binding or binding affinity was extracted and aligned with receptor structure for further analysis.
Hydrogen bonds and Gasteiger–Huckel charges were assigned to the protein of interest and designed compounds using Chimera software. The cofactors and water molecules were also eliminated from the protein. A molecular docking study was performed using PyRxAutodock Vina in the binding site of Staphylococcus aureus dihydrofolate reductase [PDB ID:3SRQ]. The grid box of dimensions in Å were: center (X, Y, Z) = (−2.5, 0.32, −21.67), dimensions (X, Y, Z) = (88.31, 88.31, 88.31), with an exhaustiveness of 8. The docking poses for protein–ligand interactions were chosen on the basis of docking scores. The pose with the highest docking score was selected. The binding interactions were developed using the Molegro molecular viewer software.
4.6. In Silico Studies
Effect of predicted compounds on nuclear signaling pathways was predicted using ProTox-II. The effect of compounds on aryl hydrocarbon receptor (AhR), androgen receptor (AR), androgen receptor ligand binding domain (AR-LBD), aromatase, estrogen receptor alpha (ER-α), estrogen receptor ligand b inding domain (ER-LBD) and peroxisome proliferator activated receptor gamma (PPAR-Gamma) nuclear signaling pathways were checked. SwissADME Web determines physicochemical, pharmacokinetic, drug-like and related parameters for multiple molecule; thus, SwissADME studies for the designed compounds were carried out.
5. Conclusions
In continuation of our research on coumarin derivatives [42,43], we report in this study the 2D-QSAR studies on pyrimidine-tethered coumarin–triazole derivatives. The models developed with 2D descriptors were found to be robust, stable and predictive. A study to discover the role of 2D descriptors in tubulin and dihydrofolate reductase inhibiting activity was carried out by developing a good statistical model. Model validation was performed internally and externally. The model was tested on designed compounds for predicting bioactivity. The best predicted active compounds were subjected to molecular docking studies for structural and interaction information supporting our QSAR model. The compounds c, d, h, i and j were found to have potential DHFR inhibitory activity and are being considered for further synthesis and biological screening.
Author Contributions
Conceptualization, A.K.S.; methodology, A.K.S.; software, R.N.; validation, R.N.; formal analysis, R.N.; investigation, R.N.; resources, A.S.; data curation, A.S.; writing—original draft preparation, A.S. and R.N.; writing—review and editing, R.R.B.; visualization, R.R.B.; supervision, A.B.S.; project administration, A.B.S.; funding acquisition, R.N. and R.R.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors A.S. and R.N. would like to thank Gramatica, P. for providing license for QSARINS software and C.L. BaidMetha College of Pharmacy for providing necessary tools for carrying out the research. A.K.S., R.R.B and A.B.S would like to thank Sathyabhama Institute of Science and Technology, Dean’s office of College of Pharmacy and Health Sciences, Ajman University, UAE and Vignan Pharmacy College, Vadlamudi, Andhra Pradesh, India for the support extended towards the preparation of manuscript. R.R.B is thankful to Deanship of Research and Graduate Studies, Ajman University for aiding towards article processing charges.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Arnold, M.; Ferlay, J.; Henegouwen, M.I.V.B.; Soerjomataram, I. Global Burden of Oesophageal and Gastric Cancer by Histology and Subsite in 2018. Gut 2020, 69, 1564–1571. [Google Scholar] [CrossRef] [PubMed]
- Nigro, O.; Chini, C.; Proserpio, I. Molecularly Targeted Therapy for Advanced Gastrointestinal Noncolorectal Cancer Treatment: How to Choose? Past, Present, Future. Anti-Cancer Drugs 2021, 32, 593–601. [Google Scholar] [CrossRef] [PubMed]
- Skariyachan, S.; Muddebihalkar, A.G.; Badrinath, V.; Umashankar, B.; Eram, D.; Uttarkar, A.; Niranjan, V. Natural Epiestriol-16 Act as Potential Lead Molecule Against Prospective Molecular Targets of Multidrug Resistant Acinetobacter Baumannii-Insight from in Silico Modelling and in Vitro Investigations. Infect. Genet. Evol. 2020, 82, 104314. [Google Scholar] [CrossRef] [PubMed]
- Wright, W.C.; Chenge, J.; Wang, J.; Girvan, H.; Yang, L.; Chai, S.C.; Huber, A.D.; Wu, J.; Oladimeji, P.O.; Munro, A.W.; et al. Clobetasol Propionate Is a Heme-Mediated Selective Inhibitor of Human Cytochrome P450 3A5. J. Med. Chem. 2020, 63, 1415–1433. [Google Scholar] [CrossRef]
- Shah, F.; Mukherjee, P.; Gut, J.; Legac, J.; Rosenthal, P.J.; Tekwani, B.L.; Avery, M.A. Identification of Novel Malarial Cysteine Protease Inhibitors Using Structure-Based Virtual Screening of a Focused Cysteine Protease Inhibitor Library. J. Chem. Inf. Model. 2011, 51, 852–864. [Google Scholar] [CrossRef]
- Julias, J.G.; Pathak, V. Deoxyribonucleoside Triphosphate Pool Imbalances In Vivo Are Associated with an Increased Retroviral Mutation Rate. J. Virol. 1998, 72, 7941–7949. [Google Scholar] [CrossRef] [Green Version]
- Osborne, C.K.; Schiff, R.; Fuqua, S.A.; Shou, J. Estrogen Receptor: Current Understanding of Its Activation and Modulation. Clin. Cancer Res. 2001, 7, 4338–4342. [Google Scholar]
- Bhat, K.P.L.; Pezzuto, J.M. Natural Modulators of Estrogen Biosynthesis and Function as Chemopreventive Agents. Arch. Pharmacal Res. 2001, 24, 473–484. [Google Scholar] [CrossRef]
- Ferretti, G.; Bria, E.; Giannarelli, D.; Felici, A.; Papaldo, P.; Fabi, A.; Di Cosimo, S.; Ruggeri, E.M.; Milella, M.; Ciccarese, M.; et al. Second- and Third-Generation Aromatase Inhibitors as First-Line Endocrine Therapy in Postmenopausal Metastatic Breast Cancer Patients: A Pooled Analysis of the Randomised Trials. Br. J. Cancer 2006, 94, 1789–1796. [Google Scholar] [CrossRef]
- Champoux, J.J. DNA Topoisomerases: Structure, Function and Mechanism. Annu. Rev. Biochem. 2001, 70, 369–413. [Google Scholar] [CrossRef] [Green Version]
- Hall, A. The Cytoskeleton and Cancer. Cancer Metastasis Rev. 2009, 28, 5–14. [Google Scholar] [CrossRef] [PubMed]
- Fidock, D.A.; Rosenthal, P.J.; Croft, S.L.; Brun, R.; Nwaka, S. Antimalarial Drug Discovery: Efficacy Models for Compound Screening. Nat. Rev. Drug Discov. 2004, 3, 509–520. [Google Scholar] [CrossRef]
- Hawser, S.; Lociuro, S.; Islam, K. Dihydrofolate Reductase Inhibitors as Antibacterial Agents. Biochem. Pharmacol. 2006, 71, 941–948. [Google Scholar] [CrossRef]
- Rao, A.S.; Tapale, S.-R. A Study on Dihydrofolate Reductase and Its Inhibitors: A Review. Int. J. Pharm. Sci. Res. 2013, 4, 2535. [Google Scholar] [CrossRef]
- Baird, J.K. Effectiveness of Antimalarial Drugs. N. Eng. J. Med. 2005, 352, 1565–1577. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Mishra, R. Dihydrofolate Reductase Enzyme: A Potent Target for Antimalarial Research. Asian J. Cell Biol. 2005, 1, 48–58. [Google Scholar] [CrossRef]
- Liu, J.; Bolstad, D.B.; Smith, A.E.; Priestley, N.D.; Wright, D.L.; Anderson, A.C. Structure-Guided Development of Efficacious Antifungal Agents Targeting Candida Glabrata Dihydrofolate Reductase. Chem. Biol. 2008, 15, 990–996. [Google Scholar] [CrossRef] [Green Version]
- Singh, P.; Kaur, M.; Sachdeva, S. Mechanism Inspired Development of Rationally Designed Dihydrofolate Reductase Inhibitors as Anticancer Agents. J. Med. Chem. 2012, 55, 6381–6390. [Google Scholar] [CrossRef] [PubMed]
- Rana, R.M.; Rampogu, S.; Zeb, A.; Son, M.; Park, C.; Lee, G.; Yoon, S.; Baek, A.; Parameswaran, S.; Park, S.J.; et al. In Silico Study Probes Potential Inhibitors of Human Dihydrofolate Reductase for Cancer Therapeutics. J. Clin. Med. 2019, 8, 233. [Google Scholar] [CrossRef] [Green Version]
- Wróbel, A. Recent Design and Structure-Activity Relationship Studies on the Modifications of DHFR Inhibitors as Anticancer Agents. Curr. Med. Chem. 2021, 28, 910–939. [Google Scholar] [CrossRef]
- Schweitzer, B.I.; Dicker, A.; Bertino, J.R. Dihydrofolate Reductase as a Therapeutic Target. FASEB J. 1990, 4, 2441–2452. [Google Scholar] [CrossRef] [PubMed]
- Blakley, R.-L. Eukaryotic Dihydrofolate Reductase. Adv. Enzymol. Relat. Areas Mol. Biol. 1995, 70, 23–102. [Google Scholar] [PubMed]
- Heaslet, H.; Harris, M.; Fahnoe, K.; Sarver, R.; Putz, H.; Chang, J.; Subramanyam, C.; Barreiro, G.; Miller, J.R. Structural Comparison of Chromosomal and Exogenous Dihydrofolate Reductase from Staphylococcus Aureusin Complex with the Potent Inhibitor Trimethoprim. Proteins Struct. Funct. Bioinform. 2009, 76, 706–717. [Google Scholar] [CrossRef]
- Hong, W.; Wang, Y.; Chang, Z.; Yang, Y.; Pu, J.; Sun, T.; Kaur, S.; Sacchettini, J.C.; Jung, H.; Wong, W.L.; et al. The Identification of Novel Mycobacterium Tuberculosis DHFR Inhibitors and the Investigation of Their Binding Preferences by Using Molecular Modelling. Sci. Rep. 2015, 5, 15328. [Google Scholar] [CrossRef] [PubMed]
- Newman, D.J.; Cragg, G.M. Natural Products as Sources of New Drugs over the 30 Years from 1981 to 2010. J. Nat. Prod. 2012, 75, 311–335. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Over, B.; Wetzel, S.; Grütter, C.; Nakai, Y.; Renner, S.; Rauh, D.; Waldmann, H. Natural-Product-Derived Fragments for Fragment-Based Ligand Discovery. Nat. Chem. 2012, 5, 21–28. [Google Scholar] [CrossRef] [PubMed]
- Nielsen, J. Combinatorial Synthesis of Natural Products. Curr. Opin. Chem. Biol. 2002, 6, 297–305. [Google Scholar] [CrossRef]
- Clough, J.; Chen, S.; Gordon, E.M.; Hackbarth, C.; Lam, S.; Trias, J.; White, R.J.; Candiani, G.; Donadio, S.; Romanò, G.; et al. Combinatorial Modification of Natural Products: Synthesis and in Vitro Analysis of Derivatives of Thiazole Peptide Antibiotic GE2270 A: A-Ring Modifications. Bioorganic Med. Chem. Lett. 2003, 13, 3409–3414. [Google Scholar] [CrossRef]
- Nepali, K.; Sharma, S.; Sharma, M.; Bedi, P.; Dhar, K. Rational Approaches, Design Strategies, Structure Activity Relationship and Mechanistic Insights for Anticancer Hybrids. Eur. J. Med. Chem. 2014, 77, 422–487. [Google Scholar] [CrossRef]
- Mehndiratta, S.; Sharma, S.; Kumar, S.; Nepali, K. Molecular Hybrids with Anticancer Activity. Top. Anti-Cancer Res. 2015, 4, 383–454. [Google Scholar] [CrossRef]
- Sanduja, M.; Gupta, J.; Singh, H.; Pagare, P.P.; Rana, A. Uracil-Coumarin Based Hybrid Molecules as Potent Anti-Cancer and Anti-Bacterial Agents. J. Saudi Chem. Soc. 2020, 24, 251–266. [Google Scholar] [CrossRef]
- Gramatica, P.; Chirico, N.; Papa, E.; Cassani, S.; Kovarich, S. QSARINS: A New Software for the Development, Analysis, and Validation of QSAR MLR Models. J. Comput. Chem. 2013, 34, 2121–2132. [Google Scholar] [CrossRef]
- Gramatica, P.; Cassani, S.; Chirico, N. QSARINS-Chem: Insubria Datasets and New QSAR/QSPR Models for Environmental Pollutants in QSARINS. J. Comput. Chem. 2014, 35, 1036–1044. [Google Scholar] [CrossRef]
- Consonni, V.; Ballabio, D.; Todeschini, R. Evaluation of Model Predictive Ability by External Validation Techniques. J. Chemom. 2010, 24, 194–201. [Google Scholar] [CrossRef]
- Worachartcheewan, A.; Suvannang, N.; Prachayasittikul, S.; Prachayasittikul, V.; Nantasenamat, C. Probing the Origins of Aromatase Inhibitory Activity of Disubstituted Coumarins via QSAR and Molecular Docking. Excli. J. 2014, 13, 1259–1274. [Google Scholar] [CrossRef] [PubMed]
- Ancuceanu, R.; Dinu, M.; Neaga, I.; Laszlo, F.G.; Boda, D. Development of QSAR Machine Learning-Based Models to Forecast the Effect of Substances on Malignant Melanoma Cells. Oncol. Lett. 2019, 17, 4188–4196. [Google Scholar] [CrossRef]
- Khosrow-Khavar, F.; Filion, K.B.; Bouganim, N.; Suissa, S.; Azoulay, L. Aromatase Inhibitors and the Risk of Cardiovascular Outcomes in Women with Breast Cancer. Circulation 2020, 141, 549–559. [Google Scholar] [CrossRef]
- Jiang, Z.; Guo, J.; Shen, J.; Jin, M.; Xie, S.; Wang, L. The Role of Estrogen Receptor Alpha in Mediating Chemoresistance in Breast Cancer Cells. J. Exp. Clin. Cancer Res. 2012, 31, 42. [Google Scholar] [CrossRef] [Green Version]
- Bai, Z.; Gust, R. Breast cancer, estrogen receptor and ligands. Arch. Pharm. 2009, 342, 133–149. [Google Scholar] [CrossRef]
- Ayari, I.; Fedeli, U.; Saguem, S.; Hidar, S.; Khlifi, S.; Pavanello, S. Role of CYP1A2 Polymorphisms in Breast Cancer Risk in Women. Mol. Med. Rep. 2012, 7, 280–286. [Google Scholar] [CrossRef]
- Rodriguez-Antona, C.; Ingelmansundberg, M. Cytochrome P450 Pharmacogenetics and Cancer. Oncogene 2006, 25, 1679–1691. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Ramalakshmi, N.; Manimegalai, P.; Bhandare, R.R.; Kumar, S.A.; Shaik, A.B. 2D-Quantitative Structure Activity Relationship (QSAR) Modeling, Docking Studies, Synthesis and in-Vitro Evaluation of 1,3,4-Thiadiazole Tethered Coumarin Derivatives as Antiproliferative Agents. J. Saudi Chem. Soc. 2021, 25, 101279. [Google Scholar] [CrossRef]
- Ramalakshmi, N.; Chitra, S.; Manimegalai, P.; Arunkumar, S. Design, Synthesis, Docking and Biological Evaluation of Novel 4-Hydroxy Coumarin Derivatives. Curr. Comput. Aided Drug Des. 2021, 17, 201–213. [Google Scholar] [CrossRef] [PubMed]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

