Abstract
In this study, we evaluated the physicochemical properties related to the previously reported anticancer activity of a dataset comprising thirty 1,2-dihydropyridine derivatives. We utilized Principal Component Analysis (PCA) to identify the most significant influencing factors. The PCA analysis showed that the first two principal components accounted for 59.91% of the total variance, indicating a strong correlation between the molecules and specific descriptors. Among the 239 descriptors analyzed, 18 were positively correlated with anticancer activity, clustering with the 12 most active compounds based on their IC50 values. Six of these variables—LogP, Csp3, b_1rotN, LogS, TPSA, and lip_don—are related to drug-likeness potential. Thus, we then ranked the 12 compounds according to these six variables and excluded those violating the drug-likeness criteria, resulting in a shortlist of nine compounds. Next, we investigated the binding affinity of these nine shortlisted compounds with the use of molecular docking towards the PIM-1 Kinase enzyme (PDB: 2OBJ), which is overexpressed in various cancer cells. Compound 6 exhibited the best docking score among the docked compounds, with a docking score of −11.77 kcal/mol, compared to −12.08 kcal/mol for the reference PIM-1 kinase inhibitor, 6-(5-bromo-2-hydroxyphenyl)-2-oxo-4-phenyl-1,2-dihydropyridine-3-carbonitrile. To discover new PIM-1 kinase inhibitors, we designed nine novel compounds featuring hybrid structures of compound 6 and the reference inhibitor. Among these, compound 31 displayed the best binding affinity, with a docking score of −13.11 kcal/mol. Additionally, we performed PubChem database mining using the structure of compound 6 and the similarity search tool, identifying 16 structurally related compounds with various reported biological properties. Among these, compound 52 exhibited the best binding affinity, with a docking score of −13.03 kcal/mol. Finally, molecular dynamics (MD) studies were conducted to confirm the stability of the protein–ligand complexes obtained from docking the studied compounds to PIM-1 kinase, validating the potential of these compounds as PIM-1 kinase inhibitors.
1. Introduction
Derivatives with 1,2-dihydropyridine scaffold are attracting a lot of attention owing to the readiness of their organic synthesis using several types of multi-component reactions (MCRs) and their various reported biological activities [1,2]. In the current era, where a premium is put on speed, efficacy, and diversity in the drug discovery process, large libraries of these compounds can be established efficiently in a relatively short period of time through the aid of conventional and microwave-assisted MCRs, and have become available for high-throughput biological screening [3]. Several studies have reported on the many pharmacological activities of 1,2-dihydropyridine derivatives including antimicrobial [4], cardiotonic [5], antidepressant [6], and anticancer activities [7,8].
There is much interest in the anticancer activity of these compounds due to their ability to interfere with some biological targets that are significantly associated with the pathophysiological progress of cancer, such as the PIM-1 kinase (proto-oncogene serine/threonine-protein kinase), survivin, and phosphodiesterase family of enzymes [7,9,10]. PIM-1 is an enzyme responsible for the phosphorylation, and hence the activation, of a large number of endogenous substrates that control cell cycle, proliferation, and apoptosis, suggesting that the enzyme plays a central regulatory role in cell division, survival, and differentiation processes. The oncogenic potential of PIM-1 has been demonstrated in B-cell tumors, prostate cancer, and erythroleukemia. Recent studies proved that the expression of PIM-1 is elevated in colorectal and pancreatic cancers, and the most fierce triple-negative breast cancer [9,11].
Statistical machine-learning techniques have seen widespread use in various chemistry domains [12,13,14,15,16,17]. Advances in algorithms, data availability, and computational power have contributed to the remarkable growth of this field [14,17]. By utilizing carefully curated databases and extending high-throughput methods, supervised learning approaches have successfully established connections between the chemical structure of molecules and their physical properties. Moreover, these approaches are employed in several applications in chemical synthesis, such as enabling the simulation of structural transformations at a reduced cost compared to the classic standard first-principles techniques [13,15,16,18].
From the panel of unsupervised machine learning techniques, principal component analysis (PCA) has been extensively utilized for data size reduction in large analytical chemistry datasets [19,20,21,22]. Other studies have also employed PCA to explore environmental contexts [23,24,25,26]. In this study, we aimed to couple PCA with a dataset comprising 239 different electronic, physical, and chemical properties (descriptors) of thirty 1,2-dihydropyridine derivatives. These derivatives have previously demonstrated in vitro anticancer activity against human colorectal tumor cell lines (HT-29) [27]. The objective was to identify the descriptors directly linked to the biological activity and rank the compounds based on these identified descriptors. Additionally, molecular docking was employed to assess the binding affinity of these derivatives towards a potential protein target, PIM-1 kinase. The combined results were utilized to identify the most promising candidates in terms of drug-likeness, biological activity, and binding affinity to the target protein.
Furthermore, the compound that showed the best profile with respect to the previously mentioned analyses was utilized to propose nine novel 1,2-dihydropyridine derivatives, and was also employed in the mining of the PubChem database to identify small organic molecules with a similar chemical scaffold while retaining a reported curation effect. All the proposed and obtained compounds were subject to a second round of molecular docking to determine those with the best binding affinity towards the PIM1-Kinase. Finally, molecular dynamics simulations studies were conducted to confirm the stability of the ligand–protein complexes obtained by the top ranked compounds.
Table 1 provides an overview of the general scaffold of the 1,2-dihydropyridine derivatives employed in this study, along with their inhibitory activity regarding tumor cell growth.
Table 1.
The general scaffold and the chemical structures of thirty previously reported 1,2-dihydropyridine derivatives and their inhibitory tumor cell growth activity [27].
2. Results and Discussion
2.1. PCA
PCA is a statistical method that simplifies a large dataset by reducing the number of variables while keeping most of the important information. It converts the original variables into a new set of uncorrelated variables called principal components. These components are arranged so that the first few capture the most significant variations in the data.
The PCA bi-plot in Figure 1 provides a visual representation of how the different electronic, physical, and chemical properties of the thirty investigated 1,2-dihydropyridine derivatives (Table 1) relate to each other and to the principal components.
Figure 1.
PCA bi-plot of the 30 investigated molecules (individuals) following 239 variables (descriptors); the big red bullets present the individuals and the small white bullets present the factors.
The first two principal components (PCs) explain a significant portion of the total variance, accounting for 56.79% (34.28% for PC1 and 22.51% for PC2). This means that there is a good correlation among the different molecules studied. Looking at the variables (descriptors), it is clear that they are well distributed and aligned with the first two principal components.
In terms of the individuals (the investigated molecules), distinct clusters can be observed. It is intriguing that these clusters demonstrate a strong correlation with different factors, indicating varying degrees of influence on the individuals.
The grey cluster comprises molecules 9, 10, 19, 20, 29, and 30, showing a slight positive correlation along PC1 and a significant positive correlation along PC2. The blue cluster includes molecules 1, 2, 5, 6, 13, 14, 15, 16, 23, 24, 25, and 26, exhibiting a slight positive correlation along PC1 and a moderate negative correlation along PC2. The yellow cluster contains molecules 3, 4, 11, 12, 21, and 22, exhibiting a negative correlation along both PCs. Finally, the green cluster comprises molecules 7, 8, 17, 18, 27, and 28, demonstrating a pronounced negative correlation along PC1 and a slight positive correlation along PC2. These observations highlight the discrepancy among the four groups (clusters) of molecules regarding a wide range of electronic, chemical, and physical properties. This indicates that the individuals (molecules) gathered in each cluster are similar in terms of their chemical and physical properties, along with their biological activities.
To further explore the characteristics of the drug candidates, a more focused approach was employed. Another PCA analysis was conducted, exclusively incorporating descriptors that were found from the first PCA analysis to be strongly related to the physicochemical properties and drug likeness of the candidates. Figure 2a displays the second PCA bi-plot, in which the first two principal components (PCs) explain a significant portion of the total variance, accounting for 59.91% (30.14% for PC1 and 29.77% for PC2). Notably, the variance observed in this analysis is similar to that seen when including the entire dataset (Figure 1). These findings suggest a strong correlation between the molecules and these specific descriptors. This indicates the higher efficiency of the method used in this analysis and reveals more distinct differences among the variables. Consequently, comparing different molecules becomes more reliable.
Figure 2.
(a) PCA bi-plot of the 30 investigated molecules (individuals) following the 18 hand-picked variables (descriptors); the big red bullets present the variables, and the small black bullets present the individuals. (b) Contribution of the investigated factors toward the first two PCs.
Among the variables, ‘mr’ and ‘vol’ contributed the most to PC1, accounting for 14.46% and 13.83%, respectively (Figure 2b). ‘LogP’ and ‘TPSA’ also showed moderate contributions to PC1, accounting for 8.69% and 8.54%, respectively. For PC2, the highest contributions came from ‘MNDO_HOMO’ and ‘h_pKb’, accounting for 15.614% and 13.21%, respectively. ‘MNDO_LUMO’ and ‘h_pKa’ exhibit an average influence along PC2, contributing 10.96% and 10.15%, respectively. Interestingly, variables that showed a moderate to high correlation along PC1 demonstrated a low to no correlation along PC2 and vice versa. This suggests that the different trends observed among the molecules are independent of each other, and each set of clustered molecules is distinct and independent.
In Figure 2a, the blue cluster consists of molecules 1, 3, 5, 9, 11, 13, 15, 19, 21, 23, 25, and 29, and shows a strong positive correlation with variables such as solvation energy (E_sol), the number of hydrogen bond acceptors (a_acc), torsional energy (E_tor), and basicity (h_pKb). In contrast, the yellow cluster includes molecules 7, 17, and 27, which do not exhibit a high correlation with any specific variable. The green cluster comprises molecules 4, 8, 12, 18, 22, and 28, demonstrating a strong positive correlation with water solubility (logS), total polar surface area (TPSA), the energy of the most stable conformer (E_str), and the number of hydrogen bond donors (lip_don). The grey cluster consists of molecules 2, 6, 10, 14, 16, 20, 24, 26, and 30, which show a strong positive correlation with variables such as Van Der Waals energy (E_vdw), partition coefficient (logP), Van Der Waals volume (vol), the fraction of sp3-hybridized carbon atoms (Csp3), molar refractivity (mr), and the number of rotatable bonds (b_1rotN).
Remarkably, the compounds found in the grey cluster, along with compounds 8, 18, and 28 from the green cluster, exhibit potent anticancer activity against HT-29 colorectal tumor cells (IC50: <10 µM; IC50 for compound 24 = 10.5 µM), as discovered by Abdel-Fattah et al. [27]. These findings suggest that the variables highlighted in the grey and green clusters likely play a significant role in determining the biological activity of the compounds. ‘MNDO_HOMO’, ‘MNDO_LUMO’, and ‘h_pKa’ are positioned between the grey and green clusters, indicating their moderate influence on both clusters. ‘logS’ and ‘E’ were positioned on the negative sides of both PCs. This suggests that a lower ‘logS’ (indicating higher hydrophobicity) and a lower ‘E’ (energy) are associated with a higher probability of compounds exhibiting a potent effect against HT-29 tumor cells.
2.2. Drug Likeness
The PCA provided insights into the key variables (descriptors) that influence the anticancer activity of the compounds under investigation. These variables included ‘LogP’, ‘Csp3’, and ‘b_1rotN’, which have a significant impact on the compounds in the grey cluster, and ‘LogS’, ‘TPSA’, and ‘lip_don’, which strongly influence the compounds in the green cluster. These variables are closely associated with the drug likeness potential of newly designed drug candidates, as indicated in previous studies [28,29]. The biologically active compounds in the grey cluster (consisting of nine compounds) and green cluster (consisting of three compounds), were then ranked based on the standard values of the aforementioned descriptors, as shown in Table 2. This ranking helped to identify the compounds that not only exhibit potent in vitro anticancer activity against HT-29 colorectal tumor cells, but also show promising bioavailability and pharmacokinetic properties.
Table 2.
Drug-likeness data of the biologically active compounds gathered in the grey and green clusters.
According to Lipinski’s rule of five, compounds that have a molecular weight below 500, a partition coefficient (LogP) lower than 5, fewer than 5 hydrogen bond donors (lip_don), and fewer than 10 hydrogen bond acceptors (lip_acc) are more likely to exhibit good oral bioavailability [28]. Additionally, two more criteria could be considered according to Veber’s rule, which include a total polar surface area (TPSA) of less than 140 Å2, and a number of rotatable bonds (b_1rotN) fewer than 10 [29]. Another factor that affects drug likeness is the fraction of sp3-hybridized carbon atoms (Csp3), which is the fraction or the number of the sp3 carbon atoms out of the total carbon count. This parameter reflects the carbon saturation of molecules and characterizes the complexity of their spatial structure. Compounds with zero Csp3 are completely planar and tend to have poor absorption and bioavailability [30]. Similarly, water solubility (LogS) plays a significant role in the absorption of newly designed drugs across biological membranes. Drug candidates with a LogS value around −4 or higher are more likely to have good oral bioavailability [31].
Among the compounds listed in Table 2, compounds 10, 20, and 30 violated the criteria for ‘LogP’ and ‘Csp3’. Additionally, their ‘LogS’ values deviated significantly from −4 compared to the other compounds. Compounds 8, 18, and 28 violated only the ‘Csp3’ criterion. On the other hand, compounds 2, 6, 14, 16, 24, and 26 did not violate any of the investigated drug-likeness criteria, suggesting that they are expected to have favorable bioavailability in addition to their previously reported potent in vitro anticancer activity against HT-29 cells.
2.3. Molecular Docking
The PCA and drug likeness analyses played a crucial role in guiding our selection process for compounds that were to be investigated further through molecular docking. Based on the PCA results, we specifically targeted compounds that belonged to both the grey and green clusters, resulting in a total of 12 compounds. To prioritize our focus, we ranked these 12 compounds based on drug likeness-related descriptors. As a result, only nine compounds met the criteria and were selected for subsequent investigation using molecular docking, which helped to explore the binding affinities of the shortlisted compounds to the PIM-1 kinase (PDB ID: 2OBJ). This enzyme has been identified as a significant molecular target in cancer management and treatment. The overexpression of PIM-1 kinase has been associated with various cancer-related processes, including cell proliferation, cell cycle progression, apoptosis, invasion, and glycolysis [9]. Some 1,2-dihydropyridine derivatives were reported to have an appreciable inhibitory potential against PIM-1 kinase enzyme [9]. Among them was (6-(5-bromo-2-hydroxyphenyl)-2-oxo-4-phenyl-1,2-dihydropyridine-3-carbonitrile), which exhibited an IC50 value of 0.05 uM and was also co-crystallized and bound to the enzyme (PDB ID: 2OBJ). In the current study, we used this compound as a reference inhibitor of PIM-1 kinase in our in silico studies, so that we could conduct effective comparisons and prioritize the newly proposed compounds for further testing.
As depicted in Table 3, all nine docked 1,2-dihydropyridine derivatives demonstrated a favorable fit within the binding site of the PIM-1 kinase enzyme, exhibiting binding affinities comparable to that of the co-crystallized reference ligand (ref). The docking scores of the compounds ranged from −11.77 to −9.27 kcal/mol, while the co-crystallized ligand scored −12.08 kcal/mol. The compounds successfully recapitulated a significant number of the interactions observed with the co-crystallized ligand, including the arene-H hydrophobic interaction with Val52 and the H-bonding interactions with Lys67, Glu89, and Asp186. Noteworthy, the most significant interaction, as per Cheney et al., was the H-bond formed between the carbonyl group of the ligand’s pyridone ring with the Lys67 side chain in the binding site [9]. Additionally, the arene–H interaction between the Val52 side chain and the bromophenol moiety of the ligand confers extra stability to the complex. Figure 3 illustrates the 2D and 3D interactions between compound 6 and the amino acid residues within the binding site of the PIM-1 kinase.
Table 3.
Docking scores and amino acids interactions obtained upon docking the shortlisted 1,2-dihydropyridine derivatives to the PIM-1 kinase enzyme.
Figure 3.
The 2D (left) and 3D (right) interactions of compound 6 with the amino acid residues in the binding site of PIM-1 kinase.
Based on the docking results presented in Table 3, compounds 6, 14, and 16 exhibited the highest binding affinities, as evidenced by their lower docking scores, and exhibited profound anticancer activity against HT-29 cells as per their previously reported IC50 values of 0.7, 9.3, and 1.5 μM, respectively, along with appreciable drug likeness potential.
Furthermore, we utilized the chemical scaffold of compound 6 and that of the ligand inhibitor that was co-crystallized with PIM-1 kinase (PDB: 2OBJ) to design nine novel compounds (31–39) bearing the hybrid structural features of the previously mentioned two compounds, as shown in Table 4.
Table 4.
The general scaffold of the newly designed 1,2-dihydropyridine derivatives.
These nine proposed designs, derived from compound 6 and the co-crystallized ligand, were shown to fulfill the drug-likeness criteria according to Lipinski’s and Veber’s rules compared to the previous candidates listed in Table 2. Upon docking compounds 31–39 to the PIM-1 kinase enzyme (2OBJ), they were found to fit properly in the binding site, with docking score values ranging from −13.11 to −11.73 kcal/mol (Table 5). These compounds exhibited a similar amino acid interaction pattern to that of compound 6, indicating an appreciable PIM-1 kinase inhibitory potential. Compound 31 exhibited the best binding affinity to the target enzyme, with a docking score of −13.11 kcal/mol compared to the −12.08 kcal/mol obtained by the co-crystallized ligand inhibitor, as shown in Figure 4.
Table 5.
Docking scores and amino acid interactions obtained upon docking the proposed 1,2-dihydropyridine designs (31–39) to the PIM-1 kinase enzyme.
Figure 4.
The 2D (left) and 3D (right) interactions of compound 31 with the amino acid residues in the binding site of PIM-1 kinase.
We subsequently employed the similarity search tool in the PubChem database to identify other compounds with similar chemical scaffolds to compound 6 that also possess reported curation effects. This approach allowed us to investigate the PIM-1 kinase-inhibitory potential of existing structurally similar compounds. Such an investigation highlights the importance of database mining in drug discovery, as it enables the identification of new clinical applications for these known compounds.
In this context, we applied the fingerprint Tanimoto-based two-dimensional similarity search with a similarity threshold of 90%. Upon excluding the homologues of compound 6, sixteen compounds were finally retained (compounds 40–55). Table 6 illustrates the chemical structure of these compounds along with their reported curation effect(s).
Table 6.
The chemical structures of compounds 40–55 and their reported curation effect(s).
The sixteen selected compounds were subject to molecular docking to the PIM-1 Kinase enzyme (2OBJ). Similarly, these compounds were found to fit properly in the binding site of the target enzyme, with docking score values ranging from −13.03 to −8.28 kcal/mol (Table 7), with quite similar amino acid interactions patterns to compound 6, which indicates an appreciable PIM-1 kinase inhibitory potential as well. Compound 52 exhibited the best binding affinity to the enzyme, with a docking score of −13.03 kcal/mol compared to −12.08 kcal/mol for the co-crystallized ligand inhibitor, as shown in Figure 5.
Table 7.
Docking scores and amino acids interactions obtained upon the docking of compounds (40–55) to the PIM-1 kinase enzyme.
Figure 5.
The 2D (left) and 3D (right) interactions of compound 52 with the amino acid residues in the binding site of PIM-1 kinase.
2.4. Molecular Dynamics (MD)
To confirm the stability of the protein–ligand complexes obtained upon docking the studied compounds to PIM-1 kinase (2OBJ), we performed molecular dynamics studies on compounds 6 (which had the best docking score among the original thirty 1,2-dihydropyridine derivatives), 31 (which had the best docking score among the nine proposed designs), and 52 (which the best docking score among compounds with similar structures to the 1,2-dihydropyridine derivatives with curation effects), along with the co-crystallized reference ligand (ref). These compounds were identified as the top candidates based on the previous analyses.
Molecular dynamics simulations showed that the studied compounds afforded more H-bond interactions than those retained in the molecular docking. For example, compound 6 showed only two crucial interactions with Lys67 and Val52 upon docking; however, it showed more H-bond interactions with Leu44 and Glu89 (Figure 6e). This outcome is expected, as the MD simulations provide a more dynamic and realistic picture of the protein–ligand complex over time. These interactions can change as the complex undergoes conformational changes or obtains different degrees of flexibility, offering insights into the dynamics of H-bond formation when compared to what occurs in the docking simulations, which are often static and based on a limited number of conformations.
Figure 6.
Molecular dynamics simulation data analysis for 2OBJ-cpd6, 2OBJ-cpd31, 2OBJ-cpd52, and 2OBJ-ref complexes. (a–d) show the H-bonds, SASA, RMSD, RoG and for the 2OBJ-cpd6 (red), 2OBJ-cpd31 (green), 2OBJ-cpd52 (purple), and 2OBJ-ref (black) complexes versus the simulation time in ns. (e) shows the per-residue RMSF from the protein sequence of 2OBJ-cpd6 (red), 2OBJ-cpd31 (green), 2OBJ-cpd52 (purple), and 2OBJ-ref (black) complexes.
Figure 6 shows the MD simulation analysis of the protein–compound (cpd) complexes 2OBJ-cpd6 (red), 2OBJ-cpd31 (green), 2OBJ-cpd52 (purple), and 2OBJ-ref (black). The four complexes were shown to be stable based on an analysis of the total number of H-bonds (6a), SASA (6b), RMSD (6c), and RoG (6d) in the production phase of MD simulation. The results showed that the three compounds exhibited a stable binding mode in the binding site of the PIM-1 kinase, similarly to the co-crystallized ligand (ref), and maintained their initial docking poses. The root mean square deviation (RMSD) curves indicate equilibration of the four systems after 20 ns, with average RMSD values of 2.10, 2.04, 2.23, 2.48 Å for 2OBJ-cpd6, 2OBJ-cpd31, 2OBJ-cpd52, and 2OBJ-ref complexes, respectively. The RMSD values of the 2OBJ-cpd6, 2OBJ-cpd31, 2OBJ-cpd52 complexes were slightly lower than the reference, which means that these ligands would not disrupt the stability of the protein and the function of the enzyme would not be altered. Notably, it was observed that, at 70 ns, the RMSD value of the 2OBJ-ref complex showed a slight elevation, which may be attributed to the free rotation of its unsubstituted phenyl ring (as was seen upon visual inspection of the MD trajectory). Similar results were obtained by Akintemi et al. upon studying the molecular dynamic simulations of the complexes between Sulfonylurea Receptor 1 (SUR1) and the polyphenols Galagin and Quercetin [46]: Galagin possesses an unsubstituted phenyl group and is characterized by a higher RMSD in comparison with Quercetin, which has the same structure as Galagin with a substituted phenyl group. Additionally, the numbers of the radius of gyration (RoG), the solvent-accessible surface area (SASA), and hydrogen bonds (H-bonds) average at around 19.4 Å, 14,500 Å2, and 80, respectively. The per-residue root mean square fluctuations (RMSFs) for the enzyme chain (6e) were plotted for the four complexes. The RMSF curves of the four complexes showed an almost identical fluctuation pattern. The highest fluctuations occurred around the Val52 and Lys67, which are the crucial residues involved in the binding pattern of the co-crystallized ligand to the PIM-1 kinase.
Furthermore, free energy calculations (based on Van der Waals, and electrostatic interactions) for the four complexes were conducted based on the NAMD output using MolAICal version 1.3 [47,48], as follows:
where G = Evdw + EElec, with solvation correction included in the EElec term as described in the MolAICal methodology [47,48].
ΔG = Gcomplex − Gligand − Gprotein
Based on the ΔG analysis shown in Table 8, it was confirmed that the four complexes are stable, with complex 2OBJ-cpd31 being the most stable among them.
Table 8.
ΔG values of the four PIM-1 kinase–ligand complexes.
3. Methods
3.1. Computing the Descriptors of the 1,2-Dihydropyridine Derivatives
The chemical structures of the previously reported 1,2-dihydropyridine derivatives (Table 1) were drawn using the builder tool of the molecular operating environment software (Molecular Operating Environment (MOE), 2022.02 Chemical Computing Group ULC, 1010 Sherbooke St. West, Suite #910, Montreal, QC, Canada, H3A 2R7, MOE2022.v11.18.1) and compiled into one database. Then, all the 239 2D and 3D descriptors listed by MOE related to electronic, physical, and chemical properties were computed using the descriptor calculator tool of the software.
3.2. Principal Component Analysis (PCA)
The PCA methodology adopted for this study is the same as that used in the work by Younes and Grasset [24]. It was performed using XLSTAT 2014. PCA is the bedrock technique for dimensionality reductions and is commonly used in different scientific and engineering fields. It was employed to reveal a lower-dimensional pattern that is most likely representative of the whole dataset. This will help create models that best interpret the scientific concept at hand. From a statistical perspective, PCA is the mathematical presentation of singular value decomposition (SVD). It will, specifically, yield a new data-driven coordinates system that represents the statistical variations in the investigated dataset. In another sense, it reveals the highest variance based on projections of the given data [19,49].
3.3. Molecular Docking
The target protein PIM-1 kinase was downloaded from the protein databank (PDB ID: 2OBJ). Molecular Operating Environment (MOE), 2022.02 was used to perform the docking runs and visualize the docking poses and interactions. The protein structures were prepared with the help of the Quickprep tool in MOE. The chemical structure of the co-crystallized ligand was drawn using the MOE builder tool and added to the compounds database to be docked among the compounds for the purpose of docking protocol validation. The energy of all ligands was minimized, and they were prepared using the database wash tool to adjust their formal charges. The default docking protocol adopting the triangle matcher as a placement method and London dG as a scoring function was applied. The output docking poses were explored and ranked according to their scores and the interactions they exhibited with the amino acid residues at the binding site. Self-docking of the co-crystallized ligand showed root mean square deviation (RMSD) values of less than two [50].
3.4. Molecular Dynamics (MD) Simulations
MD simulations of the four complexes were performed using the NAMD 2.13 software installed on the HPC platform in the American University of the Middle East (AUM), Kuwait. The calculations were performed utilizing Amber10: EHT force field [Ref.] for the protein–ligand complex and the TIP3P water model. The complex was first minimized, followed by heating to 310 K, then equilibration (at 1 atm, and 310 K) for 200 ps, and finally the production phase (for 100 ns). A 0.1 M NaCl was added to the protein–water solution [51,52]. The periodic boundary condition was utilized with a cubic simulation box. After the simulation, the trajectories were clustered and analyzed, as will be shown in the results section using VMD 1.9.3 software and in-house codes [53]. A free energy calculation (based on Van der Waals and electrostatic interactions) of the four complexes was conducted based on NAMD output files using the MolAICal protocol [47,48].
4. Conclusions
In conclusion, our study successfully identified key physicochemical properties correlated with the anticancer activity of thirty 1,2-dihydropyridine derivatives using Principal Component Analysis (PCA). By focusing on descriptors related to drug-likeness, we refined our dataset to nine promising candidates. Among these, compound 6 demonstrated the best binding affinity towards the PIM-1 kinase enzyme in terms of its docking score value. Further exploration led to the design of nine novel compounds, with compound 31 exhibiting the highest binding affinity. Additionally, data mining using the PubChem database revealed structurally related compounds, with compound 52 showing significant binding affinity towards the PIM-1 kinase. Molecular dynamics studies confirmed the stability of the protein–ligand complexes, reinforcing the therapeutic potential of these compounds. The complex between compound 31 and the PIM-1 kinase was shown to be the most stable in terms of ΔG values. This comprehensive approach highlights the critical role of data mining in discovering new potential anticancer drugs, underscoring the effectiveness of integrating machine learning, database mining, and molecular modeling in the drug discovery process, and paving the way for the development of innovative anticancer therapies targeting PIM-1 kinase. Moving forward, we aim to synthesize the proposed compounds and conduct further bioassays to confirm their inhibitory potential against the target enzyme.
Author Contributions
Conceptualization, H.D., M.A.O.A. and K.Y.; methodology, H.D., M.A.O.A., M.A.-S. and K.Y.; software, H.D., M.A.-S. and M.A.O.A.; validation H.D., M.A.O.A. and K.Y.; formal analysis, H.D., M.A.O.A. and K.Y.; investigation, H.D., M.A.O.A. and K.Y.; writing—original draft preparation, H.D., M.A.O.A. and K.Y.; writing—review and editing, K.Y.; supervision, M.A.O.A. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data is contained within the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Drabu, S.; Kumar, N. Synthesis and Biological Screening of Substituted 2-Aminocyano Pyridines. Asian J. Chem. 2007, 19, 4957. [Google Scholar]
- Wang, Y.; Liu, G.; Reyes, J.C.P.; Duverna, R. One-Pot Synthesis of 3-Cyano-2-pyridones. J. Heterocycl. Chem. 2015, 52, 1185–1191. [Google Scholar] [CrossRef]
- Sunderhaus, J.D.; Martin, S.F. Applications of Multicomponent Reactions to the Synthesis of Diverse Heterocyclic Scaffolds. Chem. A Eur. J. 2009, 15, 1300–1308. [Google Scholar] [CrossRef] [PubMed]
- Alaa, A.-M.; El-Subbagh, H.I.; Kunieda, T. Lewis Acid-Promoted Transformation of 2-Alkoxypyridines into 2-Aminopyridines and Their Antibacterial Activity. Part 2: Remarkably Facile C–N Bond Formation. Bioorg. Med. Chem. 2005, 13, 4929–4935. [Google Scholar]
- Bekhit, A.A.; Baraka, A.M. Novel Milrinone Analogs of Pyridine-3-Carbonitrile Derivatives as Promising Cardiotonic Agents. Eur. J. Med. Chem. 2005, 40, 1405–1413. [Google Scholar] [CrossRef]
- Abdel-Latif, N.A. Synthesis and Antidepressant Activity of Some New Coumarin Derivatives. Sci. Pharm. 2005, 73, 193–216. [Google Scholar] [CrossRef]
- Murata, T.; Shimizu, K.; Narita, M.; Manganiello, V.C.; Tagawa, T. Characterization of Phosphodiesterase 3 in Human Malignant Melanoma Cell Line. Anticancer Res. 2002, 22, 3171–3174. [Google Scholar] [PubMed]
- Murata, T.; Shimada, M.; Sakakibara, S.; Yoshino, T.; Kadono, H.; Masuda, T.; Shimazaki, M.; Shintani, T.; Fuchikami, K.; Sakai, K. Discovery of Novel and Selective IKK-β Serine-Threonine Protein Kinase Inhibitors. Part 1. Bioorg. Med. Chem. Lett. 2003, 13, 913–918. [Google Scholar] [CrossRef]
- Cheney, I.W.; Yan, S.; Appleby, T.; Walker, H.; Vo, T.; Yao, N.; Hamatake, R.; Hong, Z.; Wu, J.Z. Identification and Structure–Activity Relationships of Substituted Pyridones as Inhibitors of Pim-1 Kinase. Bioorg. Med. Chem. Lett. 2007, 17, 1679–1683. [Google Scholar] [CrossRef]
- Wendt, M.D.; Sun, C.; Kunzer, A.; Sauer, D.; Sarris, K.; Hoff, E.; Yu, L.; Nettesheim, D.G.; Chen, J.; Jin, S. Discovery of a Novel Small Molecule Binding Site of Human Survivin. Bioorg. Med. Chem. Lett. 2007, 17, 3122–3129. [Google Scholar] [CrossRef]
- Chen, J.; Tang, G. PIM-1 Kinase: A Potential Biomarker of Triple-Negative Breast Cancer. OncoTargets Ther. 2019, 12, 6267. [Google Scholar] [CrossRef] [PubMed]
- Gasteiger, J.; Zupan, J. Neural Networks in Chemistry. Angew. Chem. Int. Ed. Engl. 1993, 32, 503–527. [Google Scholar] [CrossRef]
- Behler, J. First Principles Neural Network Potentials for Reactive Simulations of Large Molecular and Condensed Systems. Angew. Chem. Int. Ed. 2017, 56, 12828–12840. [Google Scholar] [CrossRef] [PubMed]
- Butler, K.T.; Davies, D.W.; Cartwright, H.; Isayev, O.; Walsh, A. Machine Learning for Molecular and Materials Science. Nature 2018, 559, 547–555. [Google Scholar] [CrossRef] [PubMed]
- Deringer, V.L.; Bernstein, N.; Bartók, A.P.; Cliffe, M.J.; Kerber, R.N.; Marbella, L.E.; Grey, C.P.; Elliott, S.R.; Csányi, G. Realistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular Dynamics. J. Phys. Chem. Lett. 2018, 9, 2879–2885. [Google Scholar] [CrossRef] [PubMed]
- Sanchez-Lengeling, B.; Aspuru-Guzik, A. Inverse Molecular Design Using Machine Learning: Generative Models for Matter Engineering. Science 2018, 361, 360–365. [Google Scholar] [CrossRef]
- Aspuru-Guzik, A.; Baik, M.-H.; Balasubramanian, S.; Banerjee, R.; Bart, S.; Borduas-Dedekind, N.; Chang, S.; Chen, P.; Corminboeuf, C.; Coudert, F.X. Charting a Course for Chemistry. Nat. Chem. 2019, 11, 286–294. [Google Scholar] [CrossRef]
- Kononova, O.; Huo, H.; He, T.; Rong, Z.; Botari, T.; Sun, W.; Tshitoyan, V.; Ceder, G. Text-Mined Dataset of Inorganic Materials Synthesis Recipes. Sci. Data 2019, 6, 203. [Google Scholar] [CrossRef]
- Joliffe, I.T.; Morgan, B.J.T. Principal Component Analysis and Exploratory Factor Analysis. Stat. Methods Med. Res. 1992, 1, 69–95. [Google Scholar] [CrossRef]
- Younes, K.; Grasset, L. Comparison of Thermochemolysis and Classical Chemical Degradation and Extraction Methods for the Analysis of Carbohydrates, Lignin and Lipids in a Peat Bog. J. Anal. Appl. Pyrolysis 2018, 134, 61–72. [Google Scholar] [CrossRef]
- Younes, K.; Grasset, L. The Application of DFRC Method for the Analysis of Carbohydrates in a Peat Bog: Validation and Comparison with Conventional Chemical and Thermochemical Degradation Techniques. Chem. Geol. 2020, 545, 119644. [Google Scholar] [CrossRef]
- Ibrahimi, M.; Korichi, W.; Loqman, S.; Hafidi, M.; Ouhdouch, Y.; Lemee, L. Thermochemolysis–GC-MS as a Tool for Chemotaxonomy and Predation Monitoring of a Predatory Actinobacteria against a Multidrug Resistant Bacteria. J. Anal. Appl. Pyrolysis 2020, 145, 104740. [Google Scholar] [CrossRef]
- Younes, K.; Grasset, L. Analysis of Molecular Proxies of a Peat Core by Thermally Assisted Hydrolysis and Methylation-Gas Chromatography Combined with Multivariate Analysis. J. Anal. Appl. Pyrolysis 2017, 124, 726–732. [Google Scholar] [CrossRef]
- Younes, K.; Grasset, L. Carbohydrates as Proxies in Ombrotrophic Peatland: DFRC Molecular Method Coupled with PCA. Chem. Geol. 2022, 606, 120994. [Google Scholar] [CrossRef]
- Younes, K.; Laduranty, J.; Descostes, M.; Grasset, L. Molecular Biomarkers Study of an Ombrotrophic Peatland Impacted by an Anthropogenic Clay Deposit. Org. Geochem. 2017, 105, 20–32. [Google Scholar] [CrossRef]
- Korichi, W.; Ibrahimi, M.; Loqman, S.; Ouhdouch, Y.; Younes, K.; Lemée, L. Assessment of Actinobacteria Use in the Elimination of Multidrug-Resistant Bacteria of Ibn Tofail Hospital Wastewater (Marrakesh, Morocco): A Chemometric Data Analysis Approach. Environ. Sci. Pollut. Res. 2021, 28, 26840–26848. [Google Scholar] [CrossRef]
- Abdelf-Fattah, M.A.O.; El-Naggar, M.A.M.; Rashied, R.M.H.; Gary, B.D.; Piazza, G.A.; Abadi, A.H. Four-Component Synthesis of 1,2-Dihydropyridine Derivatives and Their Evaluation as Anticancer Agents. Med. Chem. 2012, 8, 392–400. [Google Scholar] [CrossRef]
- Lipinski, C.A.; Lombardo, F.; Dominy, B.W.; Feeney, P.J. Experimental and Computational Approaches to Estimate Solubility and Permeability in Drug Discovery and Development Settings. Adv. Drug Deliv. Rev. 2012, 64, 4–17. [Google Scholar] [CrossRef]
- Veber, D.F.; Johnson, S.R.; Cheng, H.-Y.; Smith, B.R.; Ward, K.W.; Kopple, K.D. Molecular Properties That Influence the Oral Bioavailability of Drug Candidates. J. Med. Chem. 2002, 45, 2615–2623. [Google Scholar] [CrossRef]
- Wei, W.; Cherukupalli, S.; Jing, L.; Liu, X.; Zhan, P. Fsp3: A New Parameter for Drug-Likeness. Drug Discov. Today 2020, 25, 1839–1845. [Google Scholar] [CrossRef]
- Sorkun, M.C.; Khetan, A.; Er, S. AqSolDB, a Curated Reference Set of Aqueous Solubility and 2D Descriptors for a Diverse Set of Compounds. Sci. Data 2019, 6, 143. [Google Scholar] [CrossRef]
- Method of Using Aminocyanopyridine Compounds as Mitogen Activated Protein Kinase-Activated Protein Kinase-2 Inhibitors—Patent EP-1569645-A2—PubChem. Available online: https://pubchem.ncbi.nlm.nih.gov/patent/EP-1569645-A2 (accessed on 23 June 2024).
- AID 540256—qHTS for Inhibitors of Binding or Entry into Cells for Lassa Virus—PubChem. Available online: https://pubchem.ncbi.nlm.nih.gov/bioassay/540256 (accessed on 23 June 2024).
- Use of Aminocyanopyridines as Mitogen-Activated Protein Kinase-Activated Protein Kinase-2 Inhibitors—Patent JP-2006512338-A—PubChem. Available online: https://pubchem.ncbi.nlm.nih.gov/patent/JP-2006512338-A (accessed on 23 June 2024).
- Coburn, C.A.; Holloway, M.K.; Stachel, S.J. 2-Aminopyridine Compounds Useful as Beta-Secretase Inhibitors for the Treatment of Alzheimer’s Disease. EP1807396A1, 26 June 2013. [Google Scholar]
- Murata, T.; Umeda, M.; Sakakibara, S.; Yoshiro, T.; Sato, H.; Masuda, T.; Koriyama, Y.; Shimada, M.; Shintani, T.; Kadono, H.; et al. Pyridine Derivatives. EA200700099A1, 31 August 2007. [Google Scholar]
- Boehringer, M.; Loeffler, B.M.; Peters, J.-U.; Riemer, C.; Weiss, P. Pyridine and Quinoline Derivatives. US20030195188A1, 5 October 2004. [Google Scholar]
- AID 1671174—Cornell—Phenotypic Assay to Identify Agents That Inhibit Growth of Mycobacterium Tuberculosis—PubChem. Available online: https://pubchem.ncbi.nlm.nih.gov/bioassay/1671174 (accessed on 23 June 2024).
- Albert, J.S.; Callaghan, O.; Campbell, J.; Carr, R.A.E.; Chessari, G.; Cowan, S.; Congreve, M.S.; Edwards, P.; Frederickson, M.; Murray, C.W.; et al. Substituted Aminopyridines and Uses Thereof 2008 US20080287399A1. Available online: https://patents.google.com/patent/US20080287399A1/en?oq=Patent:+US-20080287399-A1 (accessed on 23 June 2024).
- Shao, B.; Victory, S.; Ilyin, V.I.; Goehring, R.R.; Sun, Q.; Hogenkamp, D.; Hodges, D.D.; Islam, K.; Sha, D.; Zhang, C.; et al. Phenoxyphenyl Pyridines as Novel State-Dependent, High-Potency Sodium Channel Inhibitors. J. Med. Chem. 2004, 47, 4277–4285. [Google Scholar] [CrossRef]
- Amberg, W.; Netz, A.; Kling, A.; Ochse, M.; Lange, U.; Hutchins, C.W.; Garcia-Ladona, F.J.; Wernet, W. Heterocyclische Verbindungen und Ihre Verwendung als Bindungspartner Für 5-ht5-Rezeptoren. WO2007022946A1, 1 March 2007. [Google Scholar]
- Lowinger, T.B.; Murata, T.; Umeda, M.; Sakakibara, S.; Yoshino, T.; Sato, H.; Masuda, T.; Koriyama, Y.; Shimada, M.; Shintani, T.; et al. Pyridine Derivatives with Ikb-Kinase (Ikk-Beta) Inhibiting Activity. WO2002024679A1, 19 December 2007. [Google Scholar]
- Schirok, H.; Redlich, G.; Grande, Y.C.; Leineweber, K.; Bender, E.; Tinel, H.; Münter, K.; Kolkhof, P.; Himmel, H.; Straub, A. Substituierte 2-amino-3-Cyanopyridine als Inhibitoren des Natrium Calcium Austausches und Ihre Verwendung bei Kardiovaskulären Erkrankungen. WO2013144191A1, 27 March 2013. [Google Scholar]
- Calderone, V.; Minutolo, F.; Tuccinardi, T.; Testai, L.; Granchi, C.; Martelli, A.; Citi, V.; Gardinal, V.D.L.; Lenzi, G.; Leo, F.; et al. Fnew Activators of Sirt1 Enzyme for the Treatment of Cardiovascular and Cardiometabolic Pathologies 2019. WO2019162911A1. Available online: https://patents.google.com/patent/WO2019162911A1/en?oq=WO-2019162911-A1 (accessed on 23 June 2024).
- Schann, S.; Fer, M.; Mayer, S.; Doebelin, C. 5-Heteroaryl-Pyridin-2-Amine Confounds as Neuropeptide Ff Receptor Antagonists. EP4010328A1, 11 February 2021. [Google Scholar]
- Oluwafisayo Akintemi, E.; Kuben Govender, K.; Singh, T. Molecular Dynamics and Docking Investigation of Flavonol Aglycones against Sulfonylurea Receptor 1 (SUR1) for Anti–Diabetic Drug Design. ChemistrySelect 2024, 9, e202302488. [Google Scholar] [CrossRef]
- Bai, Q.; Tan, S.; Xu, T.; Liu, H.; Huang, J.; Yao, X. MolAICal: A Soft Tool for 3D Drug Design of Protein Targets by Artificial Intelligence and Classical Algorithm. Brief. Bioinform. 2020, 22, bbaa161. [Google Scholar] [CrossRef]
- Bai, Q. Research and Development of MolAICal for Drug Design via Deep Learning and Classical Programming. arXiv 2020, arXiv:2006.09747. [Google Scholar] [CrossRef]
- Kim, M.; Chang, J.W.; Park, K.; Yang, D.R. Comprehensive Assessment of the Effects of Operating Conditions on Membrane Intrinsic Parameters of Forward Osmosis (FO) Based on Principal Component Analysis (PCA). J. Membr. Sci. 2022, 641, 119909. [Google Scholar] [CrossRef]
- Hawkins, P.C.D.; Nicholls, A. Conformer Generation with OMEGA: Learning from the Data Set and the Analysis of Failures. J. Chem. Inf. Model. 2012, 52, 2919–2936. [Google Scholar] [CrossRef]
- Harrach, M.F.; Drossel, B. Structure and Dynamics of TIP3P, TIP4P, and TIP5P water near smooth and atomistic walls of different hydroaffinity. J. Chem. Phys. 2014, 140, 174501. [Google Scholar] [CrossRef]
- Mark, P.; Nilsson, L. Structure and Dynamics of the TIP3P, SPC, and SPC/E Water Models at 298 K. J. Phys. Chem. A 2001, 105, 9954–9960. [Google Scholar] [CrossRef]
- Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual Molecular Dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

















