Virtual Screening of Argentinian Natural Products to Identify Anti-Cancer Aurora Kinase A Inhibitors: A Combined Machine Learning and Molecular Docking Approach †
Abstract
1. Introduction
2. Materials and Methods
2.1. Compounds Database
2.2. Machine Learning Models
2.3. Protein Preparation and Molecular Docking
2.4. Molecular Dynamics Simulations
3. Results and Discussion
3.1. Machine Learning Model Performance
3.2. Molecular Docking Results
3.3. Dynamic Stability and Conformational Behavior
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hanahan, D.; Weinberg, R.A. Hallm. Cancer: Next Generation. Cell 2011, 144, 646–674. [Google Scholar] [CrossRef]
- Polverino, F.; Mastrangelo, A.; Guarguaglini, G. Contribution of AurkA/TPX2 Overexpression to Chromosomal Imbalances and Cancer. Cells 2024, 13, 1397. [Google Scholar] [CrossRef]
- McIntyre, P.J.; Collins, P.M.; Vrzal, L.; Birchall, K.; Arnold, L.H.; Mpamhanga, C.; Coombs, P.J.; Burgess, S.G.; Richards, M.W.; Winter, A.; et al. Characterization of Three Druggable Hot-Spots in the Aurora-A/TPX2 Interaction Using Biochemical, Biophysical, and Fragment-Based Approaches. ACS Chem. Biol. 2017, 12, 2906–2914. [Google Scholar] [CrossRef] [PubMed]
- Janeček, M.; Rossmann, M.; Sharma, P.; Emery, A.; Huggins, D.J.; Stockwell, S.R.; Stokes, J.E.; Tan, Y.S.; Almeida, E.G.; Hardwick, B.; et al. Allosteric Modulation of AURKA Kinase Activity by a Small-Molecule Inhibitor of Its Protein–Protein Interaction with TPX2. Sci. Rep. 2016, 6, 28528. [Google Scholar] [CrossRef] [PubMed]
- Newman, D.J.; Cragg, G.M. Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019. J. Nat. Prod. 2020, 83, 770–803. [Google Scholar] [CrossRef] [PubMed]
- Martínez Heredia, L.; Quispe, P.A.; Fernández, J.F.; Lavecchia, M.J. NaturAr: A Collaborative, Open-Source Database of Natural Products from Argentinian Biodiversity for Drug Discovery and Bioprospecting. J. Chem. Inf. Model. 2025, 65, 1889–1900. [Google Scholar] [CrossRef]
- Jusoh, A.S.; Remli, M.A.; Mohamad, M.S.; Cazenave, T.; Fong, C.S. How Generative Artificial Intelligence Can Transform Drug Discovery? Eur. J. Med. Chem. 2025, 295, 117825. [Google Scholar] [CrossRef]
- Kitchen, D.B.; Decornez, H.; Furr, J.R.; Bajorath, J. Docking and Scoring in Virtual Screening for Drug Discovery: Methods and Applications. Nat. Rev. Drug Discov. 2004, 3, 935–949. [Google Scholar] [CrossRef]
- NaturAr. Base de Datos de Productos Naturales de Argentina; Universidad Nacional de La Plata: La Plata, Argentina, 2025; Available online: https://naturar.quimica.unlp.edu.ar/es/ (accessed on 15 May 2025).
- RDKit. Open-Source Cheminformatics. 2023. Available online: http://www.rdkit.org (accessed on 20 May 2025).
- Kingma, D.P.; Welling, M. Auto-Encoding Variational Bayes. arXiv 2013, arXiv:1312.6114. [Google Scholar]
- Gómez-Bombarelli, R.; Wei, J.N.; Duvenaud, D.; Hernández-Lobato, J.M.; Sánchez-Lengeling, B.; Sheberla, D.; Aguilera-Iparraguirre, J.; Hirzel, T.D.; Adams, R.P.; Aspuru-Guzik, A. Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules. ACS Cent. Sci. 2018, 4, 268–276. [Google Scholar] [CrossRef]
- Wu, J.; Chen, Y.; Wu, J.; Zhao, D.; Huang, J.; Lin, M.; Wang, L. Large-Scale Comparison of Machine Learning Methods for Profiling Prediction of Kinase Inhibitors. J. Cheminform. 2024, 16, 13. [Google Scholar] [CrossRef] [PubMed]
- Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The Protein Data Bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef] [PubMed]
- OpenEye Scientific Software. FRED v4.3.0.3: Flexible Docking for Structure-Based Drug Discovery; OpenEye Scientific: Santa Fe, NM, USA, 2020; Available online: https://www.eyesopen.com (accessed on 22 May 2025).
- McGann, M. FRED and HYBRID Docking Performance on Standardized Datasets. J. Comput.-Aided Mol. Des. 2012, 26, 897–906. [Google Scholar] [CrossRef] [PubMed]
- Phillips, J.C.; Braun, R.; Wang, W.; Gumbart, J.; Tajkhorshid, E.; Villa, E.; Chipot, C.; Skeel, R.D.; Kalé, L.; Schulten, K. Scalable Molecular Dynamics with NAMD. J. Comput. Chem. 2005, 26, 1781–1802. [Google Scholar] [CrossRef]
- Genheden, S.; Ryde, U. The MM/PBSA and MM/GBSA Methods to Estimate Ligand-Binding Affinities. Expert Opin. Drug Discov. 2015, 10, 449–461. [Google Scholar] [CrossRef]
- Rogers, D.; Hahn, M. Extended-Connectivity Fingerprints. J. Chem. Inf. Model. 2010, 50, 742–754. [Google Scholar] [CrossRef]
- He, T.; Caba, K.; Ballester, P.J. A Precise Comparison of Molecular Target Prediction Methods. Digit. Discov. 2025, 4, 2548–2558. [Google Scholar] [CrossRef]
- Bell, E.W.; Zhang, Y. DockRMSD: An Open-Source Tool for Atom Mapping and RMSD Calculation of Symmetric Molecules through Graph Isomorphism. J. Cheminform. 2019, 11, 40. [Google Scholar] [CrossRef]
- Tian, Y.-Y.; Tong, J.-B.; Liu, Y.; Tian, Y. QSAR Study, Molecular Docking and Molecular Dynamic Simulation of Aurora Kinase Inhibitors Derived from Imidazo[4,5-b]pyridine Derivatives. Molecules 2024, 29, 1772. [Google Scholar] [CrossRef]
- Bathula, S.; Sankaranarayanan, M.; Malgija, B.; Kaliappan, I.; Bhandare, R.R.; Shaik, A.B. 2-Amino Thiazole Derivatives as Prospective Aurora Kinase Inhibitors against Breast Cancer: QSAR, ADMET Prediction, Molecular Docking, and Molecular Dynamic Simulation Studies. ACS Omega 2023, 8, 44287–44311. [Google Scholar] [CrossRef]
- Siudem, P.; Szeleszczuk, Ł.; Paradowska, K. Searching for Natural Aurora a Kinase Inhibitors from Peppers Using Molecular Docking and Molecular Dynamics. Pharmaceuticals 2023, 16, 1539. [Google Scholar] [CrossRef]
- Beniwal, M.; Jain, N.; Jain, S.; Aggarwal, N. Design, Synthesis, Anticancer Evaluation and Docking Studies of Novel 2-(1-Isonicotinoyl-3-Phenyl-1H-Pyrazol-4-yl)-3-Phenylthiazolidin-4-one Derivatives as Aurora-A Kinase Inhibitors. BMC Chem. 2022, 16, 61. [Google Scholar] [CrossRef]



| Representation | Model | Train AUC | Test AUC | Train F1 | Test F1 | Train Precision | Test Precision |
|---|---|---|---|---|---|---|---|
| Morgan | SVM | 0.8736 | 0.6940 | 0.8756 | 0.8214 | 0.9246 | 0.8493 |
| Morgan | RF | 0.9999 | 0.6858 | 0.9982 | 0.8743 | 0.9988 | 0.8369 |
| Morgan | XGBoost | 0.9871 | 0.7204 | 0.9516 | 0.8179 | 0.9947 | 0.8608 |
| MACCS | SVM | 0.3405 | 0.4257 | 0.3536 | 0.3565 | 0.3169 | 0.3216 |
| MACCS | RF | 0.9991 | 0.5425 | 0.9933 | 0.8643 | 0.9963 | 0.8188 |
| MACCS | XGBoost | 0.9095 | 0.5594 | 0.8254 | 0.7074 | 0.9653 | 0.8321 |
| Hybrid | SVM | 0.9956 | 0.6949 | 0.9594 | 0.8208 | 0.9987 | 0.8411 |
| Hybrid | RF | 1.0000 | 0.6824 | 1.0000 | 0.8840 | 1.0000 | 0.8152 |
| Hybrid | XGBoost | 0.9626 | 0.7018 | 0.9094 | 0.7869 | 0.9816 | 0.8408 |
| NatID | SMILE Code | FRED Chemgauss4 Score | Interacting Residues of Aurora-A |
|---|---|---|---|
| 534 | CC(C)[C@H]1CCc2c(O)ccc3c2[C@H]1OC3=O | −9.9306 | Trp128, Phe133, Glu152, Ser155, Phe157, Leu159, Tyr197, Gly198, Ile209 |
| 1231 | CC(CCc1ccc(O)cc1)OC(=O)/C=C/c1ccc(O)c(O)c1 | −9.1135 | Trp128, Phe133, Glu152, Ser155, Phe157, Leu159, Ser186, His187, Arg189, Leu196, Tyr197, Ile209 |
| 533 | C=C(C)[C@H]1CCc2c(OC)ccc3c2[C@H]1OC3=O | −8.8393 | Trp128, Phe133, Glu152, Ser155, Phe157, Leu159, Tyr197, Ile209 |
| 1228 | C[C@@H](CCc1ccc(O)c(O)c1)OC(=O)/C=C/c1ccc(O)c(O)c1 | −8.7402 | Trp128, Phe133, Glu152, Ser155, Phe157, Leu159, Ser186, Arg195, Leu196, Tyr197, Ile209 |
| 1230 | CC(CCc1ccc(O)c(O)c1)OC(=O)/C=C/c1ccc(O)c(O)c1 | −8.7402 | Trp128, Phe133, Glu152, Ser155, Phe157, Leu159, Ser186, Arg195, Leu196, Tyr197, Ile209 |
| A9B * | C1=CC(=CC=C1C2=CC=C(O2)C(=O)NN)Cl | −5.6712 | Trp128, Phe133, Glu152, Phe157, Leu159, Arg195, Tyr197, Ile209 |
| Ligand | 534 | 1231 | 533 | A9B |
|---|---|---|---|---|
| Average RMSD (Å) | 4.403 ± 1.094 | 6.904 ± 2.323 | 4.637 ± 1.433 | 6.280 ± 0.926 |
| MM/PBSA (Kcal/mol) | −14.515 ± 2.757 | −16.561 ± 3.655 | −15.694 ± 2.279 | −13.479 ± 1.515 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cartagena, G.; Jadán, E.; Guarimata, J.D. Virtual Screening of Argentinian Natural Products to Identify Anti-Cancer Aurora Kinase A Inhibitors: A Combined Machine Learning and Molecular Docking Approach. Chem. Proc. 2025, 18, 44. https://doi.org/10.3390/ecsoc-29-26728
Cartagena G, Jadán E, Guarimata JD. Virtual Screening of Argentinian Natural Products to Identify Anti-Cancer Aurora Kinase A Inhibitors: A Combined Machine Learning and Molecular Docking Approach. Chemistry Proceedings. 2025; 18(1):44. https://doi.org/10.3390/ecsoc-29-26728
Chicago/Turabian StyleCartagena, Génesis, Evelin Jadán, and Juan Diego Guarimata. 2025. "Virtual Screening of Argentinian Natural Products to Identify Anti-Cancer Aurora Kinase A Inhibitors: A Combined Machine Learning and Molecular Docking Approach" Chemistry Proceedings 18, no. 1: 44. https://doi.org/10.3390/ecsoc-29-26728
APA StyleCartagena, G., Jadán, E., & Guarimata, J. D. (2025). Virtual Screening of Argentinian Natural Products to Identify Anti-Cancer Aurora Kinase A Inhibitors: A Combined Machine Learning and Molecular Docking Approach. Chemistry Proceedings, 18(1), 44. https://doi.org/10.3390/ecsoc-29-26728

