Integrative Computational Chemistry Approaches in Modern Drug Discovery: Advances in Docking, Pharmacophore Modeling, Molecular Dynamics, and Virtual Screening
Abstract
1. Introduction
2. Computational Approaches in Drug Discovery
3. Molecular Docking: Theory, Tools, and Applications
4. Pharmacophore Modeling: From Concept to Clinical Candidates
5. Molecular Dynamics (MD) Simulations: Capturing Flexibility and Dynamics
6. Case Studies of Computational Drug Discovery
7. Virtual Screening: Large-Scale Compound Prioritization
8. Future Directions and Role of AI in Drug Discovery
9. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Nagpure, N.; Raut, H.; Kamble, S.; Rasala, T. Redefining Preclinical Research Paradigms: AI-Driven Drug Discovery as a Transformative Approach to Accelerate Innovation, Improve Predictive Accuracy, and Reduce Reliance on Animal Testing. J. Drug Deliv. Ther. 2025, 15, 115–128. [Google Scholar] [CrossRef]
- Pathak, S.; Kushwaha, S.P.; Verma, S.; Deep, P.; Gupta, D. Advances in Computational Chemistry for Drug Discovery. J. Drug Discov. Health Sci. 2024, 1, 146–152. [Google Scholar] [CrossRef]
- Serrano-Morrás, Á.; Bertran-Mostazo, A.; Miñarro-Lleonar, M.; Comajuncosa-Creus, A.; Cabello, A.; Labranya, C.; Escudero, C.; Tian, T.V.; Khutorianska, I.; Radchenko, D.S.; et al. A bottom-up approach to find lead compounds in expansive chemical spaces. Commun. Chem. 2025, 8, 225. [Google Scholar] [CrossRef] [PubMed]
- Corrêa Veríssimo, G.; Salgado Ferreira, R.; Gonçalves Maltarollo, V. Ultra-Large Virtual Screening: Definition, Recent Advances, and Challenges in Drug Design. Mol. Inform. 2025, 44, e202400305. [Google Scholar] [CrossRef]
- Veríssimo, R.F.; Matias, P.H.F.; Barbosa, M.R.; Neto, F.O.S.; Neto, B.A.D.; de Oliveira, H.C.B. Integrating Machine Learning and SHAP Analysis to Advance the Rational Design of Benzothiadiazole Derivatives with Tailored Photophysical Properties. J. Chem. Inf. Model. 2025, 65, 7874–7886. [Google Scholar] [CrossRef]
- Bhagat, R.T.; Butle, S.R.; Khobragade, D.S.; Wankhede, S.B.; Prasad, C.C.; Mahure, D.S.; Armarkar, A.V. Molecular Docking in Drug Discovery. J. Pharm. Res. Int. 2021, 33, 46–58. [Google Scholar] [CrossRef]
- Khattri, R.B.; Morris, D.L.; Bilinovich, S.M.; Manandhar, E.; Napper, K.R.; Sweet, J.W.; Modarelli, D.A.; Leeper, T.C. Identifying Ortholog Selective Fragment Molecules for Bacterial Glutaredoxins by NMR and Affinity Enhancement by Modification with an Acrylamide Warhead. Molecules 2019, 25, 147. [Google Scholar] [CrossRef]
- Awoonor-Williams, E.; Dickson, C.J.; Furet, P.; Golosov, A.A.; Hornak, V. Leveraging Advanced In Silico Techniques in Early Drug Discovery: A Study of Potent Small-Molecule YAP-TEAD PPI Disruptors. J. Chem. Inf. Model. 2023, 63, 2520–2531. [Google Scholar] [CrossRef]
- Korb, O.; Finn, P.W.; Jones, G. The cloud and other new computational methods to improve molecular modelling. Expert Opin. Drug Discov. 2014, 9, 1121–1131. [Google Scholar] [CrossRef] [PubMed]
- Zhou, G.; Rusnac, D.V.; Park, H.; Canzani, D.; Nguyen, H.M.; Stewart, L.; Bush, M.F.; Nguyen, P.T.; Wulff, H.; Yarov-Yarovoy, V.; et al. An artificial intelligence accelerated virtual screening platform for drug discovery. Nat. Commun. 2024, 15, 7761. [Google Scholar] [CrossRef]
- Jiang, J.; Wang, G.; Li, D.; Hayes, N.; Jones, B.; Shi, Y.; Qiu, H.; Zhang, B.; Zhou, T.; Wei, G.W. Unexpected Applications of AlphaFold in Molecular Sciences. Annu. Rev. Biochem. 2026. ahead of print. [Google Scholar] [CrossRef]
- Zhao, N.; Wu, T.; Wang, W.; Zhang, L.; Gong, X. Review and comparative analysis of methods and advancements in predicting protein complex structure. Interdiscip. Sci. Comput. Life Sci. 2024, 16, 261–288. [Google Scholar] [CrossRef]
- Blake, J.; Laird, E. Chppter 30. Recent advances in virtual ligand screening. Annu. Rep. Med. Chem. 2003, 38, 305–314. [Google Scholar]
- Kumar, S.; Kumar, Y. Innovations in molecular docking: A detailed analysis of methodological developments and their applications in drug discovery. Int. J. Pharma Prof. Res. 2024, 15, 52–67. [Google Scholar] [CrossRef]
- Cele, F.N.; Ramesh, M.; Soliman, M.E. Per-residue energy decomposition pharmacophore model to enhance virtual screening in drug discovery: A study for identification of reverse transcriptase inhibitors as potential anti-HIV agents. Drug Des. Dev. Ther. 2016, 10, 1365–1377. [Google Scholar]
- Joseph-McCarthy, D.; Thomas, B.E.; Belmarsh, M.; Moustakas, D.; Alvarez, J.C. Pharmacophore-based molecular docking to account for ligand flexibility. Proteins 2003, 51, 172–188. [Google Scholar] [CrossRef] [PubMed]
- Mahrous, R.S.; Fathy, H.M.; Abu EL-Khair, R.M.; Omar, A.A.; Ibrahim, R.S. Molecular docking and pharmacophore modelling; A bridged explanation with emphasis on validation. J. Adv. Pharm. Sci. 2024, 1, 138–152. [Google Scholar] [CrossRef]
- Jiménez-Luna, J.; Grisoni, F.; Weskamp, N.; Schneider, G. Artificial intelligence in drug discovery: Recent advances and future perspectives. Expert Opin. Drug Discov. 2021, 16, 949–959. [Google Scholar] [CrossRef] [PubMed]
- Frank, Y.; Unger, R.; Senderowitz, H. Statistical analysis of sequential motifs at biologically relevant protein-protein interfaces. Comput. Struct. Biotechnol. J. 2024, 23, 1244–1259. [Google Scholar] [CrossRef]
- Sadybekov, A.V.; Katritch, V. Computational approaches streamlining drug discovery. Nature 2023, 616, 673–685. [Google Scholar] [CrossRef]
- Mihai, D.P.; Nitulescu, G.M. Computer-aided drug design and drug discovery. Pharmaceuticals 2025, 18, 436. [Google Scholar] [CrossRef]
- Friesner, R.A.; Banks, J.L.; Murphy, R.B.; Halgren, T.A.; Klicic, J.J.; Mainz, D.T.; Repasky, M.P.; Knoll, E.H.; Shelley, M.; Perry, J.K.; et al. Glide: A new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J. Med. Chem. 2004, 47, 1739–1749. [Google Scholar] [CrossRef] [PubMed]
- Hollingsworth, S.A.; Dror, R.O. Molecular Dynamics Simulation for All. Neuron 2018, 99, 1129–1143. [Google Scholar] [CrossRef] [PubMed]
- Eberhardt, J.; Santos-Martins, D.; Tillack, A.F.; Forli, S. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model. 2021, 61, 3891–3898. [Google Scholar] [CrossRef] [PubMed]
- Friesner, R.A.; Murphy, R.B.; Zhang, Y.; Xiong, Y.; Devlaminck, P.A.; Tubert-Brohman, I.; Jerome, S.V. Glide WS: Methodology and Initial Assessment of Performance for Docking Accuracy and Virtual Screening. J. Chem. Theory Comput. 2025, 21, 12696–12708. [Google Scholar] [CrossRef] [PubMed]
- Garbagnoli, M.; Linciano, P.; Listro, R.; Rossino, G.; Vasile, F.; Collina, S. Biophysical Assays for Investigating Modulators of Macromolecular Complexes: An Overview. ACS Omega 2024, 9, 17691–17705. [Google Scholar] [CrossRef]
- Dixon, S.L.; Smondyrev, A.M.; Knoll, E.H.; Rao, S.N.; Shaw, D.E.; Friesner, R.A. PHASE: A new engine for pharmacophore perception, 3D QSAR model development, and 3D database screening: 1. Methodology and preliminary results. J. Comput. Aided Mol. Des. 2006, 20, 647–671. [Google Scholar] [CrossRef]
- Lipinski, C.A.; Lombardo, F.; Dominy, B.W.; Feeney, P.J. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv. Drug Deliv. Rev. 2001, 46, 3–26. [Google Scholar] [CrossRef]
- Ewing, T.J.; Makino, S.; Skillman, A.G.; Kuntz, I.D. DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases. J. Comput. Aided Mol. Des. 2001, 15, 411–428. [Google Scholar] [CrossRef] [PubMed]
- Zhong, S.; Zhang, Y.; Xiu, Z. Rescoring ligand docking poses. Curr. Opin. Drug Discov. Dev. 2010, 13, 326–334. [Google Scholar]
- Labute, P. LowModeMD—Implicit low-mode velocity filtering applied to conformational search of macrocycles and protein loops. J. Chem. Inf. Model. 2010, 50, 792–800. [Google Scholar] [CrossRef]
- Klimenko, K. Computer-Aided Drug Design of Broad-Spectrum Antiviral Compounds. Ph.D. Thesis, Institut AV Bogatsky de Chimie Physique, Université de Strasbourg, Odessa, Ukraine, 2017. [Google Scholar]
- Huang, J.; Rauscher, S.; Nawrocki, G.; Ran, T.; Feig, M.; de Groot, B.L.; Grubmüller, H.; MacKerell, A.D., Jr. CHARMM36m: An improved force field for folded and intrinsically disordered proteins. Nat. Methods 2017, 14, 71–73. [Google Scholar] [CrossRef]
- Maier, J.A.; Martinez, C.; Kasavajhala, K.; Wickstrom, L.; Hauser, K.E.; Simmerling, C. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J. Chem. Theory Comput. 2015, 11, 3696–3713. [Google Scholar] [CrossRef] [PubMed]
- Roos, K.; Wu, C.; Damm, W.; Reboul, M.; Stevenson, J.M.; Lu, C.; Dahlgren, M.K.; Mondal, S.; Chen, W.; Wang, L.; et al. OPLS3e: Extending Force Field Coverage for Drug-Like Small Molecules. J. Chem. Theory Comput. 2019, 15, 1863–1874. [Google Scholar] [CrossRef] [PubMed]
- Irwin, J.J.; Tang, K.G.; Young, J.; Dandarchuluun, C.; Wong, B.R.; Khurelbaatar, M.; Moroz, Y.S.; Mayfield, J.; Sayle, R.A. ZINC20-A Free Ultralarge-Scale Chemical Database for Ligand Discovery. J. Chem. Inf. Model. 2020, 60, 6065–6073. [Google Scholar] [CrossRef] [PubMed]
- Mysinger, M.M.; Carchia, M.; Irwin, J.J.; Shoichet, B.K. Directory of useful decoys, enhanced (DUD-E): Better ligands and decoys for better benchmarking. J. Med. Chem. 2012, 55, 6582–6594. [Google Scholar] [CrossRef]
- Venkatraman, V.; Colligan, T.H.; Lesica, G.T.; Olson, D.R.; Gaiser, J.; Copeland, C.J.; Wheeler, T.J.; Roy, A. Drugsniffer: An open source workflow for virtually screening billions of molecules for binding affinity to protein targets. Front. Pharmacol. 2022, 13, 874746. [Google Scholar] [CrossRef]
- Gorgulla, C.; Boeszoermenyi, A.; Wang, Z.-F.; Fischer, P.D.; Coote, P.W.; Padmanabha Das, K.M.; Malets, Y.S.; Radchenko, D.S.; Moroz, Y.S.; Scott, D.A. An open-source drug discovery platform enables ultra-large virtual screens. Nature 2020, 580, 663–668. [Google Scholar] [CrossRef]
- Gaulton, A.; Hersey, A.; Nowotka, M.; Bento, A.P.; Chambers, J.; Mendez, D.; Mutowo, P.; Atkinson, F.; Bellis, L.J.; Cibrián-Uhalte, E.; et al. The ChEMBL database in 2017. Nucleic Acids Res. 2017, 45, D945–D954. [Google Scholar] [CrossRef]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
- Lyskov, S.; Gray, J.J. The RosettaDock server for local protein-protein docking. Nucleic Acids Res. 2008, 36, W233–W238. [Google Scholar] [CrossRef]
- Amaro, R.E.; Baudry, J.; Chodera, J.; Demir, Ö.; McCammon, J.A.; Miao, Y.; Smith, J.C. Ensemble Docking in Drug Discovery. Biophys. J. 2018, 114, 2271–2278. [Google Scholar] [CrossRef] [PubMed]
- McNutt, A.T.; Francoeur, P.; Aggarwal, R.; Masuda, T.; Meli, R.; Ragoza, M.; Sunseri, J.; Koes, D.R. GNINA 1.0: Molecular docking with deep learning. J. Cheminform. 2021, 13, 43. [Google Scholar] [CrossRef]
- Cavasotto, C.N.; Adler, N.S.; Aucar, M.G. Quantum Chemical Approaches in Structure-Based Virtual Screening and Lead Optimization. Front. Chem. 2018, 6, 188. [Google Scholar] [CrossRef] [PubMed]
- Teng, H.; Wang, R.; Shen, Y.; Yuan, Y.; Kingsford, C. DTMol: Pocket-based molecular docking using diffusion transformers. bioRxiv 2025. [Google Scholar] [CrossRef]
- Shaker, B.; Barakat, K. Harnessing Deep Learning and Generative AI for Molecular Docking Simulations: Tools, Challenges, and Future Directions. In Molecular Docking in Biomedical Engineering and Computational Chemistry; IntechOpen: London, UK, 2025. [Google Scholar]
- Jin, Z.; Du, X.; Xu, Y.; Deng, Y.; Liu, M.; Zhao, Y.; Zhang, B.; Li, X.; Zhang, L.; Peng, C.; et al. Structure of M(pro) from SARS-CoV-2 and discovery of its inhibitors. Nature 2020, 582, 289–293. [Google Scholar] [CrossRef]
- Manglik, A.; Lin, H.; Aryal, D.K.; McCorvy, J.D.; Dengler, D.; Corder, G.; Levit, A.; Kling, R.C.; Bernat, V.; Hübner, H.; et al. Structure-based discovery of opioid analgesics with reduced side effects. Nature 2016, 537, 185–190. [Google Scholar] [CrossRef]
- IUPAC. Pharmacophore, 5.0.0 ed.; IUPAC, Ed.; International Union of Pure and Applied Chemistry (IUPAC): Research Triangle Park, NC, USA, 2025. [Google Scholar]
- Imrie, F.; Hadfield, T.E.; Bradley, A.R.; Deane, C.M. Deep generative design with 3D pharmacophoric constraints. Chem. Sci. 2021, 12, 14577–14589. [Google Scholar] [CrossRef] [PubMed]
- Trott, O.; Olson, A.J. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 2010, 31, 455–461. [Google Scholar] [CrossRef] [PubMed]
- Verdonk, M.L.; Cole, J.C.; Hartshorn, M.J.; Murray, C.W.; Taylor, R.D. Improved protein–ligand docking using GOLD. Proteins Struct. Funct. Bioinform. 2003, 52, 609–623. [Google Scholar] [CrossRef]
- Wolber, G.; Langer, T. LigandScout: 3-D pharmacophores derived from protein-bound ligands and their use as virtual screening filters. J. Chem. Inf. Model. 2005, 45, 160–169. [Google Scholar] [CrossRef]
- Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem 2023 update. Nucleic Acids Res. 2023, 51, D1373–D1380. [Google Scholar] [CrossRef] [PubMed]
- Ruddigkeit, L.; van Deursen, R.; Blum, L.C.; Reymond, J.L. Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. J. Chem. Inf. Model. 2012, 52, 2864–2875. [Google Scholar] [CrossRef]
- Chen, L.; Cruz, A.; Ramsey, S.; Dickson, C.J.; Duca, J.S.; Hornak, V.; Koes, D.R.; Kurtzman, T. Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screening. PLoS ONE 2019, 14, e0220113. [Google Scholar] [CrossRef] [PubMed]
- Brown, N.; Fiscato, M.; Segler, M.H.S.; Vaucher, A.C. GuacaMol: Benchmarking Models for de Novo Molecular Design. J. Chem. Inf. Model. 2019, 59, 1096–1108. [Google Scholar] [CrossRef] [PubMed]
- Polykovskiy, D.; Zhebrak, A.; Sanchez-Lengeling, B.; Golovanov, S.; Tatanov, O.; Belyaev, S.; Kurbanov, R.; Artamonov, A.; Aladinskiy, V.; Veselov, M.; et al. Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models. Front. Pharmacol. 2020, 11, 565644. [Google Scholar] [CrossRef]
- Hamelberg, D.; Mongan, J.; McCammon, J.A. Accelerated molecular dynamics: A promising and efficient simulation method for biomolecules. J. Chem. Phys. 2004, 120, 11919–11929. [Google Scholar] [CrossRef]
- Homeyer, N.; Gohlke, H. Free Energy Calculations by the Molecular Mechanics Poisson-Boltzmann Surface Area Method. Mol. Inform. 2012, 31, 114–122. [Google Scholar] [CrossRef]
- Chodera, J.D.; Noé, F. Markov state models of biomolecular conformational dynamics. Curr. Opin. Struct. Biol. 2014, 25, 135–144. [Google Scholar] [CrossRef]
- Mardt, A.; Pasquali, L.; Wu, H.; Noé, F. VAMPnets for deep learning of molecular kinetics. Nat. Commun. 2018, 9, 5. [Google Scholar] [CrossRef]
- Lyu, J.; Wang, S.; Balius, T.E.; Singh, I.; Levit, A.; Moroz, Y.S.; O’Meara, M.J.; Che, T.; Algaa, E.; Tolmachova, K.; et al. Ultra-large library docking for discovering new chemotypes. Nature 2019, 566, 224–229. [Google Scholar] [CrossRef]
- Ain, Q.U.; Aleksandrova, A.; Roessler, F.D.; Ballester, P.J. Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2015, 5, 405–424. [Google Scholar] [CrossRef]
- Chen, P.; Ke, Y.; Lu, Y.; Du, Y.; Li, J.; Yan, H.; Zhao, H.; Zhou, Y.; Yang, Y. DLIGAND2: An improved knowledge-based energy function for protein-ligand interactions using the distance-scaled, finite, ideal-gas reference state. J. Cheminform. 2019, 11, 52. [Google Scholar] [CrossRef]
- Li, H.; Sze, K.H.; Lu, G.; Ballester, P.J. Machine-learning scoring functions for structure-based drug lead optimization. Wiley Comput. Mol. Sci. 2020, 10, e1465. [Google Scholar] [CrossRef]
- Shirts, M.R.; Mobley, D.L.; Brown, S.P. Free-energy calculations in structure-based drug design. In Drug Design: Structure- and Ligand-Based Approaches; Cambridge University Press: Cambridge, UK, 2010; Volume 1, pp. 61–86. [Google Scholar]
- Horn, H.W.; Swope, W.C.; Pitera, J.W.; Madura, J.D.; Dick, T.J.; Hura, G.L.; Head-Gordon, T. Development of an improved four-site water model for biomolecular simulations: TIP4P-Ew. J. Chem. Phys. 2004, 120, 9665–9678. [Google Scholar] [CrossRef]
- Sethi, A.; Agrawal, N.; Brezovsky, J. Impact of water models on the structure and dynamics of enzyme tunnels. Comput. Struct. Biotechnol. J. 2024, 23, 3946–3954. [Google Scholar] [CrossRef]
- Sun, Q.; Li, Y.J.; Ning, S.B. Investigating the molecular mechanisms underlying the co-occurrence of Parkinson’s disease and inflammatory bowel disease through the integration of multiple datasets. Sci. Rep. 2024, 14, 17028. [Google Scholar] [CrossRef] [PubMed]
- Desai, D.; Kantliwala, S.V.; Vybhavi, J.; Ravi, R.; Patel, H.; Patel, J. Review of AlphaFold 3: Transformative advances in drug design and therapeutics. Cureus 2024, 16, e63646. [Google Scholar] [CrossRef]
- Mak, K.-K.; Wong, Y.-H.; Pichika, M.R. Artificial intelligence in drug discovery and development. In Drug Discovery and Evaluation: Safety and Pharmacokinetic Assays; Springer: Berlin/Heidelberg, Germany, 2024; pp. 1461–1498. [Google Scholar]
- Zheng, Y.; Koh, H.Y.; Ju, J.; Yang, M.; May, L.T.; Webb, G.I.; Li, L.; Pan, S.; Church, G. Large language models for drug discovery and development. Patterns 2025, 6, 101346. [Google Scholar] [CrossRef] [PubMed]
- Averly, R.; Baker, F.N.; Watson, I.A.; Ning, X. LIDDIA: Language-based Intelligent Drug Discovery Agent. Proc. Conf. Empir. Methods Nat. Lang. Process. 2025, 2025, 12015–12039. [Google Scholar]
- Ge, F.; Zhu, J.; Zhang, L.; Xiao, H.; Bao, X.; Xie, F.; Chen, D.; Lu, Y.; Wang, Y.; Guan, Z. AutoBinder Agent: An MCP-Based Agent for End-to-End Protein Binder Design. arXiv 2026, arXiv:2602.00019. [Google Scholar]
- Pan, Q.; Xu, D.; Yao, J.X.; Ma, L.; Zhu, Z.; Ji, J. Frogent: An end-to-end full-process drug design agent. arXiv 2025, arXiv:2508.10760. [Google Scholar]
- Kiss, L. Development of Novel Assays and Application of Innovative Screening Approaches for Improving Hit Discovery Efficiency of G Protein-Coupled Receptor Targets. Ph.D. Dissertation, Semmelweis University, Budapest, Hungary, 2021. [Google Scholar]
- Velazquez-Campoy, A.; Claro, B.; Abian, O.; Höring, J.; Bourlon, L.; Claveria-Gimeno, R.; Ennifar, E.; England, P.; Chaires, J.B.; Wu, D.; et al. A multi-laboratory benchmark study of isothermal titration calorimetry (ITC) using Ca(2+) and Mg(2+) binding to EDTA. Eur. Biophys. J. 2021, 50, 429–451. [Google Scholar] [CrossRef]
- Wienken, C.J.; Baaske, P.; Rothbauer, U.; Braun, D.; Duhr, S. Protein-binding assays in biological liquids using microscale thermophoresis. Nat. Commun. 2010, 1, 100. [Google Scholar] [CrossRef]
- Gucwa, M.; Bijak, V.; Zheng, H.; Murzyn, K.; Minor, W. CheckMyMetal (CMM): Validating metal-binding sites in X-ray and cryo-EM data. IUCrJ 2024, 11, 871–877. [Google Scholar] [CrossRef]
- Roskoski, R., Jr. Rule of five violations among the FDA-approved small molecule protein kinase inhibitors. Pharmacol. Res. 2023, 191, 106774. [Google Scholar] [CrossRef] [PubMed]
- Reese, T.C.; Devineni, A.; Smith, T.; Lalami, I.; Ahn, J.M.; Raj, G.V. Evaluating physiochemical properties of FDA-approved orally administered drugs. Expert Opin. Drug Discov. 2024, 19, 225–238. [Google Scholar] [CrossRef] [PubMed]
- Hornberger, K.R.; Araujo, E.M.V. Physicochemical Property Determinants of Oral Absorption for PROTAC Protein Degraders. J. Med. Chem. 2023, 66, 8281–8287. [Google Scholar] [CrossRef]
- O’Donovan, D.H.; De Fusco, C.; Kuhnke, L.; Reichel, A. Trends in Molecular Properties, Bioavailability, and Permeability across the Bayer Compound Collection. J. Med. Chem. 2023, 66, 2347–2360. [Google Scholar] [CrossRef]
- Baell, J.B.; Holloway, G.A. New substructure filters for removal of pan assay interference compounds (PAINS) from screening libraries and for their exclusion in bioassays. J. Med. Chem. 2010, 53, 2719–2740. [Google Scholar] [CrossRef]
- Brenk, R.; Schipani, A.; James, D.; Krasowski, A.; Gilbert, I.H.; Frearson, J.; Wyatt, P.G. Lessons learnt from assembling screening libraries for drug discovery for neglected diseases. ChemMedChem 2008, 3, 435–444. [Google Scholar] [CrossRef] [PubMed]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Fourches, D.; Muratov, E.; Tropsha, A. Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation. J. Chem. Inf. Model. 2016, 56, 1243–1252. [Google Scholar] [CrossRef] [PubMed]
- Scantlebury, J.; Vost, L.; Carbery, A.; Hadfield, T.E.; Turnbull, O.M.; Brown, N.; Chenthamarakshan, V.; Das, P.; Grosjean, H.; von Delft, F.; et al. A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening. J. Chem. Inf. Model. 2023, 63, 2960–2974. [Google Scholar] [CrossRef]
- Chen, L.; Blay, V.; Ballester, P.J.; Houston, D.R. SCORCH2: A Generalized Heterogeneous Consensus Model for High-Enrichment Interaction-Based Virtual Screening. Adv. Sci. 2025, 12, e08318. [Google Scholar] [CrossRef] [PubMed]
- Durant, G.; Boyles, F.; Birchall, K.; Marsden, B.; Deane, C.M. Robustly interrogating machine learning-based scoring functions: What are they learning? Bioinformatics 2025, 41, btaf040. [Google Scholar] [CrossRef]





| Modality | Primary Goal | Typical Inputs | Key Outputs | Strengths | Main Limitations | Representative Tools | Citation(s) |
|---|---|---|---|---|---|---|---|
| Molecular docking | Predict binding mode and rank candidates | Prepared protein (X-ray/cryo-EM/AF2), curated ligands, microstates | Poses, interaction maps, docking scores | Fast triage; structure-aware hypotheses | Scoring/solvation approximations; limited receptor flexibility | AutoDock Vina, Glide, GOLD, DOCK | [22,29,30] |
| Pharmacophore modeling | Capture essential 3D features for activity | Active ligands and/or protein–ligand complexes | Feature hypotheses; 3D queries | Scaffold hopping; ultra-fast pre-filtering | Overfitting risk; conformer/feature quality sensitive | PHASE, LigandScout, MOE, PharmaGist | [27,31,32] |
| Molecular dynamics (MD) | Probe flexibility, water networks, pose stability | Protein–ligand complex, force field, solvent/ions | Trajectories, RMSD/RMSF, H-bond/water analyses; MM-GB/SA/FEP | Mechanistic insight; validates poses; supports ΔΔG | Sampling cost; FF/setup sensitivity | CHARMM36m, AMBER ff14SB, OPLS3e | [23,33,34,35] |
| Virtual screening (VS) | Prioritize hits from ultra-large libraries | ZINC/ChEMBL/PubChem/GDB; docking/LBVS filters | Ranked shortlists; clustered chemotypes | Scales to 108–109; cloud/HPC ready | Benchmark bias; hit confirmation required | VirtualFlow; LBVS + SBVS pipelines | [36,37,38,39] |
| Category | Tool/Resource | License | Notable Capabilities | Typical Scale | Citation |
|---|---|---|---|---|---|
| Docking engine | AutoDock Vina | Open source | Stochastic search; empirical scoring; multithreaded | 105–107 | [52] |
| Docking engine | Glide (HTVS/SP/XP) | Commercial | Hierarchical filters; pose refinement; XP scoring | 105–107 | [22] |
| Docking engine | GOLD | Commercial | GA search; side-chain flexibility; constraints | 105–106 | [53] |
| Docking engine | DOCK | Open source | Shape/grid; anchor-and-grow; modular | 105–107 | [29] |
| Protein–protein docking | RosettaDock (server) | Academic | Rigid-body + side-chain optimization; web access | Interface sets | [42] |
| Pharmacophore | PHASE | Commercial | Common-pharmacophore ID; 3D-QSAR | 106–108 (filter) | [27] |
| Pharmacophore | LigandScout | Commercial/academic | Feature extraction from complexes; 3D queries | 106–108 | [54] |
| Suite | MOE (+ LowModeMD) | Commercial | Conformers; pocket/feature tools; induced-fit aids | Project-scale | [31] |
| Orchestrator (VS) | VirtualFlow | Open source | Billion-scale docking; engine-agnostic; cloud/HPC | 108–109+ | [39] |
| Library | ZINC20/22 | Free | Purchasable make-on-demand; ready-to-dock | 109-class | [36] |
| Library | ChEMBL | Free | Assay-annotated activities; targets | Millions of activities | [40] |
| Library | PubChem | Free | Open compounds and bioassays | 100M+ compounds | [55] |
| Enumerated universe | GDB-17 | Free | 166B enumerated molecules | 1011+ | [56] |
| Structures | Protein Data Bank (PDB) | Free | Standardized macromolecular structures | 200k+ entries | [24,25] |
| Benchmark | DUD-E | Free | Actives/decoys for many targets | Benchmark sets | [37] |
| Benchmark critique | DUD-E bias analysis | Free | Analog/decoy bias diagnostics | Benchmark audit | [57] |
| Structure prediction | AlphaFold | Free for models | Near-experimental single-chain models | Proteome-scale | [41] |
| ML benchmarks | GuacaMol/MOSES | Open source | Generative design benchmarking | Model-dependent | [58,59] |
| Drug (Year) | Target/Indication | Primary Computational Technique(s) Credited | What the Pipeline Contributed (Very Brief) |
|---|---|---|---|
| Captopril (1981) | ACE/hypertension | Early structure-based design guided by carboxypeptidase-A models | Transition-state/active-site modeling yielded thiol-bearing inhibitors with oral activity |
| Zanamivir (1999) | Influenza neuraminidase/influenza | SBDD + docking on NA crystal structures | Rational modifications to sialic-acid scaffold to exploit NA catalytic site; first-in-class NA inhibitor |
| Saquinavir (1995) | HIV-1 protease/HIV | SBDD from protease–peptide complexes | Transition-state mimic designed from active-site geometry; launched first HIV PI |
| Indinavir (1996) | HIV-1 protease/HIV | SBDD with iterative docking/optimization | Optimized P1/P2 groups for S1/S2 subsites; improved oral PK |
| Ritonavir (1996) | HIV-1 protease/HIV | SBDD (crystallography-guided) | Potent PI that became PK booster after metabolic insights |
| Dorzolamide (1995) | Carbonic anhydrase II/glaucoma | SBDD | Active-site geometry (Zn2+ coordination) guided sulfonamide design |
| Tirofiban (1998) | Integrin αIIbβ3/ACS | SBDD/mimetic design from RGD-ligand structures | Crystal structures with tirofiban/eptifibatide informed small-molecule antagonist design |
| Aliskiren (2007) | Renin/hypertension | SBDD + modeling on renin structures | Non-peptidic scaffold engineered for S1/S3/S1′ pockets; oral renin inhibitor |
| Boceprevir (2011) | HCV NS3 protease/HCV | SBDD | Warhead and P1/P2 optimization for covalent reversible inhibition |
| Rivaroxaban (2011) | Factor Xa/anticoagulation | SBDD + crystallography | Structure-guided optimization and FXa co-crystal analysis supported binding-mode tuning |
| Baloxavir marboxil (2018) | Influenza cap-dependent endonuclease/influenza | SBDD on PA endonuclease | Metal-chelation pharmacophore and pocket mapping drove first-in-class CEN inhibitor |
| Domain | Method/Model | What It Modeled/ Optimized | Strengths | Limitations | Typical Use | Citation |
|---|---|---|---|---|---|---|
| Docking scoring (empirical) | GlideScore; AutoDock4/Vina | vdW, H-bonding, desolvation terms fit to data | Fast; good enrichment | Transferability limits | Primary SBVS ranking | [22] |
| Docking scoring (knowledge-based) | Statistical PMF/X-Score (generic) | Statistical atom–atom potentials | Simple; robust | Coarse physics | Complementary rescoring | [57,65,66] |
| Docking scoring (ML) | RF-Score; NNScore; GNINA CNN | Data-learned pose/affinity from structures | Captures nonlinearity; strong top-N | Needs curated data; bias risk | Rescoring top poses | [44] |
| Physics-like ranking | MM-PB/GBSA | Continuum solvation + force field | Interpretable; fast triage | Dielectric/entropy sensitive | Post-docking triage | [61] |
| Alchemical ΔΔG | FEP (RBFE) | Relative binding free energy | ~1 kcal·mol−1 resolution | Setup/sampling cost | Lead optimization | [67] |
| Alchemical ΔG analysis | Thermodynamic integration | Gradient-based alchemy (λ windows) | Rigorous; general | Complex setup | Tight Structure activity relationship decisions | [68] |
| Force field (proteins) | CHARMM36m | Folded and IDP proteins | Balanced backbone/IDP | χ issues for some residues | General MD | [33] |
| Force field (proteins) | AMBER ff14SB | Proteins | Updated side-chain/backbone | Needs matched ligand params | General MD | [34] |
| Force field (proteins/ligands) | OPLS3e | Drug-like ligands + proteins | Broad ligand coverage | Licensed | Lead optimization MD | [35] |
| Water models | TIP3P; TIP4P-Ew | Solvent representation | Standardized hydration | Model-specific limits | Routine MD | [69,70,71] |
| Direction | What AI/Tech Adds | Concrete Example(s) | Expected Impact |
|---|---|---|---|
| Ultra-large virtual screening (108–109+ molecules) | Cloud/HPC orchestration; adaptive scheduling | VirtualFlow enables billion-scale SBVS, modular docking stacks | Orders-of-magnitude expansion of search space; more novel chemotypes |
| AI-guided triage for VS | Learn from sparse docking to skip most of the library | Deep Docking cuts compute by ~50×; consensus/pose filters downstream | Same hit rate at fraction of cost/time |
| DL-rescoring and pose selection | CNN/GDL models refine ranks/poses post-docking | GNINA family improves top-n pose accuracy | Better early enrichment; fewer false positives |
| Structural coverage via AI | High-accuracy protein structures when experiments lack | AlphaFold proteome-scale structures | “Unlocks” SBDD for previously intractable targets |
| Bias-aware benchmarking | Detect spurious dataset signals in training/validation | DUD-E bias analysis cautions DL claims | More reliable, reproducible VS metrics |
| Omics-driven personalization | Match compounds to patient/pathway signatures | CMap/LINCS L1000 profiles (1.3M signatures) | Indication selection, MoA inference, repurposing |
| Cryo-EM + MD + ensemble docking | Multi-state targets and cryptic pockets | State-aware docking/MD on EM ensembles | Allosteric/drugging “undruggables” |
| Foundation models and generative design | Rapid de novo ideas under multi-param constraints | GuacaMol/MOSES benchmarks standardize eval | Tighter design-make-test loops |
| Stage | Item/Metric/Assay | Measures/Goal | Practical Notes/Action | When to Use | Citation |
|---|---|---|---|---|---|
| Benchmarking | EF1%, EF5%, BEDROC, ROC-AUC | Early recognition and global ranking | Emphasize EF/BEDROC for top-fraction testing | Retrospective method evaluation | [37] |
| Benchmark audit | DUD-E bias checks | Analog/decoy bias detection | Use bias-controlled splits; external sets | Before claiming generalization | [57] |
| Orthogonal confirmation | SPR | kon, koff, KD | Control surface artifacts; kinetics insight | Hit/lead confirmation | [78] |
| Orthogonal confirmation | ITC | ΔH, ΔS, KD, stoichiometry | Thermodynamics; gold standard | Characterize prioritized hits | [79] |
| Orthogonal confirmation | MST | KD in solution | Low sample; buffer-flexible | Cross-validate binding | [80] |
| Mode validation | X-ray/cryo-EM | Bound structure and binding mode | Deposit to PDB for reuse | Structural follow-up | [81] |
| Library filter | Rule-of-Five | Oral developability | Use as soft gate with medicinal review | Pre-screening | [82,83] |
| Library filter | Veber criteria | Permeability/bioavailability | PSA/rotor thresholds | Pre-screening | [84,85] |
| Library filter | PAINS | Remove assay-interference chemotypes | Avoid over-filtering true actives | Library assembly | [86] |
| Library filter | Brenk alerts | Remove problematic fragments | Combining with expert review | Library assembly | [87] |
| Data stewardship | FAIR principles | Reproducibility and reuse | Version inputs; share code/models | All stages | [88] |
| Curation | Chemogenomics checklists | Correct labels/states/microstates | Scripted, versioned prep | Before modeling/VS | [89] |
| Common pitfall | Mis-protonated residues/ligands | Spurious contacts/energies | Enumerate microstates; pKa review | Prep and docking/MD | [89] |
| Common pitfall | Ignored conserved waters | Missed bridges/affinity | Retain/map waters; test displacement | Docking triage | [23] |
| Common pitfall | Metal coordination errors | Unrealistic poses/instability | Add constraints; spot-check with QM/MM | Targets with metals | [61] |
| Common pitfall | Overfitting benchmarks | Inflated enrichment | Prospective tests; external sets | Method claims | [90,91,92] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Altharawi, A.; Alqahtani, S.M. Integrative Computational Chemistry Approaches in Modern Drug Discovery: Advances in Docking, Pharmacophore Modeling, Molecular Dynamics, and Virtual Screening. Pharmaceutics 2026, 18, 565. https://doi.org/10.3390/pharmaceutics18050565
Altharawi A, Alqahtani SM. Integrative Computational Chemistry Approaches in Modern Drug Discovery: Advances in Docking, Pharmacophore Modeling, Molecular Dynamics, and Virtual Screening. Pharmaceutics. 2026; 18(5):565. https://doi.org/10.3390/pharmaceutics18050565
Chicago/Turabian StyleAltharawi, Ali, and Safar M. Alqahtani. 2026. "Integrative Computational Chemistry Approaches in Modern Drug Discovery: Advances in Docking, Pharmacophore Modeling, Molecular Dynamics, and Virtual Screening" Pharmaceutics 18, no. 5: 565. https://doi.org/10.3390/pharmaceutics18050565
APA StyleAltharawi, A., & Alqahtani, S. M. (2026). Integrative Computational Chemistry Approaches in Modern Drug Discovery: Advances in Docking, Pharmacophore Modeling, Molecular Dynamics, and Virtual Screening. Pharmaceutics, 18(5), 565. https://doi.org/10.3390/pharmaceutics18050565

