Reviewing the Computational Landscape of Drug Repurposing: Evolution from Structure-Based Methods to LLM-Based Methods
Abstract
1. Introduction
Search Strategy and Selection Criteria
2. Methods
2.1. Biological Mechanism-Driven
2.1.1. Structure-Based
2.1.2. Omics-Based
2.1.3. Fuzzy Logic-Based
2.1.4. Adverse Event–Based
2.2. Network-Based Methods
2.2.1. Graph Mining
Graph Clustering
Random Walk
Graph Diffusion
Meta-Path
Semantic-Based
2.2.2. Matrix Factorization or Matrix Completion
2.3. Data-Driven
2.3.1. Text Mining-Based
2.3.2. Large Language Model-Based
Medical Specialized Language Models
Multi-Source Knowledge Integration Models
LLM Agent Collaborative Models
2.3.3. Hallucinations in Large Language Models: Risks and Impacts on Drug Repurposing
Concrete Hallucination Risks in Drug Repurposing
Propagation of Hallucinations Through the Drug Repurposing Pipeline
System-Level Barriers to Reliable Deployment
Mitigation Strategies and Limitations
2.4. Cross-Method Comparison
3. Data Sources, Evaluation Metrics and Validation Strategies
3.1. Biological Mechanism-Driven
3.1.1. Structure-Based
3.1.2. Omics-Based
3.1.3. Fuzzy Logic-Based
3.1.4. Adverse Event-Based
3.2. Network-Based Methods
3.2.1. Graph Mining
3.2.2. Matrix Factorization or Matrix Completion
3.3. Data-Driven
3.3.1. Text Mining-Based
3.3.2. Large Language Model-Based
3.4. Summary
4. Discussions
4.1. Future Directions for LLM-Based Methods
4.2. A Proposed Framework of Drug Repurposing
- Phase 1: High-throughput screening using network-based and data-driven methods.
- Phase 2: Mechanistic validation through biological mechanism-driven methods.
- Phase 3: Clinical translation and safety verification.
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
- The following abbreviations are used in this manuscript:
| LLM(s) | Large language model(s) |
| MD | Molecular dynamics |
| MR | Mendelian randomization |
| AE | Adverse event |
| MF | Matrix factorization |
| MC | Matrix completion |
References
- Yeu, Y.; Yoon, Y.; Park, S. Protein localization vector propagation: A method for improving the accuracy of drug repositioning. Mol. BioSyst. 2015, 11, 2096–2102. [Google Scholar] [CrossRef]
- Xue, H.; Li, J.; Xie, H.; Wang, Y. Review of drug repositioning approaches and resources. Int. J. Biol. Sci. 2018, 14, 1232. [Google Scholar] [CrossRef]
- Ashburn, T.T.; Thor, K.B. Drug repositioning: Identifying and developing new uses for existing drugs. Nat. Rev. Drug Discov. 2004, 3, 673–683. [Google Scholar] [CrossRef]
- Pushpakom, S.; Iorio, F.; Eyers, P.A.; Escott, K.J.; Hopper, S.; Wells, A.; Doig, A.; Guilliams, T.; Latimer, J.; McNamee, C.; et al. Drug repurposing: Progress, challenges and recommendations. Nat. Rev. Drug Discov. 2019, 18, 41–58. [Google Scholar] [CrossRef]
- Chen, Y.; Ung, C. Computer automated prediction of potential therapeutic and toxicity protein targets of bioactive compounds from Chinese medicinal plants. Am. J. Chin. Med. 2002, 30, 139–154. [Google Scholar] [CrossRef]
- Li, H.; Gao, Z.; Kang, L.; Zhang, H.; Yang, K.; Yu, K.; Luo, X.; Zhu, W.; Chen, K.; Shen, J.; et al. TarFisDock: A web server for identifying drug targets with docking approach. Nucleic Acids Res. 2006, 34, W219–W224. [Google Scholar] [CrossRef] [PubMed]
- Grinter, S.Z.; Liang, Y.; Huang, S.Y.; Hyder, S.M.; Zou, X. An inverse docking approach for identifying new potential anti-cancer targets. J. Mol. Graph. Modell. 2011, 29, 795–799. [Google Scholar] [CrossRef] [PubMed]
- Kitchen, D.B.; Decornez, H.; Furr, J.R.; Bajorath, J. Docking and scoring in virtual screening for drug discovery: Methods and applications. Nat. Rev. Drug Discov. 2004, 3, 935–949. [Google Scholar] [CrossRef] [PubMed]
- Huang, S.Y.; Grinter, S.Z.; Zou, X. Scoring functions and their evaluation methods for protein–ligand docking: Recent advances and future directions. Phys. Chem. Chem. Phys. 2010, 12, 12899–12908. [Google Scholar] [CrossRef]
- Cheng, T.; Li, Q.; Zhou, Z.; Wang, Y.; Bryant, S.H. Structure-based virtual screening for drug discovery: A problem-centric review. AAPS J. 2012, 14, 133–141. [Google Scholar] [CrossRef]
- Hurle, M.R.; Yang, L.; Xie, Q.; Rajpal, D.K.; Sanseau, P.; Agarwal, P. Computational drug repositioning: From data to therapeutics. Clin. Pharmacol. Ther. 2013, 93, 335–341. [Google Scholar] [CrossRef]
- Ellingson, S.R.; Smith, J.C.; Baudry, J. VinaMPI: Facilitating multiple receptor high-throughput virtual docking on high-performance computers. J. Comput. Chem. 2013, 34, 2212–2221. [Google Scholar] [CrossRef]
- Kharkar, P.S.; Warrier, S.; Gaud, R.S. Reverse docking: A powerful tool for drug repositioning and drug rescue. Future Med. Chem. 2014, 6, 333–342. [Google Scholar] [CrossRef] [PubMed]
- Kumar, S.; Ayyannan, S.R. Identification of new small molecule monoamine oxidase-B inhibitors through pharmacophore-based virtual screening, molecular docking and molecular dynamics simulation studies. J. Biomol. Struct. Dyn. 2023, 41, 6789–6810. [Google Scholar] [CrossRef] [PubMed]
- Chandel, V.; Sharma, P.P.; Raj, S.; Choudhari, R.; Rathi, B.; Kumar, D. Structure-based drug repurposing for targeting Nsp9 replicase and spike proteins of severe acute respiratory syndrome coronavirus 2. J. Biomol. Struct. Dyn. 2022, 40, 249–262. [Google Scholar] [CrossRef] [PubMed]
- Lv, X.; Wang, J.; Yuan, Y.; Pan, L.; Liu, Q.; Guo, J. In Silico drug repurposing pipeline using deep learning and structure based approaches in epilepsy. Sci. Rep. 2024, 14, 16562. [Google Scholar] [CrossRef]
- Sadeghi, M.; Miroliaei, M.; Ghanadian, M. Drug repurposing for diabetes mellitus: In Silico and In Vitro investigation of DrugBank database for α-glucosidase inhibitors. Int. J. Biol. Macromol. 2024, 270, 132164. [Google Scholar] [CrossRef]
- Li, Z.; Ding, Y.; Tuo, X.; Hu, J.; Zhang, T.; Zhou, X.; Liu, L.; Yang, S. Structure-based drug repurposing targeting pathogenic virus superfamily 1 helicase: An integrated multi-computational screening and bioactivity identification strategy. Chin. Chem. Lett. 2025, 36, 110737. [Google Scholar] [CrossRef]
- Kaur, D.; Chopra, M.; Saluja, D. Exploiting the Achilles’ heel of cancer through a structure-based drug-repurposing approach and experimental validation of top drugs using the TRAP assay. Mol. Divers. 2025, 29, 6459–6480. [Google Scholar] [CrossRef]
- Tang, B.; Wang, Y.; Jiang, X.; Thambisetty, M.; Ferrucci, L.; Johnell, K.; Hägg, S. Genetic variation in targets of antidiabetic drugs and Alzheimer disease risk: A Mendelian randomization study. Neurology 2022, 99, e650–e659. [Google Scholar] [CrossRef]
- Wang, X.; He, S.; Zhou, Z.; Bo, X.; Qi, D.; Fu, X.; Wang, Z.; Yang, J.; Wang, S. LINCS dataset-based repositioning of rosiglitazone as a potential anti-human adenovirus drug. Antivir. Res. 2020, 179, 104789. [Google Scholar] [CrossRef] [PubMed]
- Koudijs, K.K.; Böhringer, S.; Guchelaar, H.J. Validation of transcriptome signature reversion for drug repurposing in oncology. Brief. Bioinform. 2023, 24, bbac490. [Google Scholar] [CrossRef]
- Iorio, F.; Bosotti, R.; Scacheri, E.; Belcastro, V.; Mithbaokar, P.; Ferriero, R.; Murino, L.; Tagliaferri, R.; Brunetti-Pierri, N.; Isacchi, A.; et al. Discovery of drug mode of action and drug repositioning from transcriptional responses. Proc. Natl. Acad. Sci. USA 2010, 107, 14621–14626. [Google Scholar] [CrossRef]
- Iorio, F.; Rittman, T.; Ge, H.; Menden, M.; Saez-Rodriguez, J. Transcriptional data: A new gateway to drug repositioning? Drug Discov. Today 2013, 18, 350–357. [Google Scholar] [CrossRef] [PubMed]
- Ding, Y.; Tang, J.; Guo, F. Identification of drug–target interactions via fuzzy bipartite local model. Neural Comput. Appl. 2020, 32, 10303–10319. [Google Scholar] [CrossRef]
- Wang, Z.; He, M.; Liang, Z.; He, Y.; Dong, X. DiffFNN-Med: Task-Adaptive Fuzzy Neural Networks for Interpretable Medical Drug Recommendation. In Proceedings of the 2025 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2025), Wuhan, China, 15–18 December 2025; pp. 5509–5516. [Google Scholar]
- Masoudi-Sobhanzadeh, Y.; Esmaeili, H.; Masoudi-Nejad, A. A fuzzy logic-based computational method for the repurposing of drugs against COVID-19. BioImpacts 2021, 12, 315. [Google Scholar] [CrossRef]
- Wang, F.S.; Chen, P.R.; Chen, T.Y.; Zhang, H.X. Fuzzy optimization for identifying anti-cancer targets with few side effects in constraint-based models of head and neck cancer. R. Soc. Open Sci. 2022, 9, 220633. [Google Scholar] [CrossRef]
- Wang, K.; Wan, M.; Wang, R.S.; Weng, Z. Opportunities for web-based drug repositioning: Searching for potential antihypertensive agents with hypotension adverse events. J. Med. Internet Res. 2016, 18, e4541. [Google Scholar] [CrossRef]
- Zaza, P.; Matthieu, R.; Jean-Luc, C.; Charles, K. Drug repurposing in Raynaud’s phenomenon through adverse event signature matching in the World Health Organization pharmacovigilance database. Br. J. Clin. Pharmacol. 2020, 86, 2217–2222. [Google Scholar] [CrossRef]
- Zamami, Y.; Niimura, T.; Kawashiri, T.; Goda, M.; Naito, Y.; Fukushima, K.; Ushio, S.; Aizawa, F.; Hamano, H.; Okada, N.; et al. Identification of prophylactic drugs for oxaliplatin-induced peripheral neuropathy using big data. Biomed. Pharmacother. 2022, 148, 112744. [Google Scholar] [CrossRef]
- Ko, M.; Oh, J.M.; Kim, I.W. Drug repositioning prediction for psoriasis using the adverse event reporting database. Front. Med. 2023, 10, 1159453. [Google Scholar] [CrossRef]
- Liu, Y.; Liu, Y.; Fan, R.; Kehriman, N.; Zhang, X.; Zhao, B.; Huang, L. Pharmacovigilance-based drug repurposing: Searching for putative drugs with hypohidrosis or anhidrosis adverse events for use against hyperhidrosis. Eur. J. Med. Res. 2023, 28, 95. [Google Scholar] [CrossRef]
- Wu, H.; Gao, L.; Dong, J.; Yang, X. Detecting overlapping protein complexes by rough-fuzzy clustering in protein-protein interaction networks. PLoS ONE 2014, 9, e91856. [Google Scholar] [CrossRef]
- Wu, C.; Gudivada, R.C.; Aronow, B.J.; Jegga, A.G. Computational drug repositioning through heterogeneous network clustering. BMC Syst. Biol. 2013, 7, S6. [Google Scholar] [CrossRef]
- Yu, L.; Huang, J.; Ma, Z.; Zhang, J.; Zou, Y.; Gao, L. Inferring drug-disease associations based on known protein complexes. BMC Med. Genom. 2015, 8, S2. [Google Scholar] [CrossRef]
- Lu, J.; Chen, L.; Yin, J.; Huang, T.; Bi, Y.; Kong, X.; Zheng, M.; Cai, Y.D. Identification of new candidate drugs for lung cancer using chemical–chemical interactions, chemical–protein interactions and a K-means clustering algorithm. J. Biomol. Struct. Dyn. 2016, 34, 906–917. [Google Scholar] [CrossRef]
- Luo, H.; Wang, J.; Li, M.; Luo, J.; Peng, X.; Wu, F.X.; Pan, Y. Drug repositioning based on comprehensive similarity measures and bi-random walk algorithm. Bioinformatics 2016, 32, 2664–2671. [Google Scholar] [CrossRef]
- Liu, H.; Song, Y.; Guan, J.; Luo, L.; Zhuang, Z. Inferring new indications for approved drugs via random walk on drug-disease heterogenous networks. BMC Bioinform. 2016, 17, 539. [Google Scholar] [CrossRef]
- Wang, Y.; Guo, M.; Ren, Y.; Jia, L.; Yu, G. Drug repositioning based on individual bi-random walks on a heterogeneous network. BMC Bioinform. 2019, 20, 547. [Google Scholar] [CrossRef]
- Zhou, X.; Dai, E.; Song, Q.; Ma, X.; Meng, Q.; Jiang, Y.; Jiang, W. In Silico drug repositioning based on drug-miRNA associations. Brief. Bioinform. 2020, 21, 498–510. [Google Scholar] [CrossRef]
- Xie, G.; Li, J.; Gu, G.; Sun, Y.; Lin, Z.; Zhu, Y.; Wang, W. BGMSDDA: A bipartite graph diffusion algorithm with multiple similarity integration for drug–disease association prediction. Mol. Omics 2021, 17, 997–1011. [Google Scholar] [CrossRef]
- Wang, G.; Chen, H.; Wang, H.; Fu, Y.; Shi, C.; Cao, C.; Hu, X. Heterogeneous graph contrastive learning with graph diffusion for drug repositioning. J. Chem. Inf. Model. 2025, 65, 5771–5784. [Google Scholar] [CrossRef]
- Wu, J.; Gan, W.; Yu, P.S. Graph diffusion network for drug-gene prediction. arXiv 2025, arXiv:2502.09335. [Google Scholar] [CrossRef]
- Wu, G.; Liu, J.; Yue, X. Prediction of drug-disease associations based on ensemble meta paths and singular value decomposition. BMC Bioinform. 2019, 20, 134. [Google Scholar] [CrossRef]
- Kawichai, T.; Suratanee, A.; Plaimas, K. Meta-path based gene ontology profiles for predicting drug-disease associations. IEEE Access 2021, 9, 41809–41820. [Google Scholar] [CrossRef]
- Wang, Y.; Song, J.; Dai, Q.; Duan, X. Hierarchical negative sampling based graph contrastive learning approach for drug-disease association prediction. IEEE J. Biomed. Health Inform. 2024, 28, 3146–3157. [Google Scholar] [CrossRef]
- Tian, Z.; Teng, Z.; Cheng, S.; Guo, M. Computational drug repositioning using meta-path-based semantic network analysis. BMC Syst. Biol. 2018, 12, 134. [Google Scholar] [CrossRef]
- Jia, X.; Sun, X.; Wang, K.; Li, M. DRGCL: Drug repositioning via semantic-enriched graph contrastive learning. IEEE J. Biomed. Health Inform. 2024. [Google Scholar] [CrossRef]
- Chen, B.; Ding, Y.; Wild, D.J. Assessing drug target association using semantic linked data. PLoS Comput. Biol. 2012, 8, e1002574. [Google Scholar] [CrossRef]
- Palma, G.; Vidal, M.E.; Raschid, L. Drug-target interaction prediction using semantic similarity and edge partitioning. In Proceedings of the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, 19–23 October 2014; pp. 131–146. [Google Scholar]
- Zhu, Q.; Tao, C.; Shen, F.; Chute, C.G. Exploring the pharmacogenomics knowledge base (PharmGKB) for repositioning breast cancer drugs by leveraging Web ontology language (OWL) and cheminformatics approaches. In Proceedings of the 19th Pacific Symposium on Biocomputing (PSB 2014), Kohala Coast, HI, USA, 3–7 January 2014; p. 172. [Google Scholar]
- Mullen, J.; Cockell, S.J.; Tipney, H.; Woollard, P.M.; Wipat, A. Mining integrated semantic networks for drug repositioning opportunities. PeerJ 2016, 4, e1558. [Google Scholar] [CrossRef]
- Luo, H.; Li, M.; Wang, S.; Liu, Q.; Li, Y.; Wang, J. Computational drug repositioning using low-rank matrix approximation and randomized algorithms. Bioinformatics 2018, 34, 1904–1912. [Google Scholar] [CrossRef]
- Zhang, W.; Yue, X.; Lin, W.; Wu, W.; Liu, R.; Huang, F.; Liu, F. Predicting drug-disease associations by using similarity constrained matrix factorization. BMC Bioinform. 2018, 19, 233. [Google Scholar] [CrossRef]
- Xuan, P.; Cao, Y.; Zhang, T.; Wang, X.; Pan, S.; Shen, T. Drug repositioning through integration of prior knowledge and projections of drugs and diseases. Bioinformatics 2019, 35, 4108–4119. [Google Scholar] [CrossRef]
- Yang, M.; Luo, H.; Li, Y.; Wu, F.X.; Wang, J. Overlap matrix completion for predicting drug-associated indications. PLoS Comput. Biol. 2019, 15, e1007541. [Google Scholar] [CrossRef]
- Zhang, W.; Xu, H.; Li, X.; Gao, Q.; Wang, L. DRIMC: An improved drug repositioning approach using Bayesian inductive matrix completion. Bioinformatics 2020, 36, 2839–2847. [Google Scholar] [CrossRef]
- Yang, M.; Wu, G.; Zhao, Q.; Li, Y.; Wang, J. Computational drug repositioning based on multi-similarities bilinear matrix factorization. Brief. Bioinform. 2021, 22, bbaa267. [Google Scholar] [CrossRef]
- Jamali, A.A.; Tan, Y.; Kusalik, A.; Wu, F.X. NTD-DR: Nonnegative tensor decomposition for drug repositioning. PLoS ONE 2022, 17, e0270852. [Google Scholar] [CrossRef]
- Papanikolaou, N.; Pavlopoulos, G.A.; Theodosiou, T.; Vizirianakis, I.S.; Iliopoulos, I. DrugQuest-a text mining workflow for drug association discovery. BMC Bioinform. 2016, 17, 182. [Google Scholar] [CrossRef]
- Lee, S.; Kim, D.; Lee, K.; Choi, J.; Kim, S.; Jeon, M.; Lim, S.; Choi, D.; Kim, S.; Tan, A.C.; et al. BEST: Next-generation biomedical entity search tool for knowledge discovery from biomedical literature. PLoS ONE 2016, 11, e0164680. [Google Scholar] [CrossRef]
- Jin, S.; Niu, Z.; Jiang, C.; Huang, W.; Xia, F.; Jin, X.; Liu, X.; Zeng, X. HeTDR: Drug repositioning based on heterogeneous networks and text mining. Patterns 2021, 2, 100307. [Google Scholar] [CrossRef]
- Tari, L.B.; Patel, J.H. Systematic drug repurposing through text mining. In Biomedical Literature Mining; Humana: New York, NY, USA, 2014; pp. 253–267. [Google Scholar]
- Yang, H.T.; Ju, J.H.; Wong, Y.T.; Shmulevich, I.; Chiang, J.H. Literature-based discovery of new candidates for drug repurposing. Brief. Bioinform. 2017, 18, 488–497. [Google Scholar] [CrossRef]
- Zhu, Y.; Jung, W.; Wang, F.; Che, C. Drug repurposing against Parkinson’s disease by text mining the scientific literature. Libr. Hi Tech 2020, 38, 741–750. [Google Scholar] [CrossRef]
- Ma, T.; Lin, X.; Li, T.; Li, C.; Chen, L.; Zhou, P.; Cai, X.; Yang, X.; Zeng, D.; Cao, D.; et al. Y-Mol: A multiscale biomedical knowledge-guided large language model for drug development. arXiv 2024, arXiv:2410.11550. [Google Scholar]
- Yan, C.; Grabowska, M.E.; Dickson, A.L.; Li, B.; Wen, Z.; Roden, D.M.; Michael Stein, C.; Embí, P.J.; Peterson, J.F.; Feng, Q.; et al. Leveraging generative AI to prioritize drug repurposing candidates for Alzheimer’s disease with real-world clinical validation. npj Digit. Med. 2024, 7, 46. [Google Scholar] [CrossRef]
- Gu, Y.; Xu, Z.; Yang, C. Empowering graph neural network-based computational drug repositioning with large language model-inferred knowledge representation. Interdiscip. Sci. Comput. Life Sci. 2025, 17, 698–715. [Google Scholar] [CrossRef] [PubMed]
- Schmitt, R.A.; Buelau, K.; Martin, L.; Huettl, C.; Schirner, M.; Stefanovski, L.; Ritter, P. Biological database mining for LLM-driven Alzheimer’s disease drug repurposing. bioRxiv 2024. [Google Scholar] [CrossRef]
- Liu, S.; Lu, Y.; Chen, S.; Hu, X.; Zhao, J.; Lu, Y.; Zhao, Y. Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration. In Proceedings of the 2nd AAAI Workshop on Foundation Models for Biological Discoveries (FMs4Bio 2025), Philadelphia, PA, USA, 4 March 2025. [Google Scholar]
- Inoue, Y.; Song, T.; Wang, X.; Luna, A.; Fu, T. Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction. In Proceedings of the ICLR 2025 Workshop on Machine Learning for Genomics Explorations (MLGenX 2025), Singapore, 27 April 2025. [Google Scholar]
- Sirota, M.; Dudley, J.T.; Kim, J.; Chiang, A.P.; Morgan, A.A.; Sweet-Cordero, A.; Sage, J.; Butte, A.J. Discovery and preclinical validation of drug indications using compendia of public gene expression data. Sci. Transl. Med. 2011, 3, 96ra77. [Google Scholar] [CrossRef]
- Tanoli, Z.; Seemab, U.; Scherer, A.; Wennerberg, K.; Tang, J.; Vähä-Koskela, M. Exploration of databases and methods supporting drug repurposing: A comprehensive survey. Brief. Bioinform. 2021, 22, 1656–1678. [Google Scholar] [CrossRef] [PubMed]
- Lee, J.; Yoon, W.; Kim, S.; Kim, D.; Kim, S.; So, C.H.; Kang, J. BioBERT: A pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 2020, 36, 1234–1240. [Google Scholar] [CrossRef]
- Cheng, F.; Desai, R.J.; Handy, D.E.; Wang, R.; Schneeweiss, S.; Barabási, A.L.; Loscalzo, J. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat. Commun. 2018, 9, 2691. [Google Scholar] [CrossRef]
- Koromina, M.; Pandi, M.T.; Patrinos, G.P. Rethinking drug repositioning and development with artificial intelligence, machine learning, and omics. OMICS 2019, 23, 539–548. [Google Scholar] [CrossRef]
- Xiong, Z.; Huang, F.; Wang, Z.; Liu, S.; Zhang, W. A multimodal framework for improving in silico drug repositioning with the prior knowledge from knowledge graphs. IEEE/ACM Trans. Comput. Biol. Bioinform. 2021, 19, 2623–2631. [Google Scholar] [CrossRef]
- Tanoli, Z.; Vähä-Koskela, M.; Aittokallio, T. Artificial intelligence, machine learning, and drug repurposing in cancer. Expert Opin. Drug Discov. 2021, 16, 977–989. [Google Scholar] [CrossRef]
- Kim, Y.; Jung, Y.S.; Park, J.H.; Kim, S.J.; Cho, Y.R. Drug-disease association prediction using heterogeneous networks for computational drug repositioning. Biomolecules 2022, 12, 1497. [Google Scholar] [CrossRef]
- Wang, L.; Lu, Y.; Li, D.; Zhou, Y.; Yu, L.; Mesa Eguiagaray, I.; Campbell, H.; Li, X.; Theodoratou, E. The landscape of the methodology in drug repurposing using human genomic data: A systematic review. Brief. Bioinform. 2024, 25, bbad527. [Google Scholar] [CrossRef]
- Cummings, J.L.; Zhou, Y.; Van Stone, A.; Cammann, D.; Tonegawa-Kuji, R.; Fonseca, J.; Cheng, F. Drug repurposing for Alzheimer’s disease and other neurodegenerative disorders. Nat. Commun. 2025, 16, 1755. [Google Scholar] [CrossRef]
- Huang, S.Y.; Zou, X. An iterative knowledge-based scoring function to predict protein–ligand interactions: I. Derivation of interaction potentials. J. Comput. Chem. 2006, 27, 1866–1875. [Google Scholar] [CrossRef] [PubMed]
- Fan, S.; Geng, Q.; Pan, Z.; Li, X.; Tie, L.; Pan, Y.; Li, X. Clarifying off-target effects for torcetrapib using network pharmacology and reverse docking approach. BMC Syst. Biol. 2012, 6, 152. [Google Scholar] [CrossRef] [PubMed]
- Iorio, F.; Isacchi, A.; di Bernardo, D.; Brunetti-Pierri, N. Identification of small molecules enhancing autophagic function from drug network analysis. Autophagy 2010, 6, 1204–1205. [Google Scholar] [CrossRef]
- Pilipiec, P.; Liwicki, M.; Bota, A. Using machine learning for pharmacovigilance: A systematic review. Pharmaceutics 2022, 14, 266. [Google Scholar] [CrossRef] [PubMed]
- Blondel, V.D.; Guillaume, J.L.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef]
- Nepusz, T.; Yu, H.; Paccanaro, A. Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods 2012, 9, 471–472. [Google Scholar] [CrossRef]
- Cowen, L.; Ideker, T.; Raphael, B.J.; Sharan, R. Network propagation: A universal amplifier of genetic associations. Nat. Rev. Genet. 2017, 18, 551–562. [Google Scholar] [CrossRef]
- Köhler, S.; Bauer, S.; Horn, D.; Robinson, P.N. Walking the interactome for prioritization of candidate disease genes. Am. J. Hum. Genet. 2008, 82, 949–958. [Google Scholar] [CrossRef]
- Page, L.; Brin, S.; Motwani, R.; Winograd, T. The pagerank citation ranking: Bring order to the web. In Proceedings of the 7th International World Wide Web Conference (WWW7), Brisbane, Australia, 14–18 April 1998; pp. 161–172. [Google Scholar]
- Vanunu, O.; Magger, O.; Ruppin, E.; Shlomi, T.; Sharan, R. Associating genes and protein complexes with disease via network propagation. PLoS Comput. Biol. 2010, 6, e1000641. [Google Scholar] [CrossRef]
- Gönen, M. Predicting drug–target interactions from chemical and genomic kernels using Bayesian matrix factorization. Bioinformatics 2012, 28, 2304–2310. [Google Scholar] [CrossRef] [PubMed]
- Gasteiger, J.; Weißenberger, S.; Günnemann, S. Diffusion improves graph learning. Proc. Adv. Neural Inf. Process. Syst. 2019, 32, 13366–13378. [Google Scholar]
- Sun, Y.; Han, J.; Yan, X.; Yu, P.S.; Wu, T. Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 2011, 4, 992–1003. [Google Scholar] [CrossRef]
- Shi, C.; Hu, B.; Zhao, W.X.; Yu, P.S. Heterogeneous information network embedding for recommendation. IEEE Trans. Knowl. Data Eng. 2018, 31, 357–370. [Google Scholar] [CrossRef]
- Mullen, J.; Cockell, S.J.; Woollard, P.; Wipat, A. An integrated data driven approach to drug repositioning using gene-disease associations. PLoS ONE 2016, 11, e0155811. [Google Scholar] [CrossRef]
- Gelfond, M.; Lifschitz, V. The Stable Model Semanticsfor Logic Programming. In Proceedings of the 5th International Conference and Symposium on Logic Programming (ICLP 1988), Seattle, WA, USA, 15–19 August 1988; pp. 1070–1080. [Google Scholar]
- Gelfond, M.; Lifschitz, V. Classical negation in logic programs and disjunctive databases. New Gener. Comput. 1991, 9, 365–385. [Google Scholar] [CrossRef]
- Bordes, A.; Usunier, N.; Garcia-Duran, A.; Weston, J.; Yakhnenko, O. Translating embeddings for modeling multi-relational data. Adv. Neural Inf. Process. Syst. 2013, 26, 2787–2795. [Google Scholar]
- Wang, Z.; Zhang, J.; Feng, J.; Chen, Z. Knowledge graph embedding by translating on hyperplanes. In Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI 2014), Québec City, QC, Canada, 27–31 July 2014; Volume 28, pp. 1112–1119. [Google Scholar]
- Swanson, D.R. Medical literature as a potential source of new knowledge. Bull. Med. Libr. Assoc. 1990, 78, 29. [Google Scholar]
- Wang, B.; Mezlini, A.M.; Demir, F.; Fiume, M.; Tu, Z.; Brudno, M.; Haibe-Kains, B.; Goldenberg, A. Similarity network fusion for aggregating data types on a genomic scale. Nat. Methods 2014, 11, 333–337. [Google Scholar] [CrossRef]
- Jiang, H.J.; Huang, Y.A.; You, Z.H. SAEROF: An ensemble approach for large-scale drug-disease association prediction by incorporating rotation forest and sparse autoencoder deep neural network. Sci. Rep. 2020, 10, 4972. [Google Scholar] [CrossRef]
- Olsen, E.A.; Weiner, M.S.; Delong, E.R.; Pinnell, S.R. Topical minoxidil in early male pattern baldness. J. Am. Acad. Dermatol. 1985, 13, 185–192. [Google Scholar] [CrossRef]
- Goldstein, I.; Lue, T.F.; Padma-Nathan, H.; Rosen, R.C.; Steers, W.D.; Wicker, P.A.; Sildenafil Study Group. Oral sildenafil in the treatment of erectile dysfunction. J. Urol. 2002, 167, 1197–1203. [Google Scholar] [CrossRef]
- Burn, J.; Gerdes, A.M.; Macrae, F.; Mecklin, J.P.; Moeslein, G.; Olschwang, S.; Eccles, D.; Evans, D.G.; Maher, E.R.; Bertario, L.; et al. Long-term effect of aspirin on cancer risk in carriers of hereditary colorectal cancer: An analysis from the CAPP2 randomised controlled trial. Lancet 2011, 378, 2081–2087. [Google Scholar] [CrossRef]
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 2020, 33, 1877–1901. [Google Scholar]
- Merchant, A.; Batzner, S.; Schoenholz, S.S.; Aykol, M.; Cheon, G.; Cubuk, E.D. Scaling deep learning for materials discovery. Nature 2023, 624, 80–85. [Google Scholar] [CrossRef]
- Thirunavukarasu, A.J.; Ting, D.S.J.; Elangovan, K.; Gutierrez, L.; Tan, T.F.; Ting, D.S.W. Large language models in medicine. Nat. Med. 2023, 29, 1930–1940. [Google Scholar] [CrossRef]
- Team, G.; Georgiev, P.; Lei, V.I.; Burnell, R.; Bai, L.; Gulati, A.; Tanzer, G.; Vincent, D.; Pan, Z.; Wang, S.; et al. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. arXiv 2024, arXiv:2403.05530. [Google Scholar] [CrossRef]
- Radford, A.; Kim, J.W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al. Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML 2021), Virtual Event, 18–24 July 2021; pp. 8748–8763. [Google Scholar]
- Liu, P.; Yuan, W.; Fu, J.; Jiang, Z.; Hayashi, H.; Neubig, G. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Comput. Surv. 2023, 55, 195. [Google Scholar] [CrossRef]
- Shortliffe, E. Computer-Based Medical Consultations: MYCIN; Elsevier: Amsterdam, The Netherlands, 2012; Volume 2. [Google Scholar]
- Silver, D.; Hubert, T.; Schrittwieser, J.; Antonoglou, I.; Lai, M.; Guez, A.; Lanctot, M.; Sifre, L.; Kumaran, D.; Graepel, T.; et al. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 2018, 362, 1140–1144. [Google Scholar] [CrossRef]
- Xi, Z.; Chen, W.; Guo, X.; He, W.; Ding, Y.; Hong, B.; Zhang, M.; Wang, J.; Jin, S.; Zhou, E.; et al. The rise and potential of large language model based agents: A survey. Sci. China Inf. Sci. 2025, 68, 121101. [Google Scholar] [CrossRef]
- Wang, L.; Ma, C.; Feng, X.; Zhang, Z.; Yang, H.; Zhang, J.; Chen, Z.; Tang, J.; Chen, X.; Lin, Y.; et al. A survey on large language model based autonomous agents. Front. Comput. Sci. 2024, 18, 186345. [Google Scholar] [CrossRef]
- Qian, C.; Liu, W.; Liu, H.; Chen, N.; Dang, Y.; Li, J.; Yang, C.; Chen, W.; Su, Y.; Cong, X.; et al. Chatdev: Communicative agents for software development. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL 2024), Bangkok, Thailand, 11–16 August 2024; pp. 15174–15186. [Google Scholar]
- Li, G.; Hammoud, H.; Itani, H.; Khizbullin, D.; Ghanem, B. Camel: Communicative agents for “mind" exploration of large language model society. Adv. Neural Inf. Process. Syst. 2023, 36, 51991–52008. [Google Scholar]
- Park, J.S.; O’Brien, J.; Cai, C.J.; Morris, M.R.; Liang, P.; Bernstein, M.S. Generative agents: Interactive simulacra of human behavior. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (UIST 2023), San Francisco, CA, USA, 29 October–1 November 2023; pp. 1–22. [Google Scholar]
- Ziems, C.; Held, W.; Shaikh, O.; Chen, J.; Zhang, Z.; Yang, D. Can large language models transform computational social science? Comput. Linguist. 2024, 50, 237–291. [Google Scholar] [CrossRef]
- Zhang, Y.; Ren, S.; Wang, J.; Lu, J.; Wu, C.; He, M.; Liu, X.; Wu, R.; Zhao, J.; Zhan, C.; et al. Aligning Large Language Models with Humans: A Comprehensive Survey of ChatGPT’s Aptitude in Pharmacology: Y. Zhang et al. Drugs 2025, 85, 231–254. [Google Scholar] [CrossRef]
- Huang, L.; Yu, W.; Ma, W.; Zhong, W.; Feng, Z.; Wang, H.; Chen, Q.; Peng, W.; Feng, X.; Qin, B.; et al. A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions. ACM Trans. Inf. Syst. 2025, 43, 42. [Google Scholar] [CrossRef]
- Selmi, I. A Human–AI Co-Validation Framework to Mitigate LLM Hallucinations in Clinical Decision Support. Zenodo 2025, 9, 1–7. [Google Scholar]
- Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The protein data bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef]
- Wishart, D.S.; Feunang, Y.D.; Guo, A.C.; Lo, E.J.; Marcu, A.; Grant, J.R.; Sajed, T.; Johnson, D.; Li, C.; Sayeeda, Z.; et al. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 2018, 46, D1074–D1082. [Google Scholar] [CrossRef]
- Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem in 2021: New data content and improved web interfaces. Nucleic Acids Res. 2021, 49, D1388–D1395. [Google Scholar] [CrossRef]
- Gaulton, A.; Bellis, L.J.; Bento, A.P.; Chambers, J.; Davies, M.; Hersey, A.; Light, Y.; McGlinchey, S.; Michalovich, D.; Al-Lazikani, B.; et al. ChEMBL: A large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2012, 40, D1100–D1107. [Google Scholar] [CrossRef]
- Amberger, J.S.; Bocchini, C.A.; Scott, A.F.; Hamosh, A. OMIM. org: Leveraging knowledge across phenotype–gene relationships. Nucleic Acids Res. 2019, 47, D1038–D1043. [Google Scholar] [CrossRef]
- Ashburner, M.; Ball, C.A.; Blake, J.A.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene ontology: Tool for the unification of biology. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef]
- Szklarczyk, D.; Kirsch, R.; Koutrouli, M.; Nastou, K.; Mehryary, F.; Hachilif, R.; Gable, A.L.; Fang, T.; Doncheva, N.T.; Pyysalo, S.; et al. The STRING database in 2023: Protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023, 51, D638–D646. [Google Scholar] [CrossRef]
- Ferreira, L.G.; Dos Santos, R.N.; Oliva, G.; Andricopulo, A.D. Molecular docking and structure-based drug design strategies. Molecules 2015, 20, 13384–13421. [Google Scholar] [CrossRef]
- Ghahremanian, S.; Rashidi, M.M.; Raeisi, K.; Toghraie, D. Molecular dynamics simulation approach for discovering potential inhibitors against SARS-CoV-2: A structural review. J. Mol. Liq. 2022, 354, 118901. [Google Scholar] [CrossRef]
- Sobolev, O.V.; Afonine, P.V.; Moriarty, N.W.; Hekkelman, M.L.; Joosten, R.P.; Perrakis, A.; Adams, P.D. A global Ramachandran score identifies protein structures with unlikely stereochemistry. Structure 2020, 28, 1249–1258. [Google Scholar] [CrossRef]
- Batool, M.; Ahmad, B.; Choi, S. A structure-based drug discovery paradigm. Int. J. Mol. Sci. 2019, 20, 2783. [Google Scholar] [CrossRef]
- Choudhury, C.; Murugan, N.A.; Priyakumar, U.D. Structure-based drug repurposing: Traditional and advanced AI/ML-aided methods. Drug Discov. Today 2022, 27, 1847–1861. [Google Scholar] [CrossRef]
- Sudlow, C.; Gallacher, J.; Allen, N.; Beral, V.; Burton, P.; Danesh, J.; Downey, P.; Elliott, P.; Green, J.; Landray, M.; et al. UK biobank: An open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015, 12, e1001779. [Google Scholar] [CrossRef]
- Buniello, A.; MacArthur, J.A.L.; Cerezo, M.; Harris, L.W.; Hayhurst, J.; Malangone, C.; McMahon, A.; Morales, J.; Mountjoy, E.; Sollis, E.; et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019, 47, D1005–D1012. [Google Scholar] [CrossRef] [PubMed]
- Lamb, J.; Crawford, E.D.; Peck, D.; Modell, J.W.; Blat, I.C.; Wrobel, M.J.; Lerner, J.; Brunet, J.P.; Subramanian, A.; Ross, K.N.; et al. The Connectivity Map: Using gene-expression signatures to connect small molecules, genes, and disease. Science 2006, 313, 1929–1935. [Google Scholar] [CrossRef] [PubMed]
- Subramanian, A.; Narayan, R.; Corsello, S.M.; Peck, D.D.; Natoli, T.E.; Lu, X.; Gould, J.; Davis, J.F.; Tubelli, A.A.; Asiedu, J.K.; et al. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell 2017, 171, 1437–1452. [Google Scholar] [CrossRef]
- Barrett, T.; Suzek, T.O.; Troup, D.B.; Wilhite, S.E.; Ngau, W.C.; Ledoux, P.; Rudnev, D.; Lash, A.E.; Fujibuchi, W.; Edgar, R. NCBI GEO: Mining millions of expression profiles—database and tools. Nucleic Acids Res. 2005, 33, D562–D566. [Google Scholar] [CrossRef] [PubMed]
- Weinstein, J.N.; Collisson, E.A.; Mills, G.B.; Shaw, K.R.; Ozenberger, B.A.; Ellrott, K.; Shmulevich, I.; Sander, C.; Stuart, J.M. The cancer genome atlas pan-cancer analysis project. Nat. Genet. 2013, 45, 1113–1120. [Google Scholar] [CrossRef]
- Bowden, J.; Davey Smith, G.; Haycock, P.C.; Burgess, S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016, 40, 304–314. [Google Scholar] [CrossRef]
- Burgess, S.; Butterworth, A.; Thompson, S.G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 2013, 37, 658–665. [Google Scholar] [CrossRef]
- Brown, A.S.; Patel, C.J. A review of validation strategies for computational drug repositioning. Brief. Bioinform. 2018, 19, 174–177. [Google Scholar] [CrossRef]
- Kanehisa, M.; Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar] [CrossRef] [PubMed]
- Schomburg, I.; Chang, A.; Schomburg, D. BRENDA, enzyme data and metabolic information. Nucleic Acids Res. 2002, 30, 47–49. [Google Scholar] [CrossRef] [PubMed]
- Tsherniak, A.; Vazquez, F.; Montgomery, P.G.; Weir, B.A.; Kryukov, G.; Cowley, G.S.; Gill, S.; Harrington, W.F.; Pantel, S.; Krill-Burger, J.M.; et al. Defining a cancer dependency map. Cell 2017, 170, 564–576. [Google Scholar] [CrossRef]
- Stelzer, G.; Rosen, N.; Plaschkes, I.; Zimmerman, S.; Twik, M.; Fishilevich, S.; Stein, T.I.; Nudel, R.; Lieder, I.; Mazor, Y.; et al. The GeneCards suite: From gene data mining to disease genome sequence analyses. Curr. Protoc. Bioinform. 2016, 54, 1–30. [Google Scholar] [CrossRef] [PubMed]
- Wilcoxon, F. Individual comparisons by ranking methods. In Breakthroughs in Statistics: Methodology and Distribution; Springer: New York, NY, USA, 1992; pp. 196–202. [Google Scholar]
- Friedman, M. The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J. Am. Stat. Assoc. 1937, 32, 675–701. [Google Scholar] [CrossRef]
- Armstrong, R.A. When to use the B onferroni correction. Ophthalmic Physiol. Opt. 2014, 34, 502–508. [Google Scholar] [CrossRef]
- Potter, E.; Reyes, M.; Naples, J.; Dal Pan, G. FDA adverse event reporting system (FAERS) essentials: A guide to understanding, applying, and interpreting adverse event data reported to FAERS. Clin. Pharmacol. Ther. 2025, 118, 567–582. [Google Scholar] [CrossRef]
- Brand, J.S.; Gauffin, O.; Sartori, D.; Fusaroli, M.; Sköld, H.; Bergvall, T.; Sandberg, L.; Wallberg, M.; Hjelmström, P.; Norén, G.N. VigiBase: Resource Profile Update with a Summary of Global Patterns and Trends in Adverse Event Reports for Medicines and Vaccines. Drug Saf. 2026, 49, 613. [Google Scholar] [CrossRef]
- Brown, E.G.; Wood, L.; Wood, S. The medical dictionary for regulatory activities (MedDRA). Drug Saf. 1999, 20, 109–117. [Google Scholar] [CrossRef] [PubMed]
- Hollingworth, S.; Kairuz, T. Measuring medicine use: Applying ATC/DDD methodology to real-world data. Pharmacy 2021, 9, 60. [Google Scholar] [CrossRef]
- Salvadores, M.; Alexander, P.R.; Musen, M.A.; Noy, N.F. BioPortal as a dataset of linked biomedical ontologies and terminologies in RDF. Semant. Web 2013, 4, 277–284. [Google Scholar] [CrossRef]
- Rothman, K.J.; Lanes, S.; Sacks, S.T. The reporting odds ratio and its advantages over the proportional reporting ratio. Pharmacoepidemiol. Drug Saf. 2004, 13, 519–523. [Google Scholar] [CrossRef]
- Bate, A.; Lindquist, M.; Orre, R.; Edwards, I.; Meyboom, R. Data-mining analyses of pharmacovigilance signals in relation to relevant comparison drugs. Eur. J. Clin. Pharmacol. 2002, 58, 483–490. [Google Scholar] [CrossRef] [PubMed]
- Fram, D.M.; Almenoff, J.S.; DuMouchel, W. Empirical Bayesian data mining for discovering patterns in post-marketing drug safety. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2003), Washington, DC, USA, 24–27 August 2003; pp. 359–368. [Google Scholar]
- Järvelin, K.; Kekäläinen, J. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. (TOIS) 2002, 20, 422–446. [Google Scholar] [CrossRef]
- Schober, P.; Boer, C.; Schwarte, L.A. Correlation coefficients: Appropriate use and interpretation. Anesth. Analg. 2018, 126, 1763–1768. [Google Scholar] [CrossRef]
- Saito, T.; Rehmsmeier, M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef] [PubMed]
- Dewulf, P.; Stock, M.; De Baets, B. Cold-start problems in data-driven prediction of drug–drug interaction effects. Pharmaceuticals 2021, 14, 429. [Google Scholar] [CrossRef]
- Davis, A.P.; Wiegers, T.C.; Johnson, R.J.; Sciaky, D.; Wiegers, J.; Mattingly, C.J. Comparative toxicogenomics database (CTD): Update 2023. Nucleic Acids Res. 2023, 51, D1257–D1262. [Google Scholar] [CrossRef]
- Giurgiu, M.; Reinhard, J.; Brauner, B.; Dunger-Kaltenbach, I.; Fobo, G.; Frishman, G.; Montrone, C.; Ruepp, A. CORUM: The comprehensive resource of mammalian protein complexes—2019. Nucleic Acids Res. 2019, 47, D559–D563. [Google Scholar] [CrossRef]
- Piñero, J.; Queralt-Rosinach, N.; Bravo, A.; Deu-Pons, J.; Bauer-Mehren, A.; Baron, M.; Sanz, F.; Furlong, L.I. DisGeNET: A discovery platform for the dynamical exploration of human diseases and their genes. Database 2015, 2015, bav028. [Google Scholar] [CrossRef]
- Huntley, R.; Dimmer, E.; Barrell, D.; Binns, D.; Apweiler, R. The gene ontology annotation (GOA) database. Nat. Preced. 2009. [Google Scholar] [CrossRef]
- Liu, T.; Lin, Y.; Wen, X.; Jorissen, R.N.; Gilson, M.K. BindingDB: A web-accessible database of experimentally determined protein–ligand binding affinities. Nucleic Acids Res. 2007, 35, D198–D201. [Google Scholar] [CrossRef]
- Thorn, C.F.; Klein, T.E.; Altman, R.B. PharmGKB: The pharmacogenomics knowledge base. In Pharmacogenomics: Methods and Protocols; Humana Press: Totowa, NJ, USA, 2013; pp. 311–320. [Google Scholar]
- Zarin, D.A.; Tse, T.; Williams, R.J.; Califf, R.M.; Ide, N.C. The ClinicalTrials. gov results database—Update and key issues. N. Engl. J. Med. 2011, 364, 852–860. [Google Scholar] [CrossRef]
- Brown, A.S.; Patel, C.J. A standard database for drug repositioning. Sci. Data 2017, 4, 170029. [Google Scholar] [CrossRef]
- Liang, X.; Zhang, P.; Yan, L.; Fu, Y.; Peng, F.; Qu, L.; Shao, M.; Chen, Y.; Chen, Z. LRSSL: Predict and interpret drug–disease associations based on data integration using sparse subspace learning. Bioinformatics 2017, 33, 1187–1196. [Google Scholar] [CrossRef]
- Günther, S.; Kuhn, M.; Dunkel, M.; Campillos, M.; Senger, C.; Petsalaki, E.; Ahmed, J.; Urdiales, E.G.; Gewiess, A.; Jensen, L.J.; et al. SuperTarget and Matador: Resources for exploring drug-target relationships. Nucleic Acids Res. 2007, 36, D919–D922. [Google Scholar] [CrossRef]
- McEntyre, J.; Lipman, D. PubMed: Bridging the information gap. CMAJ 2001, 164, 1317–1319. [Google Scholar]
- Clough, E.; Barrett, T. The gene expression omnibus database. In Statistical Genomics: Methods and Protocols; Humana Press: New York, NY, USA, 2016; pp. 93–110. [Google Scholar]
- Wang, Y.; Zhang, S.; Li, F.; Zhou, Y.; Zhang, Y.; Wang, Z.; Zhang, R.; Zhu, J.; Ren, Y.; Tan, Y.; et al. Therapeutic target database 2020: Enriched resource for facilitating research and early development of targeted therapeutics. Nucleic Acids Res. 2020, 48, D1031–D1041. [Google Scholar] [CrossRef]
- UniProt: The universal protein knowledgebase in 2023. Nucleic Acids Res. 2023, 51, D523–D531. [CrossRef] [PubMed]
- Edgar, R.; Domrachev, M.; Lash, A.E. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30, 207–210. [Google Scholar] [CrossRef]
- Himmelstein, D.S.; Lizee, A.; Hessler, C.; Brueggeman, L.; Chen, S.L.; Hadley, D.; Green, A.; Khankhanian, P.; Baranzini, S.E. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. eLife 2017, 6, e26726. [Google Scholar] [CrossRef]
- Handa, K.; Thomas, M.C.; Kageyama, M.; Iijima, T.; Bender, A. On the difficulty of validating molecular generative models realistically: A case study on public and proprietary data. J. Cheminf. 2023, 15, 112. [Google Scholar] [CrossRef]
- Kang, H.; Li, J.; Hou, L.; Xu, X.; Zheng, S.; Li, Q. Large Language Model–Enhanced Drug Repositioning Knowledge Extraction via Long Chain-of-Thought: Development and Evaluation Study. JMIR Med. Inform. 2025, 13, e77837. [Google Scholar] [CrossRef] [PubMed]
- Otsuka, Y.; Kaneko, M.; Narukawa, M. Factors associated with successful phase III trials for solid tumors: A systematic review. Contemp. Clin. Trials Commun. 2021, 24, 100855. [Google Scholar] [CrossRef] [PubMed]
- Jara, M.O.; Williams, R.O., III. The challenge of repurposing niclosamide: Considering pharmacokinetic parameters, routes of administration, and drug metabolism. J. Drug Deliv. Sci. Technol. 2023, 81, 104187. [Google Scholar] [CrossRef]
- World Health Organization. Report of the Technical Consultation on Innovative Clinical Trial Designs for Development of New TB Treatments; World Health Organization: Geneva, Switzerland, 2021. [Google Scholar]








| Method | Techniques | Key Advantages | Key Limitations | Application Scenarios |
|---|---|---|---|---|
| Structure-based [8,10,15,18,19] | Molecular docking; molecular dynamics (MD) simulations | Atomic-level mechanistic insights; cost-effective candidate prioritization | Dependence on high-quality 3D target structures; high computational demands; scoring function approximations | Candidate target structures are available; complex stability validation; toxicity pathway identification |
| Omics-based [20,21,22,24,73] | Mendelian randomization (genomic data); signature-based approaches (transcriptomic data) | Mode of action (MoA); causal evidence | Dependence on strong cis-variants; cell death signals confound cancer screens; population bias | Causal inference for drug-disease relationships; mechanism of action elucidation |
| Fuzzy logic-based [25,26,27,28] | Fuzzy logic: fuzzification, rule-based inference and defuzzification | Transform qualitative concepts into computable operations; filter outliers without data resampling; interpretable and aligned with clinical reasoning | Domain expertise to define membership functions; manual definition of if-then rules | Side effect quantification via fuzzy equality operators; multi-objective optimization |
| Adverse event (AE)-based [29,30,31,32,33,74] | AE-primary; AE-auxiliary | Human-centered; scalable hypothesis generation | Spontaneous reporting biases; multi-metric assessments and orthogonal validation | Inverse phenotype identification; clinical feasibility; filter out the side-effect drugs |
| Network-based [35,39,42,45,50,54,57] | Graph clustering, random walk, graph diffusion, meta-path, semantic-based; matrix factorization or matrix completion | Integrate multi-source heterogeneous data; systems-level perspective on biological networks | Heavy dependence on data quality and completeness; cold-start problem; lack interpretability; difficulty distinguishing positive vs. negative drug-disease associations | Predict unknown drug-disease associations; integrate heterogeneous biomedical data; identify functional modules and therapeutic communities |
| Text mining-based [61,62,63,64,65,66,75] | Lexical co-occurrence; semantic parsing; logical reasoning; feature extraction; pre-trained language models | Detect direct and indirect relationships beyond manual curation; transform textual data into actionable knowledge; rapid retrieval and intuitive exploration | Potential source biases in literature; dependency on entity recognition quality; require careful validation of extracted relationships | Literature-based candidate identification; disease-specific knowledge graph construction; indirect relationship inference |
| LLM-based [67,68,69,70,71,72] | Medical specialized language models; multi-source knowledge integration models; LLM agent collaborative models | Superior contextual semantic understanding; zero-shot task adaptation and knowledge transfer; cross-modal knowledge integration | Model hallucinations generate biologically implausible predictions; heavy dependency on training data quality and completeness | Multi-source data integration; high-throughput screening; hypothesis generation and mechanistic inference |
| Methods | Key Idea | Limitations |
|---|---|---|
| Graph Clustering [35,36,37,87,88] | Direct Clustering:Use Louvain or ClusterONE to directly extract drug-disease associations Protein Complex-Mediated: Use protein complexes as functional bridges to infer drug-disease associations indirectly via a tripartite network where clustering validates predictions by grouping drugs with known therapeutics Network-derived Feature-Based: Combine network-derived features with classical clustering (e.g., K-means) to prioritize candidates grouping with approved drugs | Direct Clustering: Heavily rely on existing gene annotationsProtein Complex-Mediated: Difficulty distinguishing positive or negative associations Network-derived Feature-Based: Highly rely on feature quality |
| Random Walk [38,39,40,41] | Simulate a walker moving through a biological network, where the final probability of reaching a node reflects its therapeutic potential (e.g., bidirectional or dual-perspective walks, individualized walk lengths) | Heavily on data quality and parameter tuning; Lack interpretability |
| Graph Diffusion [42,43,44,89,90,91,92,93,94] | Non-parametric: Use graph diffusion directly on biological networks to predict drug-disease associations without learnable parameters Embedding learning integration: Employ graph diffusion as a feature-processing step within graph neural networks to capture long-range dependencies and learn powerful node embeddings for classification Hard negative sampling: Leverage a graph diffusion network to generate hard negative samples for contrastive learning, enhancing the discrimination of predictive models | Non-parametric: Highly dependent on prior knowledge and parameter tuning Embedding learning integration: Model complexity; Lack interpretability Hard negative sampling: Training and computational complexity |
| Meta-path [45,46,47,95,96] | Construct a biological network; define meaningful meta-paths (e.g., Drug → Protein → Disease) or employ graph learning techniques to automatically weight and aggregate information from multiple paths; use meta-paths as features for machine learning model | Initial reliance on domain expertise to predefine relevant meta-paths |
| Semantic-based [2,49,50,51,52,53,97] | Construct a semantic network that integrates diverse biological data (e.g., drugs, targets, diseases) using ontologies and formal logic, use semantically meaningful elements (e.g., specific path patterns or subgraphs that represent well-defined biological relationships) to mine this network | Dependence on ontology completeness and data quality; High computational complexity of formal reasoning; Limited adaptability due to predefined semantic rules |
| Methods | Key Idea | Representative Methods | Prospects | Considerations |
|---|---|---|---|---|
| MF | Factorize drug-disease association matrix into a drug-feature matrix and a disease-feature matrix | DisDrugPred [56], MSBMF [59], SCMFDD [55], NTD-DR [60] | Interpretability | Cold-start problem, non-convex optimization |
| MC | Directly complete a low-rank matrix to approximate the known drug-disease association matrix | DRRS [54], OMC [57], DRIMC [58] | Alleviate cold-start problems | Computational complexity, dependence on similarity measures |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Mou, Z.; Tian, Z.; Jin, J.; Yu, H.; Huang, Y. Reviewing the Computational Landscape of Drug Repurposing: Evolution from Structure-Based Methods to LLM-Based Methods. Biomolecules 2026, 16, 830. https://doi.org/10.3390/biom16060830
Mou Z, Tian Z, Jin J, Yu H, Huang Y. Reviewing the Computational Landscape of Drug Repurposing: Evolution from Structure-Based Methods to LLM-Based Methods. Biomolecules. 2026; 16(6):830. https://doi.org/10.3390/biom16060830
Chicago/Turabian StyleMou, Zengyun, Zhiqing Tian, Jiaqi Jin, Heng Yu, and Yongzhen Huang. 2026. "Reviewing the Computational Landscape of Drug Repurposing: Evolution from Structure-Based Methods to LLM-Based Methods" Biomolecules 16, no. 6: 830. https://doi.org/10.3390/biom16060830
APA StyleMou, Z., Tian, Z., Jin, J., Yu, H., & Huang, Y. (2026). Reviewing the Computational Landscape of Drug Repurposing: Evolution from Structure-Based Methods to LLM-Based Methods. Biomolecules, 16(6), 830. https://doi.org/10.3390/biom16060830

