From In Silico Hypothesis to Validation: The Role of Real-World Evidence in the Preliminary Verification of AI-Generated Drug-Repositioning Candidates: A Comprehensive Review
Abstract
1. Introduction
2. Materials and Methods
3. Results
3.1. Alzheimer’s Disease
3.2. Parkinson’s Disease
3.3. COVID-19
3.4. Type 2 Diabetes
3.5. Neurological Diseases and Mental Disorders
3.6. Autoimmune Diseases
4. Discussion
4.1. Comparative Analysis of AI Methodologies and Validation Rigor
4.2. Methodological Synthesis
4.3. Limitations
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ACEi | Angiotensin-Converting Enzyme inhibitors |
| AD | Alzheimer’s Disease |
| ADHD | Attention-Deficit/Hyperactivity Disorder |
| AI | Artificial Intelligence |
| ALS | Amyotrophic Lateral Sclerosis |
| AOR | Adjusted Odds Ratio |
| ARB | Angiotensin II Receptor Blockers |
| ARG | Alzheimer’s Risk Genes |
| ATE | Average Treatment Effect |
| ATSUD | Amphetamine-Type Stimulant Use Disorder |
| AUC | Area Under the curve |
| aSyn | Alpha-synuclein |
| BSL | Baseline |
| CD | Crohn’s Disease |
| CI | Confidence Interval |
| COX-2 | Cyclooxygenase-2 |
| DHP-CCB | Dihydropyridine Calcium Channel Blocker |
| DMF | Dimethyl Fumarate |
| DPP-4 | Dipeptidyl Peptidase-4 |
| ECT | Emulated Clinical Trials |
| EHR/EMR | Electronic Health Record/Electronic Medical Record |
| FDA | Food and Drug Administration |
| FMA | Frequentist Model Averaging |
| GReX | Gene Expression Profiling |
| GWAS | Genome-Wide Association Study |
| HbA1c | Glycated hemoglobin |
| HR | Hazard Ratio |
| IBD | Inflammatory Bowel Diseases |
| IMID | Immune-Mediated Inflammatory Diseases |
| IPTW | Inverse Probability of Treatment Weighting |
| IPW | Inverse Probability Weighting |
| LASSO | Least Absolute Shrinkage and Selection Operator |
| LTP | Long-Term Potentiation |
| MeSH | Medical Subject Headings |
| ML | Machine Learning |
| MPI | Modeling Path Importance |
| MS | Multiple Sclerosis |
| MSM | Marginal Structural Models |
| NLP | Natural Language Processing |
| OR | Odds Ratio |
| OUD | Opioid Use Disorders |
| PCA | Principal Component Analysis |
| PD | 1. Pharmacodynamic; 2. Parkinson’s Disease |
| PK | Pharmacokinetic |
| PSM | Propensity Score Matching |
| PSW | Propensity Score Weighting |
| RR | Relative Risk |
| RWE/RWD | Real-World Evidence/Real-World Data |
| SSL | Semi-Supervised Learning |
| T2DM | Type 2 Diabetes Mellitus |
| UC | Ulcerative Colitis |
| WHO | World Health Organization |
References
- Jang, G.; Lee, T.; Hwang, S.; Park, C.; Ahn, J.; Seo, S.; Hwang, Y.; Yoon, Y. PISTON: Predicting drug indications and side effects using topic modeling and natural language processing. J. Biomed. Inform. 2018, 87, 96–107. [Google Scholar] [CrossRef] [PubMed]
- Gori, D.N.P.; Alberca, L.N.; Talevi, A. Making the most effective use of available computational methods for drug repositioning. Expert Opin. Drug Discov. 2023, 18, 495–503. [Google Scholar] [CrossRef] [PubMed]
- Lee, S.; Jeon, S.; Kim, H.-S. A Study on Methodologies of Drug Repositioning Using Biomedical Big Data: A Focus on Diabetes Mellitus. Endocrinol. Metab. 2022, 37, 195–207. [Google Scholar] [CrossRef] [PubMed]
- Leite, M.L.; Costa, L.S.D.L.; Cunha, V.A.; Kreniski, V.; Filho, M.D.O.B.; da Cunha, N.B.; Costa, F.F. Artificial intelligence and the future of life sciences. Drug Discov. Today 2021, 26, 2515–2526. [Google Scholar] [CrossRef]
- Jang, D.; Lee, S.; Lee, J.; Kim, K.; Lee, D. Inferring new drug indications using the complementarity between clinical disease signatures and drug effects. J. Biomed. Inform. 2016, 59, 248–257. [Google Scholar] [CrossRef]
- Adamek, L.; Padiasek, G.; Zhang, C.; O’dWyer, I.; Capit, N.; Dormont, F.; Hernandez, R.; Bar-Joseph, Z.; Rufino, B. Identifying indications for novel drugs using electronic health records. Comput. Biol. Med. 2024, 183, 109158. [Google Scholar] [CrossRef]
- Denny, J.C.; Van Driest, S.L.; Wei, W.; Roden, D.M. The Influence of Big (Clinical) Data and Genomics on Precision Medicine and Drug Development. Clin. Pharmacol. Ther. 2017, 103, 409–418. [Google Scholar] [CrossRef]
- Long, J.M.; Holtzman, D.M. Alzheimer Disease: An Update on Pathobiology and Treatment Strategies. Cell 2019, 179, 312–339. [Google Scholar] [CrossRef]
- GBD 2016 Dementia Collaborators. Global, regional, and national burden of Alzheimer’s disease and other dementias, 1990–2016: A systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 2019, 18, 88–106. [Google Scholar] [CrossRef]
- Jeremic, D.; Navarro-Lopez, J.D.; Jimenez-Diaz, L. Clinical Benefits and Risks of Antiamyloid Antibodies in Sporadic Alzheimer Disease: Systematic Review and Network Meta-Analysis With a Web Application. J. Med. Internet Res. 2025, 27, e68454. [Google Scholar] [CrossRef]
- Taubes, A.; Nova, P.; Zalocusky, K.A.; Kosti, I.; Bicak, M.; Zilberter, M.Y.; Hao, Y.; Yoon, S.Y.; Oskotsky, T.; Pineda, S.; et al. Experimental and real-world evidence supporting the computational repurposing of bumetanide for APOE4-related Alzheimer’s disease. Nat. Aging 2021, 1, 932–947. [Google Scholar] [CrossRef]
- Fang, J.; Zhang, P.; Wang, Q.; Chiang, C.-W.; Zhou, Y.; Hou, Y.; Xu, J.; Chen, R.; Zhang, B.; Lewis, S.J.; et al. Artificial intelligence framework identifies candidate targets for drug repurposing in Alzheimer’s disease. Alzheimer’s Res. Ther. 2022, 14, 7. [Google Scholar] [CrossRef] [PubMed]
- Xiang, S.; Lawrence, P.J.; Peng, B.; Chiang, C.; Kim, D.; Shen, L.; Ning, X. Modeling Path Importance for Effective Alzheimer’s Disease Drug Repurposing. arXiv 2023, arXiv:2310.15211v2. [Google Scholar]
- Orlenko, A.; Venkatesan, M.; Shen, L.; Ritchie, M.D.; Wang, Z.P.; Obafemi-Ajayi, T.; Moore, J.H. Biologically Enhanced Machine Learning Model to uncover Novel Gene-Drug Targets for Alzheimer’s Disease. Pac. Symp. Biocomput. 2025, 30, 441–456. [Google Scholar] [CrossRef] [PubMed]
- Dorsey, E.R.; Bloem, B.R. The Parkinson Pandemic—A Call to Action. JAMA Neurol. 2018, 75, 9–10. [Google Scholar] [CrossRef] [PubMed]
- Athauda, D.; Foltynie, T. Drug Repurposing in Parkinson’s Disease. CNS Drugs 2018, 32, 747–761. [Google Scholar] [CrossRef] [PubMed]
- Visanji, N.P.; Madan, P.; Lacoste, A.M.B.; Buleje, I.; Han, Y.; Spangler, S.; Kalia, L.V.; Alford, S.H.; Marras, C. Using artificial intelligence to identify anti-hypertensives as possible disease modifying agents in Parkinson’s disease. Pharmacoepidemiol. Drug Saf. 2020, 30, 201–209. [Google Scholar] [CrossRef] [PubMed]
- Laifenfeld, D.; Yanover, C.; Ozery-Flato, M.; Shaham, O.; Rosen-Zvi, M.; Lev, N.; Goldschmidt, Y.; Grossman, I. Emulated Clinical Trials from Longitudinal Real-World Data Efficiently Identify Candidates for Neurological Disease Modification: Examples from Parkinson’s Disease. Front. Pharmacol. 2021, 12, 631584. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Gorenflo, M.P.; Gao, Z.; Davis, P.B.; Kaelber, D.; Xu, R. Amphetamine use and Parkinson’s disease: Integration of artificial intelligence prediction, clinical corroboration, and mechanism of action analyses. PLoS ONE 2025, 20, e0323761. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Wiersinga, W.J.; Rhodes, A.; Cheng, A.C.; Peacock, S.J.; Prescott, H.C. Pathophysiology, Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review. JAMA 2020, 324, 782–793. [Google Scholar] [CrossRef]
- World Health Organization. WHO Coronavirus (COVID-19) Dashboard. World Health Organization. Available online: https://data.who.int/dashboards/covid19/ (accessed on 29 November 2025).
- Voloudakis, G.; Lee, K.M.; Vicari, J.M.; Zhang, W.; Hoagland, D.; Venkatesh, S.; Bian, J.; Anyfantakis, M.; Wu, Z.; Rahman, S.; et al. A genetically based computational drug repurposing framework for rapid identification of candidate compounds: Application to COVID-19. medRxiv 2025, 2025, 25320348. [Google Scholar] [CrossRef]
- Nam, Y.; Lucas, A.; Yun, J.-S.; Lee, S.M.; Park, J.W.; Chen, Z.; Lee, B.; Ning, X.; Shen, L.; Verma, A.; et al. Development of complemented comprehensive networks for rapid screening of repurposable drugs applicable to new emerging disease outbreaks. J. Transl. Med. 2023, 21, 415. [Google Scholar] [CrossRef] [PubMed]
- Rahman, M.; Mahi, A.M.; Melamed, R.; Alam, M.A.U. Effects of Antidepressants on COVID-19 Outcomes: Retrospective Study on Large-Scale Electronic Health Record Data. Interact. J. Med. Res. 2023, 12, e39455. [Google Scholar] [CrossRef] [PubMed]
- Fico, G.; Isayeva, U.; De Prisco, M.; Oliva, V.; Solè, B.; Montejo, L.; Grande, I.; Arbelo, N.; Gomez-Ramiro, M.; Pintor, L.; et al. Psychotropic drug repurposing for COVID-19: A Systematic Review and Meta-Analysis. Eur. Neuropsychopharmacol. 2022, 66, 30–44. [Google Scholar] [CrossRef]
- Zheng, Y.; Ley, S.H.; Hu, F.B. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat. Rev. Endocrinol. 2018, 14, 88–98. [Google Scholar] [CrossRef] [PubMed]
- Artasensi, A.; Pedretti, A.; Vistoli, G.; Fumagalli, L. Type 2 Diabetes Mellitus: A Review of Multi-Target Drugs. Molecules 2020, 25, 1987. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Koren, G.; Nordon, G.; Radinsky, K.; Shalev, V. Identification of repurposable drugs with beneficial effects on glucose control in type 2 diabetes using machine learning. Pharmacol. Res. Perspect. 2019, 7, e00529. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Brnabic, A.J.M.; Curtis, S.E.; Johnston, J.A.; Lo, A.; Zagar, A.J.; Lipkovich, I.; Kadziola, Z.; Murray, M.H.; Ryan, T. Incidence of type 2 diabetes, cardiovascular disease and chronic kidney disease in patients with multiple sclerosis initiating disease-modifying therapies: Retrospective cohort study using a frequentist model averaging statistical framework. PLoS ONE 2024, 19, e0300708. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Gao, Z.; Gorenflo, M.; Kaelber, D.C.; Monnier, V.M.; Xu, R. Drug repurposing for reducing the risk of cataract extraction in patients with diabetes mellitus: Integration of artificial intelligence-based drug prediction and clinical corroboration. Front. Pharmacol. 2023, 14, 1181711. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Mohs, R.C.; Greig, N.H. Drug discovery and development: Role of basic biological research. Alzheimers Dement. Transl. Res. Clin. Interv. 2017, 3, 651–657. [Google Scholar] [CrossRef]
- Gribkoff, V.K.; Kaczmarek, L.K. The need for new approaches in CNS drug discovery: Why drugs have failed, and what can be done to improve outcomes. Neuropharmacology 2017, 120, 11–19. [Google Scholar] [CrossRef] [PubMed]
- Paik, H.; Chung, A.-Y.; Park, H.-C.; Park, R.W.; Suk, K.; Kim, J.; Kim, H.; Lee, K.; Butte, A.J. Repurpose terbutaline sulfate for amyotrophic lateral sclerosis using electronic medical records. Sci. Rep. 2015, 5, 08580. [Google Scholar] [CrossRef] [PubMed]
- Toker, D.; Chiang, J.N.; Vespa, P.M.; Schnakers, C.; Monti, M.M. The Dipeptidyl Peptidase-4 Inhibitor Saxagliptin as a Candidate Treatment for Disorders of Consciousness: A Deep Learning and Retrospective Clinical Analysis. Neurocritical Care 2025, 43, 101–118. [Google Scholar] [CrossRef] [PubMed]
- Zhou, M.; Wang, Q.; Zheng, C.; Rush, A.J.; Volkow, N.D.; Xu, R. Drug repurposing for opioid use disorders: Integration of computational prediction, clinical corroboration, and mechanism of action analyses. Mol. Psychiatry 2021, 26, 5286–5296. [Google Scholar] [CrossRef]
- Gao, Z.; Winhusen, T.J.; Gorenflo, M.P.; Dorney, I.; Ghitza, U.E.; Kaelber, D.C.; Xu, R. Artificial intelligence-based drug repurposing with electronic health record clinical corroboration: A case for ketamine as a potential treatment for amphetamine-type stimulant use disorder. Addiction 2024, 120, 732–744. [Google Scholar] [CrossRef]
- Schett, G.; McInnes, I.B.; Neurath, M.F. Reframing Immune-Mediated Inflammatory Diseases through Signature Cytokine Hubs. N. Engl. J. Med. 2021, 385, 628–639. [Google Scholar] [CrossRef] [PubMed]
- Fugger, L.; Jensen, L.T.; Rossjohn, J. Challenges, Progress, and Prospects of Developing Therapies to Treat Autoimmune Diseases. Cell 2020, 181, 63–80. [Google Scholar] [CrossRef] [PubMed]
- Shakibfar, S.; Allin, K.H.; Jess, T.; Barbieri, M.A.; Battini, V.; Simoncic, E.; Kirchgesner, J.; Ulven, T.; Sessa, M. Drug Repurposing in Crohn’s Disease Using Danish Real-World Data. Pragmatic Obs. Res. 2024, 15, 17–29. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Bai, L.; Scott, M.K.D.; Steinberg, E.; Kalesinskas, L.; Habtezion, A.; Shah, N.H.; Khatri, P. Computational drug repositioning of atorvastatin for ulcerative colitis. J. Am. Med. Inform. Assoc. 2021, 28, 2325–2335. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Patrick, M.T.; Raja, K.; Miller, K.; Sotzen, J.; Gudjonsson, J.E.; Elder, J.T.; Tsoi, L.C. Drug Repurposing Prediction for Immune-Mediated Cutaneous Diseases using a Word-Embedding–Based Machine Learning Approach. J. Investig. Dermatol. 2019, 139, 683–691. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Mihelcic, J.R.; Barra, R.O.; Brooks, B.W.; Diamond, M.L.; Eckelman, M.J.; Gibson, J.M.; Guidotti, S.; Ikeda-Araki, A.; Kumar, M.; Maiga, Y.; et al. Celebrating the Success of Environmental Science & Technology Letters. Environ. Sci. Technol. Lett. 2023, 10, 961–962. [Google Scholar] [CrossRef]
- Kwon, H.; Ali, Z.A.; Wong, B.M. Harnessing Semi-Supervised Machine Learning to Automatically Predict Bioactivities of Per- and Polyfluoroalkyl Substances (PFASs). Environ. Sci. Technol. Lett. 2023, 10, 1017–1022. [Google Scholar] [CrossRef]
- Andronov, M.G.; Fedorov, M.V.; Sosnin, S. Exploring Chemical Reaction Space with Reaction Difference Fingerprints and Parametric t-SNE. ACS Omega 2021, 6, 30743–30751. [Google Scholar] [CrossRef]


| Study (Author/Year) | Applied Methodology and Approach | Identified Drugs/ Candidate Genes | Key Validation Results and Conclusions |
|---|---|---|---|
| Taubes et al. (2021) [11] | Precision medicine/Transcriptomic analysis: Focus on the APOE4 genotype; search for a drug reversing pathological gene expression (539-gene signature) | Bumetanide | In vivo (mice): Significant reduction in amyloid plaques (p < 0.05), restoration of LTP (p < 0.0001), and improvement in cognitive functions. RWE (humans > 65 y.o.): Reduction in AD incidence by 35–75% (statistically significant in two cohorts) |
| Fang et al. (2022) [12] | Artificial Intelligence (AI)/Network medicine 9: Bayesian algorithm integrating GWAS with multi-omic data | Pioglitazone, Febuxostat, Atenolol | Population: Pioglitazone significantly reduced AD risk vs. control (HR = 0.916) and vs. glipizide (HR = 0.921). In vitro: Pioglitazone reduces GSK3β and CDK5 kinase activity in microglia. |
| Xiang et al. (2023) [13] | MPI (Modeling Path Importance) method: Use of node representation learning (node embeddings) in interaction networks; comparison with the baseline (BSL) method | Nicotine, Etodolac (Harmful effect of trihexyphenidyl also detected) | Method comparison: MPI identified 20% more anti-AD drugs in the top 50 than BSL (including all 4 FDA-approved drugs). Validation (MarketScan): Nicotine (HR = 0.532) and etodolac (HR = 0.78) significantly reduced the risk of AD diagnosis. |
| Orlenko et al. (2025) [14] | Explainable ML/Epistatic analysis: XGBoost model, BitEpi algorithm, Propensity Score Matching (PSM), ADSP/AlzKB data | New genetic targets: CDC7, CDC42 | Prediction: Baseline model achieved 63.65% ROC AUC Discovery: Strong epistatic interactions identified in non-coding regions of new genes. The model based on CDC7 markers achieved 60.06% ROC AUC |
| Study (Author/Year) | Applied Methodology and Approach | Identified Drugs | Key Validation Results and Conclusions |
|---|---|---|---|
| Visanji et al. (2021) [17] | Computer analysis of literature (IBM Watson for Drug Discovery) to rank aSyn-inhibiting drugs + RWD verification based on IBM MarketScan (Cox models). | including ARB + DHP-CCB combination, ACEi + diuretics | A significant inverse relationship was demonstrated between the use of ARB and DHP-CCB combinations and PD diagnosis (HR = 0.55; p < 0.01) and between ACEi and diuretics (HR = 0.60; p < 0.01). Alpha-blockers increased the risk (HR = 1.81). |
| Laifenfeld et al. (2021) [18] | Framework for emulated clinical trials (ECT) on RWD from IBM MarketScan (n ≈ 120 million) and IBM Explorys (n > 60 million) databases. Causal inference (IPW, outcome models) was used. | Rasagiline, Zolpidem | Rasagiline significantly reduced the incidence of dementia by 7 percentage points (effect −0.07; p < 0.001 in both databases). Zolpidem reduced the risk of dementia compared to other psycholeptics. |
| Gorenflo et al. (2025) [19] | AI Knowledge Graph-Predict system (108,000 entities) + RWE verification in a retrospective cohort study based on TriNetX (patients with ADHD > 50 years of age). | Amphetamine | Significant reduction in PD risk: HR = 0.59 (2 years), HR = 0.55 (6 years). Strong effect in women (4th year HR = 0.24) and dose dependence (>5 mg: HR = 0.50). Eleven common signaling pathways confirmed. |
| Study (Author/Year) | Applied Methodology and Approach | Identified Drugs | Key Validation Results and Conclusions |
|---|---|---|---|
| Voloudakis et al. (2025) [22] |
Approach: Translational genomics/GReX.
Method: Integration of TWAS (17 tissues) with the LINCS L1000 signature library. | Imiquimod, Nelfinavir, Saquinavir, Everolimus, Azathioprine, Retinol, Nisoldipine. |
RWE Validation (VHA ~9 M): Confirmed the action of Azathioprine (OR = 0.69) and Retinol (OR = 0.81).
Conclusion: No drug was simultaneously confirmed in both RWE and in vitro models, highlighting the discrepancy between cellular and population models. |
| Rahman et al. (2023) [24] |
Approach: Causal Inference in RWE9.
Method: “Virtual clinical trial” using the N3C database (~12 M patients). Use of Propensity Score Weighting and Node2Vec embeddings. | Antidepressants (Analysis of a class of 16 drugs). |
RWE Validation: Demonstrated a complex impact of the drug class on hospitalization risk (ATE = −0.076 for the PSW method).
Conclusion: Identified drug subgroups with protective effects vs. those increasing risk, emphasizing the need for substance-level rather than class-level analysis. |
| Nam et al. (2023) [23] |
Approach: Network-based complementary linkage.
Method: Integration of a “backbone” knowledge network with new COVID-19 data. Graph-based semi-supervised learning (SSL). | 8 verified drugs: e.g., Methotrexate, Dexamethasone, Prednisolone, Simvastatin, Acetaminophen, Ibuprofen. |
RWE Validation (Penn Medicine ~160 k): 8 out of 30 (26.7%) AI-indicated candidates showed a significant association with clinical outcomes.
Conclusion: Successful identification of Dexamethasone validates the utility of network approaches even with incomplete pathogen data. |
| Fico et al. (2022) [25] |
Approach: Systematic Review and Meta-analysis.
Method: Synthesis of 29 studies: computational (in silico), preclinical, and clinical (observational/RWE). | Fluvoxamine (positive), Antipsychotics (negative). |
Quantitative Results: Fluvoxamine was linked to reduced mortality (OR = 0.15). Antipsychotics were linked to an increased risk of severe course (RR = 3.66).
Conclusion: Combining computational and observational data allows for identifying specific effective molecules (like fluvoxamine) despite the lack of a class-wide effect. |
| Study (Author/Year) | Applied Methodology and Approach | Identified Drugs | Key Validation Results and Conclusions |
|---|---|---|---|
| Koren et al. (2019) [28] | ML (decision trees, neural networks) on Big Data (Maccabi Health Services) + propensity score matching (PSM). | Alpha-1 receptor antagonists (e.g., tamsulosin, doxazosin). | Significantly higher therapeutic success rate (HbA1c < 6.5%) in the group treated with alpha-1 antagonists (61%) vs. control (53%) (p < 0.0004). |
| Brnabic et al. (2024) [29] | Causal inference (FMA) and ML on RWD (MarketScan) data. Cohort analysis of patients with MS. | Dimethyl fumarate (DMF) | DMF vs. teriflunomide: lower risk of developing type 2 diabetes (rHR = 0.65), heart attack (rHR = 0.59), and chronic kidney disease (rHR = 0.52). |
| Gao et al. (2023) [30] | KG-Predict AI system (knowledge graphs) + clinical verification of real-world evidence on the TriNetX platform (approx. 800,000 patients). | Aspirin, melatonin, ibuprofen, acetylcysteine. | Reduction of the risk of cataract surgery in diabetics. Aspirin (T2DM, 5 years): HR = 0.72. Melatonin (hyperglycemia): HR = 0.61. Acetylcysteine effective in T2DM (HR = 0.65) and hyperglycemia. |
| Study (Author/Year) | Applied Methodology and Approach | Identified Drugs | Key Validation Results and Conclusions |
|---|---|---|---|
| Paik et al. (2015) [33] | ClinDR: Algorithm integrating genomic data with “clinical signatures” (changes in lab results) from EMRs of 530,000 patients. Analysis of a bipartite graph of drug-disease associations. | Terbutaline sulfate |
|
| Toker et al. (2025) [34] | Deep Learning: Neural networks analyzing molecular structure (fingerprints). RWE: Retrospective cohort analysis of 4047 patients in a coma (UCLA Health database). | Saxagliptin |
|
| Zhou et al. (2021) [35] | DSEG: Predictive system based on a side effect-gene network and protein interactions. RWE: Validation based on IBM Watson Health (72.9 million patients). | Tramadol, Olanzapine, Mirtazapine, Bupropion, Atomoxetine |
|
| Gao et al. (2024) [36] | KG-Predict: Knowledge Graph model combined with Deep Learning. RWE: Validation in the TriNetX network (>100 million patients) using Target Trial Emulation. | Ketamine |
|
| Study (Author/Year) | Applied Methodology and Approach | Identified Drugs | Key Validation Results and Conclusions |
|---|---|---|---|
| Shakibfar et al. (2024) [39] | An approach based on machine learning (ML) and RWD analysis from Danish medical registries (population n = 9179). LASSO regression was used for feature selection and structural models (MSM) with IPTW weighting. | including fluticasone, fexofenadine, montelukast, glucosamine, glycerol triazotate, clopidogrel | Time-dependent analysis showed a significant reduction in the risk of surgery (surrogate for fibrosis). Sensitivity analysis confirmed strong protective evidence, particularly for fluticasone and glucosamine. |
| Bai et al. (2021) [40] | Multicohort transcriptomic analysis (definition of the UC gene signature) combined with clinical verification of RWE in two independent databases (STARR and Optum). | Atorvastatin | A significant association with reduced risk of colectomy was confirmed. In the STARR cohort (n = 827), the HR was 0.47 (p = 0.03), and in the Optum cohort (n = 7821), the HR was 0.66 (p = 0.03). |
| Patrick et al. (2019) [41] | NLP technique (word embedding) on >20 million PubMed abstracts for training the PLS-DA classification model. In silico validation on independent RNA-seq data. | including budesonide, hydroxychloroquine, leflunomide (for psoriasis) | The model achieved high predictive accuracy (AUROC = 0.93). Molecular drug targets showed significant enrichment in genes with altered expression (p < 1 × 10−6), validating the potential of NLP methods. |
| Evidence Tier | AI Methodology | RWE Validation Quality | Key Examples |
|---|---|---|---|
| Tier 1 (High) | Knowledge Graphs/Deep Learning | Causal inference (ECT, IPW, TTE), n > 1 M, multi-database validation | Laifenfeld et al. [18], Rahman et al. [24] |
| Tier 2 (Moderate) | Network analysis/Transcriptomics | Matched cohort studies (PSM/IPTW), n > 10 k | Taubes et al. [11], Koren et al. [28] |
| Tier 3 (Emerging) | NLP/Semantic Mining | Association studies, localized EMRs, n < 10 k | Patrick et al. [41], Bai et al. [40] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Gałuszewski, M.; Olszewski, J.; Jankowska, K.; Wójcik, K.; Bielecka-Wajdman, A. From In Silico Hypothesis to Validation: The Role of Real-World Evidence in the Preliminary Verification of AI-Generated Drug-Repositioning Candidates: A Comprehensive Review. J. Clin. Med. 2026, 15, 2801. https://doi.org/10.3390/jcm15072801
Gałuszewski M, Olszewski J, Jankowska K, Wójcik K, Bielecka-Wajdman A. From In Silico Hypothesis to Validation: The Role of Real-World Evidence in the Preliminary Verification of AI-Generated Drug-Repositioning Candidates: A Comprehensive Review. Journal of Clinical Medicine. 2026; 15(7):2801. https://doi.org/10.3390/jcm15072801
Chicago/Turabian StyleGałuszewski, Michał, Jan Olszewski, Karolina Jankowska, Krzysztof Wójcik, and Anna Bielecka-Wajdman. 2026. "From In Silico Hypothesis to Validation: The Role of Real-World Evidence in the Preliminary Verification of AI-Generated Drug-Repositioning Candidates: A Comprehensive Review" Journal of Clinical Medicine 15, no. 7: 2801. https://doi.org/10.3390/jcm15072801
APA StyleGałuszewski, M., Olszewski, J., Jankowska, K., Wójcik, K., & Bielecka-Wajdman, A. (2026). From In Silico Hypothesis to Validation: The Role of Real-World Evidence in the Preliminary Verification of AI-Generated Drug-Repositioning Candidates: A Comprehensive Review. Journal of Clinical Medicine, 15(7), 2801. https://doi.org/10.3390/jcm15072801

