Next Article in Journal
The Functional State of Thermoplasma acidophilum Pyruvate Kinase Relies on an Extra Carboxyl-Terminal Sequence
Previous Article in Journal
Anti-HIV and Antimicrobial Activity of 7-Hydrazino-8-hydroxyquinoline-Based Aromatic Hydrazones
Previous Article in Special Issue
Uridine, a Therapeutic Nucleoside, Exacerbates Alcoholic Liver Disease via SRC Kinase Activation: A Network Toxicology and Molecular Dynamics Perspective
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Machine Learning-Based Prediction of Rule Violations for Drug-Likeness Assessment in Peptide Molecules Using Random Forest Models

Medical College, Medical University of Varna, 84 Tzar Osvoboditel Str., 9002 Varna, Bulgaria
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(17), 8407; https://doi.org/10.3390/ijms26178407
Submission received: 18 July 2025 / Revised: 22 August 2025 / Accepted: 28 August 2025 / Published: 29 August 2025
(This article belongs to the Special Issue Network Pharmacology: An Emerging Field in Drug Discovery)

Abstract

Peptide therapeutics often fall outside classical small-molecule heuristics, such as Lipinski’s Rule of Five (Ro5), motivating the development of adapted filters and data-driven approaches to early drug-likeness assessment. We curated >300 k drug (small and peptide) and non-drug molecules from PubChem, extracted key molecular descriptors with RDKit, and generated three rule-violation counters for Ro5, the peptide-oriented beyond-Ro5 (bRo5) extension, and Muegge’s criteria. Random Forest (RF) classifier and regressor models (with 10, 20, and 30 trees) were trained and evaluated. Predictions for 26 peptide test molecules were compared with those from SwissADME, Molinspiration, and manual calculations. Model metrics were uniformly high (Ro5 accuracy/precision/recall = 1.0; Muegge ≈ 0.99), indicating effective learning. Ro5 violation counts matched reference values for 23/26 peptides; the remaining cases differed by +1 violation, reflecting larger structures and platform limits. bRo5 predictions showed near-complete agreement with manual values; minor discrepancies occurred in isolated peptides. Muegge’s predictions were internally consistent but tended to underestimate SwissADME by ~1 violation in several molecules. Four peptides (ML13–16) satisfied bRo5 boundaries; three also fully met Ro5. RF models thus provide fast and reliable in silico filters for peptide drug-likeness and can support the prioritisation of orally developable candidates.
Keywords: Random Forest; machine learning; drug-likeness; peptides; Lipinski Rule of Five; beyond Rule of Five; Muegge’s rule; oral bioavailability Random Forest; machine learning; drug-likeness; peptides; Lipinski Rule of Five; beyond Rule of Five; Muegge’s rule; oral bioavailability

Share and Cite

MDPI and ACS Style

Lambev, M.; Dimitrova, D.; Mihaylova, S. Machine Learning-Based Prediction of Rule Violations for Drug-Likeness Assessment in Peptide Molecules Using Random Forest Models. Int. J. Mol. Sci. 2025, 26, 8407. https://doi.org/10.3390/ijms26178407

AMA Style

Lambev M, Dimitrova D, Mihaylova S. Machine Learning-Based Prediction of Rule Violations for Drug-Likeness Assessment in Peptide Molecules Using Random Forest Models. International Journal of Molecular Sciences. 2025; 26(17):8407. https://doi.org/10.3390/ijms26178407

Chicago/Turabian Style

Lambev, Momchil, Dimana Dimitrova, and Silviya Mihaylova. 2025. "Machine Learning-Based Prediction of Rule Violations for Drug-Likeness Assessment in Peptide Molecules Using Random Forest Models" International Journal of Molecular Sciences 26, no. 17: 8407. https://doi.org/10.3390/ijms26178407

APA Style

Lambev, M., Dimitrova, D., & Mihaylova, S. (2025). Machine Learning-Based Prediction of Rule Violations for Drug-Likeness Assessment in Peptide Molecules Using Random Forest Models. International Journal of Molecular Sciences, 26(17), 8407. https://doi.org/10.3390/ijms26178407

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop