Next Article in Journal
Nanodiamonds Co-Localize with Mycobacterium tuberculosis in Foamy Macrophages of Infected Mouse Lungs
Previous Article in Journal
Intranasal Immunization with Recombinant Hemagglutinin of Influenza A/H5 Virus Complexed with Novochizol Induces Virus-Neutralizing Antibodies and Protects Animals from Lethal Viral Challenge
Previous Article in Special Issue
Explainable Artificial Intelligence: A Perspective on Drug Discovery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Predicting Blood–Brain Barrier Permeability from Experimental Data: An Interpretable and Externally Validated Machine Learning Framework

by
Saurabh Tiwari
1,*,
Katarzyna Mądra-Gackowska
2,
Marcin Gackowski
3,
Nokeun Park
1,* and
Łukasz Szeleszczuk
4
1
School of Materials Science and Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea
2
Department of Geriatrics, Ludwik Rydygier Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Torun, 9 Skłodowskiej Curie Str., 85-094 Bydgoszcz, Poland
3
Department of Toxicology and Bromatology, Ludwik Rydygier Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Torun, 2 Jurasza Str., 85-089 Bydgoszcz, Poland
4
Department of Organic and Physical Chemistry, Medical University of Warsaw, 1 Banacha Str., 02-097 Warsaw, Poland
*
Authors to whom correspondence should be addressed.
Pharmaceutics 2026, 18(6), 670; https://doi.org/10.3390/pharmaceutics18060670 (registering DOI)
Submission received: 24 April 2026 / Revised: 20 May 2026 / Accepted: 25 May 2026 / Published: 28 May 2026
(This article belongs to the Special Issue Recent Advances in Drug Delivery Using AI and Machine Learning)

Abstract

Background: The blood–brain barrier (BBB), which restricts the brain penetration of most small molecules and almost all biologics, continues to be a significant hurdle in the development of drugs for the central nervous system (CNS). During early-stage screening, a reliable computational prediction of BBB permeability, typically expressed as log BB, can help reduce the experimental load. Methods: We provide a well-validated machine learning system created solely using the B3DB experimental database, which includes 7807 chemicals with BBB+/BBB annotations and 1058 compounds with in vivo log BB values. Using the Mordred library, a carefully selected set of 40 two-dimensional chemical descriptors was calculated from SMILES notation without the use of artificial data augmentation. Stratified five-fold cross-validation was used to comprehensively benchmark the nine methods used in this study. Results: On a held-out test set (n = 212), gradient boosting produced the greatest regression performance, with R2 = 0.6043, RMSE = 0.4740 log units, and MAE = 0.3326, which is in line with the upper range recorded for experimental BBB datasets. On an internal test set (n = 1562), the corresponding classifier obtained an AUC-ROC of 0.9476 and a balanced accuracy of 0.8568; on an independent external validation set (n = 175), it achieved an AUC-ROC of 0.9137. Topological polar surface area was found by SHAP analysis to be the primary factor influencing BBB permeability, with lipophilicity and ionization-related characteristics being the second and third most important factors, respectively. Nonlinear relationships in accordance with accepted pharmacokinetic principles were validated using partial dependence analysis. Conclusion: This study provides a reliable technique for predicting BBB permeability in CNS drug discovery.
Keywords: blood–brain barrier; B3DB; gradient boosting; SHAP interpretability; machine learning; CNS drug design; mordred descriptors blood–brain barrier; B3DB; gradient boosting; SHAP interpretability; machine learning; CNS drug design; mordred descriptors
Graphical Abstract

Share and Cite

MDPI and ACS Style

Tiwari, S.; Mądra-Gackowska, K.; Gackowski, M.; Park, N.; Szeleszczuk, Ł. Predicting Blood–Brain Barrier Permeability from Experimental Data: An Interpretable and Externally Validated Machine Learning Framework. Pharmaceutics 2026, 18, 670. https://doi.org/10.3390/pharmaceutics18060670

AMA Style

Tiwari S, Mądra-Gackowska K, Gackowski M, Park N, Szeleszczuk Ł. Predicting Blood–Brain Barrier Permeability from Experimental Data: An Interpretable and Externally Validated Machine Learning Framework. Pharmaceutics. 2026; 18(6):670. https://doi.org/10.3390/pharmaceutics18060670

Chicago/Turabian Style

Tiwari, Saurabh, Katarzyna Mądra-Gackowska, Marcin Gackowski, Nokeun Park, and Łukasz Szeleszczuk. 2026. "Predicting Blood–Brain Barrier Permeability from Experimental Data: An Interpretable and Externally Validated Machine Learning Framework" Pharmaceutics 18, no. 6: 670. https://doi.org/10.3390/pharmaceutics18060670

APA Style

Tiwari, S., Mądra-Gackowska, K., Gackowski, M., Park, N., & Szeleszczuk, Ł. (2026). Predicting Blood–Brain Barrier Permeability from Experimental Data: An Interpretable and Externally Validated Machine Learning Framework. Pharmaceutics, 18(6), 670. https://doi.org/10.3390/pharmaceutics18060670

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop