Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery

Yang, Jiafu; Hu, Dayu; Xing, Pengcheng; Zhang, Yikai; Ye, Zongjian; Liu, Kehan; Xia, Jieyi; He, Jing; Qian, Yijing; Wu, Tianshu

doi:10.3390/toxics13110967

Open AccessArticle

Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery

by

Jiafu Yang

,

Dayu Hu

,

Pengcheng Xing

,

Yikai Zhang

,

Zongjian Ye

,

Kehan Liu

,

Jieyi Xia

,

Jing He

,

Yijing Qian

and

Tianshu Wu

^*

Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, China

^*

Author to whom correspondence should be addressed.

Toxics 2025, 13(11), 967; https://doi.org/10.3390/toxics13110967

Submission received: 9 October 2025 / Revised: 5 November 2025 / Accepted: 7 November 2025 / Published: 10 November 2025

(This article belongs to the Section Human Toxicology and Epidemiology)

Download

Browse Figures

Versions Notes

Abstract

Quantum dots (QDs) possess unique optical and electronic properties, enabling wide applications in biomedicine and optoelectronics, but their nanoscale size and surface chemistry could pose potential toxicity risks. This study established a systematic, multi-endpoint framework for QD toxicity assessment. Physicochemical properties of various QDs and their multiple toxicity endpoints, including cell death, inflammation, and oxidative stress, were collected to build machine learning models (RF, XGBoost, KNN, SVM). The predictive toxic effects were then validated based on the brain organoid. Shapley Additive exPlanations (SHAP) analysis revealed that exposure dose and particle size were key cross-model drivers, while zeta potential and optical properties differentially affected specific toxicity endpoints. Integration of GEO-derived differentially expressed genes with protein–protein interaction networks and molecular docking showed that the proteasome inhibitor Carfilzomib is an efficient interventive drug because of its strongest binding to core targets. In this study, the framework of prediction, validation and intervention effectively evaluated multi-endpoint QD toxicity and provided a systematic approach for safety assessments and strategy developments of nanomaterials.

Keywords:

nanotoxicology; artificial intelligence; molecular docking; network pharmacology; organoids

Graphical Abstract

1. Introduction

Quantum Dots (QDs) are semiconductor nanomaterials exhibiting quantum confinement effects [1]. They were first synthesized in a glass matrix by Soviet physicist Alexei Ekimov in 1981. Typically, the size of QDs ranges from 1 to 10 nm [2]. Due to the three-dimensional spatial confinement of electron and hole movements, they form discrete, atom-like energy level structures, often referred to as “artificial atoms” [3,4,5]. Since the 1990s, QDs have demonstrated broad application prospects in fields such as fluorescence labeling, biological imaging, optoelectronic devices, and medical diagnostics, owing to their unique optical and electronic properties [6,7,8,9,10]. In terms of the market, the demand for QDs continues to grow. For example, the global QD healthcare market was estimated at USD 4 billion in 2021 and is projected to reach USD 8.6 billion over the next five years [11].

However, the nanoscale dimensions and surface chemical properties of QDs also pose potential biotoxicity risks. Numerous studies have indicated that exposure to QDs may trigger key toxicity endpoints, including oxidative stress, inflammatory responses, and cell death [12,13,14,15,16], For example, nitrogen-doped graphene quantum dots can disrupt intracellular calcium homeostasis by activating two calcium channels, thereby inducing ferroptosis and inflammatory responses in mouse hippocampal tissues and cultured microglial cells [14]. Research by Liang, Chen, and others has revealed that exposure to QDs can cause nervous system damage in Caenorhabditis elegans, accompanied by abnormal protein aggregation and neuronal injury [12,17]. Furthermore, exposure to QDs through various routes (e.g., skin contact, inhalation, gavage, and intravenous injection) may pose potential health hazards to organisms, including neurotoxicity, pulmonary toxicity, renal toxicity, hepatotoxicity, and other health risks [15,18,19,20,21,22].

The toxicity of QDs is complexly regulated by multiple factors. Previous experimental studies have shown that exposure dose, duration, and particularly physicochemical properties play crucial roles [23,24,25]. The composition of QDs, such as core materials (e.g., cadmium, selenium, zinc, graphene) [26,27,28,29], as well as their shape, particle size, charge, surface modification, and solubility, collectively influence their interactions with biomolecules and thereby determine their biological effects [24,30,31,32]. In recent years, with the advancement of big data and artificial intelligence, machine learning methods have been introduced into nanotoxicology research. By integrating the physicochemical properties of QDs with existing toxicity data, these methods can predict multi-endpoint toxicity and identify potential key influencing factors, significantly enhancing the efficiency and accuracy of toxicity assessment [33,34,35].

As a prevalent technique, machine learning (ML) plays a more significant role in various domains and diseases [36,37,38]. Despite the potential of machine learning in nanotoxicology, most current studies focus on single toxicity endpoints and lack experimental validation [39,40,41,42,43]. Existing in vitro and in vivo toxicity studies are often fragmented and have high experimental costs, making systematic predictions challenging. Given the potential biotoxicity risks associated with QDs, conducting systematic and comprehensive toxicity assessments is crucial for ensuring their safety in biomedical and industrial applications. Therefore, this study collected physicochemical properties and multiple toxicity endpoint indicators of QDs from existing research data to construct a machine learning-based multi-endpoint prediction system. The prediction results were validated through organoid experiments. Additionally, potential target genes were screened using GEO data, and network pharmacology analysis was conducted to identify potential intervention drugs, followed by molecular docking analysis to evaluate potential pharmacological intervention strategies (Figure 1). This study established a multi-module integrated framework of prediction–validation–intervention, providing a systematic approach for QD toxicity assessment and intervention. It also offers theoretical foundations and practical guidance for developing potential interventions against QD-induced damage, holding significant scientific research value and application prospects.

2. Materials and Methods

2.1. Data Collection, Imputation, and Multi-Model Training

Data on the physicochemical properties and toxicological outcomes of QDs were collected from 102 published reports. After screening for completeness and quality, 40 studies were finally included, providing 306 valid records from the 646 initially extracted. To control data quality, we set a maximum missing rate of 40% for each variable. This threshold was chosen as a balance between data completeness and reliability, since a stricter criterion would greatly reduce the dataset size, while a looser one could introduce excessive uncertainty. Samples or variables exceeding this threshold were excluded. For the remaining data, missing feature values were imputed using the K-nearest neighbors (KNN, k = 5) method to preserve the overall structure and distribution. Because the three biological endpoints (cell viability/death, inflammation, and oxidative stress) were binary, samples with known outcomes were used to train a Random Forest classifier, which was then applied to predict missing outcome values. The final dataset included ten physicochemical and exposure-related features (Size (H₂O-DLS), Size (TEM), Excitation peak, Emission peak, Zeta potential, Exposure duration, Exposure dose, Dose unit, Exposure route, and Post-exposure period) and three toxicity outcomes (Cell viability/death, Inflammation, and Oxidative stress).

The complete dataset was divided into training and testing sets using a 70/30 stratified split to maintain balanced class proportions. Given the limited sample size, we did not perform repeated random splits or multi-fold cross-validation for overall model evaluation. Instead, 3-fold cross-validation within the RandomizedSearchCV framework was used for hyperparameter tuning to ensure that model selection remained stable across folds. Seven supervised machine learning algorithms were then trained and compared, including Random Forest (RF), Extreme Gradient Boosting (XGBoost), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Naive Bayes (NB), Logistic Regression (LR), and Multi-Layer Perceptron (MLP). The complete data processing and modeling workflow is summarized in Figure 2, and all features and outcome variables used in the analysis are listed in Table S1.

2.2. Hyperparameter Tuning of Machine Learning Models

2.2.1. K-Nearest Neighbors

For the KNN model, hyperparameters including the number of neighbors, weighting scheme, and distance metric were optimized using RandomizedSearchCV with 3-fold cross-validation, with ROC-AUC as the evaluation metric.

2.2.2. Logistic Regression

For the LR model, hyperparameter optimization was performed using RandomizedSearchCV with 3-fold cross-validation. The search space included the regularization strength C (0.01–10, continuous range), penalty type (L1 or L2), and solver (liblinear or lbfgs). Model performance was evaluated using the ROC-AUC score, and the optimal parameters were selected accordingly.

2.2.3. Naive Bayes

For the NB model, we used the GaussianNB algorithm. Since it has few hyperparameters, with var_smoothing being the main one, we tuned it using 3-fold cross-validation and selected the setting that achieved the best ROC-AUC performance.

2.2.4. Random Forest

For the RF model, hyperparameter optimization was performed using RandomizedSearchCV with 3-fold cross-validation. The search space included the number of trees (50–200), maximum depth (3–15), minimum samples for split (2–10), and minimum samples per leaf (1–5). The optimal model was selected based on the best ROC-AUC score.

2.2.5. Support Vector Machine

For the SVM model, hyperparameter optimization was carried out using RandomizedSearchCV with 3-fold cross-validation. The search space covered the regularization parameter C (0.1–10) and the kernel coefficient gamma (exponential distribution, scale = 0.1), with the RBF kernel fixed. The optimal configuration was determined based on the highest ROC-AUC score.

2.2.6. Extreme Gradient Boosting

For the XGBoost model, hyperparameter optimization was conducted using RandomizedSearchCV with 3-fold cross-validation. The search space included the number of trees, maximum depth, learning rate, subsample ratio, column sampling ratio, and gamma. The optimal model was selected based on the highest ROC-AUC score and was subsequently used for prediction and feature importance analysis.

2.2.7. Multi-Layer Perceptron

For the MLP model, hyperparameter optimization was conducted using RandomizedSearchCV with 3-fold cross-validation. The search space included the hidden layer structure, activation function, regularization strength (alpha), and initial learning rate. The optimal model was selected based on the highest ROC-AUC score and was subsequently used for prediction and feature importance analysis.

2.3. Machine Learning Model Evaluation

A standardized workflow was applied for all machine learning models in this study. For each outcome, the dataset was stratified to maintain the original class distribution and then split into training and test sets. Continuous features were standardized to remove scale differences, while categorical features were one-hot encoded to allow model input. Each model was trained on the training set, and hyperparameters were optimized using randomized search combined with cross-validation to select the best-performing configuration. Model performance was then evaluated on the test set using a comprehensive set of metrics, including accuracy, sensitivity, specificity, precision, F1 score, ROC-AUC, and PR-AUC, providing a robust assessment of predictive performance.

2.4. SHAP Feature Importance Analysis

SHAP values were calculated for all seven machine learning models to interpret their predictions. The training data were used as the background for computing SHAP values on the test set, ensuring that feature contributions were assessed relative to the original data distribution. For each feature, the mean absolute SHAP value was computed to quantify its impact, with continuous and categorical features handled appropriately. The results were visualized as bar plots, highlighting the relative importance of each feature for the different outcomes.

2.5. The Physicochemical Characterizations of QDs

CQDs, N-GQDs, and CdTe QDs were purchased from XFNANO Materials Tech Co., Ltd. (Nanjing, China; http://www.xfnano.com, accessed on 11 October 2024) and characterized for their physicochemical properties prior to use. The structural morphology of the QDs was examined by high-resolution transmission electron microscopy (HR-TEM, JEM-2100, JEOL, Tokyo, Japan), while their fluorescence lifetimes were measured using a time-resolved fluorescence spectrometer (Edinburgh FLSP980, UK). Hydrodynamic size and surface ζ-potential were determined with a Malvern Zetasizer Nano ZS (Zetasizer Nano-ZS90, Malvern Instruments, Worcestershire, UK). A summary of the physicochemical properties of the CQDs is provided in Table S2.

2.6. Generation and Quantum Dot Exposure of Human Brain Organoids

Human brain organoids were generated from human embryonic stem cells (H9 line) through stepwise induction. Briefly, cells were maintained in Essential 8 medium, and when they reached approximately 70% confluency, they were dissociated with Dispase to form embryoid bodies (EBs). The EBs were cultured in neural induction medium (NIM) supplemented with SB431542 and DMH1 for 7 days, then embedded in Matrigel to promote neuroepithelial formation, followed by culture in NIM without small-molecule inhibitors. Over time, the organoids gradually developed ventricular-like structures and neuroepithelial features, and by day 30, they exhibited stable morphological and differentiation characteristics and could be maintained in long-term culture. To evaluate the toxic effects of quantum dots, organoids at days 10, 20, and 30 were exposed for 24 h, after which lactate dehydrogenase (LDH) release and malondialdehyde (MDA) levels were measured to assess cell damage and oxidative stress. The assays for LDH and MDA were performed according to the manufacturer’s instructions (Solarbio, Beijing, China).

2.7. Identification and Functional Analysis of Potential Target Genes in Response to QD Exposure

Differentially expressed genes (DEGs) were obtained from GEO datasets and intersected to identify potential target genes. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses of these potential targets were performed using the Metascape platform (https://metascape.org, accessed on 26 August 2025). Protein–protein interaction (PPI) networks were constructed and analyzed using Cytoscape 3.10 to identify key hub genes. Six core target genes were selected based on network topology, and GO and KEGG enrichment analyses were further conducted on these core genes using Metascape.

2.8. Screening of Potential Therapeutics for QD-Induced Toxicity

Based on the six key target genes identified in this study, potential therapeutic agents were screened using the DGIdb Drug Targets 2024 dataset and the MAGMA Drugs and Diseases dataset via the Enrichr platform. Selected candidate drugs were subjected to molecular docking with the target proteins, and binding free energies were calculated using AutoDock Vina 1.2.7. Protein–ligand interactions, including hydrogen bond numbers and interaction patterns, were analyzed using the Protein–Ligand Interaction Profiler (PLIP) platform. Three-dimensional visualization of the complexes and annotation of binding sites, hydrogen bonds, and binding energies were performed using PyMol. It should be noted that AutoDock Vina 1.2.7 does not support boron parameterization; therefore, for Bortezomib, the boron atom was replaced with a carbon atom during docking. This approach is scientifically reasonable, as it does not significantly alter the overall molecular scaffold or binding mode, allowing the docking process to proceed smoothly and providing a reasonable prediction of ligand–protein binding trends.

2.9. Data Analysis and Statistics

Data are expressed as mean ± standard deviation (SD). Statistical analyses were performed using GraphPad Prism 9.0. Differences among groups were evaluated by one-way ANOVA, followed by Dunnett’s post hoc test for multiple comparisons. A p-value < 0.05 was considered statistically significant, with p-value < 0.01 and p-value < 0.001 indicating high and very high significance, respectively.

3. Results

3.1. Performance Comparison of Seven Machine Learning Models in Predicting Quantum Dot-Induced Cell Viability, Inflammation, and Oxidative Stress

In this study, we systematically compared the performance of seven mainstream machine learning models (RF, KNN, XGBoost, SVM, LR, NB, and MLP) in predicting three toxicity endpoints induced by quantum dots: cell death, inflammatory response, and oxidative stress (Table S3, Figure 3). For the prediction of cell viability and cell death, the RF model demonstrated the best performance, achieving an accuracy of 0.837, an F1 score of 0.854, an ROC-AUC of 0.928 (Figure 3c), and a PR-AUC of 0.947 (Figure 3d), indicating a high overall predictive capability. The KNN, XGBoost, and SVM models also exhibited good performance, with F1 scores of 0.816, 0.835, and 0.788, respectively, although there was a trade-off between sensitivity and specificity. For example, the NB model had a sensitivity of only 0.196 but a specificity as high as 0.927, suggesting its tendency to predict negative samples. Overall, RF and XGBoost outperformed the other models in terms of comprehensive performance metrics.

In the prediction of inflammatory response (Table S3, Figure 3), the XGBoost model performed the best, achieving an accuracy of 0.913, an F1 score of 0.949, a PR-AUC of 0.991, and an ROC-AUC of 0.956. The SVM and RF models also performed well, with F1 scores of 0.945 and 0.938, respectively. However, the NB model showed significantly lower accuracy and F1 score (0.337 and 0.358, respectively), indicating its insufficient ability to predict positive samples. Notably, the sensitivity of most models was close to or reached 1, suggesting their good ability to capture positive inflammatory samples, but the specificity metrics were relatively low, indicating limited predictive capability for negative samples.

In the prediction of oxidative stress (Table S3, Figure 3), the RF and XGBoost models also performed outstandingly, both achieving an accuracy of 0.902, with F1 scores of 0.942 and 0.943, respectively, and PR-AUCs of 0.966 and 0.962, respectively, demonstrating high discriminative ability between positive and negative samples. The KNN model followed closely, while the NB model, although exhibiting high specificity (0.941), had a sensitivity of only 0.16, resulting in poor overall performance. Overall, the RF and XGBoost models maintained high robustness in predicting the three toxicity endpoints, particularly excelling in capturing positive samples.

Comprehensive analysis revealed that the RF and XGBoost models demonstrated the best overall performance in predicting cell death, inflammatory response, and oxidative stress, leading in terms of accuracy, F1 score, ROC-AUC, and PR-AUC. The KNN and SVM models performed secondarily, while the MLP model showed moderate performance. Although the NB model exhibited performance in some metrics, its overall accuracy and F1 score were significantly lower, suggesting its unsuitability for comprehensive toxicity prediction in this dataset.

3.2. SHAP Analysis of the Physicochemical Properties and Exposure Conditions of Quantum Dots Under Different Toxicological Outcomes

In this section, we conducted SHAP value analysis on the relevant features of each model (Figure 4 and Figure S1). In the prediction of cell death induced by quantum dots (Figure 4a), the SHAP value analyses of the three best-performing models (KNN, RF, and XGBoost) all indicated that characterization parameters and exposure conditions jointly drove the prediction results. Specifically, the KNN model revealed that Zeta potential, emission peak, and hydrodynamic diameter (H₂O-DLS) in aqueous solution made the greatest contributions to the model’s output; the RF model emphasized the importance of particle size measured by transmission electron microscopy (TEM) and exposure dose; the XGBoost model further highlighted the dominant roles of exposure dose and particle size (both TEM and H₂O-DLS). These results indicate that cell death is closely associated with both the physicochemical properties of the quantum dots and the exposure dose, suggesting that variations in size, surface charge, or composition may significantly influence their cytotoxic effects.

In the prediction of inflammatory outcomes (Figure 4b), the KNN model similarly underscored the importance of Zeta potential and emission peak in aqueous solution; the RF model highlighted the contributions of exposure dose and particle size (TEM); and the XGBoost model reaffirmed that exposure dose was the most significant driving factor, followed by particle size and optical properties (excitation peak, emission peak). These results indicate that the inflammatory response is primarily influenced by the exposure dose, while physicochemical properties such as particle size and surface charge also play important regulatory roles. This suggests that both the amount of exposure and the intrinsic characteristics of the particles contribute to the observed pro-inflammatory effects.

In the prediction of oxidative stress (Figure 4c), the KNN model results showed that Zeta potential and hydrodynamic diameter were the primary influencing factors; the RF model emphasized the roles of exposure dose and Zeta potential; and the XGBoost model results further stressed the decisive contribution of exposure dose, accompanied by the auxiliary roles of particle size and optical characteristics. Overall, the occurrence of oxidative stress is closely related to the surface charge of quantum dots and the exposure dose.

In summary, the predictions for all three outcomes consistently highlighted exposure dose and particle size as major contributing factors across different models and toxicity endpoints. Meanwhile, Zeta potential and optical properties appeared to influence specific outcomes in distinct ways. These findings suggest that both the physicochemical characteristics of quantum dots and the exposure conditions together shape their observed biological toxicity.

3.3. Toxic Effects of Different Types of Quantum Dots in Brain Organoids and Machine Learning Validation

To assess the toxic effects of different types of QDs on brain organoids, we measured LDH release and MDA production. We found that CdTe QDs markedly increased LDH release in organoids cultured for 10 days (Figure 5a). Moreover, LDH levels rose in a clear dose-dependent manner as the concentration increased from 1.25 to 5 nM, indicating that higher exposure led to more pronounced cell damage. In brain organoids cultured for 20 days, exposure to CQDs also significantly promoted LDH release (Figure 5b), showing a concentration-dependent enhancement effect within the range of 25–100 μg/mL. Furthermore, in brain organoids cultured for 30 days, exposure to N-GQDs also led to a significant increase in LDH (Figure 5c), indicating that different types of QDs can all induce cell damage in brain organoids.

In addition to cell death markers, we assessed lipid peroxidation levels. The results demonstrated that CQDs, N-GQDs, and CdTe QDs could all significantly elevate MDA levels (Figure 5d–f). Among them, CQDs caused the maximum increase in MDA at 100 μg/mL; N-GQDs exhibited a strong pro-oxidative effect across all dose groups; while exposure to CdTe QDs resulted in a significant increase in MDA within the range of 2.5–5 nM. Overall, different types of QDs could induce marked cell damage and oxidative stress in brain organoids, with dose-dependent variations.

To further validate the reliability of the machine learning results, we systematically compared the experimental results of the three types of QDs with the predictive results of the KNN, RF, and XGBoost models (Table S4). The results showed that all three models were highly consistent with the experimental results in predicting toxicity in terms of cell viability and oxidative stress. Specifically, CQDs and N-GQDs were accurately predicted by all models to exhibit cytotoxic and oxidative stress effects; some predictive results for CdTe QDs showed slight discrepancies with the experimental results, but overall, they were in agreement. These findings indicate that machine learning models possess high accuracy and reliability in predicting QD toxicity and can corroborate actual experimental data from brain organoids.

3.4. Differential Gene Expression Screening and Functional Enrichment Analysis Induced by Quantum Dots

Subsequently, we screened for DEGs in four GEO datasets, with the results depicted in volcano plots (Figure S2a–d). In GSE96720, 5317 significantly upregulated genes and 6226 downregulated genes were identified; in GSE159776, there were 3418 upregulated genes and 3395 downregulated genes; in GSE89756 (3-day treatment), 1033 upregulated genes and 854 downregulated genes were detected; while in GSE89756 (21-day treatment), 1991 upregulated genes and 1674 downregulated genes were found. These findings indicate that different quantum dot treatments can induce significant gene expression changes under various experimental conditions.

To further integrate transcriptomic evidence, we performed an intersection analysis of the DEGs across the four datasets (Figure 6a). The results revealed that 82 genes exhibited consistent differential expression in all datasets, suggesting that these genes may be key targets for the toxic effects induced by quantum dots. KEGG pathway enrichment analysis was conducted on these 82 intersecting genes (Figure 6b). The results showed that these genes were primarily enriched in pathways such as spliceosome, RNA degradation, biosynthesis of amino acids, alanine, aspartate and glutamate metabolism, long-term depression, and renal cell carcinoma. These pathways are closely related to gene transcription regulation, energy metabolism, and neurological dysfunction.

GO functional annotation analysis (Figure 6c) demonstrated that, in terms of biological processes (BPs), the intersecting genes were significantly enriched in processes such as ribonucleoprotein complex biogenesis, ribosome subunit biogenesis, and RNA splicing. At the cellular component (CC) level, they were mainly associated with structures like the nucleolus, precursor bodies, and proteasome complexes. Regarding molecular functions (MFs), they were concentrated in functions such as RNA binding, ATP hydrolysis activity, and chromatin remodeling. The aforementioned results indicate that these key differentially expressed genes play crucial roles in RNA metabolism, protein synthesis, and cellular stress responses, potentially serving as the molecular basis for cell damage and neurotoxicity induced by quantum dots.

3.5. Drug Prediction and Molecular Docking Validation of Core Target Genes

After performing a protein–protein interaction (PPI) network analysis on the 82 intersecting differentially expressed genes (Figure 7a), six core target genes were identified, including IMP3, EXOSC9, PRPF31, NHP2, RSL24D1, and PSMC6. These core genes exhibited high connectivity within the network, suggesting their pivotal positions in the regulatory network of the intersecting genes. We conducted GO and KEGG enrichment analyses on these core genes, and the results revealed that they were primarily enriched in key biological processes and signaling pathways such as ribosome biogenesis, RNA processing, proteasome function, spliceosome, and RNA degradation (Figure 7b). This indicates that these targets play crucial roles in maintaining post-transcriptional regulation and protein homeostasis in cells.

To explore potential strategies to mitigate QD-induced toxicity, we performed drug target enrichment analysis using DGIdb and MAGMA datasets. This analysis aimed to identify drugs acting on the same targets as the core genes affected by QDs, providing candidates that may counteract the toxic effects of QDs. The results showed that these six potential targets were closely associated with proteasome inhibitors, such as Bortezomib and Carfilzomib (Figure 7c). These findings suggest that proteasome inhibitors may exert their effects by directly or indirectly regulating these core genes.

On this basis, we further conducted molecular docking verification. The docking results demonstrated that the candidate drugs could all form stable binding modes with the core targets (Figure 7d–f, Table S5). Among them, Carfilzomib exhibited the strongest binding affinity, with energy values significantly lower than −7.0 kcal/mol when binding to IMP3 (−7.347 kcal/mol), EXOSC9 (−7.317 kcal/mol), and PSMC6 (−7.100 kcal/mol). Specifically, Carfilzomib formed hydrogen bonds and hydrophobic interactions with amino acids such as GLU-476, TYR-544, and CYS-110 in the active pocket of IMP3; hydrogen bonds with ASN-271 and ARG-361 in the binding cavity of EXOSC9; and stable interactions with residues like LEU-303, LYS-383, and LEU-384 in PSMC6. These interactions enhanced the binding stability between the ligand and the protein. In contrast, Bortezomib exhibited slightly higher binding energies (−6.0 to −6.6 kcal/mol), while Metformin generally showed weaker binding energies (−4.0 to −5.1 kcal/mol). In summary, the database screening and molecular docking results consistently indicated that Carfilzomib had the strongest binding capacity to the core targets, outperforming Bortezomib and Metformin, suggesting that it may be the most promising regulatory molecule acting on the identified targets.

4. Discussion

This study systematically integrated machine learning prediction and brain organoid experimental validation, as well as transcriptomics and molecular docking analysis, to construct a multi-module integrated framework of “prediction–validation–intervention” for a systematic exploration of the toxic effects of QDs. The results showed that different types of QDs exhibited significant effects on three key toxic endpoints: cell death, inflammation, and oxidative stress. Moreover, machine learning models, particularly Random Forest (RF) and XGBoost, could stably capture these effects and were highly consistent with the results of brain organoid experiments. This outcome not only validated the application potential of machine learning in nanotoxicology research but also provided new insights for the rapid prediction and mechanism exploration of QD-related toxicity.

Firstly, the comparison results of machine learning models revealed that RF and XGBoost demonstrated the best overall performance in predicting different toxic endpoints, suggesting that these machine learning models could effectively handle the nonlinear and complex relationships between the physicochemical properties of QDs and their toxic effects [44]. This finding not only validated the robustness of machine learning models in small-sample, high-dimensional toxicological data but also offered methodological references for nanomaterial toxicity prediction. Further SHAP analysis revealed that exposure dose and particle size were common key factors across models for different toxic outcomes, while Zeta potential and optical properties exhibited differential effects across endpoints. This discovery indicated that the occurrence of QD toxicity was not driven by a single physicochemical property but rather resulted from a composite effect of multifactorial interactions. Dose and particle size might determine the initial interaction intensity between QDs and biological systems by influencing cell membrane penetration ability, in vivo distribution patterns, and metabolic clearance rates. In contrast, Zeta potential and optical properties could affect downstream oxidative stress and inflammatory pathways by regulating surface reactivity and energy transfer processes [25,45,46,47]. Compared with the prevailing conclusion in previous studies that smaller particle size and higher surface activity lead to stronger toxicity [48,49,50,51,52], this study further clarified the relative weights and action modes of different physicochemical factors in multiple toxic endpoints through model interpretability analysis. This not only enhanced the understanding of QD toxicity mechanisms but also provided ideas for regulating the biocompatibility of QDs through physicochemical parameters in the future.

Additionally, the differences in feature importance ranking among different machine learning models primarily stem from their algorithmic principles, capabilities in capturing feature interactions, and sensitivity to the distribution structure of features [53,54,55]. Although the KNN, RF, and XGBoost models all identify exposure dose and particle size as the core driving factors for quantum dot-induced cytotoxic responses in terms of overall trends, they exhibit significant differences in their emphasis on physicochemical property variables. The KNN model conducts classification based on local distance metrics between samples, relying more on feature clustering and gradient changes in multidimensional space. Consequently, it tends to highlight features such as Zeta potential and hydrodynamic diameter (H₂O-DLS) that significantly influence colloidal stability and dispersion state, as these factors directly determine the similarity structure among samples [56,57]. The RF model iteratively splits the feature space through multiple random decision trees, selecting variables that maximize information gain. Thus, at a global level, it is more prone to capturing features that can form clear threshold effects, such as exposure dose and transmission electron microscopy particle size (TEM) [58,59,60,61]. The XGBoost model, building upon RF, introduces a gradient boosting strategy to iteratively optimize residuals, thereby capturing high-order nonlinearities and feature interactions. As a result, it further reinforces the dominant roles of exposure dose and particle size (both TEM and H₂O-DLS) and identifies the synergistic contributions of optical parameters such as emission peak and excitation peak. This suggests that physicochemical properties may participate in regulating toxic effects by modulating energy band structures and electron excitation behaviors [62,63,64,65]. Overall, KNN reflects continuous effects dominated by local similarity, RF emphasizes globally separable discriminative features, and XGBoost integrates complex nonlinearities and interaction effects. The differences among these models do not arise from algorithmic inconsistencies but rather reflect their complementary capabilities in parsing feature spaces. Therefore, multi-model SHAP analysis not only enhances the robustness of the results but also reveals the complex mechanisms by which the physicochemical properties of quantum dots and exposure conditions collectively shape toxicity endpoints such as cell death, inflammation, and oxidative stress at different levels. This provides a more interpretable evidence base for understanding their multidimensional biological responses.

Brain organoid experiments provided direct experimental evidence. CdTe QDs, CQDs, and N-GQDs significantly induced cell damage and oxidative stress in a dose-dependent manner. This finding was similar to early in vitro neurotoxicity studies [12,17,18]. However, compared with two-dimensional cell models and model organisms, brain organoids could more realistically simulate the human neural tissue microenvironment, suggesting that organoids combined with machine learning could more accurately predict the neurotoxicity of QDs. Although there were slight discrepancies between some predicted results of CdTe QDs and experimental data, the overall consistency indicated that the model had good generalization ability across different types of QDs.

Bioinformatics analysis revealed that 82 intersecting differentially expressed genes were mainly enriched in pathways related to RNA metabolism, protein synthesis, and neural function. This was consistent with previous reports that nanomaterials could induce neurotoxicity and cell damage by interfering with RNA processing and energy metabolism [66,67,68,69]. These results also suggested that the toxic effects induced by QDs were not limited to oxidative stress and cell death but might further trigger neurotoxicity and systemic damage by interfering with RNA processing and energy metabolism pathways. The six core targets (IMP3, EXOSC9, PRPF31, NHP2, RSL24D1, PSMC6) screened from the protein–protein interaction (PPI) network indicated that QDs might induce downstream cytotoxic effects by regulating key nodes such as ribosome biogenesis, RNA splicing, and proteasome function. Compared with previous studies on the toxicity mechanisms of nanoparticles [15,18,19,20,21,22], this study provided more systematic molecular evidence across multiple endpoints and offered potential targets for subsequent drug intervention research.

Notably, the results of drug prediction and molecular docking analysis showed that proteasome inhibitors, especially Carfilzomib, had the lowest binding energy with core target genes, demonstrating the strongest binding ability. This suggested that proteasome inhibitors might be a class of potential intervention drugs for the transcriptional regulation and protein homeostasis imbalance induced by QDs. Of course, as an anticancer drug, Carfilzomib itself had strong side effects [70,71,72], so its application in QD toxicity intervention still required further validation and safety assessment. However, this result provided a directional reference for exploring nanomaterial toxicity intervention strategies.

Overall, the multi-module integrated framework established in this study could not only achieve multi-endpoint toxicity prediction of QDs but also reveal potential mechanisms by combining organoid experiments and transcriptomics, as well as explore intervention strategies. Compared with traditional single in vitro or in vivo experiments, this method significantly improved the systematicity and efficiency of toxicity assessment, providing a theoretical and methodological basis for future safety evaluation and intervention strategy development of QDs. However, this study was still limited by the types of QDs and the scale of data. Future research should incorporate more types of QDs and multi-dose, multi-endpoint, and multi-omics data to further enhance the generalization ability and prediction reliability of the models.

5. Conclusions

This study established a machine learning-based multi-endpoint toxicity prediction system for QDs, integrating brain organoid experiments, transcriptomic analysis, and molecular docking verification to form an integrated “prediction–validation–intervention” framework. The results demonstrated that different types of quantum dots could significantly induce cell death, inflammation, and oxidative stress, with their toxicity regulated by both dose and particle size. The RF and XGBoost models performed best in the predictions, and SHAP analysis revealed that exposure dose and particle size were the key driving factors. Transcriptomic and molecular docking analyses suggested that QDs might mediate toxicity through RNA metabolism and protein homeostasis, while the proteasome inhibitor Carfilzomib showed potential for intervention. This study not only provides an efficient and systematic approach for evaluating the toxicity of quantum dots but also offers a theoretical foundation and practical reference for future safety management and the development of potential intervention strategies for quantum dots, holding significant scientific and applied value for the biological safety assessment of nanomaterials.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/toxics13110967/s1, Table S1: Machine learning dataset: features and outcomes with types; Table S2: The summary of Physicochemical Characteristics of Experimental Quantum Dots; Table S3: Machine Learning Model Performance for Predicting Quantum Dot Toxicity Based on Physicochemical Properties; Table S4: Comparison of KNN, Random Forest, and XGBoost Predictions with Experimental Results; Table S5: Molecular Docking of Six Proteins (IMP3, EXOSC9, PRPF31, NHP2, RSL24D1, PSMC6) with Bortezomib, Carfilzomib, and Metformin; Figure S1: Mean SHAP values of LR (a), NB (b), SVM (c), and MLP (d) models for predicting quantum dot-induced cell death, inflammation, and oxidative stress; Figure S2: Volcano plots showing differentially expressed genes identified from four GEO datasets; Section S1: Biological rationale of the most promising candidate targets [73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90].

Author Contributions

Conceptualization, J.Y. and T.W.; methodology, J.Y.; validation, D.H., P.X. and Y.Z.; formal analysis, Z.Y. and K.L.; investigation, J.X., J.H. and Y.Q.; resources, T.W.; data curation, J.Y.; writing—original draft preparation, J.Y.; writing—review and editing, T.W.; visualization, J.Y.; supervision, T.W.; project administration, T.W.; funding acquisition, T.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (grant numbers 82574144, 82373617, 82103884) and the Supporting Program of Southeast University Zhishan Yong Scholar (grant number 2242025RCB0029).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors acknowledge the use of AI-assisted tools (ChatGPT-4o) for language editing and manuscript structuring. All outputs were reviewed and finalized by the authors, who take full responsibility for the content.

Conflicts of Interest

The authors of this article declare that there are no conflicts of competing financial interest.

Abbreviations

The following abbreviations are used in this manuscript:

QDs	Quantum Dots
SHAP	Shapley Additive exPlanations
RF	Random Forest
XGBoost	Extreme Gradient Boosting
KNN	K-Nearest Neighbors
SVM	Support Vector Machine
NB	Naive Bayes
LR	Logistic Regression
MLP	Multi-Layer Perceptron
EBs	Embryoid bodies
NIM	Neural induction medium
LDH	Lactate dehydrogenase
MDA	Malondialdehyde
DEGs	Differentially expressed genes
GO	Gene Ontology
KEGG	Kyoto Encyclopedia of Genes and Genomes
PPI	Protein–protein interaction
PLIP	Protein–Ligand Interaction Profiler

References

Ekimov, A.I.; Onushchenko, A.A. Quantum Size Effect in the Optical-Spectra of Semiconductor Micro-Crystals. Soviet physics. Semiconductors 1982, 16, 775–778. [Google Scholar]
Chen, B.; Li, D.; Wang, F. InP Quantum Dots: Synthesis and Lighting Applications. Small 2020, 16, 2002454. [Google Scholar] [CrossRef] [PubMed]
Tisdale, W.A.; Zhu, X.Y. Artificial atoms on semiconductor surfaces. Proceedings of the National Academy of Sciences 2010, 108, 965–970. [Google Scholar] [CrossRef]
Oosterkamp, T.H.; Fujisawa, T.; van der Wiel, W.G.; Ishibashi, K.; Hijman, R.V.; Tarucha, S.; Kouwenhoven, L.P. Microwave spectroscopy of a quantum-dot molecule. Nature 1998, 395, 873–876. [Google Scholar] [CrossRef]
Dowling, J.P.; Gea-Banacloche, J. Atomic quantum dots. In Proceedings of the International Quantum Electronics Conference, QThF12, Anaheim, CA, USA, 8–13 May 1994. [Google Scholar]
Michalet, X.; Pinaud, F.F.; Bentolila, L.A.; Tsay, J.M.; Doose, S.; Li, J.J.; Sundaresan, G.; Wu, A.M.; Gambhir, S.S.; Weiss, S. Quantum Dots for Live Cells, in Vivo Imaging, and Diagnostics. Science 2005, 307, 538–544. [Google Scholar] [CrossRef]
Cheng, H.; Liu, Q.; Han, S.; Zhang, S.; Ouyang, X.; Wang, X.; Duan, Z.; Wei, H.; Zhang, X.; Ma, N.; et al. Highly Efficient Photothermal Conversion of Ti₃C₂Tx/Ionic Liquid Gel Pen Ink for Smoothly Writing Ultrasensitive, Wide-Range Detecting, and Flexible Thermal Sensors. ACS Appl. Mater. Interfaces 2020, 12, 37637–37646. [Google Scholar] [CrossRef]
Wang, C.W.; Wang, Q.J. Extending the detection limit: Innovations in infrared quantum dot photodetectors reaching up to 18 μm. Light Sci. Appl. 2024, 13, 154. [Google Scholar] [CrossRef]
Klimov, V.I.; Mikhailovsky, A.A.; Xu, S.; Malko, A.; Hollingsworth, J.A.; Leatherdale, C.A.; Eisler, H.J.; Bawendi, M.G. Optical Gain and Stimulated Emission in Nanocrystal Quantum Dots. Science 2000, 290, 314–317. [Google Scholar] [CrossRef]
Guo, D.; Xu, P.; Chen, D.; Wang, L.; Zhu, Y.; Zuo, Y.; Chen, B. Daunorubicin-Loaded CdTe QDs Conjugated with Anti-CD123 mAbs: A Novel Delivery System for Myelodysplastic Syndromes Treatment. Int. J. Nanomed. 2020, 15, 521–536. [Google Scholar] [CrossRef]
Abdellatif, A.A.H.; Tawfeek, H.M.; Younis, M.A.; Alsharidah, M.; Al Rugaie, O. Biomedical Applications of Quantum Dots: Overview, Challenges, and Clinical Potential. Int. J. Nanomed. 2022, 17, 1951–1970. [Google Scholar] [CrossRef] [PubMed]
Chen, M.; Chen, S.; Liu, K.; Ye, Z.; Qian, Y.; He, J.; Xia, J.; Xing, P.; Yang, J.; Wa Ng, Y.; et al. Putative Adverse Outcome Pathway for Parkinson’s Disease-like Symptoms Induced by Silicon Quantum Dots based on In Vivo/Vitro Approaches. ACS Nano 2024, 18, 25271–25289. [Google Scholar] [CrossRef]
Wu, T.; Liu, K.; Chen, S.; Ye, Z.; Xia, J.; He, J.; Xing, P.; Yang, J.; Qian, Y.; Chen, M. Pulmonary microbiota disruption by respiratory exposure to carbon quantum dots induces neuronal damages in mice. J. Hazard. Mater. 2025, 487, 137255. [Google Scholar] [CrossRef]
Wu, T.; Wang, X.; Cheng, J.; Liang, X.; Li, Y.; Chen, M.; Kong, L.; Tang, M. Nitrogen-doped graphene quantum dots induce ferroptosis through disrupting calcium homeostasis in microglia. Part. Fibre Toxicol. 2022, 19, 22. [Google Scholar] [CrossRef]
Yao, Y.; Wang, Z.; Huang, X.; Wei, T.; Liu, N.; Zou, L.; Niu, Y.; Hu, Y.; Fang, Q.; Wang, X.; et al. Adverse Outcome Pathway-Based Strategies to Mitigate Ag₂Se Quantum Dot-Induced Neurotoxicity. ACS Nano 2025, 19, 11029–11048. [Google Scholar] [CrossRef]
Wang, M.; Lan, S.; Zhang, W.; Jin, Q.; Du, H.; Sun, X.; He, L.; Meng, X.; Su, L.; Liu, G. Anti-Cancer Potency of Copper-Doped Carbon Quantum Dots Against Breast Cancer Progression. Int. J. Nanomed. 2024, 19, 1985–2004. [Google Scholar] [CrossRef] [PubMed]
Liang, X.; Wang, X.; Cheng, J.; Zhang, X.; Wu, T. Ag₂Se quantum dots damage the nervous system of nematode Caenorhabditis elegans. Bull. Environ. Contam. Toxicol. 2022, 109, 279–285. [Google Scholar] [CrossRef] [PubMed]
Chen, L.; Zheng, F.; Yang, P.; Chen, B.; Aguilar, Z.P.; Fu, F.; Xu, H. Effects of QDs exposure on the reproductive and embryonic developmental toxicity in mice at various pregnancy stages. Toxicol. Res. 2020, 9, 371–378. [Google Scholar] [CrossRef] [PubMed]
Fan, J.; Wang, S.; Zhang, X.; Chen, W.; Li, Y.; Yang, P.; Cao, Z.; Wang, Y.; Lu, W.; Ju, D. Quantum Dots Elicit Hepatotoxicity through Lysosome-Dependent Autophagy Activation and Reactive Oxygen Species Production. ACS Biomater. Sci. Eng. 2018, 4, 1418–1427. [Google Scholar] [CrossRef]
He, C.; Ruan, F.; Jiang, S.; Zeng, J.; Yin, H.; Liu, R.; Zhang, Y.; Huang, L.; Wang, C.; Ma, S.; et al. Black Phosphorus Quantum Dots Cause Nephrotoxicity in Organoids, Mice, and Human Cells. Small 2020, 16, 2001371. [Google Scholar] [CrossRef]
Wu, T.; Tang, M. Toxicity of quantum dots on respiratory system. Inhal. Toxicol. 2014, 26, 128–139. [Google Scholar] [CrossRef]
Yao, Y.; Zhang, T.; Tang, M. The DNA damage potential of quantum dots: Toxicity, mechanism and challenge. Environ. Pollut. 2023, 317, 120676. [Google Scholar] [CrossRef]
Hardman, R. A Toxicologic Review of Quantum Dots: Toxicity Depends on Physicochemical and Environmental Factors. Environ. Health Perspect. 2006, 114, 165–172. [Google Scholar] [CrossRef]
Sun, H.; Zhang, F.; Wei, H.; Yang, B. The effects of composition and surface chemistry on the toxicity of quantum dots. J. Mater. Chem. B 2013, 1, 6485. [Google Scholar] [CrossRef] [PubMed]
Gidwani, B.; Sahu, V.; Shukla, S.S.; Pandey, R.; Joshi, V.; Jain, V.K.; Vyas, A. Quantum dots: Prospectives, toxicity, advances and applications. J. Drug Deliv. Sci. Technol. 2021, 61, 102308. [Google Scholar] [CrossRef]
Hu, L.; Zhong, H.; He, Z. Toxicity evaluation of cadmium-containing quantum dots: A review of optimizing physicochemical properties to diminish toxicity. Colloids Surf. B Biointerfaces 2021, 200, 111609. [Google Scholar] [CrossRef] [PubMed]
Oh, E.; Liu, R.; Nel, A.; Gemill, K.B.; Bilal, M.; Cohen, Y.; Medintz, I.L. Meta-analysis of cellular toxicity for cadmium-containing quantum dots. Nat. Nanotechnol. 2016, 11, 479–486. [Google Scholar] [CrossRef]
Wang, X.; He, K.; Hu, Y.; Tang, M. A review of pulmonary toxicity of different types of quantum dots in environmental and biological systems. Chem.-Biol. Interact. 2022, 368, 110247. [Google Scholar] [CrossRef]
Cui, L.W.; Fan, L.Y.; Shen, Z.Y. Application Research Progress of Nanomaterial Graphene and its Derivative Complexes in Tumor Diagnosis and Therapy. Curr. Med. Chem. 2024, 31, 6436–6459. [Google Scholar] [CrossRef]
Gupta, J.; Vaid, P.K.; Priyadarshini, E.; Rajamani, P. Nano-bio convergence unveiled: Systematic review on quantum dots-protein interaction, their implications, and applications. Biophys. Chem. 2024, 310, 107238. [Google Scholar] [CrossRef]
Sukhanova, A.; Bozrova, S.; Gerasimovich, E.; Baryshnikova, M.; Sokolova, Z.; Samokhvalov, P.; Guhrenz, C.; Gaponik, N.; Karaulov, A.; Nabiev, I. Dependence of Quantum Dot Toxicity In Vitro on Their Size, Chemical Composition, and Surface Charge. Nanomaterials 2022, 12, 2734. [Google Scholar] [CrossRef]
Hoshino, A.; Fujioka, K.; Oku, T.; Suga, M.; Sasaki, Y.F.; Ohta, T.; Yasuhara, M.; Suzuki, K.; Yamamoto, K. Physicochemical Properties and Cellular Toxicity of Nanocrystal Quantum Dots Depend on Their Surface Modification. Nano Lett. 2004, 4, 2163–2169. [Google Scholar] [CrossRef]
Chen, S.; Wu, T. Progression and prospects of machine learning techniques in nanotoxicology: Riding the AI-driven wave. Toxicology Mechanisms and Methods 2025, 1–20. [Google Scholar] [CrossRef] [PubMed]
Singh, A.V.; Varma, M.; Laux, P.; Choudhary, S.; Datusalia, A.K.; Gupta, N.; Luch, A.; Gandhi, A.; Kulkarni, P.; Nath, B. Artificial intelligence and machine learning disciplines with the potential to improve the nanotoxicology and nanomedicine fields: A comprehensive review. Arch. Toxicol. 2023, 97, 963–979. [Google Scholar] [CrossRef]
Yousaf, I. AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties. arXiv 2024, arXiv:2409.15322. [Google Scholar] [CrossRef]
Zhou, Z.Y.; Bai, N.; Zheng, W.J.; Ni, S.J. MultiOmics analysis of metabolic dysregulation and immune features in breast cancer. Int. Immunopharmacol. 2025, 152, 114376. [Google Scholar] [CrossRef]
Shang, Y.; Wang, X.; Su, S.; Ji, F.; Shao, D.; Duan, C.; Chen, T.; Liang, C.; Zhang, D.; Lu, H. Identifying of immune-associated genes for assessing the obesity-associated risk to the offspring in maternal obesity: A bioinformatics and machine learning. CNS Neurosci. Ther. 2024, 30, e14700. [Google Scholar] [CrossRef]
Zhang, B.; Liu, H.; Wu, F.; Ding, Y.; Wu, J.; Lu, L.; Bajpai, A.K.; Sang, M.; Wang, X. Identification of hub genes and potential molecular mechanisms related to drug sensitivity in acute myeloid leukemia based on machine learning. Front. Pharmacol. 2024, 15, 1359832. [Google Scholar] [CrossRef] [PubMed]
Xu, H.; Wang, X.; Zhang, X.; Cheng, J.; Zhang, J.; Chen, M.; Wu, T. A Deep Learning Analysis Reveals Nitrogen-Doped Graphene Quantum Dots Damage Neurons of Nematode Caenorhabditis elegans. Nanomaterials 2021, 11, 3314. [Google Scholar] [CrossRef] [PubMed]
Qi, L.; Yang, J.; Niu, Q.; Li, J. Exploring pesticide risk in autism via integrative machine learning and network toxicology. Ecotoxicol. Env. Saf. 2025, 297, 118233. [Google Scholar] [CrossRef]
Guo, W.; Liu, J.; Dong, F.; Song, M.; Li, Z.; Khan, M.K.H.; Patterson, T.A.; Hong, H. Review of machine learning and deep learning models for toxicity prediction. Exp. Biol. Med. 2023, 248, 1952–1973. [Google Scholar] [CrossRef]
Khokhlov, I.; Legashev, L.; Bolodurina, I.; Shukhman, A.; Shoshin, D.; Kolesnik, S. Prediction of Dynamic Toxicity of Nanoparticles Using Machine Learning. Toxics 2024, 12, 750. [Google Scholar] [CrossRef]
Zang, X.; Zhou, W.; Zhang, H.; Zang, X. Using Four Machine Learning Methods to Analyze the Association Between Polycyclic Aromatic Hydrocarbons and Visual Impairment in American Adults: Evidence from NHANES. Toxics 2024, 12, 789. [Google Scholar] [CrossRef]
Wang, X.; Wang, L.; Wang, S.; Ren, Y.; Chen, W.; Li, X.; Han, P.; Song, T. QuantumTox: Utilizing quantum chemistry with ensemble learning for molecular toxicity prediction. Comput. Biol. Med. 2023, 157, 106744. [Google Scholar] [CrossRef]
Zeng, S.; Tang, Q.; Xiao, M.; Tong, X.; Yang, T.; Yin, D.; Lei, L.; Li, S. Cell membrane-coated nanomaterials for cancer therapy. Mater. Today Bio 2023, 20, 100633. [Google Scholar] [CrossRef]
Yang, P.; Yang, L.; Kuang, H.; Xu, H. Research advances in characteristics of biotransport and biotransformation and toxicities of quantum dots in vivo. Chin. J. Pharmacol. Toxicol. 2015, 29, 1007–1013. [Google Scholar] [CrossRef]
Le, N.; Zhang, M.; Kim, K. Quantum Dots and Their Interaction with Biological Systems. Int. J. Mol. Sci. 2022, 23, 10763. [Google Scholar] [CrossRef]
Liu, W.; Liao, H.; Wei, M.; Junaid, M.; Chen, G.; Wang, J. Biological uptake, distribution and toxicity of micro(nano)plastics in the aquatic biota: A special emphasis on size-dependent impacts. TrAC Trends Anal. Chem. 2024, 170, 117477. [Google Scholar] [CrossRef]
Carnovale, C.; Bryant, G.; Shukla, R.; Bansal, V. Size, shape and surface chemistry of nano-gold dictate its cellular interactions, uptake and toxicity. Prog. Mater. Sci. 2016, 83, 152–190. [Google Scholar] [CrossRef]
Dolai, J.; Mandal, K.; Jana, N.R. Nanoparticle Size Effects in Biomedical Applications. ACS Appl. Nano Mater. 2021, 4, 6471–6496. [Google Scholar] [CrossRef]
Naseer, B.; Srivastava, G.; Qadri, O.S.; Faridi, S.A.; Islam, R.U.; Younis, K. Importance and health hazards of nanoparticles used in the food industry. Nanotechnol. Rev. 2018, 7, 623–641. [Google Scholar] [CrossRef]
Cheng, D.; Zheng, D.; Jiang, M.; Jin, Y.; Liu, R.; Zhou, Y.; Shen, J.; Tang, J.; Wang, F.; Tang, J.; et al. Inhibition of iron ion accumulation alleviates polystyrene nanoplastics-induced pulmonary fibroblast proliferation and activation. Int. Immunopharmacol. 2025, 164, 115367. [Google Scholar] [CrossRef]
He, Z.; Armaghani, D.J.; Masoumnezhad, M.; Khandelwal, M.; Zhou, J.; Murlidhar, B.R. A Combination of Expert-Based System and Advanced Decision-Tree Algorithms to Predict Air-Overpressure Resulting from Quarry Blasting. Nat. Resour. Res. 2020, 30, 1889–1903. [Google Scholar] [CrossRef]
Ju, W.; Xing, Z. A novel technology for unraveling the spatial risk of Natech disasters based on machine learning and GIS: A case study from the city of Changzhou, China. Earth Sci. Inform. 2024, 17, 5751–5770. [Google Scholar] [CrossRef]
Mustafa, R.; Ahmad, M.T. Internal Stability of Mechanically Stabilized Earth Wall Using Machine Learning Techniques. Transp. Infrastruct. Geotechnol. 2024, 11, 3204–3234. [Google Scholar] [CrossRef]
Hashida, S. Dispersibility of Colloidal Particles: Basic Theory about Zeta Potential Measurement. J. Adhes. Soc. Jpn. 2019, 55, 266–270. [Google Scholar] [CrossRef]
Inthajak, K.; Duanggate, C.; Uyyanonvara, B.; Makhanov, S.S.; Barman, S. Medical image blob detection with feature stability and KNN classification. In Proceedings of the 2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE), Nakhonpathom, Thailand, 11–13 May 2011; pp. 128–131. [Google Scholar] [CrossRef]
Roberts, N.; Smith, M.; Qi, J. Data engineering for predictive machine learning of stormwater infrastructure conditions. Eng. Appl. Artif. Intell. 2024, 133, 108195. [Google Scholar] [CrossRef]
Virnodkar, S.S.; Pachghare, V.K.; Patil, V.C.; Jha, S.K. Remote sensing and machine learning for crop water stress determination in various crops: A critical review. Precis. Agric. 2020, 21, 1121–1155. [Google Scholar] [CrossRef]
Amiri, A.; Peltier, N.; Goldberg, C.; Sun, Y.; Nathan, A.; Hiremath, S.; Mankodiya, K. WearSense: Detecting Autism Stereotypic Behaviors through Smartwatches. Healthcare 2017, 5, 11. [Google Scholar] [CrossRef]
Mistry, P.; Neagu, D.; Trundle, P.R.; Vessey, J.D. Using random forest and decision tree models for a new vehicle prediction approach in computational toxicology. Soft Comput. 2015, 20, 2967–2979. [Google Scholar] [CrossRef]
Zhang, Z.; Zhu, X.; Liu, D. Model of Gradient Boosting Random Forest Prediction. In Proceedings of the 2022 IEEE International Conference on Networking, Sensing and Control (ICNSC), Shanghai, China, 15–18 December 2022; pp. 1–6. [Google Scholar] [CrossRef]
Mohammed, B.; Hamza, C. A Robust Estimation of Blasting-Induced Flyrock Using Machine Learning Decision Tree Algorithms: Random Forest, Gradient Boosting Machine, and XGBoost. Min. Metall. Explor. 2025, 42, 1609–1624. [Google Scholar] [CrossRef]
Sheridan, R.P.; Wang, W.M.; Liaw, A.; Ma, J.; Gifford, E.M. Extreme Gradient Boosting as a Method for Quantitative Structure–Activity Relationships. J. Chem. Inf. Model. 2016, 56, 2353–2360. [Google Scholar] [CrossRef]
Wang, S.; Long, W.; Wei, L.; Cheng, W.; Chen, H.; Yang, J.; Fu, H. Nano effect fluorescence visual sensor based on Au-AgNCs: A novel strategy to identify the origin and growth year of Lilium bulbs. Food Chem. 2024, 441, 138353. [Google Scholar] [CrossRef]
Calé, A.; Elblová, P.; Andělová, H.; Lunova, M.; Lunov, O. Analyzing Molecular Determinants of Nanodrugs’ Cytotoxic Effects. Int. J. Mol. Sci. 2025, 26, 6687. [Google Scholar] [CrossRef]
Ge, D.; Du, Q.; Ran, B.; Liu, X.; Wang, X.; Ma, X.; Cheng, F.; Sun, B. The neurotoxicity induced by engineered nanomaterials. Int. J. Nanomed. 2019, 14, 4167–4186. [Google Scholar] [CrossRef]
Xu, S.; Pang, X.; Zhang, X.; Lv, Q.; Zhang, M.; Wang, J.; Ni, N.; Sun, X. Nanomaterials disrupting cell-cell junctions towards various diseases. Nano Res. 2023, 16, 7053–7074. [Google Scholar] [CrossRef]
Sun, J.; Peng, S.; Yang, Q.; Yang, J.; Dai, Y.; Xing, L. Microplastics/nanoplastics and neurological health: An overview of neurological defects and mechanisms. Toxicology 2025, 511, 154030. [Google Scholar] [CrossRef] [PubMed]
Barla, I.; Efentakis, P.; Lamprou, S.; Gavriatopoulou, M.; Dimopoulos, M.-A.; Terpos, E.; Andreadou, I.; Thomaidis, N.; Gikas, E. Metabolomics Point out the Effects of Carfilzomib on Aromatic Amino Acid Biosynthesis and Degradation. Int. J. Mol. Sci. 2023, 24, 13966. [Google Scholar] [CrossRef] [PubMed]
Forghani, P.; Rashid, A.; Sun, F.; Liu, R.; Li, D.; Lee, M.R.; Hwang, H.; Maxwell, J.T.; Mandawat, A.; Wu, R.; et al. Carfilzomib Treatment Causes Molecular and Functional Alterations of Human Induced Pluripotent Stem Cell–Derived Cardiomyocytes. J. Am. Heart Assoc. 2021, 10, e022247. [Google Scholar] [CrossRef] [PubMed]
Mendez-Lopez, M.; Besse, A.; Zuppinger, C.; Perez-Shibayama, C.; Gil-Cruz, C.; Florea, B.I.; De Martin, A.; Lütge, M.; Beckerova, D.; Klimovic, S.; et al. Carfilzomib-specific proteasome β5/β2 inhibition drives cardiotoxicity via remodeling of protein homeostasis and the renin-angiotensin-system. iScience 2025, 28, 113228. [Google Scholar] [CrossRef]
Gao, F.; Zhang, B.; Xiao, C.; Sun, Z.; Gao, Y.; Liu, C.; Dou, X.; Tong, H.; Wang, R.; Li, P.; et al. IGF2BP3 stabilizes SESN1 mRNA to mitigate oxidized low-density lipoprotein-induced oxidative stress and endothelial dysfunction in human umbilical vein endothelial cells by activating Nrf2 signaling. Prostaglandins Other Lipid Mediat. 2024, 172, 106832. [Google Scholar] [CrossRef]
Lv, L.; Wei, Q.; Zhang, J.; Dong, Y.; Shan, Z.; Chang, N.; Zhao, Y.; Bian, P.; Yi, Q. IGF2BP3 prevent HMGB1 mRNA decay in bladder cancer and development. Cell. Mol. Biol. Lett. 2024, 29, 39. [Google Scholar] [CrossRef]
Suvasini, R.; Shruti, B.; Thota, B.; Shinde, S.V.; Friedmann-Morvinski, D.; Nawaz, Z.; Prasanna, K.V.; Thennarasu, K.; Hegde, A.S.; Arivazhagan, A.; et al. Insulin Growth Factor-2 Binding Protein 3 (IGF2BP3) Is a Glioblastoma-specific Marker That Activates Phosphatidylinositol 3-Kinase/Mitogen-activated Protein Kinase (PI3K/MAPK) Pathways by Modulating IGF-2. J. Biol. Chem. 2011, 286, 25882–25890. [Google Scholar] [CrossRef]
Burns, D.T.; Donkervoort, S.; Müller, J.S.; Knierim, E.; Bharucha-Goebel, D.; Faqeih, E.A.; Bell, S.K.; AlFaifi, A.Y.; Monies, D.; Millan, F.; et al. Variants in EXOSC9 Disrupt the RNA Exosome and Result in Cerebellar Atrophy with Spinal Motor Neuronopathy. Am. J. Hum. Genet. 2018, 102, 858–873. [Google Scholar] [CrossRef] [PubMed]
Dabaj, I.; Hassani, A.; Burglen, L.; Qebibo, L.; Guerrot, A.-M.; Marret, S.; Tebani, A.; Bekri, S. Pontocerebellar Hypoplasia Type 1D: A Case Report and Comprehensive Literature Review. J. Clin. Med. 2022, 11, 4335. [Google Scholar] [CrossRef] [PubMed]
Sakamoto, M.; Iwama, K.; Sekiguchi, F.; Mashimo, H.; Kumada, S.; Ishigaki, K.; Okamoto, N.; Behnam, M.; Ghadami, M.; Koshimizu, E.; et al. Novel EXOSC9 variants cause pontocerebellar hypoplasia type 1D with spinal motor neuronopathy and cerebellar atrophy. J. Hum. Genet. 2020, 66, 401–407. [Google Scholar] [CrossRef]
Georgiou, M.; Atkinson, R.; Mozaffari-Jovin, S.; Lako, M. Progressive accumulation of cytoplasmic aggregates in PRPF31 retinal pigment epithelium cells interferes with cell survival. Clin. Transl. Discov. 2022, 2, e89. [Google Scholar] [CrossRef]
Wagle, A.S.; Vargas, M. Uncovering Pre-messenger RNA Splicing Mechanisms in Retinitis Pigmentosa. FASEB J. 2022, 36. [Google Scholar] [CrossRef]
Alnafakh, R.A.A.; Adishesh, M.; Button, L.; Saretzki, G.; Hapangama, D.K. Telomerase and Telomeres in Endometrial Cancer. Front. Oncol. 2019, 9, 344. [Google Scholar] [CrossRef] [PubMed]
Maliński, B.; Vertemara, J.; Faustini, E.; Ladenvall, C.; Norberg, A.; Zhang, Y.; von Castelmur, E.; Baliakas, P.; Tisi, R.; Cammenga, J.; et al. Novel pathological variants of NHP2 affect N-terminal domain flexibility, protein stability, H/ACA Ribonucleoprotein (RNP) complex formation and telomerase activity. Hum. Mol. Genet. 2023, 32, 2901–2912. [Google Scholar] [CrossRef]
Rembiałkowska, N.; Sędzik, M.; Kisielewska, M.; Łuniewska, W.; Sebastianka, K.; Molik, K.; Skinderowicz, K.; Kuźnicki, J.; Tunikowska, J.; Kulbacka, J. Telomere Maintenance and DNA Repair: A Bidirectional Relationship in Cancer Biology and Therapy. Cancers 2025, 17, 2284. [Google Scholar] [CrossRef]
Danilova, N.; Gazda, H.T. Ribosomopathies: How a common root can cause a tree of pathologies. Dis. Models Mech. 2015, 8, 1013–1026. [Google Scholar] [CrossRef]
Kang, J.; Brajanovski, N.; Chan, K.T.; Xuan, J.; Pearson, R.B.; Sanij, E. Ribosomal proteins and human diseases: Molecular mechanisms and targeted therapy. Signal Transduct. Target. Ther. 2021, 6, 323. [Google Scholar] [CrossRef]
Temaj, G.; Saha, S.; Dragusha, S.; Ejupi, V.; Buttari, B.; Profumo, E.; Beqa, L.; Saso, L. Ribosomopathies and cancer: Pharmacological implications. Expert. Rev. Clin. Pharmacol. 2022, 15, 729–746. [Google Scholar] [CrossRef]
Vadivel Gnanasundram, S.; Fåhraeus, R. Translation Stress Regulates Ribosome Synthesis and Cell Proliferation. Int. J. Mol. Sci. 2018, 19, 3757. [Google Scholar] [CrossRef] [PubMed]
Goldberg, A.L. Protein degradation and protection against misfolded or damaged proteins. Nature 2003, 426, 895–899. [Google Scholar] [CrossRef] [PubMed]
Marshall, R.S.; Vierstra, R.D. Dynamic Regulation of the 26S Proteasome: From Synthesis to Degradation. Front. Mol. Biosci. 2019, 6, 40. [Google Scholar] [CrossRef] [PubMed]
Amm, I.; Sommer, T.; Wolf, D.H. Protein quality control and elimination of protein waste: The role of the ubiquitin–proteasome system. Biochim. Biophys. Acta (BBA)-Mol. Cell Res. 2014, 1843, 182–196. [Google Scholar] [CrossRef]

Figure 1. Machine learning framework for multi-endpoint quantum dot (QD) toxicity prediction with organoid validation and drug target discovery. This schematic illustrates the overall workflow integrating data preprocessing, feature selection, model training and validation, SHAP-based feature interpretation, organoid validation, and drug target prediction. Abbreviations: SHAP, Shapley Additive exPlanations; MDA, malondialdehyde; LDH, lactate dehydrogenase; RF, random forest; XGB, eXtreme Gradient Boosting; SVM, support vector machine; LR, logistic regression; NB, naïve Bayes; MLP, multi-layer perceptron; KNN, k-nearest neighbors.

Figure 2. Workflow for developing seven machine learning models to predict quantum dot-induced toxicities based on physicochemical properties.

Figure 3. Comparison of seven machine learning models (RF, XGB, KNN, SVM, LR, NB, and MLP) in predicting quantum dot-induced cell death, inflammation, and oxidative stress based on accuracy, F1 score (a,b), ROC-AUC (c), and PR-AUC (d). Other evaluation metrics are provided in Table S3.

Figure 4. Mean SHAP values of the top three performing models for predicting quantum dot-induced cell death (a), inflammation (b), and oxidative stress (c).

Figure 5. LDH release in brain organoids cultured for 10 days induced by CdTe QDs (a), for 20 days induced by CQDs (b), and for 30 days induced by N-GQDs (c); MDA content elevation in 30-day cultured brain organoids following exposure to CQDs (d), N-GQDs (e), and CdTe QDs (f). A p-value < 0.05 was considered statistically significant (*), with p-value < 0.01 (**), p-value < 0.001 (***), and p-value < 0.0001 (****) indicating progressively stronger levels of significance.

Figure 6. (a) Intersection of differentially expressed genes (DEGs) from four GEO datasets; (b) KEGG pathway enrichment analysis; and (c) GO functional enrichment analysis of the 82 intersected DEGs.

Figure 7. (a) Protein–protein interaction (PPI) network of the 82 intersected DEGs. Six core target genes were identified based on network centrality. (b) GO and KEGG enrichment analyses of the six core target genes, highlighting their biological functions and pathways. (c) Drug target enrichment analysis of the six core genes using DGIdb Drug Targets 2024 and MAGMA Drugs and Diseases datasets. Molecular docking results showing interactions between Carfilzomib and three selected core targets, IMP3 (d), EXOSC9 (e), and PSMC6 (f). Docking scores indicate potential binding affinity.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, J.; Hu, D.; Xing, P.; Zhang, Y.; Ye, Z.; Liu, K.; Xia, J.; He, J.; Qian, Y.; Wu, T. Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery. Toxics 2025, 13, 967. https://doi.org/10.3390/toxics13110967

AMA Style

Yang J, Hu D, Xing P, Zhang Y, Ye Z, Liu K, Xia J, He J, Qian Y, Wu T. Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery. Toxics. 2025; 13(11):967. https://doi.org/10.3390/toxics13110967

Chicago/Turabian Style

Yang, Jiafu, Dayu Hu, Pengcheng Xing, Yikai Zhang, Zongjian Ye, Kehan Liu, Jieyi Xia, Jing He, Yijing Qian, and Tianshu Wu. 2025. "Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery" Toxics 13, no. 11: 967. https://doi.org/10.3390/toxics13110967

APA Style

Yang, J., Hu, D., Xing, P., Zhang, Y., Ye, Z., Liu, K., Xia, J., He, J., Qian, Y., & Wu, T. (2025). Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery. Toxics, 13(11), 967. https://doi.org/10.3390/toxics13110967

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning Framework for Multi-Endpoint Quantum Dot Toxicity Prediction with Organoid Validation and Drug Target Discovery

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Collection, Imputation, and Multi-Model Training

2.2. Hyperparameter Tuning of Machine Learning Models

2.2.1. K-Nearest Neighbors

2.2.2. Logistic Regression

2.2.3. Naive Bayes

2.2.4. Random Forest

2.2.5. Support Vector Machine

2.2.6. Extreme Gradient Boosting

2.2.7. Multi-Layer Perceptron

2.3. Machine Learning Model Evaluation

2.4. SHAP Feature Importance Analysis

2.5. The Physicochemical Characterizations of QDs

2.6. Generation and Quantum Dot Exposure of Human Brain Organoids

2.7. Identification and Functional Analysis of Potential Target Genes in Response to QD Exposure

2.8. Screening of Potential Therapeutics for QD-Induced Toxicity

2.9. Data Analysis and Statistics

3. Results

3.1. Performance Comparison of Seven Machine Learning Models in Predicting Quantum Dot-Induced Cell Viability, Inflammation, and Oxidative Stress

3.2. SHAP Analysis of the Physicochemical Properties and Exposure Conditions of Quantum Dots Under Different Toxicological Outcomes

3.3. Toxic Effects of Different Types of Quantum Dots in Brain Organoids and Machine Learning Validation

3.4. Differential Gene Expression Screening and Functional Enrichment Analysis Induced by Quantum Dots

3.5. Drug Prediction and Molecular Docking Validation of Core Target Genes

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI