Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (5,461)

Search Parameters:
Keywords = machine learning robustness

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
24 pages, 2544 KB  
Article
Perspectives of Machine Learning for Ligand-Field Analyses in Lanthanide-Based Single Molecule Magnets
by Zayan Ahsan Ali, Preeti Tewatia and Oliver Waldmann
Magnetochemistry 2026, 12(2), 19; https://doi.org/10.3390/magnetochemistry12020019 - 2 Feb 2026
Abstract
Lanthanide-based single-molecule magnets are promising candidates for potential applications. Their magnetism is governed by ligand-field splittings, which may require up to 27 ligand-field parameters for accurate modeling. Determining these parameters reliably from measured data is a major challenge, for which machine learning approaches [...] Read more.
Lanthanide-based single-molecule magnets are promising candidates for potential applications. Their magnetism is governed by ligand-field splittings, which may require up to 27 ligand-field parameters for accurate modeling. Determining these parameters reliably from measured data is a major challenge, for which machine learning approaches offer promising solutions. We provide an overview of these approaches and present our perspective on addressing the inverse problem relating experimental data to ligand-field parameters. Previously, a machine learning architecture combining a variational autoencoder (VAE) and an invertible neural network (INN) showed promise for analyzing temperature-dependent magnetic susceptibility data. In this work, the VAE-INN model is extended through data augmentation to enhance its tolerance to common experimental inaccuracies. Focusing on second-order ligand-field parameters, diamagnetic and molar-mass errors are incorporated by augmenting the training dataset with experimentally motivated error distributions. Tests on simulated experimental susceptibility curves demonstrate substantially improved prediction accuracy and robustness when the distributions correspond to realistic error ranges. When applied to the experimental susceptibility curve of the complex Al2IIIEr2III, the augmented VAE–INN recovers ligand-field solutions consistent with least-squares benchmarks. The proposed data augmentation thus overcomes a key limitation, bringing the ML approach closer to practical use for higher-order ligand-field parameters. Full article
Show Figures

Graphical abstract

20 pages, 3811 KB  
Article
Enhancing CYP3A4 Inhibition Prediction Using a Hybrid GNN–ML Model with Data Augmentation
by Somin Woo, Ju-Hyeok Jeon, Sangil Han, Changkyu Lee and Sang-Hyun Min
Pharmaceuticals 2026, 19(2), 258; https://doi.org/10.3390/ph19020258 - 2 Feb 2026
Abstract
Background/Objectives: Cytochrome P450 3A4 (CYP3A4) metabolizes approximately 30–50% of clinically used drugs; thus, accurate prediction of CYP3A4 inhibition is essential for early assessment of drug–drug interaction (DDI) risk and toxicity. This study evaluated an integrated artificial intelligence framework for predicting CYP3A4 inhibition [...] Read more.
Background/Objectives: Cytochrome P450 3A4 (CYP3A4) metabolizes approximately 30–50% of clinically used drugs; thus, accurate prediction of CYP3A4 inhibition is essential for early assessment of drug–drug interaction (DDI) risk and toxicity. This study evaluated an integrated artificial intelligence framework for predicting CYP3A4 inhibition (%) using a large, curated chemical dataset. Methods: A dataset of 23,713 compounds was compiled from the Korea Chemical Bank and multiple commercial and public databases. Vector-based machine learning (ML) models (LightGBM, XGBoost, CatBoost, and a weighted ML ensemble) and graph neural network (GNN) models (O-GNN with contrastive learning and manifold mixup (O-GNN + CL + Mixup), D-MPNN, GINE, and GATv2) were evaluated. Manifold mixup was applied during GNN training, and SMILES enumeration-based test-time augmentation was used at inference. The best-performing ML and GNN models were integrated using a weighted ensemble strategy. Model interpretability was examined using SHAP analysis for ML models and occlusion sensitivity analysis for O-GNN + CL + Mixup. Results: The weighted ML ensemble achieved the best performance among ML models (RMSE = 19.1031, Pearson correlation coefficient (PCC) = 0.7566); the O-GNN + CL + Mixup model performed the best among GNN models (RMSE = 20.1002, PCC = 0.7265). The hybrid model achieved improved predictive accuracy (RMSE = 19.0784, PCC = 0.7570). External validation on 100 newly generated experimental data points confirmed generalizability (Custom Metric = 0.8035). Conclusions: This study demonstrated that integrating ML and GNN models with data augmentation strategies improves the robustness and interpretability of CYP3A4 inhibition prediction and established a practical framework for metabolic screening and DDI risk assessment. Full article
(This article belongs to the Section Pharmaceutical Technology)
Show Figures

Graphical abstract

24 pages, 8118 KB  
Article
Hyperspectral Inversion of Apple Leaf Nitrogen Across Phenological Stages Based on an Optimized XGBoost Model
by Ruiqian Xi, Yanxia Gu, Haoyu Ren and Zhenhui Ren
Horticulturae 2026, 12(2), 184; https://doi.org/10.3390/horticulturae12020184 - 2 Feb 2026
Abstract
Precision monitoring of leaf nitrogen content (LNC) in fruit trees is critical for optimizing fertilization and fruit quality. In this study, 1120 apple-leaf samples spanning two phenological stages were collected. Characteristic wavelengths were selected using competitive adaptive reweighted sampling and the successive projection [...] Read more.
Precision monitoring of leaf nitrogen content (LNC) in fruit trees is critical for optimizing fertilization and fruit quality. In this study, 1120 apple-leaf samples spanning two phenological stages were collected. Characteristic wavelengths were selected using competitive adaptive reweighted sampling and the successive projection algorithm (CARS–SPA). To mitigate inefficient exploration during population initialization and iterations, we propose a collaborative enhancement strategy integrating Sobol-sequence sampling and elite opposition-based learning (EOBL), termed SEO, which simultaneously refines initialization and iterative updating in swarm-based optimization algorithms. Four machine learning algorithms were trained to construct cross-phenological-stage LNC inversion models. Results indicated characteristic wavelengths lay within the visible region. The combined SEO strategy improved search capability and efficiency, with SEO-BKA achieving the best performance. Consequently, the SEO-BKA-XGBoost model yielded the highest accuracy in the bloom and fruit-set stage (R2 = 0.883; RMSE = 0.124) and fruit-enlargement stage (R2 = 0.897; RMSE = 0.069). These findings provide robust technical support for LNC hyperspectral inversion in apple trees. Full article
(This article belongs to the Section Protected Culture)
Show Figures

Graphical abstract

19 pages, 1923 KB  
Article
A Novel Recurrent Neural Network Framework for Prediction and Treatment of Oncogenic Mutation Progression
by Rishab Parthasarathy and Achintya K. Bhowmik
AI 2026, 7(2), 54; https://doi.org/10.3390/ai7020054 - 2 Feb 2026
Abstract
Despite significant medical advancements, cancer remains the second leading cause of death in the US, causing over 600,000 deaths per year. One emerging field, pathway analysis, is promising but still relies on manually derived wet lab data, which is time-consuming to acquire. This [...] Read more.
Despite significant medical advancements, cancer remains the second leading cause of death in the US, causing over 600,000 deaths per year. One emerging field, pathway analysis, is promising but still relies on manually derived wet lab data, which is time-consuming to acquire. This work proposes an efficient, effective, end-to-end framework for Artificial Intelligence (AI)-based pathway analysis that predicts both cancer severity and mutation progression in order to recommend possible treatments. The proposed technique involves a novel combination of time-series machine learning models and pathway analysis. First, mutation sequences were isolated from The Cancer Genome Atlas (TCGA) Database. Then, a novel preprocessing algorithm was used to filter key mutations by mutation frequency. This data was fed into a Recurrent Neural Network (RNN) that predicted cancer severity. The model probabilistically used the RNN predictions, information from the preprocessing algorithm, and multiple drug-target databases to predict future mutations and recommend possible treatments. This framework achieved robust results and Receiver Operating Characteristic (ROC) curves (a key statistical metric) with accuracies greater than 60%, similar to existing cancer diagnostics. In addition, preprocessing played a key role in isolating a few hundred key driver mutations per cancer stage, consistent with current research. Heatmaps based on predicted gene frequency were also generated, highlighting key mutations in each cancer. Overall, this work is the first to propose an efficient, cost-effective end-to-end framework for projecting cancer prognosis and providing possible treatments without relying on expensive, time-consuming wet lab work. Full article
(This article belongs to the Special Issue Transforming Biomedical Innovation with Artificial Intelligence)
Show Figures

Figure 1

31 pages, 3609 KB  
Review
The Machine-Learning-Driven Transformation of Forest Biometrics: Progress and Pathways Ahead Review
by Markos Progios and Maria J. Diamantopoulou
Forests 2026, 17(2), 200; https://doi.org/10.3390/f17020200 - 2 Feb 2026
Abstract
Forest biometrics has emerged as one of the fastest-growing scientific disciplines within environmental sciences. Machine learning (ML), an increasingly essential approach that uses effective algorithms, has proven to be an accurate and cost-efficient solution to forest-related problems. Recently, ML methods have evolved, from [...] Read more.
Forest biometrics has emerged as one of the fastest-growing scientific disciplines within environmental sciences. Machine learning (ML), an increasingly essential approach that uses effective algorithms, has proven to be an accurate and cost-efficient solution to forest-related problems. Recently, ML methods have evolved, from traditional machine learning (TML) algorithms to more sophisticated approaches, such as deep learning (DL) and ensemble (ENS) methods. To uncover these developments, a structured review and analysis of 150 peer-reviewed studies was conducted, following a standardized workflow. The analysis reveals clear shifts in methodological adoption. During the most recent five-year period (2021–2025), DL and shallow neural network (SNN) methods dominated the literature, accounting for 37.5% of published studies, followed by ENS and TML methods, contributing 29.2% and 27.1%, respectively, presenting a marked increase in the utilization of artificial neural networks (ANNs) and related algorithms across the domains of forest biometrics. Nevertheless, overall trends indicate that the benefits of TML methods still need further exploration for ground-based received data. Advances in remote sensing and satellite data have brought large-scale remotely sensed data into environmental research, further boosting ML utilization. However, each field could be strengthened by implementing standardized evaluation metrics and broader geographic representation. In this way, robust and widely transferable modeling frameworks for forest ecosystems can be developed. At the same time, further research on algorithms and their applicability to natural resources proves a key component for comprehensive and sustainable forest management. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
Show Figures

Figure 1

27 pages, 4367 KB  
Article
MTFE-Net: A Deep Learning Vision Model for Surface Roughness Extraction Based on the Combination of Texture Features and Deep Learning Features
by Qiancheng Jin, Wangzhe Du, Huaxin Liu, Xuwei Li, Xiaomiao Niu, Yaxing Liu, Jiang Ji, Mingjun Qiu and Yuanming Liu
Metals 2026, 16(2), 179; https://doi.org/10.3390/met16020179 - 2 Feb 2026
Abstract
Surface roughness, critically measured by the Arithmetical Mean Roughness (Ra), is a vital determinant of workpiece functional performance. Traditional contact-based measurement methods are inefficient and unsuitable for online inspection. While machine vision offers a promising alternative, existing approaches lack robustness, and pure deep [...] Read more.
Surface roughness, critically measured by the Arithmetical Mean Roughness (Ra), is a vital determinant of workpiece functional performance. Traditional contact-based measurement methods are inefficient and unsuitable for online inspection. While machine vision offers a promising alternative, existing approaches lack robustness, and pure deep learning models suffer from poor interpretability. Therefore, MTFE-Net is proposed, which is a novel deep learning framework for surface roughness classification. The key innovation of MTFE-Net lies in its effective integration of traditional texture feature analysis with deep learning within a dual-branch architecture. The MTFE (Multi-dimensional Texture Feature Extraction) branch innovatively combines a comprehensive suite of texture descriptors including Gray-Level Co-occurrence Matrix (GLCM), gray-level difference statistic, first-order statistic, Tamura texture features, wavelet transform, and Local Binary Pattern (LBP). This multi-scale, multi-perspective feature extraction strategy overcomes the limitations of methods that focus on only specific texture aspects. These texture features are then refined using Multi-Head Self-Attention (MHA) mechanism and Mamba model. Experiments on a dataset of Q235 steel surfaces show that MTFE-Net achieves state-of-the-art performance with 95.23% accuracy, 94.89% precision, 94.67% recall and 94.74% F1-score, significantly outperforming comparable models. The results validate that the fusion strategy effectively enhances accuracy and robustness, providing a powerful solution for industrial non-contact roughness inspection. Full article
(This article belongs to the Section Computation and Simulation on Metals)
Show Figures

Figure 1

24 pages, 2957 KB  
Article
Development of a PM2.5 Emission Factor Prediction Model for Shrubs in the Xiao Xing’an Mountains Based on Coupling Effects of Physical Factors
by Tianbao Zhang, Xiaoying Han, Haifeng Gao, Hui Huang, Zhiyuan Wu, Yu Gu, Bingbing Lu and Zhan Shu
Forests 2026, 17(2), 199; https://doi.org/10.3390/f17020199 - 2 Feb 2026
Abstract
Over recent years, the intensity of forest fires has escalated, with wildfire-emitted pollutants exerting substantial impacts on the environment, ecosystems, and human well-being. This study developed a robust predictive framework to quantify wildfire-induced PM2.5 emission factors (EFs) using seven shrub species—Corylus [...] Read more.
Over recent years, the intensity of forest fires has escalated, with wildfire-emitted pollutants exerting substantial impacts on the environment, ecosystems, and human well-being. This study developed a robust predictive framework to quantify wildfire-induced PM2.5 emission factors (EFs) using seven shrub species—Corylus mandshurica, Eleutherococcus senticosus, Philadelphus schrenkii, Sorbaria sorbifolia, Syringa reticulata, Spiraea salicifolia, and Lonicera maackii. These species represent ecological cornerstones of Northeast Asian forests and hold global relevance as widely introduced or invasive taxa in North America and Europe. The novelty of this research lies in the integration of traditional statistical inference with machine learning to resolve the complex coupling between fuel traits and emissions. We conducted 1134 laboratory-controlled burns in the Liangshui National Nature Reserve, evaluating two continuous and three categorical variables. Initial screening via Analysis of Variance (ANOVA) and stepwise linear regression (Step-AIC) identified the primary drivers of emissions and revealed that interspecific differences among the seven shrubs did not significantly affect the EF (p = 0.0635). To ensure statistical rigor, a log-transformation was applied to the EF data to correct for right-skewness and heteroscedasticity inherent in raw observations. Linear Mixed-effects Models (LMMs) and Gradient Boosting Machines (GBMs) were subsequently employed to quantify factor effects and capture potential nonlinearities. The LMM results consistently identified burning type and plant part as the dominant determinants: smoldering combustion and leaf components exerted strong positive effects on PM2.5 emissions compared to flaming and branch components. Fuel load was positively correlated with emissions, while moisture content showed a significant negative effect. Notably, the model identified a significant negative quadratic effect for moisture content, indicating a non-linear inhibitory trend as moisture increases. While interspecific differences among the seven shrubs did not significantly affect EFs suggesting that physical fuel traits exert a more consistent influence than species-specific genetic backgrounds, complex interactions were captured. These include a negative synergistic effect between leaves and smoldering, and a positive interaction between moisture content and leaves that significantly amplified emissions. This research bridges the gap between physical fuel traits and chemical smoke production, providing a high-resolution tool for refining global biomass burning emission inventories and assisting international forest management in similar temperate biomes. Full article
Show Figures

Figure 1

19 pages, 4660 KB  
Article
Analysis of Grounding Schemes and Machine Learning-Based Fault Detection in Hybrid AC/DC Distribution System
by Zeeshan Haider, Shehzad Alamgir, Muhammad Ali, S Jarjees Ul Hassan and Arif Mehdi
Electricity 2026, 7(1), 11; https://doi.org/10.3390/electricity7010011 - 2 Feb 2026
Abstract
The increasing integration of hybrid AC/DC networks in modern power systems introduces new challenges in fault detection and grounding scheme design, necessitating advanced techniques for stable and reliable operation. This paper investigates fault detection and grounding schemes in hybrid AC/DC networks using a [...] Read more.
The increasing integration of hybrid AC/DC networks in modern power systems introduces new challenges in fault detection and grounding scheme design, necessitating advanced techniques for stable and reliable operation. This paper investigates fault detection and grounding schemes in hybrid AC/DC networks using a machine learning (ML) approach to enhance accuracy, speed, and adaptability. Traditional methods often struggle with the dynamic and complex nature of hybrid systems, leading to delayed or incorrect fault identification. To address this, we propose a data-driven ML framework that leverages features such as voltage, current, and frequency characteristics for real-time detection and classification of faults. Additionally, the effectiveness of various grounding schemes is analyzed under different fault conditions to ensure system stability and safety. Simulation results on a hybrid AC/DC test network demonstrate the superior performance of the proposed ML-based fault detection method compared to conventional techniques, achieving high precision, recall, and robustness against noise and varying operating conditions. The findings highlight the potential of ML in improving fault management and grounding strategy optimization for future hybrid power grids. Full article
Show Figures

Figure 1

53 pages, 3892 KB  
Systematic Review
Research Advances in Maize Crop Disease Detection Using Machine Learning and Deep Learning Approaches
by Thangavel Murugan, Nasurudeen Ahamed Noor Mohamed Badusha, Nura Shifa Musa, Eiman Mubarak Masoud Alahbabi, Ruqayyah Ali Ahmed Alyammahi, Abebe Belay Adege, Afedi Abdi and Zemzem Mohammed Megersa
Computers 2026, 15(2), 99; https://doi.org/10.3390/computers15020099 (registering DOI) - 2 Feb 2026
Abstract
Recent developments in machine learning (ML) and deep learning (DL) algorithms have introduced a new approach to the automatic detection of plant diseases. However, existing reviews of this field tend to be broader than maize-focused and do not offer a comprehensive synthesis of [...] Read more.
Recent developments in machine learning (ML) and deep learning (DL) algorithms have introduced a new approach to the automatic detection of plant diseases. However, existing reviews of this field tend to be broader than maize-focused and do not offer a comprehensive synthesis of how ML and DL methods have been applied to image-based detection of maize leaf disease. Following the PRISMA guidelines, this systematic review of 102 peer-reviewed papers published between 2017 and 2025 examined methods and approaches used to classify leaf images for detecting disease in maize plants. The 102 papers were categorized by disease type, dataset, task, learning approach, architecture, and metrics used to evaluate performance. The analysis results indicate that traditional ML methods, when combined with effective feature engineering, can achieve classification accuracies of approximately 79–100%, while DL, especially CNNs, provide consistent, superior classification performance on controlled benchmark datasets (up to 99.9%). Yet in “real field” conditions, many of these improvements typically decrease or disappear due to dataset bias, environmental factors, and limited evaluation. The review provides a comprehensive overview of emerging trends, performance trade-offs, and ongoing gaps in developing field-ready, explainable, reliable, and scalable maize leaf disease detection systems. Full article
Show Figures

Figure 1

22 pages, 1796 KB  
Article
Untargeted Metabolomics and Multivariate Data Processing to Reveal SARS-CoV-2 Specific VOCs for Canine Biodetection
by Diego Pardina Aizpitarte, Eider Larrañaga, Ugo Mayor, Ainhoa Isla, Jose Manuel Amigo and Luis Bartolomé
Chemosensors 2026, 14(2), 35; https://doi.org/10.3390/chemosensors14020035 - 2 Feb 2026
Abstract
The exceptional olfactory capabilities of trained detection dogs demonstrate high potential for identifying infectious diseases. However, safe and standardized canine training requires specific chemical targets rather than infectious biological samples. This study presents an analytical proof-of-concept combining untargeted metabolomics and machine learning (ML) [...] Read more.
The exceptional olfactory capabilities of trained detection dogs demonstrate high potential for identifying infectious diseases. However, safe and standardized canine training requires specific chemical targets rather than infectious biological samples. This study presents an analytical proof-of-concept combining untargeted metabolomics and machine learning (ML) to decode the specific odor profile of SARS-CoV-2 infection. Using headspace solid-phase microextraction gas chromatography coupled with time-of-flight mass spectrometry (HS-SPME-GC/MS-ToF), axillary sweat samples from 76 individuals (SARS-CoV-2 positive and negative) were analyzed. Data preprocessing and dimensionality reduction were performed to feed a Partial Least Squares-Discriminant Analysis (PLS-DA) model. The optimized model achieved an overall accuracy of 79%, with a specificity of 89% and sensitivity of 70% in external validation, identifying a specific panel of Volatile Organic Compounds (VOCs) as discriminant biomarkers. The optimized model achieved robust classification performance, effectively distinguishing infected individuals from healthy controls based solely on their volatilome. Six VOCs were found to be consistently presented in COVID-19-positive individuals. These compounds were proposed as candidate odor signatures for constructing artificial training aids to standardize and accelerate the training of detection dogs. This study establishes a framework where machine learning-driven metabolomic profiling directly informs biological sensor training, offering a novel synergy between ML and biological intelligence in disease detection. This study establishes a scalable computational framework to translate biological samples into chemical data, providing the scientific basis for designing safe, synthetic K9 training aids for future infectious disease outbreaks without the biosafety risks associated with handling live pathogens. Full article
Show Figures

Figure 1

29 pages, 1055 KB  
Article
An Interpretable Multi-Dataset Learning Framework for Breast Cancer Prediction Using Clinical and Biomedical Tabular Data
by Muhammad Ateeb Ather, Abdullah, Zulaikha Fatima, José Luis Oropeza Rodríguez and Grigori Sidorov
Computers 2026, 15(2), 97; https://doi.org/10.3390/computers15020097 (registering DOI) - 2 Feb 2026
Abstract
Despite the numerous advancements that have been made in the treatment and management of breast cancer, it continues to be a source of mortality in millions of female patients across the world each year; thus, there is a need for proper and reliable [...] Read more.
Despite the numerous advancements that have been made in the treatment and management of breast cancer, it continues to be a source of mortality in millions of female patients across the world each year; thus, there is a need for proper and reliable diagnostic assistance tools that are quite effective in the prediction of the disease in its early stages. In our research, in addition to the proposed framework, a comprehensive comparative assessment of traditional machine learning, deep learning, and transformer-based models has been performed to predict breast cancer in a multi-dataset environment. For the purpose of improving diversity and reducing any possible biases in the datasets, our research combined three datasets: breast cancer biopsy morphological (WDBC), biochemical and metabolic properties (Coimbra), and cytological attributes (WBCO), intended to expose the model to heterogeneous feature domains and evaluate robustness under distributional variation. Based on the thorough process conducted in our research involving traditional machine learning models, deep learning models, and transformers, a proposed hybrid architecture referred to as the FT-Transformer-Attention-LSTM-SVM framework has been designed and developed in our research that is compatible and well-suited for the processing and analysis of the given tabular biomedical datasets. The proposed design in the research has an effective performance of 99.90% accuracy in the primary test environment, an average mean accuracy of 99.56% in the 10-fold cross-validation process, and an accuracy of 98.50% in the WBCO test environment, with a considerable margin of significance less than 0.0001 in the paired two-sample t-test comparison process. In our research, we have performed the importance assessment in conjunction with the SHAP and LIME techniques and have demonstrated that its decisions are based upon important attributes such as the values of the attributes of radius, concavity, perimeter, compactness, and texture. Additionally, the research has conducted the ablation test and has proved the importance of the designed FT-Transformer. Full article
(This article belongs to the Special Issue Machine and Deep Learning in the Health Domain (3rd Edition))
Show Figures

Figure 1

33 pages, 3142 KB  
Article
Exploring Net Promoter Score with Machine Learning and Explainable Artificial Intelligence: Evidence from Brazilian Broadband Services
by Matheus Raphael Elero, Rafael Henrique Palma Lima, Bruno Samways dos Santos and Gislaine Camila Lapasini Leal
Computers 2026, 15(2), 96; https://doi.org/10.3390/computers15020096 (registering DOI) - 2 Feb 2026
Abstract
Despite the growing use of machine learning (ML) for analyzing service quality and customer satisfaction, empirical studies based on Brazilian broadband telecommunications data remain scarce. This is especially true for those who leverage publicly available nationwide datasets. To address this gap, this study [...] Read more.
Despite the growing use of machine learning (ML) for analyzing service quality and customer satisfaction, empirical studies based on Brazilian broadband telecommunications data remain scarce. This is especially true for those who leverage publicly available nationwide datasets. To address this gap, this study investigates customer satisfaction with broadband internet services in Brazil using supervised ML and explainable artificial intelligence (XAI) techniques applied to survey data collected by ANATEL between 2017 and 2020. Customer satisfaction was operationalized using the Net Promoter Score (NPS) reference scale, and three modifications in the scale were evaluated: (i) a binary model grouping ratings ≥ 8 as satisfied and ≤7 as dissatisfied (portion of the neutrals as satisfied and another as dissatisfied); (ii) a binary model excluding neutral responses (ratings 7–8) and retaining only detractors (≤6) and promoters (≥9); and (iii) a multiclass model following the original NPS categories (detractors, neutrals, and promoters). Nine ML classifiers were trained and validated on tabular data for each formulation. Model interpretability was addressed through SHAP and feature importance analysis using tree-based models. The results indicate that Histogram Gradient Boosting and Random Forest achieve the most robust and stable performance, particularly in binary classification scenarios. The analysis of neutral customers reveals classification ambiguity, showing scores of “7” tend toward dissatisfaction, while scores of “8” tend toward satisfaction. XAI analyses consistently identify browsing speed, billing accuracy, fulfillment of advertised service conditions, and connection stability as the most influential predictors of satisfaction. By combining predictive performance with model transparency, this study provides computational evidence for explainable satisfaction modeling and highlights the value of public regulatory datasets for reproducible ML research. Full article
Show Figures

Graphical abstract

22 pages, 1267 KB  
Article
Application of a Hybrid Explainable ML–MCDM Approach for the Performance Optimisation of Self-Compacting Concrete Containing Crumb Rubber and Calcium Carbide Residue
by Musa Adamu, Shrirang Madhukar Choudhari, Ashwin Raut, Yasser E. Ibrahim and Sylvia Kelechi
J. Compos. Sci. 2026, 10(2), 76; https://doi.org/10.3390/jcs10020076 (registering DOI) - 2 Feb 2026
Abstract
The combined incorporation of crumb rubber (CR) and calcium carbide residue (CCR) in self-compacting concrete (SCC) induces competing and nonlinear effects on its fresh and hardened properties, making the simultaneous optimisation of workability, strength, durability, and stability challenging. CR reduces density and enhances [...] Read more.
The combined incorporation of crumb rubber (CR) and calcium carbide residue (CCR) in self-compacting concrete (SCC) induces competing and nonlinear effects on its fresh and hardened properties, making the simultaneous optimisation of workability, strength, durability, and stability challenging. CR reduces density and enhances deformability and flow stability but adversely affects strength, whereas CCR improves particle packing, cohesiveness, and early-age strength up to an optimal replacement level. To systematically address these trade-offs, this study proposes an integrated multi-criteria decision-making (MCDM)–explainable machine learning–global optimisation framework for sustainable SCC mix design. A composite performance score encompassing fresh, mechanical, durability, and thermal indicators is constructed using a weighted MCDM scheme and learned through surrogate machine-learning models. Three learners—glmnet, ranger, and xgboost—are tuned using v-fold cross-validation, with xgboost demonstrating the highest predictive fidelity. Given the limited experimental dataset, bootstrap out-of-bag validation is employed to ensure methodological robustness. Model-agnostic interpretability, including permutation importance, SHAP analysis, and partial-dependence plots, provides physical transparency and reveals that CR and CCR exert strong yet opposing influences on the composite response, with CCR partially compensating for CR-induced strength losses through enhanced cohesiveness. Differential Evolution (DEoptim) applied to the trained surrogate identifies optimal material proportions within a continuous design space, favouring mixes with 5–10% CCR and limited CR content. Among the evaluated mixes, 0% CR–5% CCR delivers the best overall performance, while 20% CR–5% CCR offers a balanced strength–ductility compromise. Overall, the proposed framework provides a transparent, interpretable, and scalable data-driven pathway for optimising SCC incorporating circular materials under competing performance requirements. Full article
(This article belongs to the Special Issue Sustainable Cementitious Composites)
Show Figures

Figure 1

24 pages, 3245 KB  
Article
Experimental Data-Driven Machine Learning Analysis for Prediction of PCM Charging and Discharging Behavior in Portable Cold Storage Systems
by Raju R. Yenare, Chandrakant Sonawane, Anindita Roy and Stefano Landini
Sustainability 2026, 18(3), 1467; https://doi.org/10.3390/su18031467 - 2 Feb 2026
Abstract
The problem of the post-harvest loss of perishable products has been a loss facing food security, especially in areas that lack adequate cold chain facilities. This issue is directly connected with sustainability objectives because post-harvest losses are the major source of food wastage, [...] Read more.
The problem of the post-harvest loss of perishable products has been a loss facing food security, especially in areas that lack adequate cold chain facilities. This issue is directly connected with sustainability objectives because post-harvest losses are the major source of food wastage, unneeded energy use, and related greenhouse gas emissions. Cold storage with phase-change material (PCM) is a promising alternative, as it aims at stabilizing temperatures and enhancing energy consumption, but current analyses of performance have been conducted through experimental testing and computational fluid dynamic (CFD) simulations, which are precise but computationally expensive. To handle this drawback, the current work constructs a machine learning predictive model to predict the dynamics of charging and discharging temperature of PCM cold storage systems. Four regression models, namely Random Forest, Extreme Gradient Boosting (XGBoost), Support Vector Regression (SVR), and K-Nearest Neighbors (KNNs), were trained and tested on experimental datasets that were obtained for varying storage layouts. The various error and accuracy measures used to determine model performance comprised MSE, MAE, R2, MAPE, and percentage accuracy. The findings suggest that Random Forest provides the best accuracy during both the charging and the discharging process, with the highest R2 values of over 0.98 and with minimal mean absolute errors. The KNN model was competitive in the discharge process, especially in cases of consistent thermal recovery patterns, and XGBoost was consistent in layout accuracy. However, SVR had relatively lower robustness, particularly when using nonlinear charged dynamics. Among the evaluated models, the Random Forest algorithm demonstrated the highest predictive accuracy, achieving coefficients of determination (R2) exceeding 0.98 for both charging and discharging processes, with mean absolute errors below 0.6 °C during charging and 0.3 °C during discharging. This paper has proven that machine learning is an efficient surrogate to CFD and experimental-only methods and can be used to predict the thermal behavior of PCM quickly and precisely. The proposed framework will allow for developing cold storage systems based on energy efficiency, low costs, and sustainability, especially in the context of decentralized and resource-limited agricultural supply chains, with the help of quick and data-focused forecasting of PCM thermal behavior. Full article
Show Figures

Figure 1

17 pages, 304 KB  
Article
Bioinspired Deep Neural Networks for Predicting Income-Reporting Discontinuities in the Chilean Student Loan Program
by Yoslandy Lazo, Álex Paz, Broderick Crawford, Carlos Valle, Eduardo Rodriguez-Tello, Ricardo Soto, José Barrera-Garcia, Felipe Cisternas-Caneo and Benjamín López Cortés
Biomimetics 2026, 11(2), 98; https://doi.org/10.3390/biomimetics11020098 (registering DOI) - 1 Feb 2026
Abstract
This study addresses discontinuity prediction in income reporting within the Chilean student loan program, a critical event for credit risk management. Although the literature has incorporated machine learning models to anticipate non-compliance behavior, a gap remains in the development of methodologically robust evaluations [...] Read more.
This study addresses discontinuity prediction in income reporting within the Chilean student loan program, a critical event for credit risk management. Although the literature has incorporated machine learning models to anticipate non-compliance behavior, a gap remains in the development of methodologically robust evaluations that integrate nonlinear imputation, imbalance correction, and repeated validation across multiple partitions. To address this need, a complete pipeline was implemented on a dataset of 22,303 records, including MissForest imputation, SMOTE-based balancing, and a comparative assessment of a biologically inspired Deep Neural Network (DNN) and a Random Forest (RF) classifier used as a classical baseline model, evaluated across 35 stratified partitions. The results show that the bioinspired DNN, as the primary focus of this study, consistently outperforms the RF in metrics such as AUC (0.9991 vs 0.9709), F1-score (0.9966 vs 0.9497), and agreement measures, while also exhibiting lower variability across partitions. The interpretability analysis indicates that financial variables account for the greatest influence on predictions, whereas demographic variables contribute minimally. The study provides a replicable and robust methodology aligned with risk analysis practices in student credit contexts. Full article
Show Figures

Graphical abstract

Back to TopTop