MDPI - Publisher of Open Access Journals

21 pages, 5351 KB

Open AccessArticle

PSO-Based Ensemble Learning Enhanced with Explainable Artificial Intelligence for Breast Glandular Dose Estimation in Mammography

by Sevgi Ünal and Remzi Gürfidan

Appl. Sci. 2026, 16(5), 2514; https://doi.org/10.3390/app16052514 - 5 Mar 2026

Viewed by 231

Abstract

Objectives: This study aims to predict patient-specific Average Glandular Dose (AGD) in mammography using machine learning-based models to support personalised radiation dose optimisation and reduce unnecessary exposure during breast cancer screening. Methods: A retrospective dataset of 671 female patients who underwent full-field digital [...] Read more.

Objectives: This study aims to predict patient-specific Average Glandular Dose (AGD) in mammography using machine learning-based models to support personalised radiation dose optimisation and reduce unnecessary exposure during breast cancer screening. Methods: A retrospective dataset of 671 female patients who underwent full-field digital mammography between 2020 and 2024 was analysed. Right craniocaudal (CC) images were used to construct a structured dataset including mAs, kVp, compressed breast thickness, air kerma (k_air), half-value layer (HVL), and breast pattern. Five regression-based machine learning models (CatBoost, Gradient Boosting, Random Forest, Extra Trees, and AdaBoost) and their Particle Swarm Optimisation (PSO)-enhanced versions were evaluated. Model performance was assessed using MSE, RMSE, MAE, MAPE, and R². SHAP analysis was applied to interpret model predictions and determine variable importance. Results: PSO integration significantly reduced prediction errors, particularly in boosting-based models. The CatBoost + PSO model achieved the best performance (RMSE = 0.0100, MAPE ≈ 1.74%, R² = 0.9846), followed by the Gradient Boosting + PSO model (R² = 0.9787). PSO reduced RMSE and MAPE by approximately 55% and 52%, respectively. SHAP analysis identified k_air, breast thickness, and breast pattern as the most influential factors affecting AGD. Conclusions: Machine learning models enhanced with PSO, especially CatBoost + PSO, provide accurate and reliable patient-specific AGD predictions. The proposed approach enables rapid and clinically applicable dose estimation and highlights breast pattern as a critical parameter influencing glandular dose, supporting personalised radiation dose optimisation in mammography. Full article

► Show Figures

Figure 1

32 pages, 19818 KB

Open AccessArticle

An Interpretable Ensemble Machine Learning Framework for Predicting the Ultimate Flexural Capacity of BFRP-Reinforced Concrete Beams

by Sebghatullah Jueyendah and Elif Ağcakoca

Polymers 2026, 18(5), 601; https://doi.org/10.3390/polym18050601 - 28 Feb 2026

Viewed by 272

Abstract

Prediction of the ultimate moment capacity (Mu) of BFRP-reinforced concrete beams is complicated by nonlinear parameter interactions and the linear-elastic response of BFRP, reducing the accuracy of conventional design models. This study develops an optimized machine learning (ML) framework incorporating random forest, extra [...] Read more.

Prediction of the ultimate moment capacity (Mu) of BFRP-reinforced concrete beams is complicated by nonlinear parameter interactions and the linear-elastic response of BFRP, reducing the accuracy of conventional design models. This study develops an optimized machine learning (ML) framework incorporating random forest, extra trees, gradient boosting, adaboost, bagging, support vector regression, histogram-based gradient boosting, and ensemble voting and stacking strategies for reliable prediction of the Mu of BFRP-reinforced concrete beams. A comprehensive database of material, geometric, reinforcement, and BFRP mechanical parameters was analyzed, and model performance was evaluated using an 80/20 train–test split and 10-fold cross-validation based on R², RMSE, MAE, and MAPE. The stacking regressor demonstrated superior predictive performance, achieving an R² of 0.999 (RMSE = 0.590) in training and an R² of 0.988 (RMSE = 2.487) in testing, indicating excellent robustness and strong generalization capability in predicting Mu. Furthermore, interpretability analyses based on SHAP, PDP, ALE, and ICE demonstrate that span length (L) and beam depth (h) constitute the governing parameters in the prediction of Mu. Unlike prior studies focused mainly on predictive accuracy, this work proposes an optimized and interpretable stacking ensemble framework that integrates explainable AI with classical flexural mechanics for physically consistent and reliable prediction of the ultimate moment capacity of BFRP-reinforced concrete beams. Full article

(This article belongs to the Special Issue Fiber-Reinforced Polymer Composites: Progress and Prospects)

► Show Figures

Graphical abstract

26 pages, 3654 KB

Open AccessArticle

From Experiment to Prediction: Machine Learning Solutions for Concrete Strength Assessment with Steel Clamps

by Panumas Saingam, Burachat Chatveera, Gritsada Sua-Iam, Preeda Chaimahawan, Chisanuphong Suthumma, Panuwat Joyklad, Qudeer Hussain and Afaq Ahmad

Buildings 2026, 16(4), 851; https://doi.org/10.3390/buildings16040851 - 20 Feb 2026

Viewed by 224

Abstract

This study examines the confined compressive strength (Fcc) of circular, square, and rectangular column geometries under varying confinement conditions. Results indicate that circular columns have the highest Fcc values, exceeding those of square and rectangular shapes. Increased confinement through clamps significantly enhances compressive [...] Read more.

This study examines the confined compressive strength (Fcc) of circular, square, and rectangular column geometries under varying confinement conditions. Results indicate that circular columns have the highest Fcc values, exceeding those of square and rectangular shapes. Increased confinement through clamps significantly enhances compressive strength. Five machine learning models, Linear Regression, Decision Tree, Random Forest, AdaBoost, and Gradient Boosting, were used to predict Fcc based on geometric and confinement parameters. Linear Regression and Decision Tree models achieved moderate predictive performance, with R² values of 0.84 and 0.83, respectively, and relatively higher error measures (RMSE, MAE, and MAPE), indicating limited ability to capture complex nonlinear relationships in the data. In contrast, ensemble-based methods demonstrated superior performance. The Random Forest model improved the coefficient of determination to 0.90 while substantially reducing all error metrics, reflecting enhanced generalization through bagging. The boosting-based approaches yielded the best results, with AdaBoost achieving the highest R² value of 0.99 and the lowest RMSE, MAE, and MAPE among all models, followed closely by Gradient Boosting with an R² of 0.98. These results confirm that ensemble learning techniques, particularly boosting algorithms, yield more accurate and robust predictions than single learners for the problem studied. Data visualization techniques, including Regression Error Characteristic curves (REC) and SHapley Additive exPlanations (SHAP) value analysis, highlighted model performance and feature importance, emphasizing the roles of confinement and geometry in compressive strength. This research demonstrates the potential of machine learning to optimize structural engineering design and suggests further exploration of alternative shapes and confinement strategies to enhance structural integrity. Full article

(This article belongs to the Special Issue Emerging Trends in Machine Learning for Structural Engineering: Innovations and Applications)

► Show Figures

Figure 1

23 pages, 8017 KB

Open AccessArticle

Individual-Aware Gradient Boosting Regression for Visual Saliency Prediction of Damaged Regions in Ancient Murals

by Naiyu Xie, Yingchun Cao and Bowen Zhang

Appl. Sci. 2026, 16(4), 2055; https://doi.org/10.3390/app16042055 - 19 Feb 2026

Viewed by 274

Abstract

Murals are vital cultural heritage assets, yet many are increasingly threatened by long-term natural weathering, environmental erosion, and human-induced damage. Given limited conservation resources, an objective method for prioritizing restoration is urgently needed. This study proposes an Individual-Aware Gradient Boosting Regression (IA-GBR) approach [...] Read more.

Murals are vital cultural heritage assets, yet many are increasingly threatened by long-term natural weathering, environmental erosion, and human-induced damage. Given limited conservation resources, an objective method for prioritizing restoration is urgently needed. This study proposes an Individual-Aware Gradient Boosting Regression (IA-GBR) approach to predict the visual saliency of damaged mural regions by integrating physical damage characteristics, spatial location, and observer identity. We construct an eye-tracking dataset containing complete fixation records from multiple participants viewing diverse mural damage types. IA-GBR employs a two-level feature fusion strategy that combines damage, spatial, and individual features within a gradient boosting residual learning framework. The experimental results demonstrate that IA-GBR consistently outperforms baseline methods, including linear and ridge regression, SVR, decision trees, random forests, AdaBoost, and multilayer perceptrons. Feature importance analysis further reveals the relative contributions of individual differences, damage size, spatial position, and semantic factors to saliency formation. The proposed framework provides data-driven support for restoration prioritization and advances perception-aware saliency modeling in cultural heritage conservation. Full article

► Show Figures

Figure 1

17 pages, 6352 KB

Open AccessProceeding Paper

Comparative Evaluation of Machine Learning Classifiers for Breast Cancer Diagnosis: A Comprehensive Statistical Analysis

by Sambit Subhankar Das, Atal Mahaprasad, Neelamadhab Padhy, Srikant Misra, Rasmita Panigrahi, Pradeep Kumar Mahapatro and Dasaradha Arangi

Eng. Proc. 2026, 124(1), 35; https://doi.org/10.3390/engproc2026124035 - 15 Feb 2026

Viewed by 249

Abstract

Content/Background: Breast cancer is one of the most fatal cancers among women around the globe. The chances of surviving this cancer increase with early tumor detection, which is necessary for effective treatment. Traditional diagnostic techniques are ineffective and time-consuming, and they yield results [...] Read more.

Content/Background: Breast cancer is one of the most fatal cancers among women around the globe. The chances of surviving this cancer increase with early tumor detection, which is necessary for effective treatment. Traditional diagnostic techniques are ineffective and time-consuming, and they yield results that may be accurate or inaccurate. Therefore, our primary objective is to determine how a machine learning model can reduce diagnostic errors and provide accurate results. Objective: The main objective of this project is to build an ML-based classification model that can help doctors detect breast cancer early and more accurately. This project also aims to provide an interactive interface for easy access in healthcare settings. Materials/Methods: For this study, twelve machine learning classification algorithms are implemented and tested: Logistic Regression, K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Decision Tree, Random Forest, Gradient Boosting, XGBOOST, Naive Bayes, AdaBoosting, Light GBM, CatBoost, and the Artificial Neural Network (ANN). This study used the Wisconsin Breast Cancer Dataset (WBCD) from the UCI ML Repository. It contains 569 patient samples and 30 features. This dataset has the following features: Radius, Texture, Area, Perimeter, Smoothness, Compactness, Concavity, and Fractional Dimension. The target variable is diagnosis, which is categorized as malignant vs. benign. Results: The fifteen models were analyzed, evaluated, and compared using five performance metrics: Accuracy, Precision, Recall, F1-Score, and AUC-ROC. Among the evaluated models, CatBoost, LoGR, and AdaBoost outperformed the others, with an Accuracy of 97.%, Precision of 97%, Recall of 97%, and AUC-ROC score of 99%. The AUC-ROC is nearly 99%, and the model has a high ability to differentiate between malignant and benign tumors. Full article

(This article belongs to the Proceedings of The 6th International Electronic Conference on Applied Sciences)

► Show Figures

Figure 1

17 pages, 2175 KB

Open AccessArticle

Machine Learning Radiomics in Computed Tomography for Prediction of Tumor and Nodal Stages in Colorectal Cancer

by Lara de Souza Moreno, Tony Alexandre Medeiros da Silva, Mayra Veloso Ayrimoraes Soares, João Luiz Azevedo de Carvalho and Fabio Pittella-Silva

Cancers 2026, 18(3), 377; https://doi.org/10.3390/cancers18030377 - 26 Jan 2026

Viewed by 447

Abstract

Background/Objectives: Accurate preoperative TN staging is essential for guiding surgical and adjuvant treatment decisions in colorectal cancer (CRC), yet conventional imaging still faces limitations in reliably distinguishing early from advanced disease. This study aimed to evaluate whether CT-based radiomics combined with machine [...] Read more.

Background/Objectives: Accurate preoperative TN staging is essential for guiding surgical and adjuvant treatment decisions in colorectal cancer (CRC), yet conventional imaging still faces limitations in reliably distinguishing early from advanced disease. This study aimed to evaluate whether CT-based radiomics combined with machine learning can noninvasively predict both tumor (T) and nodal (N) stages of CRC, and to identify which feature groups most contribute to each task. Methods: Fifty-three patients (55 tumors) with histologically confirmed CRC who underwent preoperative contrast-enhanced CT were retrospectively analyzed. A total of 107 radiomic features were extracted using PyRadiomics version 3.1.0, including shape, first-order, and texture features. Multiple preprocessing strategies—z-score normalization, PCA, and SMOTE—were tested across 11 machine learning classifiers. Results: For T staging, logistic regression using shape-based features achieved a mean sensitivity of 0.721, a specificity of 0.68, a balanced accuracy of 0.70, and an AUC of 0.751. For N staging, the AdaBoost model using texture-based features achieved a sensitivity of 0.742, a specificity of 0.622, a balanced accuracy of 0.682, and an AUC of 0.750. Shape features predominantly contributed to T prediction, while texture matrices drove N prediction, reflecting morphological and microstructural correlates of invasiveness and lymphatic dissemination. Conclusions: CT-based radiomics can quantitatively capture both morphological and textural patterns of tumor behavior, providing a noninvasive framework for preoperative TN staging in CRC. Full article

(This article belongs to the Special Issue Radiomics and Imaging in Cancer Analysis)

► Show Figures

Graphical abstract

17 pages, 4604 KB

Open AccessArticle

Machine Learning Predictions of the Flexural Response of Low-Strength Reinforced Concrete Beams with Various Longitudinal Reinforcement Configurations

by Batuhan Cem Öğe, Muhammet Karabulut, Hakan Öztürk and Bulent Tugrul

Buildings 2026, 16(2), 433; https://doi.org/10.3390/buildings16020433 - 20 Jan 2026

Viewed by 379

Abstract

There are almost no studies that investigate the flexural behavior of existing reinforced concrete (RC) beams with insufficient concrete strength using machine learning methods. This study investigates the flexural response of low-strength concrete (LSC) RC beams reinforced exclusively with steel rebars, focusing on [...] Read more.

There are almost no studies that investigate the flexural behavior of existing reinforced concrete (RC) beams with insufficient concrete strength using machine learning methods. This study investigates the flexural response of low-strength concrete (LSC) RC beams reinforced exclusively with steel rebars, focusing on the effectiveness of three different longitudinal reinforcement configurations. Nine beams, each measuring 150 × 200 × 1100 mm and cast with C10-grade low-strength concrete, were divided into three groups according to their reinforcement layout: Group 1 (L2L) with two Ø12 mm rebars, Group 2 (L3L) with three Ø12 mm rebars, and Group 3 (F10L3L) with three Ø10 mm rebars. All specimens were tested under three-point bending to evaluate their load–deflection characteristics and failure mechanisms. The experimental findings were compared with ML approaches. To enhance predictive understanding, several ML regression models were developed and trained using the experimental datasets. Among them, the Light Gradient Boosting, K Neighbors Regressor and Adaboost Regressor exhibited the best predictive performance, estimating beam deflections with R² values of 0.89, 0.90, 0.94, 0.74, 0.84, 0.64, 0.70, 0.82, and 0.72, respectively. The results highlight that the proposed ML models effectively capture the nonlinear flexural behavior of RC beams and that longitudinal reinforcement configuration plays a significant role in the flexural performance of low-strength concrete beams, providing valuable insights for both design and structural assessment. Full article

(This article belongs to the Section Construction Management, and Computers & Digitization)

► Show Figures

Figure 1

15 pages, 6524 KB

Open AccessArticle

Applying the Ensemble and Metaheuristic Algorithm to Predict the Flexural Characteristics of Ice

by Chengxi Lu and Xiangyu Han

Materials 2026, 19(2), 333; https://doi.org/10.3390/ma19020333 - 14 Jan 2026

Viewed by 244

Abstract

The stability of ice structures in cold regions and polar environments has been increasingly challenged by global warming and climate change, making the accurate estimation of ice flexural properties essential. However, the flexural failure process of ice is highly complex, and the calculated [...] Read more.

The stability of ice structures in cold regions and polar environments has been increasingly challenged by global warming and climate change, making the accurate estimation of ice flexural properties essential. However, the flexural failure process of ice is highly complex, and the calculated flexural properties are influenced by multiple factors. Hence, several data-driven artificial intelligence models were developed to predict flexural strength, using classification and regression tree (CART), AdaBoost, and Random Forest methods, while the Elitist Ant System (EAS) was applied to optimize model parameters. The EAS procedure converged rapidly within ten iterations and effectively enhanced overall model performance. Compared with the single CART model, ensemble approaches exhibited higher prediction accuracy and better generalization, with AdaBoost achieving the best performance (R² = 0.736). Feature-importance analysis indicated that the testing method and specimen geometry had the greatest influence on the results, highlighting the importance of careful control of experimental conditions. The proposed ensemble–metaheuristic framework provides an efficient tool for predicting the mechanical behavior of ice and offers useful support for stability assessments of ice structures under changing climatic conditions. Full article

(This article belongs to the Special Issue Fracture and Fatigue of Materials Based on Machine Learning)

► Show Figures

Graphical abstract

10 pages, 1829 KB

Open AccessProceeding Paper

Machine Learning Based Agricultural Price Forecasting for Major Food Crops in India Using Environmental and Economic Factors

by P. Ankit Krishna, Gurugubelli V. S. Narayana, Siva Krishna Kotha and Debabrata Pattnayak

Biol. Life Sci. Forum 2025, 54(1), 7; https://doi.org/10.3390/blsf2025054007 - 12 Jan 2026

Viewed by 932

Abstract

The contemporary agricultural market is profoundly volatile, where agricultural prices are based on a complex supply chain, climatic irregularity or unscheduled market demand. Prices of crops need to be predicted in a reliable and timely manner for farmers, policy-makers and other stakeholders to [...] Read more.

The contemporary agricultural market is profoundly volatile, where agricultural prices are based on a complex supply chain, climatic irregularity or unscheduled market demand. Prices of crops need to be predicted in a reliable and timely manner for farmers, policy-makers and other stakeholders to take evidence-based decisions ultimately for the benefit towards sustainable agriculture and economic sustainability. Objective: The objective of this study is to develop and evaluate a comprehensive machine learning model for predicting agricultural prices incorporating logistic, economic and environmental considerations. It is the desire to make agriculture more profitable by building simple and accurate forecasting models. Methods: An assorted dataset was collected, which covers major factors to constitute the dataset of temperature, rainfall, fertiliser use, pest and disease attack level, cost of transportation, market demand-supply ratio and regional competitiveness. The data was subjected to pre-processing and feature extraction for quality control/quality assurance. Several machine learning models (Linear Regression, Support Vector Machines, AdaBoost, Random Forest, and XGBoost) were trained and evaluated using performance metrics such as R2 score, Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE). Results: Out of the model approaches that were analysed, predictive performance was superior for XGBoost (with an R2 Score of 0.94, RMSE of 12.8 and MAE of 8.6). To generate accurate predictions, the ability to account for complex non-linear relationships between market and environmental information was necessary. Conclusions: The forecast model of the XGBoost-based prediction system is reliable, of low complexity and widely applicable to large-scale real-time forecasting of agricultural monitoring. The model substantially reduces the uncertainty of price forecasting, and does so by including multivariate environmental and economic aspects that permit more profitable management practices in a schedule for future sustainable agriculture. Full article

(This article belongs to the Proceedings of The 3rd International Online Conference on Agriculture)

► Show Figures

Figure 1

23 pages, 3238 KB

Open AccessArticle

Agricultural Injury Severity Prediction Using Integrated Data-Driven Analysis: Global Versus Local Explainability Using SHAP

by Omer Mermer, Yanan Liu, Charles A. Jennissen, Milan Sonka and Ibrahim Demir

Safety 2026, 12(1), 6; https://doi.org/10.3390/safety12010006 - 8 Jan 2026

Viewed by 483

Abstract

Despite the agricultural sector’s consistently high injury rates, formal reporting is often limited, leading to sparse national datasets that hinder effective safety interventions. To address this, our study introduces a comprehensive framework leveraging advanced ensemble machine learning (ML) models to predict and interpret [...] Read more.

Despite the agricultural sector’s consistently high injury rates, formal reporting is often limited, leading to sparse national datasets that hinder effective safety interventions. To address this, our study introduces a comprehensive framework leveraging advanced ensemble machine learning (ML) models to predict and interpret the severity of agricultural injuries. We use a unique, manually curated dataset of over 2400 agricultural incidents from AgInjuryNews, a public repository of news reports detailing incidents across the United States. We evaluated six ensemble models, including Gradient Boosting (GB), eXtreme Grading Boosting (XGB), Light Gradient Boosting Machine (LightGBM), Adaptive Boosting (AdaBoost), Histogram-based Gradient Boosting Regression Trees (HistGBRT), and Random Forest (RF), for their accuracy in classifying injury outcomes as fatal or non-fatal. A key contribution of our work is the novel integration of explainable artificial intelligence (XAI), specifically SHapley Additive exPlanations (SHAP), to overcome the “black-box” nature of complex ensemble models. The models demonstrated strong predictive performance, with most achieving an accuracy of approximately 0.71 and an F1-score of 0.81. Through global SHAP analysis, we identified key factors influencing injury severity across the dataset, such as the presence of helmet use, victim age, and the type of injury agent. Additionally, our application of local SHAP analysis revealed how specific variables like location and the victim’s role can have varying impacts depending on the context of the incident. These findings provide actionable, context-aware insights for developing targeted policy and safety interventions for a range of stakeholders, from first responders to policymakers, offering a powerful tool for a more proactive approach to agricultural safety. Full article

(This article belongs to the Special Issue Farm Safety, 2nd Edition)

► Show Figures

Figure 1

22 pages, 1021 KB

Open AccessArticle

A Multiclass Machine Learning Framework for Detecting Routing Attacks in RPL-Based IoT Networks Using a Novel Simulation-Driven Dataset

by Niharika Panda and Supriya Muthuraman

Future Internet 2026, 18(1), 35; https://doi.org/10.3390/fi18010035 - 7 Jan 2026

Viewed by 515

Abstract

The use of resource-constrained Low-Power and Lossy Networks (LLNs), where the IPv6 Routing Protocol for LLNs (RPL) is the de facto routing standard, has increased due to the Internet of Things’ (IoT) explosive growth. Because of the dynamic nature of IoT deployments and [...] Read more.

The use of resource-constrained Low-Power and Lossy Networks (LLNs), where the IPv6 Routing Protocol for LLNs (RPL) is the de facto routing standard, has increased due to the Internet of Things’ (IoT) explosive growth. Because of the dynamic nature of IoT deployments and the lack of in-protocol security, RPL is still quite susceptible to routing-layer attacks like Blackhole, Lowered Rank, version number manipulation, and Flooding despite its lightweight architecture. Lightweight, data-driven intrusion detection methods are necessary since traditional cryptographic countermeasures are frequently unfeasible for LLNs. However, the lack of RPL-specific control-plane semantics in current cybersecurity datasets restricts the use of machine learning (ML) for practical anomaly identification. In order to close this gap, this work models both static and mobile networks under benign and adversarial settings by creating a novel, large-scale multiclass RPL attack dataset using Contiki-NG’s Cooja simulator. To record detailed packet-level and control-plane activity including DODAG Information Object (DIO), DODAG Information Solicitation (DIS), and Destination Advertisement Object (DAO) message statistics along with forwarding and dropping patterns and objective-function fluctuations, a protocol-aware feature extraction pipeline is developed. This dataset is used to evaluate fifteen classifiers, including Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), k-Nearest Neighbors (KNN), Random Forest (RF), Extra Trees (ET), Gradient Boosting (GB), AdaBoost (AB), and XGBoost (XGB) and several ensemble strategies like soft/hard voting, stacking, and bagging, as part of a comprehensive ML-based detection system. Numerous tests show that ensemble approaches offer better generalization and prediction performance. With overfitting gaps less than 0.006 and low cross-validation variance, the Soft Voting Classifier obtains the greatest accuracy of 99.47%, closely followed by XGBoost with 99.45% and Random Forest with 99.44%. Full article

(This article belongs to the Special Issue Industrial Internet of Things (IIoT): Trends and Technologies—2nd Edition)

► Show Figures

Graphical abstract

32 pages, 1816 KB

Open AccessArticle

Pragmatic Models for Detection of Hypertension Using Ballistocardiograph Signals and Machine Learning

by Sunil Kumar Prabhakar and Dong-Ok Won

Bioengineering 2026, 13(1), 43; https://doi.org/10.3390/bioengineering13010043 - 30 Dec 2025

Viewed by 464

Abstract

To identify hypertension, Ballistocardiograph (BCG) signals can be primarily utilized. The BCG signal must be thoroughly understood and interpreted so that its application in the classification process could become clearer and more distinct. Various unhealthy habits such as excess consumption of alcohol and [...] Read more.

To identify hypertension, Ballistocardiograph (BCG) signals can be primarily utilized. The BCG signal must be thoroughly understood and interpreted so that its application in the classification process could become clearer and more distinct. Various unhealthy habits such as excess consumption of alcohol and tobacco, accompanied by a lack of good diet and a sedentary lifestyle, lead to hypertension. Common symptoms of hypertension include chest pain, shortness of breath, blurred vision, mood swings, frequent urination, etc. In this work, two pragmatic models are proposed for the detection of hypertension using BCG signals and machine learning models. The first model uses K-means clustering, the maximum overlap discrete wavelet transform (MODWT) and the Empirical Wavelet Transform (EWT) techniques for feature extraction, followed by the Binary Tunicate Swarm Algorithm (BTSA) and Information Gain (IG) for feature selection, as well as two efficient hybrid classifiers such as the Hybrid AdaBoost–-Maximum Uncertainty Linear Discriminant Analysis (MULDA) classifier and the Hybrid AdaBoost–Random Forest (RF) classifier for the classification of BCG signals. The second model uses Principal Component Analysis (PCA), Kernel Principal Component Analysis (KPCA) and the Random Feature Mapping (RFM) technique for feature extraction, followed by IG and the Aquila Optimization Algorithm (AOA) for feature selection, as well as two versatile hybrid classifiers such as the Hybrid AutoRegressive Integrated Moving Average (ARIMA)–AdaBoost classifier and the Time-weighted Hybrid AdaBoost–Support Vector Machine (TW-HASVM) classifier for the classification of BCG signals. The proposed methodology was tested on a publicly available BCG dataset, and the best results were obtained when the KPCA feature extraction technique was used with the AOA feature selection technique and classified using the Hybrid ARIMA–AdaBoost classifier, reporting a good classification accuracy of 96.89%. Full article

(This article belongs to the Section Biosignal Processing)

► Show Figures

Figure 1

16 pages, 2489 KB

Open AccessArticle

Prediction of Breast Radiation Absorbed Dose Chest CT Examinations Using Machine Learning Techniques

by Sevgi Ünal, Remzi Gürfidan, Merve Gürsoy Bulut and Mustafa Fazıl Gelal

Tomography 2025, 11(12), 142; https://doi.org/10.3390/tomography11120142 - 16 Dec 2025

Viewed by 472

Abstract

Background/Objectives: The breast is a highly radiosensitive organ that is directly exposed to ionizing radiation during chest computed tomography (CT) examinations. Excessive radiation exposure increases the risk of radiation-induced malignancies, highlighting the importance of accurate and patient-specific dose estimation. This study aims [...] Read more.

Background/Objectives: The breast is a highly radiosensitive organ that is directly exposed to ionizing radiation during chest computed tomography (CT) examinations. Excessive radiation exposure increases the risk of radiation-induced malignancies, highlighting the importance of accurate and patient-specific dose estimation. This study aims to estimate the effective radiation dose absorbed by the breast during chest CT examinations using a machine learning (ML)-based personalized prediction approach. Methods: In this retrospective study, a total of 653 female patients who underwent both mammography and chest CT between 2020 and 2024 were included. A structured database was created incorporating demographic and anatomical parameters, including body weight, height, body mass index (BMI), and breast thickness (mm) obtained from mammography, along with dose length product (DLP) values from chest CT scans. Five regression-based ML algorithms—CatBoost, Gradient Boosting, Extra Trees, AdaBoost, and Random Forest—were implemented to predict breast radiation dose. Model performance was evaluated using Mean Squared Error (MSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and the Coefficient of Determination (R²). Results: Among the evaluated models, the CatBoost algorithm optimized with Particle Swarm Optimization (CatBoostPSO) achieved the best overall predictive performance, yielding the lowest MSE (0.3795), MAE (0.3846), and MAPE (4.37%), along with the highest R² value (0.9875). CatBoost and Gradient Boosting models demonstrated predictions most closely aligned with ground truth values, indicating that ensemble-based and dynamically optimized models are particularly effective for breast dose estimation. Conclusions: The proposed machine learning framework enables rapid, accurate, and clinically applicable estimation of breast radiation dose during chest CT examinations. This patient-specific approach has strong potential to support personalized radiation dose monitoring and optimization strategies, contributing to improved radiation safety in clinical practice. Full article

► Show Figures

Figure 1

23 pages, 2048 KB

Open AccessArticle

Robust Ensemble-Based Model and Web Application for Nitrogen Content Prediction in Hydrochar from Sewage Sludge

by Esraa Q. Shehab, Nadia Moneem Al-Abdaly, Mohammed E. Seno, Hamza Imran and Antonio Albuquerque

Water 2025, 17(24), 3468; https://doi.org/10.3390/w17243468 - 6 Dec 2025

Cited by 1 | Viewed by 571

Abstract

Hydrochar is a carbon-rich material produced through the hydrothermal carbonization (HTC) of wet biomass such as sewage sludge. Its nitrogen content is a critical quality parameter, influencing its suitability for use as a soil amendment and its potential environmental impacts. This study develops [...] Read more.

Hydrochar is a carbon-rich material produced through the hydrothermal carbonization (HTC) of wet biomass such as sewage sludge. Its nitrogen content is a critical quality parameter, influencing its suitability for use as a soil amendment and its potential environmental impacts. This study develops a high-accuracy ensemble machine learning framework to predict the nitrogen content of hydrochar derived from sewage sludge based on feedstock compositions and HTC process conditions. Four ensemble algorithms—Gradient Boosting Regression Trees (GBRTs), AdaBoost, Light Gradient Boosting Machine (LightGBM), and eXtreme Gradient Boosting (XGBoost)—were trained using an 80/20 train–test split and evaluated through standard statistical metrics. GBRT and XGBoost provided the best performance, achieving R² values of 0.993 and 0.989 and RMSE values of 0.169 and 0.213 during training, while maintaining strong predictive capabilities on the test dataset. SHAP analyses identified nitrogen content, ash content, and heating temperature as the most influential predictors of hydrochar nitrogen levels. Predicting nitrogen behaviour during HTC is environmentally relevant, as the improper management of nitrogen-rich hydrochar residues can contribute to nitrogen leaching, eutrophication, and disruption of aquatic biogeochemical cycles. The proposed ensemble-based modelling approach therefore offers a reliable tool for optimizing HTC operations, supporting sustainable sludge valorisation, and reducing environmental risks associated with nitrogen emissions. Full article

(This article belongs to the Section Water Quality and Contamination)

► Show Figures

Figure 1

27 pages, 11265 KB

Open AccessArticle

Using Machine Learning Methods to Predict Cognitive Age from Psychophysiological Tests

by Daria D. Tyurina, Sergey V. Stasenko, Konstantin V. Lushnikov and Maria V. Vedunova

Healthcare 2025, 13(24), 3193; https://doi.org/10.3390/healthcare13243193 - 5 Dec 2025

Viewed by 494

Abstract

Background/Objectives: This paper presents the results of predicting chronological age from psychophysiological tests using machine learning regressors. Methods: Subjects completed a series of psychological tests measuring various cognitive functions, including reaction time and cognitive conflict, short-term memory, verbal functions, and color and spatial [...] Read more.

Background/Objectives: This paper presents the results of predicting chronological age from psychophysiological tests using machine learning regressors. Methods: Subjects completed a series of psychological tests measuring various cognitive functions, including reaction time and cognitive conflict, short-term memory, verbal functions, and color and spatial perception. The sample included 99 subjects, 68 percent of whom were men and 32 percent were women. Based on the test results, 43 features were generated. To determine the optimal feature selection method, several approaches were tested alongside the regression models using MAE,

R^{2}

, and

C V_R^{2}

metrics. SHAP and Permutation Importance (via Random Forest) delivered the best performance with 10 features. Features selected through Permutation Importance were used in subsequent analyses. To predict participants’ age from psychophysiological test results, we evaluated several regression models, including Random Forest, Extra Trees, Gradient Boosting, SVR, Linear Regression, LassoCV, RidgeCV, ElasticNetCV, AdaBoost, and Bagging. Model performance was compared using the determination coefficient (

R^{2}

) and mean absolute error (MAE). Cross-validated performance (

C V_R^{2}

) was estimated via 5-fold cross-validation. To assess metric stability and uncertainty, bootstrapping (1000 resamples) was applied to the test set, yielding distributions of MAE and RMSE from which mean values and 95% confidence intervals were derived. Results: The study identified RidgeCV with winsorization and standardization as the best model for predicting cognitive age, achieving a mean absolute error of 5.7 years and an

R^{2}

of 0.60. Feature importance was evaluated using SHAP values and permutation importance. SHAP analysis showed that stroop_time_color and stroop_var_attempt_time were the strongest predictors, followed by several task-timing features with moderate contributions. Permutation importance confirmed this ranking, with these two features causing the largest performance drop when permuted. Partial dependence plots further indicated clear positive relationships between these key features and predicted age. Correlation analysis stratified by sex revealed that most features were significantly associated with age, with stronger effects generally observed in men. Conclusions: Feature selection revealed Stroop timing measures and task-related metrics from math and campimetry tests as the strongest predictors, reflecting core cognitive processes linked to aging. The results underscore the value of careful outlier handling, feature selection, and interpretable regularized models for analyzing psychophysiological data. Future work should include longitudinal studies and integration with biological markers to further improve clinical relevance. Full article

(This article belongs to the Special Issue AI-Driven Healthcare Insights)

► Show Figures

Figure 1

Search Results (358)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (358)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI