MDPI - Publisher of Open Access Journals

42 pages, 1422 KB

Open AccessArticle

Exploring Handwriting-Based Biomarkers for Alzheimer’s Disease: Identifying Discriminative Features and Tasks to Enhance Diagnostic Accuracy

by Cansu Akyürek Anacur, Asuman Günay Yılmaz and Bekir Dizdaroğlu

Diagnostics 2026, 16(5), 697; https://doi.org/10.3390/diagnostics16050697 - 26 Feb 2026

Viewed by 232

Abstract

Background/Objectives: This study proposes a comprehensive classification framework for the automatic detection of Alzheimer’s disease using handwriting data. An enriched feature space is constructed by combining 18 baseline features extracted from raw handwriting signals with 30 additional features derived from established handwriting analysis [...] Read more.

Background/Objectives: This study proposes a comprehensive classification framework for the automatic detection of Alzheimer’s disease using handwriting data. An enriched feature space is constructed by combining 18 baseline features extracted from raw handwriting signals with 30 additional features derived from established handwriting analysis studies, resulting in a total of 48 features. To enhance clinical practicality, a task reduction analysis is conducted by comparing the full dataset containing 25 handwriting tasks with a reduced dataset comprising 14 selected tasks. Methods: The proposed framework employs a two-stage evaluation strategy involving four feature selection methods (Random Forest Feature Importance, Extreme Gradient Boosting Feature Importance, L1 Regularization and Recursive Feature Elimination), three normalization techniques (Unnormalized, Min–Max and Z-Score), and five baseline machine learning classifiers (Random Forest, Logistic Regression, Multilayer Perceptron, XGBoost and Support Vector Machines). In the second stage, a dynamic ensemble learning strategy is introduced, where the most effective classifiers are adaptively selected for each cross-validation fold and integrated using soft and hard voting schemes. Results: The experimental results demonstrate that reducing the number of tasks leads to an improvement in average classification accuracy from 79.47% to 81.03%, while simultaneously decreasing training time and memory consumption by approximately 40% and 35%, respectively. The highest classification performance, achieving an accuracy of 94.20%, is obtained using the Hard Ensemble combined with L1-based feature selection. Conclusions: These findings highlight that the joint use of enriched feature representations, task reduction, and dynamic ensemble learning provides an effective and computationally efficient solution for handwriting-based Alzheimer’s disease detection. Full article

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

► Show Figures

Figure 1

32 pages, 16166 KB

Open AccessArticle

A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features

by Muhammad Abdullah, Hongying Zan, Arifa Javed, Muhammad Sohail, Orken Mamyrbayev, Zhanibek Turysbek, Hassan Eshkiki and Fabio Caraffini

Mathematics 2026, 14(2), 360; https://doi.org/10.3390/math14020360 - 21 Jan 2026

Viewed by 744

Abstract

Detecting fake news is essential in natural language processing to verify news authenticity and prevent misinformation-driven social, political, and economic disruptions targeting specific groups. A major challenge in multimodal fake news detection is effectively integrating textual and visual modalities, as semantic gaps and [...] Read more.

Detecting fake news is essential in natural language processing to verify news authenticity and prevent misinformation-driven social, political, and economic disruptions targeting specific groups. A major challenge in multimodal fake news detection is effectively integrating textual and visual modalities, as semantic gaps and contextual variations between images and text complicate alignment, interpretation, and the detection of subtle or blatant inconsistencies. To enhance accuracy in fake news detection, this article introduces an ensemble-based framework that integrates textual and visual data using ViLBERT’s two-stream architecture, incorporates VADER sentiment analysis to detect emotional language, and uses Image–Text Contextual Similarity to identify mismatches between visual and textual elements. These features are processed through the Bi-GRU classifier, Transformer-XL, DistilBERT, and XLNet, combined via a stacked ensemble method with soft voting, culminating in a T5 metaclassifier that predicts the outcome for robustness. Results on the Fakeddit and Weibo benchmarking datasets show that our method outperforms state-of-the-art models, achieving up to 96% and 94% accuracy in fake news detection, respectively. This study highlights the necessity for advanced multimodal fake news detection systems to address the increasing complexity of misinformation and offers a promising solution. Full article

(This article belongs to the Special Issue Research on Machine Learning, Data Mining, Natural Language Processes, and Optimization Methods)

► Show Figures

Figure 1

31 pages, 1485 KB

Open AccessArticle

Explainable Multi-Modal Medical Image Analysis Through Dual-Stream Multi-Feature Fusion and Class-Specific Selection

by Naeem Ullah, Ivanoe De Falco and Giovanna Sannino

AI 2026, 7(1), 30; https://doi.org/10.3390/ai7010030 - 16 Jan 2026

Cited by 1 | Viewed by 749

Abstract

Effective and transparent medical diagnosis relies on accurate and interpretable classification of medical images across multiple modalities. This paper introduces an explainable multi-modal image analysis framework based on a dual-stream architecture that fuses handcrafted descriptors with deep features extracted from a custom MobileNet. [...] Read more.

Effective and transparent medical diagnosis relies on accurate and interpretable classification of medical images across multiple modalities. This paper introduces an explainable multi-modal image analysis framework based on a dual-stream architecture that fuses handcrafted descriptors with deep features extracted from a custom MobileNet. Handcrafted descriptors include frequency-domain and texture features, while deep features are summarized using 26 statistical metrics to enhance interpretability. In the fusion stage, complementary features are combined at both the feature and decision levels. Decision-level integration combines calibrated soft voting, weighted voting, and stacking ensembles with optimized classifiers, including decision trees, random forests, gradient boosting, and logistic regression. To further refine performance, a hybrid class-specific feature selection strategy is proposed, combining mutual information, recursive elimination, and random forest importance to select the most discriminative features for each class. This hybrid selection approach eliminates redundancy, improves computational efficiency, and ensures robust classification. Explainability is provided through Local Interpretable Model-Agnostic Explanations, which offer transparent details about the ensemble model’s predictions and link influential handcrafted features to clinically meaningful image characteristics. The framework is validated on three benchmark datasets, i.e., BTTypes (brain MRI), Ultrasound Breast Images, and ACRIMA Retinal Fundus Images, demonstrating generalizability across modalities (MRI, ultrasound, retinal fundus) and disease categories (brain tumor, breast cancer, glaucoma). Full article

(This article belongs to the Special Issue Digital Health: AI-Driven Personalized Healthcare and Applications)

► Show Figures

Figure 1

33 pages, 2607 KB

Open AccessArticle

Efficient Blended Models for Analysis and Detection of Neuropathic Pain from EEG Signals Using Machine Learning

by Sunil Kumar Prabhakar, Keun-Tae Kim and Dong-Ok Won

Bioengineering 2026, 13(1), 67; https://doi.org/10.3390/bioengineering13010067 - 7 Jan 2026

Viewed by 491

Abstract

Due to the damage happening in the nervous system, neuropathic pain occurs and it affects the quality of life of the patient to a great extent. Therefore, some clinical evaluations are required to assess the diagnostic outcomes precisely. A lot of information about [...] Read more.

Due to the damage happening in the nervous system, neuropathic pain occurs and it affects the quality of life of the patient to a great extent. Therefore, some clinical evaluations are required to assess the diagnostic outcomes precisely. A lot of information about the activities of the brain is provided by Electroencephalography (EEG) signals and neuropathic pain can be assessed and classified with the aid of EEG and machine learning. In this work, two approaches are proposed in terms of efficient blended models for the classification of neuropathic pain through EEG signals. In the first blended model, once the features are extracted using Discrete Wavelet Transform (DWT), statistical features, and Fuzzy C-Means (FCM) clustering techniques, the features are selected using Grey Wolf Optimization (GWO), Feature Correlation Clustering Technique (FCCT), F-test, and Bayesian Optimization Algorithm (BOA) and it is classified with the help of three hybrid classification models like Spider Monkey Optimization-based Gradient Boosting Machine (SMO-GBM) classifier, hybrid deep kernel learning with Support Vector Machine (DKL-SVM) classifier, and CatBoost classifier. In the second blended model, once the features are extracted, the features are selected using Hybrid Feature Selection—Majority Voting System (HFS-MVS), Hybrid Salp Swarm Optimization—Particle Swarm Optimization (SSO-PSO), Pearson Correlation Coefficient (PCC), and Mutual Information (MI) and it is classified with the help of three hybrid classification models like Partial Least Squares (PLS) variant classification models combined with Kernel-based SVM, ensemble classification model with soft voting strategy, and Extreme Gradient Boosting (XGBoost) classifier. The proposed blended models are evaluated on a publicly available dataset and the best results are shown when the FCM features are selected with SSO-PSO feature selection technique and classified with Polynomial Kernel-based PLS-SVM Classifier, reporting a high classification accuracy of 92.68% in this work. Full article

(This article belongs to the Section Biosignal Processing)

► Show Figures

Figure 1

22 pages, 1021 KB

Open AccessArticle

A Multiclass Machine Learning Framework for Detecting Routing Attacks in RPL-Based IoT Networks Using a Novel Simulation-Driven Dataset

by Niharika Panda and Supriya Muthuraman

Future Internet 2026, 18(1), 35; https://doi.org/10.3390/fi18010035 - 7 Jan 2026

Viewed by 523

Abstract

The use of resource-constrained Low-Power and Lossy Networks (LLNs), where the IPv6 Routing Protocol for LLNs (RPL) is the de facto routing standard, has increased due to the Internet of Things’ (IoT) explosive growth. Because of the dynamic nature of IoT deployments and [...] Read more.

The use of resource-constrained Low-Power and Lossy Networks (LLNs), where the IPv6 Routing Protocol for LLNs (RPL) is the de facto routing standard, has increased due to the Internet of Things’ (IoT) explosive growth. Because of the dynamic nature of IoT deployments and the lack of in-protocol security, RPL is still quite susceptible to routing-layer attacks like Blackhole, Lowered Rank, version number manipulation, and Flooding despite its lightweight architecture. Lightweight, data-driven intrusion detection methods are necessary since traditional cryptographic countermeasures are frequently unfeasible for LLNs. However, the lack of RPL-specific control-plane semantics in current cybersecurity datasets restricts the use of machine learning (ML) for practical anomaly identification. In order to close this gap, this work models both static and mobile networks under benign and adversarial settings by creating a novel, large-scale multiclass RPL attack dataset using Contiki-NG’s Cooja simulator. To record detailed packet-level and control-plane activity including DODAG Information Object (DIO), DODAG Information Solicitation (DIS), and Destination Advertisement Object (DAO) message statistics along with forwarding and dropping patterns and objective-function fluctuations, a protocol-aware feature extraction pipeline is developed. This dataset is used to evaluate fifteen classifiers, including Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), k-Nearest Neighbors (KNN), Random Forest (RF), Extra Trees (ET), Gradient Boosting (GB), AdaBoost (AB), and XGBoost (XGB) and several ensemble strategies like soft/hard voting, stacking, and bagging, as part of a comprehensive ML-based detection system. Numerous tests show that ensemble approaches offer better generalization and prediction performance. With overfitting gaps less than 0.006 and low cross-validation variance, the Soft Voting Classifier obtains the greatest accuracy of 99.47%, closely followed by XGBoost with 99.45% and Random Forest with 99.44%. Full article

(This article belongs to the Special Issue Industrial Internet of Things (IIoT): Trends and Technologies—2nd Edition)

► Show Figures

Graphical abstract

18 pages, 2708 KB

Open AccessArticle

Interpretable Ensemble Machine Learning for Liquefaction Risk Prediction

by Doszhan Tuzelbayev, Sung-Woo Moon, Minho Lee, Shynggys Abdialim, Elijah Adebayonle Aremu, Alfrendo Satyanaga and Jong Kim

Infrastructures 2025, 10(11), 304; https://doi.org/10.3390/infrastructures10110304 - 11 Nov 2025

Viewed by 682

Abstract

This paper presents a comprehensive machine learning (ML) framework for predicting liquefaction risk, a crucial aspect of seismic hazard assessment. A benchmark geotechnical dataset with multi-dimensional input features was used to evaluate several ML classifiers, followed by hyperparameter optimization through stratified 5-fold cross-validation. [...] Read more.

This paper presents a comprehensive machine learning (ML) framework for predicting liquefaction risk, a crucial aspect of seismic hazard assessment. A benchmark geotechnical dataset with multi-dimensional input features was used to evaluate several ML classifiers, followed by hyperparameter optimization through stratified 5-fold cross-validation. Optimized models were combined into a soft Voting Ensemble to enhance stability and accuracy of liquefaction potential prediction. The proposed ensemble model achieved a mean accuracy of 90.12% and a recall of 97.23%, outperforming individual models in most folds. The ensemble’s effectiveness was further evidenced by its precision-recall (PR) and receiver operating characteristic (ROC) curves, with areas under the curve (AUC) of 0.962 and 0.931, respectively—closely matching those of the Gradient Boosting classifier, indicating comparable discriminatory performance. Additionally, SHapley Additive exPlanations (SHAP) analysis was conducted on the ensemble model to assess contributions of each geotechnical inputs to the predictions, revealing that normalized shear wave velocity (

V_{S 1}

) as the most influential variable in liquefaction prediction. The proposed framework demonstrates a robust, interpretable, and performance-consistent approach for liquefaction risk assessment. Full article

(This article belongs to the Special Issue Advances in Artificial Intelligence for Geotechnical Engineering)

► Show Figures

Figure 1

25 pages, 2322 KB

Open AccessArticle

Enhancing Cyberattack Prevention Through Anomaly Detection Ensembles and Diverse Training Sets

by Faisal Saleem S Alraddadi, Luis F. Lago-Fernández and Francisco B. Rodríguez

Computers 2025, 14(11), 477; https://doi.org/10.3390/computers14110477 - 3 Nov 2025

Viewed by 978

Abstract

A surge in global connectivity has led to an increase in cyberattacks, creating a need for improved security. A promising area of research is using machine learning to detect these attacks. Traditional two-class machine learning models can be ineffective for real-time detection, as [...] Read more.

A surge in global connectivity has led to an increase in cyberattacks, creating a need for improved security. A promising area of research is using machine learning to detect these attacks. Traditional two-class machine learning models can be ineffective for real-time detection, as attacks often represent a minority of traffic (anomaly) and fluctuate with time. This comparative study uses an ensemble of one-class classification models. First, we employed an ensemble of autoencoders with randomly generated architectures to enhance the dynamic detection of attacks, enabling each model to learn distinct aspects of the data distribution. The term ‘dynamic’ reflects the ensemble’s superior responsiveness to different attack rates without the need for retraining, offering enhanced performance compared to a static average of individual models, which we refer to as the baseline approach. Second, for comparison with the ensemble of autoencoders, we employ an ensemble of isolation forests, which also improves dynamic attack detection. We evaluated our ensemble models using the NSL-KDD dataset, testing them without the need for retraining with varying attack ratios, and comparing the results with the baseline method. Then, we investigated the impact of training data overlap among ensemble components and its effect on the detection of extremely low attack rates. The objective is to train each model within the ensemble with the minimal amount of data necessary to detect malicious traffic across varying attack rates effectively. Based on the conclusions drawn from our initial study using the NSL-KDD dataset, we re-evaluated our strategy with a modern dataset, CIC_IoT-2023, which also achieved good performance in detecting various attack rates using an ensemble of simple autoencoder models. Finally, we have observed that when distributing normal traffic data among ensemble components with a small overlap, the results show enhanced overall performance. Full article

(This article belongs to the Special Issue Using New Technologies in Cyber Security Solutions (2nd Edition))

► Show Figures

Figure 1

21 pages, 1538 KB

Open AccessArticle

SarcoNet: A Pilot Study on Integrating Clinical and Kinematic Features for Sarcopenia Classification

by Muthamil Balakrishnan, Janardanan Kumar, Jaison Jacob Mathunny, Varshini Karthik and Ashok Kumar Devaraj

Diagnostics 2025, 15(19), 2513; https://doi.org/10.3390/diagnostics15192513 - 3 Oct 2025

Viewed by 969

Abstract

Background and Objectives: Sarcopenia is a progressive loss of skeletal muscle mass and function in elderly adults, posing a significant risk of frailty, falls, and morbidity. The current study designs and evaluates SarcoNet, a novel artificial neural network (ANN)-based classification framework developed in [...] Read more.

Background and Objectives: Sarcopenia is a progressive loss of skeletal muscle mass and function in elderly adults, posing a significant risk of frailty, falls, and morbidity. The current study designs and evaluates SarcoNet, a novel artificial neural network (ANN)-based classification framework developed in order to classify Sarcopenic from non-Sarcopenic subjects using a comprehensive real-time dataset. Methods: This pilot study involved 30 subjects, who were divided into Sarcopenic and non-Sarcopenic groups based on physician assessment. The collected dataset consists of thirty-one clinical parameters like skeletal muscle mass, which is collected using various equipment such as Body Composition Analyser, along with ten kinetic features which are derived from video-based gait analysis of joint angles obtained during walking on three terrain types such as slope, steps, and parallel path. The performance of the designed ANN-based SarcoNet was benchmarked against the traditional machine learning classifiers utilised including Support Vector Machine (SVM), k-Nearest Neighbours (k-NN), and Random Forest (RF), as well as hard and soft voting ensemble classifiers. Results: SarcoNet achieved the highest overall classification accuracy of about 94%, with a specificity and precision of about 100%, an F1-score of about 92.4%, and an AUC of 0.94, outperforming all other models. The incorporation of lower-limb joint kinetics such as knee flexion, extension, ankle plantarflexion and dorsiflexion significantly enhanced predictive capability of the model and thus reflecting the functional deterioration characteristic of muscles in Sarcopenia. Conclusions: SarcoNet provides a promising AI-driven solution in Sarcopenia diagnosis, especially in low-resource healthcare settings. Future work will focus on improving the dataset, validating the model across diverse populations, and incorporating explainable AI to improve clinical adoption. Full article

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

► Show Figures

Figure 1

27 pages, 3355 KB

Open AccessArticle

ECO-HYBRID: Sustainable Waste Classification Using Transfer Learning with Hybrid and Enhanced CNN Models

by Sharanya Shetty, Saanvi Kallianpur, Roshan Fernandes, Anisha P. Rodrigues and Vijaya Padmanabha

Sustainability 2025, 17(19), 8761; https://doi.org/10.3390/su17198761 - 29 Sep 2025

Viewed by 1947

Abstract

Effective waste management is important for reducing environmental harm, improving recycling operations, and building urban sustainability. However, accurate waste classification remains a critical challenge, as many deep learning models struggle with diverse waste types. In this study, classification accuracy is enhanced using transfer [...] Read more.

Effective waste management is important for reducing environmental harm, improving recycling operations, and building urban sustainability. However, accurate waste classification remains a critical challenge, as many deep learning models struggle with diverse waste types. In this study, classification accuracy is enhanced using transfer learning, ensemble techniques, and custom architectures. Eleven pre-trained convolutional neural networks, including ResNet-50, EfficientNet variants, and DenseNet-201, were fine-tuned to extract meaningful patterns from waste images. To further improve model performance, ensemble strategies such as weighted averaging, soft voting, and stacking were implemented, resulting in a hybrid model combining ResNet-50, EfficientNetV2-M, and DenseNet-201, which outperformed individual models. In the proposed system, two specialized architectures were developed: EcoMobileNet, an optimized MobileNetV3 Large-based model incorporating Squeeze-and-Excitation blocks for efficient mobile deployment, and EcoDenseNet, a DenseNet-201 variant enhanced with Mish activation for improved feature extraction. The evaluation was conducted on a dataset comprising 4691 images across 10 waste categories, sourced from publicly available repositories. The implementation of EcoMobileNet achieved a test accuracy of 98.08%, while EcoDenseNet reached an accuracy of 97.86%. The hybrid model also attained 98.08% accuracy. Furthermore, the ensemble stacking approach yielded the highest test accuracy of 98.29%, demonstrating its effectiveness in classifying heterogeneous waste types. By leveraging deep learning, the proposed system contributes to the development of scalable, sustainable, and automated waste-sorting solutions, thereby optimizing recycling processes and minimizing environmental impact. Full article

(This article belongs to the Special Issue Smart Cities with Innovative Solutions in Sustainable Urban Future)

► Show Figures

Figure 1

18 pages, 725 KB

Open AccessArticle

Breast Cancer Prediction Using Rotation Forest Algorithm Along with Finding the Influential Causes

by Prosenjit Das, Proshenjit Sarker, Jun-Jiat Tiang and Abdullah-Al Nahid

Bioengineering 2025, 12(10), 1020; https://doi.org/10.3390/bioengineering12101020 - 25 Sep 2025

Viewed by 2098

Abstract

Breast cancer is a widespread disease involving abnormal (uncontrolled) growth of breast tissue cells along with the formation of a tumor and metastasis. Breast cancer cases occur mostly among women. Early detection and regular screening have significantly improved survival rates. This research classifies [...] Read more.

Breast cancer is a widespread disease involving abnormal (uncontrolled) growth of breast tissue cells along with the formation of a tumor and metastasis. Breast cancer cases occur mostly among women. Early detection and regular screening have significantly improved survival rates. This research classifies breast cancer and non-breast cancer cases using machine learning algorithms based on the Breast Cancer Coimbra dataset by optimizing the classifier performance and feature selection methodology. In addition, this research identifies the influential features responsible for BC classification by using diverse counterfactual explanations. The Rotation Forest classifier algorithm is used to classify breast cancer and non-breast cancer cases. The hyperparameters of this algorithm are optimized using the Optuna optimizer. Three wrapper-based feature selection techniques (Sequential Forward Selection, Sequential Backward Selection, and Exhaustive Feature Selection) are used to select the most relevant features. An ensemble environment is also created using the best feature subsets of these methods, incorporating both soft and hard voting strategies. Experimental results show that the hard voting strategy achieves an accuracy of 85.71%, F1-score of 83.87%, precision of 92.85%, and recall of 76.47%. In contrast, the soft voting strategy obtains an accuracy of 80.00%, F1-score of 77.42%, precision of 85.71%, and recall of 70.59%. These findings demonstrate that hard voting achieves noticeably better performance. The misclassification outcomes of both strategies are explored using Diverse Counterfactual Explanations, revealing that BMI and Glucose values are most influential in predicting correct classes, whereas the HOMA, Adiponectin, and Resistin values have little influence. Full article

(This article belongs to the Section Biosignal Processing)

► Show Figures

Figure 1

20 pages, 948 KB

Open AccessArticle

High-Accuracy Classification of Parkinson’s Disease Using Ensemble Machine Learning and Stabilometric Biomarkers

by Ana Carolina Brisola Brizzi, Osmar Pinto Neto, Rodrigo Cunha de Mello Pedreiro and Lívia Helena Moreira

Neurol. Int. 2025, 17(9), 133; https://doi.org/10.3390/neurolint17090133 - 26 Aug 2025

Cited by 1 | Viewed by 1942

Abstract

Background: Accurate differentiation of Parkinson’s disease (PD) from healthy aging is crucial for timely intervention and effective management. Postural sway abnormalities are prominent motor features of PD. Quantitative stabilometry and machine learning (ML) offer a promising avenue for developing objective markers to [...] Read more.

Background: Accurate differentiation of Parkinson’s disease (PD) from healthy aging is crucial for timely intervention and effective management. Postural sway abnormalities are prominent motor features of PD. Quantitative stabilometry and machine learning (ML) offer a promising avenue for developing objective markers to support the diagnostic process. This study aimed to develop and validate high-performance ML models to classify individuals with PD and age-matched healthy older adults (HOAs) using a comprehensive set of stabilometric parameters. Methods: Thirty-seven HOAs (mean age 70 ± 6.8 years) and 26 individuals with idiopathic PD (Hoehn and Yahr stages 2–3, on medication; mean age 66 years ± 2.9 years), all aged 60–80 years, participated. Stabilometric data were collected using a force platform during quiet stance under eyes-open (EO) and eyes-closed (EC) conditions, from which 34 parameters reflecting the time- and frequency-domain characteristics of center-of-pressure (COP) sway were extracted. After data preprocessing, including mean imputation for missing values and feature scaling, three ML classifiers (Random Forest, Gradient Boosting, and Support Vector Machine) were hyperparameter-tuned using GridSearchCV with three-fold cross-validation. An ensemble voting classifier (soft voting) was constructed from these tuned models. Model performance was rigorously evaluated using 15 iterations of stratified train–test splits (70% train and 30% test) and an additional bootstrap procedure of 1000 iterations to derive reliable 95% confidence intervals (CIs). Results: Our optimized ensemble voting classifier achieved excellent discriminative power, distinguishing PD from HOAs with a mean accuracy of 0.91 (95% CI: 0.81–1.00) and a mean Area Under the ROC Curve (AUC ROC) of 0.97 (95% CI: 0.92–1.00). Importantly, feature analysis revealed that anteroposterior sway velocity with eyes open (V-AP) and total sway path with eyes closed (TOD_EC, calculated using COP displacement vectors from its mean position) are the most robust and non-invasive biomarkers for differentiating the groups. Conclusions: An ensemble ML approach leveraging stabilometric features provides a highly accurate, non-invasive method to distinguish PD from healthy aging and may augment clinical assessment and monitoring. Full article

(This article belongs to the Section Movement Disorders and Neurodegenerative Diseases)

► Show Figures

Graphical abstract

30 pages, 2687 KB

Open AccessArticle

A Multimodal Framework for Advanced Cybersecurity Threat Detection Using GAN-Driven Data Synthesis

by Nikolaos Peppes, Emmanouil Daskalakis, Theodoros Alexakis and Evgenia Adamopoulou

Appl. Sci. 2025, 15(15), 8730; https://doi.org/10.3390/app15158730 - 7 Aug 2025

Cited by 4 | Viewed by 2860

Abstract

Cybersecurity threats are becoming increasingly sophisticated, frequent, and diverse, posing a major risk to critical infrastructure, public trust, and digital economies. Traditional intrusion detection systems often struggle with detecting novel or rare attack types, particularly when data availability is limited or heterogeneous. The [...] Read more.

Cybersecurity threats are becoming increasingly sophisticated, frequent, and diverse, posing a major risk to critical infrastructure, public trust, and digital economies. Traditional intrusion detection systems often struggle with detecting novel or rare attack types, particularly when data availability is limited or heterogeneous. The current study tries to address these challenges by proposing a unified, multimodal threat detection framework that leverages the combination of synthetic data generation through Generative Adversarial Networks (GANs), advanced ensemble learning, and transfer learning techniques. The research objective is to enhance detection accuracy and resilience against zero-day, botnet, and image-based malware attacks by integrating multiple data modalities, including structured network logs and malware binaries, within a scalable and flexible pipeline. The proposed system features a dual-branch architecture: one branch uses a CNN with transfer learning for image-based malware classification, and the other employs a soft-voting ensemble classifier for tabular intrusion detection, both trained on augmented datasets generated by GANs. Experimental results demonstrate significant improvements in detection performance and false positive reduction, especially when multimodal outputs are fused using the proposed confidence-weighted strategy. The findings highlight the framework’s adaptability and practical applicability in real-world intrusion detection and response systems. Full article

(This article belongs to the Special Issue Data Mining and Machine Learning in Cybersecurity)

► Show Figures

Figure 1

28 pages, 3513 KB

Open AccessArticle

AI-Driven Anomaly Detection in Smart Water Metering Systems Using Ensemble Learning

by Maria Nelago Kanyama, Fungai Bhunu Shava, Attlee Munyaradzi Gamundani and Andreas Hartmann

Water 2025, 17(13), 1933; https://doi.org/10.3390/w17131933 - 27 Jun 2025

Cited by 1 | Viewed by 3359

Abstract

Water, the lifeblood of our planet, sustains ecosystems, economies, and communities. However, climate change and increasing hydrological variability have exacerbated global water scarcity, threatening livelihoods and economic stability. According to the United Nations, over 2 billion people currently live in water-stressed regions, a [...] Read more.

Water, the lifeblood of our planet, sustains ecosystems, economies, and communities. However, climate change and increasing hydrological variability have exacerbated global water scarcity, threatening livelihoods and economic stability. According to the United Nations, over 2 billion people currently live in water-stressed regions, a figure expected to rise significantly by 2030. To address this urgent challenge, this study proposes an AI-driven anomaly detection framework for smart water metering networks (SWMNs) using machine learning (ML) techniques and data resampling methods to enhance water conservation efforts. This research utilizes 6 years of monthly water consumption data from 1375 households from Location A, Windhoek, Namibia, and applies support vector machine (SVM), decision tree (DT), random forest (RF), and k-nearest neighbors (kNN) models within ensemble learning strategies. A significant challenge in real-world datasets is class imbalance, which can reduce model reliability in detecting abnormal patterns. To address this, we employed data resampling techniques including random undersampling (RUS), SMOTE, and SMOTEENN. Among these, SMOTEENN achieved the best overall performance for individual models, with the RF classifier reaching an accuracy of 99.5% and an AUC score of 0.998. Ensemble learning approaches also yielded strong results, with the stacking ensemble achieving 99.6% accuracy, followed by soft voting at 99.2% and hard voting at 98.1%. These results highlight the effectiveness of ensemble methods and advanced sampling techniques in improving anomaly detection under class-imbalanced conditions. To the best of our knowledge, this is the first study to explore and evaluate the combined use of ensemble learning and resampling techniques for ML-based anomaly detection in SWMNs. By integrating artificial intelligence into water systems, this work lays the foundation for scalable, secure, and efficient smart water management solutions, contributing to global efforts in sustainable water governance. Full article

(This article belongs to the Special Issue AI, Machine Learning and Digital Twin Applications in Water)

► Show Figures

Figure 1

22 pages, 8644 KB

Open AccessArticle

Privacy-Preserving Approach for Early Detection of Long-Lie Incidents: A Pilot Study with Healthy Subjects

by Riska Analia, Anne Forster, Sheng-Quan Xie and Zhiqiang Zhang

Sensors 2025, 25(12), 3836; https://doi.org/10.3390/s25123836 - 19 Jun 2025

Cited by 2 | Viewed by 1702

Abstract

(1) Background: Detecting long-lie incidents—where individuals remain immobile after a fall—is essential for timely intervention and preventing severe health consequences. However, most existing systems focus only on fall detection, neglect post-fall monitoring, and raise privacy concerns, especially in real-time, non-invasive applications; (2) Methods: [...] Read more.

(1) Background: Detecting long-lie incidents—where individuals remain immobile after a fall—is essential for timely intervention and preventing severe health consequences. However, most existing systems focus only on fall detection, neglect post-fall monitoring, and raise privacy concerns, especially in real-time, non-invasive applications; (2) Methods: This study proposes a lightweight, privacy-preserving, long-lie detection system utilizing thermal imaging and a soft-voting ensemble classifier. A low-resolution thermal camera captured simulated falls and activities of daily living (ADL) performed by ten healthy participants. Human pose keypoints were extracted using MediaPipe, followed by the computation of five handcrafted postural features. The top three classifiers—automatically selected based on cross-validation performance—formed the soft-voting ensemble. Long-lie conditions were identified through post-fall immobility monitoring over a defined period, using rule-based logic on posture stability and duration; (3) Results: The ensemble model achieved high classification performance with accuracy, precision, recall, and an F1 score of 0.98. Real-time deployment on a Raspberry Pi 5 demonstrated the system is capable of accurately detecting long-lie incidents based on continuous monitoring over 15 min, with minimal posture variation; (4) Conclusion: The proposed system introduces a novel approach to long-lie detection by integrating privacy-aware sensing, interpretable posture-based features, and efficient edge computing. It demonstrates strong potential for deployment in homecare settings. Future work includes validation with older adults and integration of vital sign monitoring for comprehensive assessment. Full article

(This article belongs to the Special Issue Sensing and Signal Processing Technologies for Outpatient Monitoring and Rehabilitation)

► Show Figures

Figure 1

24 pages, 4557 KB

Open AccessArticle

Advanced Multi-Level Ensemble Learning Approaches for Comprehensive Sperm Morphology Assessment

by Abdulsamet Aktas, Taha Cap, Gorkem Serbes, Hamza Osman Ilhan and Hakkı Uzun

Diagnostics 2025, 15(12), 1564; https://doi.org/10.3390/diagnostics15121564 - 19 Jun 2025

Cited by 5 | Viewed by 1473

Abstract

Introduction: Fertility is fundamental to human well-being, significantly impacting both individual lives and societal development. In particular, sperm morphology—referring to the shape, size, and structural integrity of sperm cells—is a key indicator in diagnosing male infertility and selecting viable sperm in assisted reproductive [...] Read more.

Introduction: Fertility is fundamental to human well-being, significantly impacting both individual lives and societal development. In particular, sperm morphology—referring to the shape, size, and structural integrity of sperm cells—is a key indicator in diagnosing male infertility and selecting viable sperm in assisted reproductive technologies such as in vitro fertilisation (IVF) and intracytoplasmic sperm injection (ICSI). However, traditional manual evaluation methods are highly subjective and inconsistent, creating a need for standardized, automated systems. Objectives: This study aims to develop a robust and fully automated sperm morphology classification framework capable of accurately identifying a wide range of morphological abnormalities, thereby minimizing observer variability and improving diagnostic support in reproductive healthcare. Methods: We propose a novel ensemble-based classification approach that combines convolutional neural network (CNN)-derived features using both feature-level and decision-level fusion techniques. Features extracted from multiple EfficientNetV2 variants are fused and classified using Support Vector Machines (SVM), Random Forest (RF), and Multi-Layer Perceptron with Attention (MLP-Attention). Decision-level fusion is achieved via soft voting to enhance robustness and accuracy. Results: The proposed ensemble framework was evaluated using the Hi-LabSpermMorpho dataset, which contains 18 distinct sperm morphology classes. The fusion-based model achieved an accuracy of 67.70%, significantly outperforming individual classifiers. The integration of multiple CNN architectures and ensemble techniques effectively mitigated class imbalance and enhanced the generalizability of the model. Conclusions: The presented methodology demonstrates a substantial improvement over traditional and single-model approaches in automated sperm morphology classification. By leveraging ensemble learning and multi-level fusion, the model provides a reliable and scalable solution for clinical decision-making in male fertility assessment. Full article

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

► Show Figures

Figure 1

Search Results (57)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (57)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI