MDPI - Publisher of Open Access Journals

34 pages, 7649 KB

Open AccessArticle

SMOTE-Data-Augmented Machine Learning for Enhancing Individual Tree Biomass Estimation Using UAV LiDAR

by Sina Jarahizadeh and Bahram Salehi

Remote Sens. 2026, 18(5), 729; https://doi.org/10.3390/rs18050729 - 28 Feb 2026

Viewed by 278

Estimating individual tree Above-Ground Biomass (AGB) is essential for assessing ecological functions and carbon storage in both forest and urban environments. Traditional field-based methods, such as plot measurements, are costly and impractical for large-scale applications. However, satellite- and aerial-based techniques lack the spatial [...] Read more.

Estimating individual tree Above-Ground Biomass (AGB) is essential for assessing ecological functions and carbon storage in both forest and urban environments. Traditional field-based methods, such as plot measurements, are costly and impractical for large-scale applications. However, satellite- and aerial-based techniques lack the spatial resolution for individual-tree-level analysis. Unmanned Aerial Vehicle (UAV) Light Detection and Ranging (LiDAR) data, combined with machine learning (ML), offers a powerful alternative for detailed tree structure measurement and AGB estimation. Leveraging advances in deep-learning-based individual tree detection and geometric structure estimation including Height (H), Surface Area (SA), Volume (V), and Crown Width (CW), this study develops ML regression models for estimating individual tree AGB. We explore three objectives: (1) evaluating four regression models including Random Forest (RF), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), and Feed-Forward Neural Network (FFNN); (2) sensitivity assessment of different geometric feature combinations on model accuracy; and (3) improving model robustness using Synthetic Minority Over-sampling Technique (SMOTE) data augmentation for addressing imbalanced data. Results show that the RF model outperforms others that achieved the lowest RMSE and most balanced residual distribution. CW was the strongest single predictor of AGB and, in combination with H, yielded to the most accurate results. This combination improved RMSE and R² by 14.2% and 89.3% with respect to single-variable-based models. The integration of SMOTE and RF further improved model performance since it lowered RMSE by 225.6 kg (~22.1%) and increased R² by 0.76 (~49.0%). This was particularly evident in underrepresented low and high AGB ranges. The proposed RF-SMOTE approach is a cost-effective and scalable approach for generating high-quality ground truth data to enable large-scale satellite-based biomass estimation and help forest carbon accounting and planning in cities and forests. Full article

(This article belongs to the Special Issue UAV Applications for Forest Management: Wood Volume, Biomass, and Mapping (Second Edition))

► Show Figures

Figure 1

20 pages, 580 KB

Open AccessArticle

A Maturation-Aware Machine Learning Framework for Screening the Nutritional Status of Adolescents

by Hatem Ghouili, Zouhaier Farhani, Narimen Yousfi, Halil İbrahim Ceylan, Amel Dridi, Andrea de Giorgio, Nicola Luigi Bragazzi, Noomen Guelmami, Ismail Dergaa and Anissa Bouassida

Nutrients 2026, 18(4), 660; https://doi.org/10.3390/nu18040660 - 17 Feb 2026

Viewed by 519

Abstract

Background: Malnutrition in adolescents remains a significant public health issue worldwide, with undernutrition and overweight often coexisting. Accurate nutritional screening during adolescence is complicated by variability in biological maturation and class imbalance, particularly among underweight adolescents. Objective: This study aims to develop and [...] Read more.

Background: Malnutrition in adolescents remains a significant public health issue worldwide, with undernutrition and overweight often coexisting. Accurate nutritional screening during adolescence is complicated by variability in biological maturation and class imbalance, particularly among underweight adolescents. Objective: This study aims to develop and validate machine learning models for classifying the nutritional status of adolescents, accounting for class imbalance and biological maturation, and to evaluate model stability and variable importance at different stages of peak height velocity (PHV). Methods: In this cross-sectional study, 4232 adolescents aged 11 to 18 years were recruited from nine educational institutions in Tunisia. Their nutritional status was classified according to the International Obesity Task Force (IOTF) BMI thresholds into three categories: underweight (14.4%), normal weight (68.3%), and overweight (17.2%). Ten anthropometric, behavioral, and maturation-related predictors were analyzed. Six supervised machine learning algorithms were evaluated using a 70/30 stratified split between training and test sets, with five-fold cross-validation. Class imbalance was addressed by ROSE combined with cost-sensitive learning. Model performance was assessed using accuracy, Cohen’s kappa coefficient, macro F1 score, sensitivity, specificity, and AUC. Results: The cost-sensitive Random Forest (RF) model achieved the best overall performance, with an accuracy of 0.830, a macro F1 score of 0.767, a macro-AUC of 0.921, and a macro- sensitivity of 0.743. The class-specific sensitivities were 0.70 (underweight), 0.91 (normal weight), and 0.62 (overweight), with no major misclassification between the extreme categories. Performance remained stable across the different maturation phases (accuracy from 0.823 to 0.839), with optimal discrimination in the pre-PHV (macro-AUC = 0.936; sensitivity for underweight = 0.82) and post-PHV (macro-AUC = 0.931) periods. Body mass was the main predictor (importance = 1.00), followed by waist circumference (0.34–0.53). The importance of age for classifying underweight increased significantly from the pre-PHV (0.10) to the post-PHV (0.75) period. A two-stage hierarchical model further improved underweight detection (stage 1 AUC = 0.911; sensitivity = 0.732). Conclusions: A cost-sensitive RF model, combined with ROSE, provides robust classification of adolescents’ nutritional status maturation, significantly improving underweight detection while preserving overall accuracy. This approach is particularly well-suited to public health screening in schools as a first-stage assessment that requires clinical confirmation and promotes a maturation-aware interpretation of nutritional risk among adolescents. Full article

(This article belongs to the Special Issue Nutrition-Based Counseling and Interventions for Chronic Disease Prevention)

► Show Figures

Graphical abstract

28 pages, 2032 KB

Open AccessArticle

Addressing Class Imbalance in Fetal Health Classification: Rigorous Benchmarking of Multi-Class Resampling Methods on Cardiotocography Data

by Zainab Subhi Mahmood Hawrami, Mehmet Ali Cengiz and Emre Dünder

Diagnostics 2026, 16(3), 485; https://doi.org/10.3390/diagnostics16030485 - 5 Feb 2026

Viewed by 550

Abstract

Background/Objectives: Fetal health is essential in prenatal care, influencing both maternal and fetal outcomes. Cardiotocography (CTG) monitors uterine contractions and fetal heart rate, yet manual interpretation exhibits significant inter-examiner variability. Machine learning offers automated alternatives; however, class imbalance in CTG datasets where [...] Read more.

Background/Objectives: Fetal health is essential in prenatal care, influencing both maternal and fetal outcomes. Cardiotocography (CTG) monitors uterine contractions and fetal heart rate, yet manual interpretation exhibits significant inter-examiner variability. Machine learning offers automated alternatives; however, class imbalance in CTG datasets where pathological cases constitute less than 10% leads to poor detection of minority classes. This study aims to provide the first systematic benchmark comparing five resampling strategies across seven classifier families for multi-class CTG classification, evaluated using imbalance-aware metrics rather than overall accuracy alone. Methods: Seven machine learning models were employed: Naïve Bayes (NB), Random Forest (RF), Linear Discriminant Analysis (LDA), k-Nearest Neighbors (KNN), Linear Support Vector Machine (SVM), Multinomial Logistic Regression (MLR), and Multi-Layer Perceptron (MLP). To address class imbalance, we evaluated the original unbalanced dataset (base) and five resampling methods: SMOTE, BSMOTE, ADASYN, NearMiss, and SCUT. Performance was evaluated on a held-out test set using Balanced Accuracy (BACC), Macro-F1, the Macro-Matthews Correlation Coefficient (Macro-MCC), and Macro-Averaged ROC-AUC. We also report per-class ROC curves. Results: Among all models, RF proved most reliable. Training on the original distribution (base) yielded the highest BACC (0.9118), whereas RF combined with BSMOTE provided the strongest class-balanced performance (Macro-MCC = 0.8533, Macro-F1 = 0.9073) with a near-perfect ROC-AUC (approximately 0.986–0.989). Overall, resampling effects proved model dependent. While some classifiers achieved optimal performance on the natural class distribution, oversampling techniques, particularly SMOTE and BSMOTE, demonstrated significant improvements in minority class discrimination and class-balanced metrics across multiple model families. Notably, certain models benefited substantially from resampling, exhibiting enhanced Macro-F1, BACC, and minority class recall without sacrificing overall accuracy. Conclusions: These findings establish robust, model-agnostic baselines for CTG-based fetal health screening. They highlight that strategic oversampling can translate improved minority class discrimination into clinically meaningful performance gains, supporting deployment in cost-sensitive and threshold-aware clinical settings. Full article

(This article belongs to the Special Issue Artificial Intelligence in Biomedical Diagnostics and Analysis 2025)

► Show Figures

Figure 1

20 pages, 3081 KB

Open AccessArticle

Fractional-Order Bioimpedance Modelling for Early Detection of Tissue Freezing in Cryogenic and Thermal Medical Applications

by Noelia Vaquero-Gallardo, Herminio Martínez-García and Oliver Millán-Blasco

Sensors 2026, 26(2), 603; https://doi.org/10.3390/s26020603 - 15 Jan 2026

Viewed by 498

Abstract

Cryotherapy and radiofrequency (RF) treatments modulate tissue temperature to induce therapeutic effects; however, improper application can result in thermal injury. Traditional temperature-based monitoring methods rely on multiple thermal sensors whose accuracy strongly depends on their number and spatial positioning, often failing to detect [...] Read more.

Cryotherapy and radiofrequency (RF) treatments modulate tissue temperature to induce therapeutic effects; however, improper application can result in thermal injury. Traditional temperature-based monitoring methods rely on multiple thermal sensors whose accuracy strongly depends on their number and spatial positioning, often failing to detect early tissue crystallization. This study introduces a fractional order bioimpedance modelling framework for the early detection of tissue freezing during cryogenic and thermal medical treatments, with the feasibility and effectiveness of this approach having been reported in our prior publications. While bioimpedance spectroscopy itself is a well-est. The corresponablished technique in biomedical engineering, its novel application to predict and identify premature freezing events provides a new pathway for safe and efficient energy-based therapies. Fractional-order models derived from the Cole family accurately reproduce the complex electrical behavior of biological tissues using fewer parameters than classical integer-order models, thus reducing both hardware requirements and computational cost. Experimental impedance data from human abdominal, gluteal, and femoral regions were modelled to extract fractional parameters that serve as sensitive indicators of phase-transition onset. The results demonstrate that the proposed approach enables real-time identification of freezing-induced electrical transitions, offering a physiologically grounded alternative to conventional temperature-based monitoring. Furthermore, the fractional order bioimpedance method exhibits high reproducibility and selectivity, and its analytical figures of merit, including the limits of detection and quantification, support its use for reliable real-time tissue monitoring and early injury detection. Overall, the proposed fractional order bioimpedance framework enhances both safety and control precision in cryogenic and thermal medical applications. Full article

(This article belongs to the Special Issue Feature Papers in Biosensors Section 2025)

► Show Figures

Figure 1

20 pages, 3268 KB

Open AccessArticle

Portable Electronic Olfactometer for Non-Invasive Screening of Canine Ehrlichiosis: A Proof-of-Concept Study Using Machine Learning

by Silvana Valentina Durán Cotrina, Cristhian Manuel Durán Acevedo and Jeniffer Katerine Carrillo Gómez

Vet. Sci. 2026, 13(1), 88; https://doi.org/10.3390/vetsci13010088 - 15 Jan 2026

Viewed by 431

Abstract

Canine ehrlichiosis, caused by Ehrlichia canis, represents a relevant challenge in veterinary medicine, particularly in resource-limited settings where access to laboratory-based diagnostics may be constrained. This pilot and exploratory study aimed to evaluate the feasibility of a portable electronic olfactometer as a [...] Read more.

Canine ehrlichiosis, caused by Ehrlichia canis, represents a relevant challenge in veterinary medicine, particularly in resource-limited settings where access to laboratory-based diagnostics may be constrained. This pilot and exploratory study aimed to evaluate the feasibility of a portable electronic olfactometer as a non-invasive screening approach, based on the analysis of volatile organic compounds (VOCs) present in breath, saliva, and hair samples from dogs. Signals were acquired using an array of eight metal-oxide (MOX) gas sensors (MQ and TGS series). After preprocessing, principal component analysis (PCA) was applied for dimensionality reduction, and the resulting features were analyzed using supervised machine-learning classifiers, including AdaBoost, support vector machines (SVM), k-nearest neighbors (k-NN), and Random Forests (RF). A total of 38 dogs (19 PCR-confirmed infected cases and 19 controls) were analyzed, generating 114 samples evenly distributed across the three biological matrices. Among the evaluated models, SVM showed the most consistent performance, particularly for saliva samples, achieving an accuracy, sensitivity, and precision of 94.7% (AUC = 0.964). In contrast, breath and hair samples showed lower discriminative performance. Given the limited sample size and the exploratory nature of the study, these results should be interpreted as preliminary; nevertheless, they suggest that electronic olfactometry may represent a complementary, low-cost, non-invasive screening tool for future research on canine ehrlichiosis, rather than a standalone diagnostic method. Full article

(This article belongs to the Special Issue AI, Deep Learning and Machine Learning in Veterinary Clinical Applications)

► Show Figures

Graphical abstract

18 pages, 1993 KB

Open AccessArticle

Prediction, Uncertainty Quantification, and ANN-Assisted Operation of Anaerobic Digestion Guided by Entropy Using Machine Learning

by Zhipeng Zhuang, Xiaoshan Liu, Jing Jin, Ziwen Li, Yanheng Liu, Adriano Tavares and Dalin Li

Entropy 2025, 27(12), 1233; https://doi.org/10.3390/e27121233 - 5 Dec 2025

Viewed by 594

Abstract

Anaerobic digestion (AD) is a nonlinear and disturbance-sensitive process in which instability is often induced by feedstock variability and biological fluctuations. To address this challenge, this study develops an entropy-guided machine learning framework that integrates parameter prediction, uncertainty quantification, and entropy-based evaluation of [...] Read more.

Anaerobic digestion (AD) is a nonlinear and disturbance-sensitive process in which instability is often induced by feedstock variability and biological fluctuations. To address this challenge, this study develops an entropy-guided machine learning framework that integrates parameter prediction, uncertainty quantification, and entropy-based evaluation of AD operation. Using six months of industrial data (~10,000 samples), three models—support vector machine (SVM), random forest (RF), and artificial neural network (ANN)—were compared for predicting biogas yield, fermentation temperature, and volatile fatty acid (VFA) concentration. The ANN achieved the highest performance (accuracy = 96%, F1 = 0.95, root mean square error (RMSE) = 1.2 m³/t) and also exhibited the lowest prediction error entropy, indicating reduced uncertainty compared to RF and SVM. Feature entropy and permutation analysis consistently identified feed solids, organic matter, and feed rate as the most influential variables (>85% contribution), in agreement with the RF importance ranking. When applied as a real-time prediction and decision-support tool in the plant (“sensor → prediction → programmable logic controller (PLC)/operation → feedback”), the ANN model was associated with a reduction in gas-yield fluctuation from approximately ±18% to ±5%, a decrease in process entropy, and an improvement in operational stability of about 23%. Techno-economic and life-cycle assessments further indicated a 12–15 USD/t lower operating cost, 8–10% energy savings, and 5–7% CO₂ reduction compared with baseline operation. Overall, this study demonstrates that combining machine learning with entropy-based uncertainty analysis offers a reliable and interpretable pathway for more stable and low-carbon AD operation. Full article

(This article belongs to the Special Issue Entropy in Machine Learning Applications, 2nd Edition)

► Show Figures

Figure 1

35 pages, 5223 KB

Open AccessArticle

Physics-Based Machine Learning for Vibration Mitigation by Open Buried Trenches

by Luís Pereira, Luís Godinho, Fernando G. Branco, Paulo da Venda Oliveira, Pedro Alves Costa and Aires Colaço

Appl. Sci. 2025, 15(21), 11609; https://doi.org/10.3390/app152111609 - 30 Oct 2025

Viewed by 666

Abstract

Mitigating ground vibrations from sources like vehicles and construction operations poses significant challenges, often relying on computationally intensive numerical methods such as Finite Element Methods (FEM) or Boundary Element Methods (BEM) for analysis. This study addresses these limitations by developing and evaluating Machine [...] Read more.

Mitigating ground vibrations from sources like vehicles and construction operations poses significant challenges, often relying on computationally intensive numerical methods such as Finite Element Methods (FEM) or Boundary Element Methods (BEM) for analysis. This study addresses these limitations by developing and evaluating Machine Learning (ML) methodologies for the rapid and accurate prediction of Insertion Loss (IL), a critical parameter for assessing the effectiveness of open trenches as vibration barriers. A comprehensive database was systematically generated through high-fidelity numerical simulations, capturing a wide range of geometric, elastic, and physical configurations of a stratified geotechnical system. Three distinct ML strategies—Artificial Neural Networks (ANN), Support Vector Machines (SVM), and Random Forests (RF)—were initially assessed for their predictive capabilities. Subsequently, a Meta-RF stacking ensemble model was developed, integrating the predictions of these base methods. Model performance was rigorously evaluated using complementary statistical metrics (RMSE, MAE, NMAE, R), substantiated by in-depth statistical analyses (normality tests, Bootstrap confidence intervals, Wilcoxon tests) and an analysis of input parameter sensitivity. The results clearly demonstrate the high efficacy of Machine Learning (ML) in accurately predicting IL across diverse, realistic scenarios. While all models performed strongly, the RF and the Meta-RF stacking ensemble models consistently emerged as the most robust and accurate predictors. They exhibited superior generalization capabilities and effectively mitigated the inherent biases found in the ANN and SVM models. This work is intended to function as a proof-of-concept and offers promising avenues for overcoming the significant computational costs associated with traditional simulation methods, thereby enabling rapid design optimization and real-time assessment of vibration mitigation measures in geotechnical engineering. Full article

► Show Figures

Figure 1

21 pages, 2200 KB

Open AccessArticle

Segmented vs. Non-Segmented Heart Sound Classification: Impact of Feature Extraction and Machine Learning Models

by Ceyda Boz and Yucel Kocyigit

Appl. Sci. 2025, 15(20), 11047; https://doi.org/10.3390/app152011047 - 15 Oct 2025

Cited by 2 | Viewed by 1145

Abstract

Cardiovascular diseases remain a leading cause of mortality worldwide, emphasizing the importance of early diagnosis. Heart sound analysis offers a non-invasive avenue for detecting cardiac abnormalities. This study systematically evaluates the effect of segmentation on phonocardiogram (PCG) classification performance. Unlike conventional fixed-window or [...] Read more.

Cardiovascular diseases remain a leading cause of mortality worldwide, emphasizing the importance of early diagnosis. Heart sound analysis offers a non-invasive avenue for detecting cardiac abnormalities. This study systematically evaluates the effect of segmentation on phonocardiogram (PCG) classification performance. Unlike conventional fixed-window or HSMM-based methods, a data-adaptive segmentation approach combining Shannon energy and Otsu thresholding is proposed. After segmentation, features are extracted using Empirical Mode Decomposition (EMD) and Mel-Frequency Cepstral Coefficients (MFCCs), followed by classification with k-Nearest Neighbor (kNN), Support Vector Machine (SVM), and Random Forest (RF). Experiments on the PhysioNet/CinC 2016 and Pascal datasets revealed that segmentation markedly enhances classification accuracy. The optimal results were achieved using kNN with segmented EMD features, attaining 99.97% accuracy, 99.98% sensitivity, and 99.96% specificity; segmented MFCC features also provided high accuracy (99.37%). In contrast, non-segmented models yielded substantially lower performance. Principal Component Analysis (PCA) is applied for dimensionality reduction, preserving classification efficiency while minimizing computational cost. These findings demonstrate the critical importance of effective segmentation in heart sound classification and establish the proposed Shannon–Otsu-based method as a robust, interpretable, and resource-efficient tool for automated cardiac diagnostics. Using annotated PhysioNet recordings, segmentation achieved ~90% sensitivity for S1/S2 detection. A limitation is the absence of full segment annotations in the Pascal dataset, which prevents comprehensive timing-error evaluation. Full article

► Show Figures

Figure 1

16 pages, 3508 KB

Open AccessArticle

Reconfigurable Multi-Channel Gas-Sensor Array for Complex Gas Mixture Identification and Fish Freshness Classification

by He Wang, Dechao Wang, Hang Zhu and Tianye Yang

Sensors 2025, 25(19), 6212; https://doi.org/10.3390/s25196212 - 7 Oct 2025

Cited by 2 | Viewed by 3573

Abstract

Oxide semiconductor gas sensors are widely used due to their low cost, rapid response, small footprint, and ease of integration. However, in complex gas mixtures their selectivity is often limited by inherent cross-sensitivity. To address this, we developed a reconfigurable sensor-array system that [...] Read more.

Oxide semiconductor gas sensors are widely used due to their low cost, rapid response, small footprint, and ease of integration. However, in complex gas mixtures their selectivity is often limited by inherent cross-sensitivity. To address this, we developed a reconfigurable sensor-array system that supports up to 12 chemiresistive sensors with four- or six-electrode configurations, independent thermal control, and programmable gas paths. As a representative case study, we designed a customized array for fish-spoilage biomarkers, intentionally leveraging the cross-sensitivity and broad-spectrum responses of metal-oxide sensors. Following principal component analysis (PCA) preprocessing, we evaluated convolutional neural network (CNN), random forest (RF), and particle swarm optimization–tuned support vector machine (PSO-SVM) classifiers. The RF model achieved 94% classification accuracy. Subsequent channel optimization (correlation analysis and feature-importance assessment) reduced the array from 12 to 8 sensors and improved accuracy to 96%, while simplifying the system. These results demonstrate that deliberately leveraging cross-sensitivity within a carefully selected array yields an information-rich odor fingerprint, providing a practical platform for complex gas-mixture identification and food-freshness assessment. Full article

(This article belongs to the Section Chemical Sensors)

► Show Figures

Figure 1

13 pages, 1292 KB

Open AccessArticle

Development and Internal Validation of Machine Learning Algorithms to Predict 30-Day Readmission in Patients Undergoing a C-Section: A Nation-Wide Analysis

by Audrey Andrews, Nadia Islam, George Bcharah, Hend Bcharah and Misha Pangasa

J. Pers. Med. 2025, 15(10), 476; https://doi.org/10.3390/jpm15100476 - 2 Oct 2025

Viewed by 1202

Abstract

Background/Objectives: Cesarean section (C-section) is a common surgical procedure associated with an increased risk of 30-day postpartum hospital readmissions. This study utilized machine learning (ML) to predict readmissions using a nationwide database. Methods: A retrospective analysis of the National Surgical Quality [...] Read more.

Background/Objectives: Cesarean section (C-section) is a common surgical procedure associated with an increased risk of 30-day postpartum hospital readmissions. This study utilized machine learning (ML) to predict readmissions using a nationwide database. Methods: A retrospective analysis of the National Surgical Quality Improvement Project (2012–2022) included 54,593 patients who underwent C-sections. Random Forests (RF) and Extreme Gradient Boosting (XGBoost) models were developed and compared to logistic regression (LR) using demographic, preoperative, and perioperative data. Results: Of the cohort, 1306 (2.39%) patients were readmitted. Readmitted patients had higher rates of being of African American race (17.99% vs. 9.83%), diabetes (11.03% vs. 8.19%), and hypertension (11.49% vs. 4.68%) (p < 0.001). RF achieved the highest performance (AUC = 0.737, sensitivity = 72.03%, specificity: 61.33%), and a preoperative-only RF model achieved a sensitivity of 83.14%. Key predictors included age, BMI, operative time, white blood cell count, and hematocrit. Conclusions: ML effectively predicts C-section readmissions, supporting early identification and interventions to improve patient outcomes and reduce healthcare costs. Full article

(This article belongs to the Special Issue Advances in Prenatal Diagnosis and Maternal Fetal Medicine)

► Show Figures

Figure 1

25 pages, 3236 KB

Open AccessArticle

A Wearable IoT-Based Measurement System for Real-Time Cardiovascular Risk Prediction Using Heart Rate Variability

by Nurdaulet Tasmurzayev, Bibars Amangeldy, Timur Imankulov, Baglan Imanbek, Octavian Adrian Postolache and Akzhan Konysbekova

Eng 2025, 6(10), 259; https://doi.org/10.3390/eng6100259 - 2 Oct 2025

Cited by 1 | Viewed by 3616

Abstract

Cardiovascular diseases (CVDs) remain the leading cause of global mortality, with ischemic heart disease (IHD) being the most prevalent and deadly subtype. The growing burden of IHD underscores the urgent need for effective early detection methods that are scalable and non-invasive. Heart Rate [...] Read more.

Cardiovascular diseases (CVDs) remain the leading cause of global mortality, with ischemic heart disease (IHD) being the most prevalent and deadly subtype. The growing burden of IHD underscores the urgent need for effective early detection methods that are scalable and non-invasive. Heart Rate Variability (HRV), a non-invasive physiological marker influenced by the autonomic nervous system (ANS), has shown clinical relevance in predicting adverse cardiac events. This study presents a photoplethysmography (PPG)-based Zhurek IoT device, a custom-developed Internet of Things (IoT) device for non-invasive HRV monitoring. The platform’s effectiveness was evaluated using HRV metrics from electrocardiography (ECG) and PPG signals, with machine learning (ML) models applied to the task of early IHD risk detection. ML classifiers were trained on HRV features, and the Random Forest (RF) model achieved the highest classification accuracy of 90.82%, precision of 92.11%, and recall of 91.00% when tested on real data. The model demonstrated excellent discriminative ability with an area under the ROC curve (AUC) of 0.98, reaching a sensitivity of 88% and specificity of 100% at its optimal threshold. The preliminary results suggest that data collected with the “Zhurek” IoT devices are promising for the further development of ML models for IHD risk detection. This study aimed to address the limitations of previous work, such as small datasets and a lack of validation, by utilizing real and synthetically augmented data (conditional tabular GAN (CTGAN)), as well as multi-sensor input (ECG and PPG). The findings of this pilot study can serve as a starting point for developing scalable, remote, and cost-effective screening systems. The further integration of wearable devices and intelligent algorithms is a promising direction for improving routine monitoring and advancing preventative cardiology. Full article

(This article belongs to the Special Issue Advanced Artificial Intelligence Techniques for Disease Prediction, Diagnosis and Management)

► Show Figures

Figure 1

28 pages, 11275 KB

Open AccessArticle

Field-Scale Rice Yield Prediction in Northern Coastal Region of Peru Using Sentinel-2 Vegetation Indices and Machine Learning Models

by Isabel Jarro-Espinal, José Huanuqueño-Murillo, Javier Quille-Mamani, David Quispe-Tito, Lia Ramos-Fernández, Edwin Pino-Vargas and Alfonso Torres-Rua

Agriculture 2025, 15(19), 2054; https://doi.org/10.3390/agriculture15192054 - 30 Sep 2025

Cited by 2 | Viewed by 1858

Abstract

Accurate rice yield prediction is essential for optimizing water management and supporting decision-making in agricultural systems, particularly in arid environments where irrigation efficiency is critical. This study assessed five machine learning algorithms—Multiple Linear Regression (MLR), Support Vector Regression (SVR, linear and RBF), Partial [...] Read more.

Accurate rice yield prediction is essential for optimizing water management and supporting decision-making in agricultural systems, particularly in arid environments where irrigation efficiency is critical. This study assessed five machine learning algorithms—Multiple Linear Regression (MLR), Support Vector Regression (SVR, linear and RBF), Partial Least Squares Regression (PLSR), Random Forest (RF), and Extreme Gradient Boosting (XGBoost)—for plot-scale rice yield estimation using Sentinel-2 vegetation indices (VIs) during the 2022 and 2023 seasons in the Chancay–Lambayeque Valley, Peru. VIs sensitive to canopy vigor, water status, and structure were derived in Google Earth Engine and optimized via Sequential Forward Selection to identify the most relevant predictors per phenological stage. Models were trained and validated against field yields using leave-one-out cross-validation (LOOCV). Intermediate stages (Flowering, Milk, Dough) yielded the strongest relationships, with water-sensitive indices (NDMI, MSI) consistently ranked as key predictors. MLR and PLSR achieved the highest generalization (R²_CV up to 0.68; RMSE_CV ≈ 1.3 t ha⁻¹), while RF and XGBoost showed high training accuracy but lower validation performance, indicating overfitting. Model accuracy decreased in 2023 due to climatic variability and limited satellite observations. Findings confirm that Sentinel-2–based VI modeling offers a cost-effective, scalable alternative to UAV data for operational rice yield monitoring, supporting water resource management and decision-making in data-scarce agricultural regions. Full article

(This article belongs to the Section Artificial Intelligence and Digital Agriculture)

► Show Figures

Figure 1

23 pages, 8928 KB

Open AccessArticle

Dynamic Fracture Strength Prediction of HPFRC Using a Feature-Weighted Linear Ensemble Approach

by Xin Cai, Yunmin Wang, Yihan Zhao, Liye Chen and Jifeng Yuan

Materials 2025, 18(17), 4097; https://doi.org/10.3390/ma18174097 - 1 Sep 2025

Cited by 1 | Viewed by 851

Abstract

Owing to its excellent crack resistance and durability, High-Performance Fiber-Reinforced Concrete (HPFRC) has been extensively applied in engineering structures exposed to extreme loading conditions. The Mode I dynamic fracture strength of HPFRC under high-strain-rate conditions exhibits significant strain-rate sensitivity and nonlinear response characteristics. [...] Read more.

Owing to its excellent crack resistance and durability, High-Performance Fiber-Reinforced Concrete (HPFRC) has been extensively applied in engineering structures exposed to extreme loading conditions. The Mode I dynamic fracture strength of HPFRC under high-strain-rate conditions exhibits significant strain-rate sensitivity and nonlinear response characteristics. However, existing experimental methods for strength measurement are limited by high costs and the absence of standardized testing protocols. Meanwhile, conventional data-driven models for strength prediction struggle to achieve both high-precision prediction and physical interpretability. To address this, this study introduces a dynamic fracture strength prediction method based on a feature-weighted linear ensemble (FWL) mechanism. A comprehensive database comprising 161 sets of high-strain-rate test data on HPFRC fracture strength was first constructed. Key modeling variables were then identified through correlation analysis and an error-driven feature selection approach. Subsequently, six representative machine learning models (KNN, RF, SVR, LGBM, XGBoost, MLPNN) were employed as base learners to construct two types of ensemble models, FWL and Voting, enabling a systematic comparison of their performance. Finally, the predictive mechanisms of the models were analyzed for interpretability at both global and local scales using SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) methods. The results demonstrate that the FWL model achieved optimal predictive performance on the test set (R² = 0.908, RMSE = 2.632), significantly outperforming both individual models and the conventional ensemble method. Interpretability analysis revealed that strain rate and fiber volume fraction are the primary factors influencing dynamic fracture strength, with strain rate demonstrating a highly nonlinear response mechanism across different ranges. The integrated prediction framework developed in this study offers the combined advantages of high accuracy, robustness, and interpretability, providing a novel and effective approach for predicting the fracture behavior of HPFRC under high-strain-rate conditions. Full article

(This article belongs to the Special Issue Advances in Machine Learning for the Prediction of Construction Materials Properties)

► Show Figures

Figure 1

24 pages, 1094 KB

Open AccessArticle

Machine Learning-Based Surrogate Ensemble for Frame Displacement Prediction Using Jackknife Averaging

by Zhihao Zhao, Jinjin Wang and Na Wu

Buildings 2025, 15(16), 2872; https://doi.org/10.3390/buildings15162872 - 14 Aug 2025

Cited by 1 | Viewed by 1323

Abstract

High-fidelity finite element analysis (FEA) plays a key role in structural engineering by enabling accurate simulation of displacement, stress, and internal forces under static loads. However, its high computational cost limits applicability in real-time control, iterative design, and large-scale uncertainty quantification. Surrogate modeling [...] Read more.

High-fidelity finite element analysis (FEA) plays a key role in structural engineering by enabling accurate simulation of displacement, stress, and internal forces under static loads. However, its high computational cost limits applicability in real-time control, iterative design, and large-scale uncertainty quantification. Surrogate modeling provides a computationally efficient alternative by learning input–output mappings from precomputed simulations. Yet, the performance of individual surrogates is often sensitive to data distribution and model assumptions. To enhance both accuracy and robustness, we propose a model averaging framework based on Jackknife Model Averaging (JMA) that integrates six surrogate models: polynomial response surfaces (PRSs), support vector regression (SVR), radial basis function (RBF) interpolation, eXtreme Gradient Boosting (XGB), Light Gradient Boosting Machine (LGBM), and Random Forest (RF). Three ensembles are formed: JMA1 (classical models), JMA2 (tree-based models), and JMA3 (all models). JMA assigns optimal convex weights using cross-validated out-of-fold errors without a meta-learner. We evaluate the framework on the Static Analysis Dataset with over 300,000 FEA simulations. Results show that JMA consistently outperforms individual models in root mean squared error, mean absolute error, and the coefficient of determination, while also producing tighter, better-calibrated conformal prediction intervals. These findings support JMA as an effective tool for surrogate-based structural analysis. Full article

(This article belongs to the Special Issue Emerging Trends in Machine Learning for Structural Engineering: Innovations and Applications)

► Show Figures

Figure 1

28 pages, 6267 KB

Open AccessArticle

Detection of Pine Wilt Disease Using a VIS-NIR Slope-Based Index from Sentinel-2 Data

by Jian Guo, Ran Kang, Tianhe Xu, Caiyun Deng, Li Zhang, Siqi Yang, Guiling Pan, Lulu Si, Yingbo Lu and Hermann Kaufmann

Forests 2025, 16(7), 1170; https://doi.org/10.3390/f16071170 - 16 Jul 2025

Cited by 1 | Viewed by 969

Abstract

Pine wilt disease (PWD), caused by Bursaphelenchus xylophilus Steiner & Buhrer (pine wood nematodes, PWN), impacts forest carbon sequestration and climate change. However, satellite-based PWD monitoring is challenging due to the limited spatial resolution of Sentinel’s MSI sensor, which reduces its sensitivity to [...] Read more.

Pine wilt disease (PWD), caused by Bursaphelenchus xylophilus Steiner & Buhrer (pine wood nematodes, PWN), impacts forest carbon sequestration and climate change. However, satellite-based PWD monitoring is challenging due to the limited spatial resolution of Sentinel’s MSI sensor, which reduces its sensitivity to subtle biochemical alterations in foliage. We have, therefore, developed a slope product index (SPI) for effective detection of PWD using single-date satellite imagery based on spectral gradients in the visible and near-infrared (VNIR) range. The SPI was compared against 15 widely used vegetation indices and demonstrated superior robustness across diverse test sites. Results show that the SPI is more sensitive to changes in chlorophyll content in the PWD detection, even under potentially confounding conditions such as drought. When integrated into Random Forest (RF) and Back-Propagation Neural Network (BPNN) models, SPI significantly improved classification accuracy, with the multivariate RF model achieving the highest performance and univariate with SPI in BPNN. The generalizability of SPI was validated across test sites in distinct climate zones, including Zhejiang (accuracyZ_Mean = 88.14%) and Shandong (accuracyS_Mean = 78.45%) provinces in China, as well as Portugal. Notably, SPI derived from Sentinel-2 imagery in October enables more accurate and timely PWD detection while reducing field investigation complexity and cost. Full article

(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)

► Show Figures

Graphical abstract

Search Results (63)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (63)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI